BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013377
         (444 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 277/407 (68%), Positives = 326/407 (80%), Gaps = 6/407 (1%)

Query: 25  FGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK 84
           +GFGTFGFD HHRYSDPVKG+L+VDDLP+KGS  YY+++AHRD    + GR L +  N  
Sbjct: 36  YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRD--ILIHGRKLVSD-NTS 92

Query: 85  TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGL 142
           TPLTF +GN+TYR +SLGFLHY NVS+G P+LS++VALDTGSDLFWLPCDC +  CV GL
Sbjct: 93  TPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGL 152

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
              SG+ IDFNIY PN SSTS  +PCN+TLC  Q +CPSA S CPYQV+YLS+GT STG 
Sbjct: 153 QFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGV 212

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           LVED+LHL TD+ QS+++D++I FGCGRVQTGSFLDGAAPNGLFGLGM   SVPS LA +
Sbjct: 213 LVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLARE 272

Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
           G   NSFSMCFG DG GRISFGD GS GQGETPF+LRQ HPTYN++IT+++VGG   + E
Sbjct: 273 GYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADLE 332

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           FSAIFDSGTSFTYLNDPAYT ISE+FN  AKEKR +S SD+PFEYCY +S NQTN E P 
Sbjct: 333 FSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPT 392

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           VNL M+GG  F V DPIVIV  +  G  +YCL +VKS +VNIIG+ +
Sbjct: 393 VNLVMQGGSQFNVTDPIVIVILQ-GGASIYCLAIVKSGDVNIIGQNF 438


>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
          Length = 829

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 263/408 (64%), Positives = 314/408 (76%), Gaps = 10/408 (2%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
           C    +FGFD HHR+SDPVK IL V DLP KG+  YY  +AHRDR FR  GR LAA  + 
Sbjct: 24  CHALNSFGFDIHHRFSDPVKEILGVHDLPDKGTRLYYVVMAHRDRIFR--GRRLAAAVH- 80

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
            +PLTF   N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C  CV G+ 
Sbjct: 81  HSPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGV- 139

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
            S+G+ I FNIY    SSTS  V CNS LCELQ+QCPS+ S CPY+V YLS+GT +TGFL
Sbjct: 140 ESNGEKIAFNIYDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFL 199

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           VEDVLHL TD+ ++K  D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM   SVPSILA +G
Sbjct: 200 VEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEG 259

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
           L  NSFSMCFGSDG GRI+FGD  S  QG+TPF+LR  HPTYNIT+TQ+ VGGNA + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADLEF 319

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVLSPNQTNFEYP 381
            AIFDSGTSFT+LNDPAY QI+ +FNS  K +R +S+S  +LPFEYCY LS N+T  E P
Sbjct: 320 HAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKT-VELP 378

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            +NLTMKGG  + V DPIV +S E  G+ L CLGV+KS+NVNIIG+ +
Sbjct: 379 -INLTMKGGDNYLVTDPIVTISGE--GVNLLCLGVLKSNNVNIIGQNF 423


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 261/412 (63%), Positives = 318/412 (77%), Gaps = 13/412 (3%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDD---LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
           C+  G FG D HHR+SDPV  IL + +   LP KG+  YY+A+ HRDR F   GR LA  
Sbjct: 33  CYSLGKFGLDIHHRFSDPVTEILGIGNDELLPHKGTPQYYAAMVHRDRVFH--GRRLA-- 88

Query: 81  GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
            +  TP+TF+AGN+T+++ + GFLH+ NVSVG P L F+VALDTGSDLFWLPC+C SCV 
Sbjct: 89  DDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSCVR 148

Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           GL + +G+VID NIY  + SST   VPCNS +C+ Q QC S+GS+C Y+V YLS+ T S+
Sbjct: 149 GLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSS 207

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           GFLVEDVLHL TD  Q+K +D++I+ GCG+VQTG FL+GAAPNGLFGLGM+  SVPSILA
Sbjct: 208 GFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILA 267

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
            +GLI +SFSMCFGSDG+GRI+FGD GS  QG+TPF+LR++HPTYN+TITQ+ VGG A +
Sbjct: 268 QKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPFNLRESHPTYNVTITQIIVGGYAAD 327

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE---TSTSDLPFEYCYVLSPNQTN 377
            EF AIFDSGTSFTYLNDPAYT ISE FNSL K  R    +  SDLPFEYCY +SP+QT 
Sbjct: 328 HEFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQT- 386

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            E P +NLTMKGG  ++V DPIV VSSE +G  L CLG+ KSDN+NIIGREY
Sbjct: 387 IEVPFLNLTMKGGDDYYVTDPIVPVSSEVEG-NLLCLGIQKSDNLNIIGREY 437


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 257/412 (62%), Positives = 322/412 (78%), Gaps = 10/412 (2%)

Query: 23  CCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           CC+G  TFGFD HHR+SD +KG+L +DD+P+KG+  YY+ +AHRDR FR  GR LA   +
Sbjct: 26  CCYGLSTFGFDIHHRFSDQIKGMLGIDDVPQKGTPQYYAVMAHRDRVFR--GRRLAG-AD 82

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG- 141
             +PLTF+AGNDT+++ S GFLH+ NVSVG P L F+VALDTGSDLFWLPCDC+SCVHG 
Sbjct: 83  HHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCDCISCVHGG 142

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCN-STLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           L + +G+++ FN Y  + SSTS++V CN ST C  ++QCPSAGS C YQV YLS+ T S 
Sbjct: 143 LRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSR 202

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           GF+VEDVLHL TD+ Q+K  D+RI+FGCG+VQTG FL+GAAPNGLFGLGMD  SVPSILA
Sbjct: 203 GFVVEDVLHLITDDDQTKDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILA 262

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
            +GLI NSFSMCFGSD  GRI+FGD GSP Q +TPF++R+ HPTYNITIT++ V  +  +
Sbjct: 263 REGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITKIIVEDSVAD 322

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTN 377
            EF AIFDSGTSFTY+NDPAYT+I E +NS  K KR +S    S++PF+YCY +S +QT 
Sbjct: 323 LEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQT- 381

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            E P +NLTMKGG  ++V DPI+ VSSE +G  L CLG+ KSD+VNIIG+ +
Sbjct: 382 IEVPFLNLTMKGGDDYYVMDPIIQVSSEEEG-DLLCLGIQKSDSVNIIGQNF 432


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  514 bits (1323), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 250/419 (59%), Positives = 316/419 (75%), Gaps = 7/419 (1%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFR 71
           +L+++ S     C G G FGF+FHHR+SD V G+L  D LP + S  YY  +AHRDR   
Sbjct: 15  ILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-- 72

Query: 72  LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           +RGR LA++  D++ +TF+ GN+T R+N+LGFLHY NV+VG P+  F+VALDTGSDLFWL
Sbjct: 73  IRGRRLASE--DQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWL 130

Query: 132 PCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
           PCDC  +CV  L +  G  +D NIYSPN SSTSSKVPCNSTLC    +C S  S+CPYQ+
Sbjct: 131 PCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQI 190

Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
           RYLS+GT STG LVEDVLHL + EK SK + +RI+ GCG VQTG F DGAAPNGLFGLG+
Sbjct: 191 RYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGL 250

Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
           +  SVPS+LA +G+  NSFSMCFG DG GRISFGDKGS  Q ETP ++RQ HPTYN+T+T
Sbjct: 251 EDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVT 310

Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
           Q+SVGGN  + EF A+FD+GTSFTYL D  YT ISE+FNSLA +KR  + S+LPFEYCY 
Sbjct: 311 QISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYA 370

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           +SPN+ +FEYP VNLTMKGG  + V  P+++V  E     +YCL ++KS++++IIG+ +
Sbjct: 371 VSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDT--VVYCLAIMKSEDISIIGQNF 427


>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 263/408 (64%), Positives = 313/408 (76%), Gaps = 10/408 (2%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
           C    +FGFD HHR+SDPVK IL V DLP KG+  YY A+AHRDR FR  GR LAA    
Sbjct: 24  CHALHSFGFDIHHRFSDPVKEILGVHDLPDKGTRQYYVAMAHRDRIFR--GRRLAA--GY 79

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
            +PLTF   N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C  CVHG+ 
Sbjct: 80  HSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVHGIG 139

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
            S+G+ I FNIY    SSTS  V CNS+LCELQ+QCPS+ + CPY+V YLS+GT +TGFL
Sbjct: 140 LSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGFL 199

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           VEDVLHL TD+ ++K  D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM   SVPSILA +G
Sbjct: 200 VEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKEG 259

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
           L  NSFSMCFGSDG GRI+FGD  S  QG+TPF+LR  HPTYNIT+TQ+ VG    + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLEF 319

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVLSPNQTNFEYP 381
            AIFDSGTSFTYLNDPAY QI+ +FNS  K +R +++S  +LPFEYCY LSPNQT  E  
Sbjct: 320 HAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQT-VELS 378

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            +NLTMKGG  + V DPIV VS E  G+ L CLGV+KS+NVNIIG+ +
Sbjct: 379 -INLTMKGGDNYLVTDPIVTVSGE--GINLLCLGVLKSNNVNIIGQNF 423


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 241/396 (60%), Positives = 308/396 (77%), Gaps = 7/396 (1%)

Query: 35  HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
           HHR+SD V G+L  D LP + S  YY  +AHRDR   +RGR LA +  D++ +TFS GN+
Sbjct: 38  HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93

Query: 95  TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
           T R+++LGFLHY NV+VG P+  F+VALDTGSDLFWLPCDC +CV  L +  G  +D NI
Sbjct: 94  TVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153

Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           YSPN SSTS+KVPCNSTLC    +C S  S+CPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           K SK++ +R++FGCG+VQTG F DGAAPNGLFGLG++  SVPS+LA +G+  NSFSMCFG
Sbjct: 214 KSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
           +DG GRISFGDKGS  Q ETP ++RQ HPTYNIT+T++SVGGN  + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFT 333

Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY LSPN+ +F+YP VNLTMKGG  +
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 393

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            V  P+V++    K   +YCL ++K ++++IIG+ +
Sbjct: 394 PVYHPLVVIPM--KDTDVYCLAIMKIEDISIIGQNF 427


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/396 (60%), Positives = 306/396 (77%), Gaps = 7/396 (1%)

Query: 35  HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
           HHR+SD V G+L  D LP + S  YY  +AHRDR   +RGR LA +  D++ +TFS GN+
Sbjct: 38  HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93

Query: 95  TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
           T R+++LGFLHY NV+VG P+  F+VALDTGSDLFWLPCDC +CV  L +  G  +D NI
Sbjct: 94  TIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153

Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           YSPN SSTS+KVPCNSTLC    +C S  SNCPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           K SK++ +R++ GCG+VQTG F DGAAPNGLFGLG++  SVPS+LA +G+  NSFSMCFG
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
           +DG GRISFGDKGS  Q ETP ++RQ HPTYNIT+T++SV GN  + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFT 333

Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY LSPN+ +F+YP VNLTMKGG  +
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 393

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            V  P+V++    K   +YCL ++K ++++IIG+ +
Sbjct: 394 PVYHPLVVIPM--KDTDVYCLAILKIEDISIIGQNF 427


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score =  497 bits (1279), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 250/412 (60%), Positives = 310/412 (75%), Gaps = 11/412 (2%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN- 82
           C+G  +FGFD HHR+SDPVKGIL +D++P KGS  YY A+AHRDR FR  GR LA  G+ 
Sbjct: 33  CYGSSSFGFDIHHRFSDPVKGILGIDNIPDKGSREYYVAMAHRDRVFR--GRRLADGGDV 90

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
           D+  LTFS  N TY+++  G+LH+ NVSVG PA S++VALDTGSDLFWLPC+C  CVHG+
Sbjct: 91  DQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTKCVHGI 150

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTG 201
             S+GQ I FNIY    SSTS  V CNS+LCE + QC  S+G  CPYQV YLS+ T +TG
Sbjct: 151 QLSTGQKIAFNIYDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTG 210

Query: 202 FLVEDVLHLATD-EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           FLVEDVLHL TD + Q++  +  I+FGCG+VQTG+FLDGAAPNGLFGLGM   SVPSILA
Sbjct: 211 FLVEDVLHLITDNDDQTQHANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILA 270

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAV 319
            QGL  NSFSMCF +DG GRI+FGD  S   QG+TPF++R +H TYNIT+TQ+ VGGN+ 
Sbjct: 271 KQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSA 330

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE--TSTSDLPFEYCYVLSPNQTN 377
           + EF+AIFD+GTSFTYLN+PAY QI+++F+S  K +R   +++ DLPFEYCY L  NQT 
Sbjct: 331 DLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQT- 389

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            E P +NLTMKGG  +FV DPI+       G  + CL V+KS+NVNIIG+ +
Sbjct: 390 IEVPNINLTMKGGDNYFVMDPIITSGGGNNG--VLCLAVLKSNNVNIIGQNF 439


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 232/403 (57%), Positives = 299/403 (74%), Gaps = 6/403 (1%)

Query: 27  FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
           FG+F F+ HH YS  V+ IL     P +G+  YY+A+   D +   R  G   Q  D  P
Sbjct: 55  FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDHFVHSRRLG---QVQDHRP 111

Query: 87  LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
           LTF +GN+T R++ LGFL+Y  V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++ 
Sbjct: 112 LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 171

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
           G V +FNIYSPN SSTS +V C+S+LC    QC S    CPYQV YLSD T STG+LVED
Sbjct: 172 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 230

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           +LHL T++ QSK V++RI+ GCG+ Q+G+FL  AAPNGLFGLG++  SVPSILAN GLI 
Sbjct: 231 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 290

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
           NSFS+CFG    GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+  + + + I
Sbjct: 291 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 350

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           FDSGTSFTYLNDPAY+  ++ F S+ +EK+ T  SD+PFE CY LSPNQT F YP++NLT
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 410

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           MKGGG F +N PIV++S+E K   L+CL + +SD++NIIG+ +
Sbjct: 411 MKGGGHFVINHPIVLISTESK--RLFCLAIARSDSINIIGQNF 451


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  483 bits (1244), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 232/403 (57%), Positives = 299/403 (74%), Gaps = 6/403 (1%)

Query: 27  FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
           FG+F F+ HH YS  V+ IL     P +G+  YY+A+   D +   R  G   Q  D  P
Sbjct: 32  FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLG---QVQDHRP 88

Query: 87  LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
           LTF +GN+T R++ LGFL+Y  V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++ 
Sbjct: 89  LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 148

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
           G V +FNIYSPN SSTS +V C+S+LC    QC S    CPYQV YLSD T STG+LVED
Sbjct: 149 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 207

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           +LHL T++ QSK V++RI+ GCG+ Q+G+FL  AAPNGLFGLG++  SVPSILAN GLI 
Sbjct: 208 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 267

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
           NSFS+CFG    GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+  + + + I
Sbjct: 268 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 327

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           FDSGTSFTYLNDPAY+  ++ F S+ +EK+ T  SD+PFE CY LSPNQT F YP++NLT
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 387

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           MKGGG F +N PIV++S+E K   L+CL + +SD++NIIG+ +
Sbjct: 388 MKGGGHFVINHPIVLISTESK--RLFCLAIARSDSINIIGQNF 428


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 226/393 (57%), Positives = 290/393 (73%), Gaps = 32/393 (8%)

Query: 63  LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF----------------LHY 106
           +AHRDR   +RGR LA +  D++ +TFS GN+T R+++LGF                LHY
Sbjct: 1   MAHRDRL--IRGRRLANE--DQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHY 56

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            NV+VG P+  F+VALDTGSDLFWLPCDC +CV  L +  G  +D NIYSPN SSTS+KV
Sbjct: 57  ANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 116

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           PCNSTLC    +C S  S+CPYQ+RYLS+GT STG LVEDVLHL +++K SK++ +R++F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDK 286
           GCG+VQTG F DGAAPNGLFGLG++  SVPS+LA +G+  NSFSMCFG+DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236

Query: 287 GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
           GS  Q ETP ++RQ HPTYNIT+T++SVGGN  + EF A+FDSGTSFTYL D AYT ISE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296

Query: 347 TFNSLAKEKR-ETSTSDLPFEYCYVLS---------PNQTNFEYPVVNLTMKGGGPFFVN 396
           +FNSLA +KR +T+ S+LPFEYCY L          PN+ +F+YP VNLTMKGG  + V 
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356

Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            P+V++    K   +YCL ++K ++++IIG+ +
Sbjct: 357 HPLVVIPM--KDTDVYCLAIMKIEDISIIGQNF 387


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 220/418 (52%), Positives = 294/418 (70%), Gaps = 9/418 (2%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAV-DDLPKKGSFAYYSALAHRDRYFR 71
           LLI +   +  C G   F F  HHR+SD  K    +  + P+KGSF YY+ALAHRD+   
Sbjct: 10  LLITIWVFSKTCKG-RVFTFKMHHRFSDSFKNWSGLTRNWPEKGSFEYYAALAHRDQM-- 66

Query: 72  LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           LRGR L+   +    L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+
Sbjct: 67  LRGRRLS---DADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWV 123

Query: 132 PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVR 191
           PCDC  C     +S     + +IY+P  SSTS KV CN+ +C  + +C    S+CPY V 
Sbjct: 124 PCDCSRCAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVS 183

Query: 192 YLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
           Y+S  T ++G LV+DVLHL T++   + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+
Sbjct: 184 YVSAQTSTSGILVKDVLHLTTEDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGME 243

Query: 252 KTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
           K SVPS+L+ +GLI +SFSMCFG DG GRISFGDKGSP Q ETPF++   HPTYN+T+TQ
Sbjct: 244 KISVPSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPFNVNPAHPTYNVTVTQ 303

Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
             VG   ++ EF+A+FDSGTSFTY+ DPAY+++SE F+SLA++KR      +PFEYCY +
Sbjct: 304 ARVGTMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDM 363

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           SP+      P ++LTMKGG  F V DPI+++S++ +   +YCL VVKS  +NIIG+ +
Sbjct: 364 SPDANASLVPSMSLTMKGGRHFTVYDPIIVISTQNE--IVYCLAVVKSTELNIIGQNF 419


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 220/425 (51%), Positives = 290/425 (68%), Gaps = 15/425 (3%)

Query: 10  VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-----GILAVDDLPKKGSFAYYSALA 64
           +  LL L  CC   C G   + F  HHR+S+PV+         +   P++G+  YY+ LA
Sbjct: 8   IVSLLSLWECCQ--CHGH-VYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYAELA 64

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
            RDR   LRGR L+        L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 65  DRDRL--LRGRKLS---QIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 119

Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
           GSDLFW+PCDC  C    +++     D N+Y+PN SSTS KV CN++LC  + QC    S
Sbjct: 120 GSDLFWVPCDCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFS 179

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
           NCPY V Y+S  T ++G LVEDVLHL  ++     V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 180 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 239

Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
           LFGLGM+K SVPS+L+ +G   +SFSMCFG DG GRISFGDKGS  Q ETPF+L  +HPT
Sbjct: 240 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 299

Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
           YNIT+TQV VG   ++ EF+A+FDSGTSFTYL DP YT+++E+F+S  +++R  S S +P
Sbjct: 300 YNITVTQVRVGTTVIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 359

Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
           FEYCY +SP+      P V+LTM GG  F V DPI+I+S++ +   +YCL VVKS  +NI
Sbjct: 360 FEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE--LVYCLAVVKSAELNI 417

Query: 425 IGREY 429
           IG+ +
Sbjct: 418 IGQNF 422


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score =  437 bits (1124), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 221/424 (52%), Positives = 291/424 (68%), Gaps = 14/424 (3%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAH 65
           ++ILLS           F F  HHR+S+PVK             + P KGSF YY+ LAH
Sbjct: 9   IVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAH 68

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           RDR   LRGR L+   +    LTFS GN T+R++SLGFLHYT VS+G P   F+VALDTG
Sbjct: 69  RDR--ALRGRRLS---DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTG 123

Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
           SDLFW+PCDC  C     ++     + +IY+P  SSTS KV C+++LC  + +C    SN
Sbjct: 124 SDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSN 183

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           CPY V Y+S  T ++G LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGL
Sbjct: 184 CPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGL 243

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
           FGLG++K SVPSIL+ +G   +SFSMCFG DG GRISFGDKGSP Q ETPF+L   HPTY
Sbjct: 244 FGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNALHPTY 303

Query: 306 NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
           NIT+TQV VG   ++ +F+A+FDSGTSFTYL DP YT + ++F+S A++ R    S +PF
Sbjct: 304 NITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPF 363

Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
           E+CY +SP +     P ++LTMKGG  F V DPI+I+SS+ +   +YC+ VV+S  +NII
Sbjct: 364 EFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIIISSQSE--LIYCMAVVRSAELNII 421

Query: 426 GREY 429
           G+ +
Sbjct: 422 GQNF 425


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 223/425 (52%), Positives = 293/425 (68%), Gaps = 19/425 (4%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYYSALA 64
           + I+ S     C G   + F  HHR+S+PV+        GI A    P+KG+  YY+ LA
Sbjct: 5   VFIIASLFLSLCHGH-VYTFTMHHRHSEPVRKWSHSTASGIPAP---PEKGTVEYYAELA 60

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
            RDR   LRGR L+ Q +D   L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 61  DRDRL--LRGRKLS-QIDDG--LAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 115

Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
           GSDLFW+PCDC  C    +S+     D N+Y+PN SSTS KV CN++LC  + QC    S
Sbjct: 116 GSDLFWVPCDCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLS 175

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
           NCPY V Y+S  T ++G LVEDVLHL  ++     V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 176 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 235

Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
           LFGLGM+K SVPS+L+ +G   +SFSMCFG DG GRISFGDKGS  Q ETPF+L  +HPT
Sbjct: 236 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 295

Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
           YNIT+TQV VG   ++ EF+A+FDSGTSFTYL DP YT+++E+F+S  +++R  S S +P
Sbjct: 296 YNITVTQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 355

Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
           FEYCY +SP+      P V+LTM GG  F V DPI+I+S++ +   +YCL VVK+  +NI
Sbjct: 356 FEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE--LVYCLAVVKTAELNI 413

Query: 425 IGREY 429
           IG+ +
Sbjct: 414 IGQNF 418


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 220/436 (50%), Positives = 300/436 (68%), Gaps = 14/436 (3%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
           M+  +  + + ++ IL+    G C G   F F+ HHR+SD VK      G  A    P K
Sbjct: 1   MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
           GSF Y++AL  RD  + +RGR L+   ++    LTFS GN T R++SLGFLHYT V +G 
Sbjct: 58  GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115

Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
           P + F+VALDTGSDLFW+PCDC  C     ++     + +IY+P  S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175

Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
             + QC    S CPY V Y+S  T ++G L+EDV+HL T++K  + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
           GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS  Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
           TPF+L  +HP YNIT+T+V VG   ++ EF+A+FD+GTSFTYL DP YT +SE+F+S A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQ 355

Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC 413
           +KR +  S +PFEYCY +S +      P ++LTMKG   F +NDPI+++S+E  G  +YC
Sbjct: 356 DKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVISTE--GELVYC 413

Query: 414 LGVVKSDNVNIIGREY 429
           L +VKS  +NIIG+ Y
Sbjct: 414 LAIVKSSELNIIGQNY 429


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 210/403 (52%), Positives = 281/403 (69%), Gaps = 10/403 (2%)

Query: 30  FGFDFHHRYSDPVKGI---LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
           F F  HHR+SD +K +       + P KGSF YY+ LAHRD+   LRGR L    N + P
Sbjct: 28  FTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAELAHRDQM--LRGRKLY---NVEAP 82

Query: 87  LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
           L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC  C      + 
Sbjct: 83  LAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAY 142

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
               + +IY P  SSTS KV CN+ LC  + +C    S+CPY V Y+S  T ++G LVED
Sbjct: 143 ASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVED 202

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           VLHL +++   +S+ + ++FGCG+VQ+GSFL+ AAPNGLFGLGMD+ SVPSIL+ +GL  
Sbjct: 203 VLHLTSEDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTA 262

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
           +SFSMCFG DG GRISFGDKGSP Q ETPF+   +HP+YNI++TQV VG   V+ +F+A+
Sbjct: 263 DSFSMCFGHDGVGRISFGDKGSPDQEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFTAL 322

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           FDSGTSFTYL +P Y  +SE F++ A++KR      +PFEYCY +SP   +   P ++LT
Sbjct: 323 FDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLT 382

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           MKG G F V DPI++++++ +   +YCL +VKS  +NIIG+ +
Sbjct: 383 MKGRGHFTVFDPIIVITTQNE--LVYCLAIVKSTELNIIGQNF 423


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 215/412 (52%), Positives = 289/412 (70%), Gaps = 10/412 (2%)

Query: 22  GCCFGFGTFGFDFHHRYSDPVKGILAVD----DLPKKGSFAYYSALAHRDRYFRLRGRGL 77
           G C G   F F+ HHR+SD VK            P KGSF Y++AL  RD  + +RGR L
Sbjct: 22  GSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFVKFPPKGSFEYFNALVLRD--WLIRGRRL 78

Query: 78  AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
           +   ++ + LTFS GN T R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC  
Sbjct: 79  SDSESESS-LTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGK 137

Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
           C     ++     + +IY+P  S+T+ KV CN++LC  + QC    S CPY V Y+S  T
Sbjct: 138 CAPTEGATYASEFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 197

Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
            ++G L+EDV+HL T++K  + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+K SVPS
Sbjct: 198 STSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPS 257

Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN 317
           +LA +GL+ +SFSMCFG DG GRISFGDKGS  Q ETPF+L  +HP YNIT+T+V VG  
Sbjct: 258 VLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTT 317

Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
            ++ EF+A+FD+GTSFTYL DP YT +SE+F+S A++KR +  S +PFEYCY +S +   
Sbjct: 318 LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANA 377

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
              P ++LTMKG   F +NDPI+++S+E  G  +YCL +VKS  +NIIG+ Y
Sbjct: 378 SLIPSLSLTMKGNSHFTINDPIIVISTE--GELVYCLAIVKSSELNIIGQNY 427


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 231/411 (56%), Positives = 278/411 (67%), Gaps = 43/411 (10%)

Query: 63  LAHRDRYFRLRGRGLAA-----QGNDKTPLTFSAGNDTYRLNSLGF-------------- 103
           +A RDR   + GR LA        N+KT LTF  GN+TYR++ LG               
Sbjct: 1   MAQRDRV--IHGRRLATSTGGDNKNNKTLLTFYYGNETYRIDGLGLRNSCVSLYSNGLFG 58

Query: 104 --LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
             LHY NVSVG P++SF+VALDTGS+L WLPCDC SCVH L S SG V D NIYSPNTSS
Sbjct: 59  YILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTV-DLNIYSPNTSS 117

Query: 162 TSSKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           TS KVPCNSTLC   ++  CPS  SNCPYQV YLS+GT +TG++V+D+LHL +D+ QSK+
Sbjct: 118 TSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISDDSQSKA 177

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           VD++I+FGCG+VQTGSFL G APNGLFGLGM   SVPS LA+ G    SFSMCF  +G G
Sbjct: 178 VDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIG 237

Query: 280 RISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLND 338
           RISFGDKGS GQGET F+  Q   + YNI+ITQ S+GG A +  +SAIFDSGTSFTYLND
Sbjct: 238 RISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLVYSAIFDSGTSFTYLND 297

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS--------------PNQTNFEYPVVN 384
           PAYT I+E+FN L KE R +ST  +PF+YCY +                NQT    P V 
Sbjct: 298 PAYTLIAESFNKLVKETRRSST-QVPFDYCYDIRSFISAQILPFSCAYANQTEPTIPAVT 356

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
           L M GG  F V DPIV+V     G  +YCLG++KS +VNIIG+ +   + I
Sbjct: 357 LVMSGGDYFNVTDPIVLVQLA-DGSAVYCLGMIKSGDVNIIGQNFMTGHRI 406


>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 525

 Score =  410 bits (1053), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/414 (50%), Positives = 275/414 (66%), Gaps = 21/414 (5%)

Query: 30  FGFDFHHRYSDPVKGILAV-------DDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           F F  HHR+SD +K    V       D  P KG+  YY+ LA RDR+FR  G+ L+    
Sbjct: 28  FSFKMHHRFSDQLKNWSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFR--GQRLSEFDG 85

Query: 83  DKTPLTFSAGNDTYRLNSLGFLH-------YTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
              PL FS GN ++R++SLGF         YT V +G P   F+VALDTGSDLFW+PCDC
Sbjct: 86  ---PLAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFMVALDTGSDLFWVPCDC 142

Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
             C     S      + ++YSP  SSTS  VPCN+ LC  + QC  A  NCPY V Y+S 
Sbjct: 143 SRCAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSA 202

Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
            T +TG L+ED+LHL T+ K S+ + + I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SV
Sbjct: 203 ETSTTGILIEDLLHLKTEHKHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262

Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
           PSIL+ +GL+ NSFSMCF  DG GRI+FGDKGS  Q ETPF+L Q HP YNIT+T + VG
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVG 322

Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
              ++ + +A+FDSGTSF+Y  DP Y+++S +F++  ++ R      +PFEYCY +SP+ 
Sbjct: 323 TTLIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSPDA 382

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
                P ++LTMKGGGPF V DPI+++S++ +   +YCL VVKS  +NIIG+ +
Sbjct: 383 NASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKSAELNIIGQNF 434


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 217/448 (48%), Positives = 289/448 (64%), Gaps = 17/448 (3%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFG--FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFA 58
           MAS++ +    +L++ +   AG        +F FD HHR+SD +KGI   + LP+K +  
Sbjct: 1   MASTFSSGAQMLLVLSVFILAGSLRSGDAASFKFDIHHRFSDSIKGIFHSEGLPEKHTPG 60

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           YY+ + HRDR   +RGR LAA   D T LTF+ GNDT  +  LGFL+Y NVSVG P+L F
Sbjct: 61  YYATMVHRDRL--VRGRRLAASDVD-TQLTFAYGNDTAFIPDLGFLYYANVSVGTPSLDF 117

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
           +VALDTGSDLFWLPC+C SC   LN+S+G     N YSPN S+TSS VPC S+LC    +
Sbjct: 118 LVALDTGSDLFWLPCECSSCFTYLNTSNGGKFMLNHYSPNDSTTSSTVPCTSSLC---NR 174

Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
           C S  + CPY++RYLS  T S G+LVEDVLHLATD+   K V+++I+FGCG VQTG F  
Sbjct: 175 CTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPVEAKITFGCGTVQTGIFAT 234

Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
            AAPNGL GLGM+K SVPS LA+QGL  NSFSMCFG+DG GRI FGD G   Q +TPF+ 
Sbjct: 235 TAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDTGPADQKQTPFNT 294

Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
              + +YN+T   ++VGG   +  F+AIFDSGTSFTYL +PAY+ I++  ++  K KR +
Sbjct: 295 MLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYS 354

Query: 359 STS-DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL-------- 409
               + PFEYCY + P    F+Y  +N TMKGG  F   D  V +  +   +        
Sbjct: 355 LFGPNFPFEYCYEIPPGAKEFQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT 414

Query: 410 YLYCLGVVKSDNVNIIGREYPIANNISL 437
           ++ CL + KS ++++IG+ +     I+ 
Sbjct: 415 HVACLAIAKSTDIDLIGQNFMTGYRITF 442


>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 553

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 215/441 (48%), Positives = 279/441 (63%), Gaps = 42/441 (9%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGIL-----AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
           C     F F  HHRYS+PVK             P+KGS  YY+ LA RDR+  LRGR L+
Sbjct: 20  CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRF--LRGRRLS 77

Query: 79  AQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
                   L FS GN T+R++SLGFLHYT + +G P + F+VALDTGSDLFW+PCDC  C
Sbjct: 78  ---QFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRC 134

Query: 139 ----VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
                    S+     D ++Y+PN SSTS KV CN++LC  + QC    SNCPY V Y+S
Sbjct: 135 SATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVS 194

Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
             T ++G LVEDVLHL   +     V++ + FGCG+VQ+GSFLD AAPNGLFGLGM+K S
Sbjct: 195 AETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKIS 254

Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
           VPS+L+ +G   +SFSMCFG DG GRISFGDKGS  Q ETPF++  +HPTYNITI QV V
Sbjct: 255 VPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRV 314

Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET--------------------------F 348
           G   ++ EF+A+FDSGTSFTYL DP Y+++SE+                          F
Sbjct: 315 GTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQF 374

Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
           +S  +++R    S +PF+YCY +SP+      P ++LTM GG  F V DPI+I+S++ + 
Sbjct: 375 HSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE- 433

Query: 409 LYLYCLGVVKSDNVNIIGREY 429
             +YCL VVKS  +NIIG+ +
Sbjct: 434 -LVYCLAVVKSAELNIIGQNF 453


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 205/399 (51%), Positives = 269/399 (67%), Gaps = 5/399 (1%)

Query: 32  FDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
            D HHRYS  V+G+   +   P  G+  YY+ALA  D   R R    AA G     L F+
Sbjct: 27  LDVHHRYSAAVRGLAGHLRAPPPAGTAEYYAALAGHD--LRRRSLAAAAGGGGAGNLAFA 84

Query: 91  AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
            GNDTYRLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C    +   G  +
Sbjct: 85  DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGD-L 143

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            F++YSP  SSTS KVPC+S+LC+ Q  C +A ++CPY ++YLS+ T S G LVEDVL+L
Sbjct: 144 KFDMYSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYL 203

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            T+  QSK   + I+FGCG+VQ+GSFL  AAPNGL GLGMD  SVPS+LA++G+  NSFS
Sbjct: 204 TTESGQSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFS 263

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
           MCFG DG GRI+FGD GS  Q ETP ++ + +P YNI+IT   VGG + + +FSA+ DSG
Sbjct: 264 MCFGEDGHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTKFSAVVDSG 323

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           TSFT L+DP YT+I+ TFN+  KE R+   + +PFEYCY +S  Q     P ++LT KGG
Sbjct: 324 TSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSIS-AQGAVNPPNISLTAKGG 382

Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
             F VN PI+ ++        YCL ++KS+ VN+IG  +
Sbjct: 383 SIFPVNGPIITITDTSSRPIAYCLAIMKSEGVNLIGENF 421


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 207/405 (51%), Positives = 267/405 (65%), Gaps = 15/405 (3%)

Query: 32  FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
            D HHRYS  V+G   +   P  G+  YY+ALA  D    LR R L+             
Sbjct: 34  LDVHHRYSATVRGWAGLRRGPSPGTAEYYAALAGHDD---LRRRSLSLAAAPAPGAGGPF 90

Query: 92  ----GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
               GNDTYRLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C   L+S   
Sbjct: 91  AFVDGNDTYRLNQFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LSSPDY 149

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
             + F++YSP  SSTS KVPC+S +C+LQ +C +A ++CPY++ YLSD T S G LVEDV
Sbjct: 150 GNLKFDVYSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDV 209

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           ++LAT+   SK   + I+FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA+QG+  N
Sbjct: 210 MYLATESGHSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAAN 269

Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
           SFSMCFG DG GRI+FGD GS  Q ETP ++ + +P YNI+I     GG   + +FSA+ 
Sbjct: 270 SFSMCFGEDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTKFSAVV 329

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGTSFT L+DP YT+I+  F+   KEKR  + S LPFEYCY +S ++     P ++LT 
Sbjct: 330 DSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTIS-SKGAVSPPNISLTA 388

Query: 388 KGGGPFFVNDPIVI---VSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           KGG  F V DPI+    +SS P G   YCL ++KS+ VN+IG  +
Sbjct: 389 KGGSVFPVKDPIITITDISSSPVG---YCLAIMKSEGVNLIGENF 430


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 209/399 (52%), Positives = 267/399 (66%), Gaps = 8/399 (2%)

Query: 32  FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
            D HHRYS       A    P  G+  YY+ALA  D    LR R L   G        F+
Sbjct: 29  LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84

Query: 91  AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
            GNDTYRLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C   L S +   +
Sbjct: 85  DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LQSPNYGSL 143

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            F++YSP  S+TS KVPC+S LC+LQ  C S  ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            +D  QSK V + I FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA++GL  NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
           MCFG DG GRI+FGD GS  Q ETP ++ + +P YNITIT ++VG  +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           TSFT L+DP YTQI+ +F++  +  R    S +PFE+CY +S N     +P V+LT KGG
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGG 381

Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
             F VNDPI+ ++        YCL ++KS+ VN+IG  +
Sbjct: 382 SIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 420


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 208/399 (52%), Positives = 267/399 (66%), Gaps = 8/399 (2%)

Query: 32  FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
            D HHRYS       A    P  G+  YY+ALA  D    LR R L   G        F+
Sbjct: 29  LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84

Query: 91  AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
            GNDTYRLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C    + + G  +
Sbjct: 85  DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-L 143

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            F++YSP  S+TS KVPC+S LC+LQ  C S  ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            +D  QSK V + I FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA++GL  NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
           MCFG DG GRI+FGD GS  Q ETP ++ + +P YNITIT ++VG  +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           TSFT L+DP YTQI+ +F++  +  R    S +PFE+CY +S N     +P V+LT KGG
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGG 381

Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
             F VNDPI+ ++        YCL ++KS+ VN+IG  +
Sbjct: 382 SIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 420


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/408 (51%), Positives = 272/408 (66%), Gaps = 16/408 (3%)

Query: 28  GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           G    +FHHR+S  V+      G       P  G FAY +ALA  DR+     R L+A G
Sbjct: 21  GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
             + PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C   
Sbjct: 76  G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
            +S++     F  Y P+ SSTS  VPCNS  C L+K+C S  S+CPY++ Y+S  T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191

Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
           FLVEDVL+L+T++   + + ++I FGCG VQTGSFLD AAPNGLFGLG+D  SVPSILA 
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251

Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
           +GL  NSFSMCFG DG GRISFGD+GS  Q ETP  + Q HPTY ITIT ++VG N ++ 
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
           E S IFD+GTSFTYL DPAYT I++ F+S  +  R  + S +PFEYCY LS ++   + P
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTP 371

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            ++L   GG  F   DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 372 SISLRTVGGSLFPAIDPGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 418


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/408 (51%), Positives = 272/408 (66%), Gaps = 16/408 (3%)

Query: 28  GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           G    +FHHR+S  V+      G       P  G FAY +ALA  DR+     R L+A G
Sbjct: 21  GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
             + PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C   
Sbjct: 76  G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
            +S++     F  Y P+ SSTS  VPCNS  C L+K+C S  S+CPY++ Y+S  T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191

Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
           FLVEDVL+L+T++   + + ++I FGCG VQTGSFLD AAPNGLFGLG+D  SVPSILA 
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251

Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
           +GL  NSFSMCFG DG GRISFGD+GS  Q ETP  + Q HPTY ITIT ++VG N ++ 
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
           E S IFD+GTSFTYL DPAYT I++ F+S  +  R  + S +PFEYCY LS ++   + P
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTP 371

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
            ++L   GG  F   DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 372 SISLRTVGGSLFPAIDPGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 418


>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
 gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
          Length = 523

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 210/400 (52%), Positives = 269/400 (67%), Gaps = 2/400 (0%)

Query: 30  FGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
              D HHRYS  V+        P  G+  YY+ALA  D   R    G AA G     + F
Sbjct: 29  LSLDVHHRYSATVREWAGHHRAPPAGTAEYYAALARHDLRRRSLAAGPAAGGGGGGEVAF 88

Query: 90  SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
           + GNDTYRLN LGFLHY  V++G P ++F+VALDTGSDLFW+PCDC++C   L S + + 
Sbjct: 89  ADGNDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRD 147

Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
           + F+ YSP  SSTS KVPC+S LC+LQ  C SA S+CPY + YLSD T STG LVEDVL+
Sbjct: 148 LKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLY 207

Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
           L T+  Q K V + I+FGCGR+QTGSFL  AAPNGL GLGMD  SVPS+LA++G+  NSF
Sbjct: 208 LITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVAANSF 267

Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
           SMCFG DG GRI+FGD GS  Q ETP ++ + +P YNI+IT   VG  + N  F+AI DS
Sbjct: 268 SMCFGDDGRGRINFGDTGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNTNFNAIVDS 327

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GTSFT L+DP Y++I+ +FNS  ++K     S LPFE+CY +SP + +   P ++L  KG
Sbjct: 328 GTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISP-KGSVNPPNISLMAKG 386

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           G  F VNDPI+ ++ +      YCL V+KS+ VN+IG  +
Sbjct: 387 GSIFPVNDPIITITDDASNPMAYCLAVMKSEGVNLIGENF 426


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 208/402 (51%), Positives = 261/402 (64%), Gaps = 8/402 (1%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
            +F F  HHR+SD +K I   + LP+K +  YY+A+ HRDR   L GR LA    D TPL
Sbjct: 30  ASFKFTIHHRFSDSIKEIFGSEGLPEKHTPGYYAAMVHRDRL--LHGRNLATTNGD-TPL 86

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
            FS GN+TY L+ LG L+Y NVS+G P L F+VALDTGSDLFWLPC+C  C   L     
Sbjct: 87  MFSYGNETYELSGLGNLYYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDN 146

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
                N YS N SSTS +VPC+S+LCEL  QC S  S+CPYQ  YLS+ + S G+LV+D+
Sbjct: 147 GKFWLNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDI 206

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           LH+ATD+ Q K VD +++ GCG+VQTG F +  APNGL GLGM K SVPS LA+QGL  +
Sbjct: 207 LHMATDDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTD 266

Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
           SFSMCFG  G GRI FGD G  GQ ETPF+      +YN+TI Q+ V     N   +AI 
Sbjct: 267 SFSMCFGYYGYGRIDFGDIGPVGQRETPFN--PASLSYNVTILQIIVTNRPTNVHLTAII 324

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSG SFTYL DP Y+ I+E  ++  + +R  S SD PFEYCY LS   T F+ P +N TM
Sbjct: 325 DSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSL-ATIFQQPNLNFTM 383

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           +GG  F V    V V ++  G  L CL +VKS ++N+IG  +
Sbjct: 384 EGGRKFDVITSYVSVDTD-DGPAL-CLAIVKSTDINVIGHNF 423


>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
 gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 500

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 202/407 (49%), Positives = 267/407 (65%), Gaps = 19/407 (4%)

Query: 32  FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
            +FHHR+S P++      G       P  GS AY +ALA  DR+     R ++A G   +
Sbjct: 32  LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
           D  PLTF+ GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    
Sbjct: 87  DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
            ++SG       Y P  SSTS  VPCNS  C+LQK+C S    CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           LVEDVL+L+T+    + + ++I  GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262

Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
           GL  NSFSMCFG DG GRISFGD+ S  Q ETP  + + HPTY ITI+ ++VG    + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           F  IFD+GTSFTYL DPAYT I+++F++  +  R  + S +PFEYCY LS ++  F  P 
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPD 382

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           + L    G  F V DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 383 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 428


>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 520

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 202/407 (49%), Positives = 267/407 (65%), Gaps = 17/407 (4%)

Query: 32  FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
            +FHHR+S P++      G       P  GS AY +ALA  DR+     R ++A G   +
Sbjct: 32  LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
           D  PLTF+ GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    
Sbjct: 87  DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
            ++SG       Y P  SSTS  VPCNS  C+LQK+C S    CPY++ Y+S GT S+GF
Sbjct: 147 TAASGS-FQATFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 204

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           LVEDVL+L+T+    + + ++I  GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 205 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 264

Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
           GL  NSFSMCFG DG GRISFGD+ S  Q ETP  + + HPTY ITI+ ++VG    + +
Sbjct: 265 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 324

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           F  IFD+GTSFTYL DPAYT I+++F++  +  R  + S +PFEYCY LS ++  F  P 
Sbjct: 325 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPD 384

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           + L    G  F V DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 385 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 430


>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
 gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
          Length = 499

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 203/405 (50%), Positives = 266/405 (65%), Gaps = 15/405 (3%)

Query: 32  FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
            +FHHR+S P++      G       P  GS AY +ALA  DR+   R    A  G   T
Sbjct: 31  LEFHHRFSAPLRRWAEARGRALPGGWPAPGSAAYVAALAGHDRH---RAVSAAGGGGSGT 87

Query: 86  P-LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS 144
           P LTF+ GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C     +
Sbjct: 88  PPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATA 147

Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
           +SG       Y P  SSTS  VPCNS  C+LQK+C S    CPY++ Y+S GT S+GFLV
Sbjct: 148 ASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLV 203

Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
           EDVL+L+T+    + + ++I  GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL
Sbjct: 204 EDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGL 263

Query: 265 IPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
             NSFSMCFG DG GRISFGD+GS  Q ETP ++ Q HPTY ITI+ +++G    + +F 
Sbjct: 264 TSNSFSMCFGRDGIGRISFGDQGSSDQEETPLNINQQHPTYAITISGITIGNKPTDLDFI 323

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            IFD+GTSFTYL DPAYT I+++F++  +  R  + S +PFEYCY LS ++  F  P + 
Sbjct: 324 TIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDII 383

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           L    G  F V DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 384 LRTVSGSLFPVIDPGQVISIQ-EHEYVYCLAIVKSRKLNIIGQNF 427


>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 498

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 202/407 (49%), Positives = 266/407 (65%), Gaps = 21/407 (5%)

Query: 32  FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
            +FHHR+S P++      G       P  GS AY +ALA  DR+     R ++A G   +
Sbjct: 32  LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
           D  PLTF+ GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    
Sbjct: 87  DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
            ++SG       Y P  SSTS  VPCNS  C+LQK+C S    CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           LVEDVL+L+T+    + + ++I  GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262

Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
           GL  NSFSMCFG DG GRISFGD+ S  Q ETP  + + HPTY ITI+ ++VG    + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           F  IFD+GTSFTYL DPAYT I+++F++  +  R  + S +PFEYCY LS  +  F  P 
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLS--EARFPIPD 380

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           + L    G  F V DP  ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 381 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 426


>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 510

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 203/404 (50%), Positives = 270/404 (66%), Gaps = 18/404 (4%)

Query: 32  FDFHHRYSDPVKGILAVD------DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
            +FHHR+S  ++G             P  G  AY +ALA  DR+     R LAA   D  
Sbjct: 30  LEFHHRFSARLRGWADARGHELPGGWPPPGGAAYVAALAGHDRH-----RALAAA--DHP 82

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
           PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    + +
Sbjct: 83  PLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGA 142

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           SG     + Y P+ SSTS  VPCNS  C+ +K C S  S+CPY++ Y+S  T S+GFLVE
Sbjct: 143 SGSA---SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYKMVYVSADTSSSGFLVE 198

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
           DVL+L+T++   + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D  SVPSILA++GL 
Sbjct: 199 DVLYLSTEDNHPQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLT 258

Query: 266 PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
            +SFSMCFG DG GRISFGD+GS  Q ETP  + Q HPTY ITIT ++VG   ++ EFS 
Sbjct: 259 SDSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGITVGTEPMDLEFST 318

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           IFD+GT+FTYL DPAYT I+++F++  +  R  + + +PFEYCY LS ++   + P V+ 
Sbjct: 319 IFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVSF 378

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
              GG  F V D   ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 379 RTVGGSLFPVIDLGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 421


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/338 (55%), Positives = 244/338 (72%), Gaps = 3/338 (0%)

Query: 89  FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQ 148
           F+ GNDTYRLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C    + + G 
Sbjct: 19  FADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78

Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
            + F++YSP  S+TS KVPC+S LC+LQ  C S  ++CPY ++YLSD T S+G LVEDVL
Sbjct: 79  -LKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVL 137

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
           +L +D  QSK V + I FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA++GL  NS
Sbjct: 138 YLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANS 197

Query: 269 FSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
           FSMCFG DG GRI+FGD GS  Q ETP ++ + +P YNITIT ++VG  +++ EFSAI D
Sbjct: 198 FSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVD 257

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGTSFT L+DP YTQI+ +F++  +  R    S +PFE+CY +S N     +P V+LT K
Sbjct: 258 SGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAK 315

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           GG  F VNDPI+ ++        YCL ++KS+ VN+IG
Sbjct: 316 GGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIG 353


>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 537

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 209/419 (49%), Positives = 267/419 (63%), Gaps = 23/419 (5%)

Query: 30  FGFDFHHRYSDPVKGILAVDDLP-------KKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
            GFD HHR S  V+        P        +G+  YY+AL   DR    R RGLA +G+
Sbjct: 29  IGFDLHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLAR-RGLA-EGD 86

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
            +  LTF++GN T+RL   G LHY  V+VG P  +F+VALDTGSDLFW+PCDC  C    
Sbjct: 87  GEGLLTFASGNLTFRLE--GSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIA 144

Query: 143 NSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTM 198
           N+S  +   D   YSP  SSTS  V C   LCE    C +AG   ++CPY VRY+S  T 
Sbjct: 145 NASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTS 204

Query: 199 STGFLVEDVLHLATDEK--QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           S+G LVEDVLHL+ +     S +V + +  GCG+VQTG+FLDGAA +GL GLGMDK SVP
Sbjct: 205 SSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVP 264

Query: 257 SILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
           S+L   GL+  +SFSMCF  DG GRI+FGD G  GQ ETPF++R THPTYNI++T +SV 
Sbjct: 265 SVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISVTAMSVS 324

Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           G  V  EF+AI DSGTSFTYLNDPAYT+++  FNS  +E+R   ++ +PFEYCY L   Q
Sbjct: 325 GKEVAAEFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQ 384

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL----YCLGVVKSD-NVNIIGREY 429
           T    P V+LT +GG  F V  PIV++  E     +    YCL V+K+D  ++IIG+ +
Sbjct: 385 TELFVPEVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNF 443


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)

Query: 32  FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
            +FHHR+S PV      +G +     P+ GS  Y +AL   DR   L          G+ 
Sbjct: 35  LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
             PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    +
Sbjct: 95  PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
           ++SG     + Y P+ SSTS  VPCNS  CEL+K+C S  S CPY++ Y+S  T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           VEDVL+L+T++   + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D  S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
           L  NSF+MCF  DG GRISFGD+GS  Q ETP  +   HPTY I+I++++VG +  + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
           S IFD+GTSFTYL DPAYT I+++F++     R  + S +PFEYCY LS ++   + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           +L   GG  F V D   ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)

Query: 32  FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
            +FHHR+S PV      +G +     P+ GS  Y +AL   DR   L          G+ 
Sbjct: 35  LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
             PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    +
Sbjct: 95  PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
           ++SG     + Y P+ SSTS  VPCNS  CEL+K+C S  S CPY++ Y+S  T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           VEDVL+L+T++   + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D  S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
           L  NSF+MCF  DG GRISFGD+GS  Q ETP  +   HPTY I+I++++VG +  + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
           S IFD+GTSFTYL DPAYT I+++F++     R  + S +PFEYCY LS ++   + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           +L   GG  F V D   ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)

Query: 32  FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
            +FHHR+S PV      +G +     P+ GS  Y +AL   DR   L          G+ 
Sbjct: 35  LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
             PLTFS GN T ++++LGFLHY  V+VG P  +F+VALDTGSDLFWLPC C  C    +
Sbjct: 95  PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
           ++SG     + Y P+ SSTS  VPCNS  CEL+K+C S  S CPY++ Y+S  T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           VEDVL+L+T++   + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D  S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
           L  NSF+MCF  DG GRISFGD+GS  Q ETP  +   HPTY I+I++++VG +  + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDLEF 330

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
           S IFD+GTSFTYL DPAYT I+++F++     R  + S +PFEYCY LS ++   + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           +L   GG  F V D   ++S + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435


>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
 gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
          Length = 455

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 258/376 (68%), Gaps = 16/376 (4%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
           M+  +  + + ++ IL+    G C G   F F+ HHR+SD VK      G  A    P K
Sbjct: 1   MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
           GSF Y++AL  RD  + +RGR L+   ++    LTFS GN T R++SLGFLHYT V +G 
Sbjct: 58  GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115

Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
           P + F+VALDTGSDLFW+PCDC  C     ++     + +IY+P  S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175

Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
             + QC    S CPY V Y+S  T ++G L+EDV+HL T++K  + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
           GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS  Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
           TPF+L  +HP YNIT+T+V VG   ++ EF+A+FD+GTSFTYL DP YT +SE+    A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQ 351

Query: 354 EKRETSTSDLPFEYCY 369
           +KR +  S +PFEYCY
Sbjct: 352 DKRHSPDSRIPFEYCY 367


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 207/413 (50%), Positives = 272/413 (65%), Gaps = 11/413 (2%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           C   G F F+ HH +SD VK  L +DDL P+KGS  Y+  LA RDR   +RGRGLA+  N
Sbjct: 23  CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
           ++TP+TF  GN T  ++ LGFLHY NVSVG PA  F+VALDTGSDLFWLPC+C S C+  
Sbjct: 80  EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139

Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           L      Q    N+YSPNTSSTSS + C+   C    +C S  S+CPYQ++YLS  T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L EDVLHL T+++  + V + I+ GCG+ QTG     AA NGL GLG+   SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259

Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
              +  NSFSMCFG+  D  GRISFGDKG   Q ETP    +  PTY +++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDA 319

Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
           V  +  A+FD+GTSFT+L +P Y  I++ F+    +KR     +LPFE+CY LSPN+T  
Sbjct: 320 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 379

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
            +P V +T +GG   F+ +P+ IV +E     +YCLG++KS +  +NIIG+ +
Sbjct: 380 LFPRVAMTFEGGSQMFLRNPLFIVWNEDNS-AMYCLGILKSVDFKINIIGQNF 431


>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 417

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 174/326 (53%), Positives = 232/326 (71%), Gaps = 2/326 (0%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           LHYT V +G P   F+VALDTGSDLFW+PCDC  C     S      + ++YSP  SSTS
Sbjct: 3   LHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTS 62

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             VPCN++LC  + QC  A  NCPY V Y+S  T +TG L+ED+LHL T+ K S+ + + 
Sbjct: 63  KTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEPIQAY 122

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SVPSIL+ +GL+ NSFSMCF  DG GRI+F
Sbjct: 123 ITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINF 182

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
           GDKGS  Q ETPF+L Q HP YNIT+T + VG   ++ + +A+FDSGTSF+Y  DP Y++
Sbjct: 183 GDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADITALFDSGTSFSYFTDPIYSK 242

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
           +S +F++  ++ R      +PFEYCY +SP+      P ++LTMKGGGPF V DPI+++S
Sbjct: 243 LSASFHAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVYDPIIVIS 302

Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREY 429
           ++ +   +YCL VVKS  +NIIG+ +
Sbjct: 303 TQNE--LIYCLAVVKSAELNIIGQNF 326


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score =  367 bits (942), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 182/333 (54%), Positives = 238/333 (71%), Gaps = 3/333 (0%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
           RLN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C    + + G  + F++YS
Sbjct: 68  RLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDVYS 126

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  S+TS KVPC+S LC+LQ  C S  ++CPY ++YLSD T S+G LVEDVL+L +D  Q
Sbjct: 127 PAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQ 186

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           SK V + I FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA++GL  NSFSMCFG D
Sbjct: 187 SKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDD 246

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYL 336
           G GRI+FGD GS  Q ETP ++ + +P YNITIT ++VG  +++ EFSAI DSGTSFT L
Sbjct: 247 GHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFTAL 306

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
           +DP YTQI+ +F++  +  R    S +PFE+CY +S N     +P V+LT KGG  F VN
Sbjct: 307 SDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGGSIFPVN 364

Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           DPI+ ++        YCL ++KS+ VN+IG  +
Sbjct: 365 DPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 397


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score =  366 bits (940), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 200/425 (47%), Positives = 262/425 (61%), Gaps = 32/425 (7%)

Query: 29  TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           +FGFD HHR+S  V+       G LA D  P +G+  YYSAL+  DR  R       A G
Sbjct: 33  SFGFDLHHRFSPVVRRWAEARGGPLAADQWPARGTPEYYSALSRHDRARRA-----LAGG 87

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--V 139
            D   LTF+AGNDTY+    G L+Y  V +G P  +F+VALDTGSDLFW+PCDC  C  +
Sbjct: 88  ADDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATI 144

Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTM 198
              N +         YSP  SSTS +V C++ LC  +  C +A   +CPY+V+Y+S  T 
Sbjct: 145 PSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQRNGCSAATNGSCPYEVQYVSANTS 204

Query: 199 STGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDK 252
           S+G LV+DVLHL  +        +++ + + FGCG+VQTG+FLDG   A +GL GLGM K
Sbjct: 205 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGK 264

Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
            SVPS LA  GL+  +SFSMCFG DG GR++FGD GS GQ ETPF++R  +PTYN++ T 
Sbjct: 265 VSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTS 324

Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEY 367
           + VG  +V  EF+A+ DSGTSFTYL+DP YTQ++  FNS   E+R      S    PFEY
Sbjct: 325 IGVGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEY 384

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNI 424
           CY LSPNQT    P V+LT KGG  F V  P + V         YCL ++++D    ++I
Sbjct: 385 CYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAVGYCLAIMRNDMAIGIDI 444

Query: 425 IGREY 429
           IG+ +
Sbjct: 445 IGQNF 449


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 182/335 (54%), Positives = 238/335 (71%), Gaps = 3/335 (0%)

Query: 95  TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
           T  LN  GFLHY  V++G P ++F+VALDTGSDLFW+PCDC+ C    + + G  + F++
Sbjct: 52  TADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDV 110

Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           YSP  S+TS KVPC+S LC+LQ  C S  ++CPY ++YLSD T S+G LVEDVL+L +D 
Sbjct: 111 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 170

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
            QSK V + I FGCG+VQTGSFL  AAPNGL GLGMD  SVPS+LA++GL  NSFSMCFG
Sbjct: 171 AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG 230

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
            DG GRI+FGD GS  Q ETP ++ + +P YNITIT ++VG  +++ EFSAI DSGTSFT
Sbjct: 231 DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFT 290

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
            L+DP YTQI+ +F++  +  R    S +PFE+CY +S N     +P V+LT KGG  F 
Sbjct: 291 ALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGGSIFP 348

Query: 395 VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           VNDPI+ ++        YCL ++KS+ VN+IG  +
Sbjct: 349 VNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 383


>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
          Length = 585

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 185/339 (54%), Positives = 233/339 (68%), Gaps = 12/339 (3%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAH 65
           ++ILLS           F F  HHR+S+PVK             + P KGSF YY+ LAH
Sbjct: 9   IVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAH 68

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           RDR   LRGR L+   +    LTFS GN T+R++SLGFLHYT VS+G P   F+VALDTG
Sbjct: 69  RDR--ALRGRRLS---DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTG 123

Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
           SDLFW+PCDC  C     ++     + +IY+P  SSTS KV CN++LC  + +C    SN
Sbjct: 124 SDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTFSN 183

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           CPY V Y+S  T ++G LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGL
Sbjct: 184 CPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGL 243

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
           FGLG++K SVPSIL+ +G   +SFSMCFG DG GRISFGDKG P Q ETPF+L   HPTY
Sbjct: 244 FGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPFNLNALHPTY 303

Query: 306 NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQI 344
           NIT+TQV VG   ++ +F+A+FDSGTSFTYL DP YT +
Sbjct: 304 NITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNV 342



 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/97 (56%), Positives = 70/97 (72%), Gaps = 3/97 (3%)

Query: 7   NSPVCVLLILLS-CCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
           NS   ++++L+S   +  C+G GTFGFD HHR+SDPVKGIL VDDLP+K S  YY A+AH
Sbjct: 491 NSXWVLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAH 550

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
           RD  + + GR L+     K PLTFS GN+TYRL+SLG
Sbjct: 551 RD--WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLG 585


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score =  363 bits (933), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 199/424 (46%), Positives = 262/424 (61%), Gaps = 32/424 (7%)

Query: 30  FGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           FGFD HHR+S  V+       G LA D  P +G+  YYSAL+  DR      R   A G 
Sbjct: 36  FGFDLHHRFSPVVRRWAEARGGPLAADRWPARGTPEYYSALSRHDR-----ARRALAGGA 90

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--VH 140
           D   LTF+AGNDTY+    G L+Y  V +G P  +F+VALDTGSDLFW+PCDC  C  + 
Sbjct: 91  DDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIP 147

Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMS 199
             N++         YSP  SSTS +V C++ LC  +  C +A   +CPY+V+Y+S  T S
Sbjct: 148 SANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSS 207

Query: 200 TGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLD--GAAPNGLFGLGMDKT 253
           +G LV+DVLHL  +        +++ + + FGCG+VQTG+FLD  G A +GL GLGM K 
Sbjct: 208 SGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKV 267

Query: 254 SVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
           SVPS LA  GL+  +SFSMCFG DG GR++FGD GS GQ ETPF++R  +PTYN++ T +
Sbjct: 268 SVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSI 327

Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEYC 368
            +G  +V  EF+A+ DSGTSFTYL+DP YTQ++  FNS   E+R      S    PFEYC
Sbjct: 328 GIGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYC 387

Query: 369 YVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNII 425
           Y LSPNQT    P V+LT KGG  F V  P + V         YCL ++++D    ++II
Sbjct: 388 YRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDII 447

Query: 426 GREY 429
           G+ +
Sbjct: 448 GQNF 451


>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
 gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
          Length = 541

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 203/422 (48%), Positives = 267/422 (63%), Gaps = 29/422 (6%)

Query: 29  TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           + GFD HHR+S  V+          A  D P +GS  YYSAL+  DR    R R LA  G
Sbjct: 33  SVGFDLHHRFSPVVRQWAEARGHPFAAQDWPARGSPEYYSALSRHDRAVLSR-RALA-DG 90

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
            D   +TF+AGNDT  L  +G L+Y  V VG P  +F+VALDTGSDLFW+PCDC  C   
Sbjct: 91  ADGL-VTFAAGNDT--LQYIGSLYYAVVEVGTPNATFLVALDTGSDLFWVPCDCKQCASI 147

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMST 200
            N +         YSP  SSTS +V C++ LC+    C +A   +CPY+V+YLS  T ++
Sbjct: 148 ANVTGQPATALRPYSPRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTS 207

Query: 201 GFLVEDVLHLATDE-----KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
           G LV+DVLHL  +      +  +++ + + FGCG+VQTG+FLDGAA +GL GLG +  SV
Sbjct: 208 GVLVQDVLHLTRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSV 267

Query: 256 PSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
           PS+LA+ GL+  +SFSMCFG DG GRI+FGD GS GQGETPF+ R+T   YN++ T V+V
Sbjct: 268 PSVLASSGLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTGRRT--LYNVSFTAVNV 325

Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET----STSDLPFEYCYV 370
              +V  EF+A+ DSGTSFTYL DP YT+++  FNSL +E+R      S    PFEYCY 
Sbjct: 326 ETKSVAAEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCYA 385

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIGR 427
           L PNQT    P V+LT KGG  F V  P++ V+S  + +  YCL ++K+D   N NIIG+
Sbjct: 386 LGPNQTEALIPDVSLTTKGGARFPVTQPVIGVASG-RTVVGYCLAIMKNDLGVNFNIIGQ 444

Query: 428 EY 429
            +
Sbjct: 445 NF 446


>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score =  357 bits (915), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 193/416 (46%), Positives = 264/416 (63%), Gaps = 14/416 (3%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           C   G FGF+ HH +SD VK  L +DDL P++GS  Y+  LAHRDR   +RGRGLA+  N
Sbjct: 23  CEASGKFGFEVHHIFSDAVKQSLGLDDLVPEQGSLEYFKVLAHRDRL--IRGRGLASN-N 79

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC-VSCVHG 141
           + TP+TF  GN T  +  LG L+Y NVSVG P  SF+VALDTGSDLFWLPC+C  +C+  
Sbjct: 80  EDTPVTFDGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRD 139

Query: 142 LNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           L      Q +  N+Y+PN S+TSS + C+   C   K+C S  S CPYQ+ Y S+ T +T
Sbjct: 140 LEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISY-SNSTGTT 198

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L++DVLHLAT+++    V + ++ GCG+ QTG F    + NG+ GLG+   SVPS+LA
Sbjct: 199 GTLLQDVLHLATEDENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLA 258

Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
              +  +SFSMCFG      GRISFGDKG   Q ETPF        Y + +T VSVGG+ 
Sbjct: 259 KANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGDP 318

Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
           V     A FD+G+SFT+L +PAY  ++++F+ L ++KR     +LPFE+CY LSPN T+ 
Sbjct: 319 VGTRLFAKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSI 378

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPK---GLYLYCLGVVKSD--NVNIIGREY 429
           E+P V +T  GG    +N+P     ++ +   G  +YCLGV+KS    +N+IG+ +
Sbjct: 379 EFPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKINVIGQNF 434


>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
          Length = 519

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 206/413 (49%), Positives = 269/413 (65%), Gaps = 21/413 (5%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           C   G F F+ HH +SD VK  L +DDL P+KGS  Y+  LA RDR   +RGRGLA+  N
Sbjct: 23  CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
           ++TP+TF  GN T  ++ LGFLHY NVSVG PA  F+VALDTGSDLFWLPC+C S C+  
Sbjct: 80  EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139

Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           L      Q    N+YSPNTSSTSS + C+   C    +C S  S+CPYQ++YLS  T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L EDVLHL T+++  + V + I+ GCG+ QTG     AA NGL GLG+   SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259

Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
              +  NSFSMCFG+  D  GRISFGDKG   Q ETP  L  T P    ++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTEP----SVTEVSVGGDA 313

Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
           V  +  A+FD+GTSFT+L +P Y  I++ F+    +KR     +LPFE+CY LSPN+T  
Sbjct: 314 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 373

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
            +P V +T +GG   F+ +P+ I +S      +YCLG++KS +  +NIIG+ +
Sbjct: 374 LFPRVAMTFEGGSQMFLRNPLFIDNSA-----MYCLGILKSVDFKINIIGQNF 421


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 191/419 (45%), Positives = 266/419 (63%), Gaps = 18/419 (4%)

Query: 24  CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
           C+GF      G FGF+ HH +SD VK  L + DL P++GS  Y+  LAHRDR   +RGRG
Sbjct: 17  CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74

Query: 77  LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
           LA+  ND+TP+TF  GN T  +  LG L+Y NVSVG P  SF+VALDTGSDLFWLPC+C 
Sbjct: 75  LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133

Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
            +C+  L      Q +  N+Y+PN S+TSS + C+   C   K+C S  S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192

Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
           + T + G L++DVLHLAT+++    V + ++ GCG+ QTG F    + NG+ GLG+   S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252

Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
           VPS+LA   +  NSFSMCFG      GRISFGD+G   Q ETPF        Y + I+ V
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGV 312

Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
           SV G+ V+    A FD+G+SFT+L +PAY  ++++F+ L +++R     +LPFE+CY LS
Sbjct: 313 SVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLS 372

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGREY 429
           PN T  ++P+V +T  GG    +N+P     ++ +G  +YCLGV+KS    +N+IG+ +
Sbjct: 373 PNATTIQFPLVEMTFIGGSKIILNNPFFTARTQ-EGNVMYCLGVLKSVGLKINVIGQNF 430


>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 530

 Score =  355 bits (910), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 209/413 (50%), Positives = 268/413 (64%), Gaps = 11/413 (2%)

Query: 24  CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           C   G F F+ HH +SD VK  L +DDL P+KGS  Y+  LA RDR   +RGRGLA+  N
Sbjct: 24  CEASGKFSFEVHHMFSDRVKQTLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 80

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
           ++TP+TF  GN T  ++ LGFLHY NVSVG PA  F+VALDTGS+LFWLPC+C S C+  
Sbjct: 81  EETPITFMRGNRTVSIDFLGFLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGSTCIRD 140

Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
           L      Q    N+YSPNTSSTSS + CN   C    QC S  S+CPYQ++YLS  T +T
Sbjct: 141 LKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTT 200

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L EDVLHL T++   K V + I+ GCGR QTG     AA NGL GLGM   SVPSILA
Sbjct: 201 GTLFEDVLHLVTEDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILA 260

Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
              +  NSFSMCFG+  D  GRISFGDKG   Q ETP    +  PTY + +T+VSVGG+ 
Sbjct: 261 KAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPSPTYAVNVTEVSVGGDV 320

Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
           V  +  A+FD+GTSFT+L +P Y  I++ F+    +KR     ++PFE+CY LSPN T  
Sbjct: 321 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTI 380

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
            +P V +T +GG   F+ +P+ IV +E     +YCLG++KS +  +NIIG+ +
Sbjct: 381 LFPRVAMTFEGGSLMFLRNPLFIVWNE-DNTAMYCLGILKSVDFKINIIGQNF 432


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score =  354 bits (908), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 185/416 (44%), Positives = 266/416 (63%), Gaps = 16/416 (3%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G+  F+ HHR+S+ VK +L    LP+ GS  YY AL HRDR     GR L +  N++T +
Sbjct: 20  GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSS 146
           +F+ GN T     + FLHY NV++G PA  F+VALDTGSDLFWLPC+C S CV  + +  
Sbjct: 75  SFAQGNST---EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQ 131

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
           G+ I  NIY+P+ S +SSKV CNSTLC L+ +C S  S+CPY++RYLS G+ STG LVED
Sbjct: 132 GERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVED 191

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           V+H++T+E +++  D+RI+FGC   Q G F +  A NG+ GL +   +VP++L   G+  
Sbjct: 192 VIHMSTEEGEAR--DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 248

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
           +SFSMCFG +G G ISFGDKGS  Q ETP S   +   Y+++IT+  VG   V+ EF+A 
Sbjct: 249 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 308

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           FDSGT+ T+L +P YT ++  F+    ++R + + D PFE+CY+++      + P V+  
Sbjct: 309 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFE 368

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN--IIGREYPIANNISLFHN 440
           MKGG  + V  PI++  +      +YCL V+K  N +  IIG+ +    N  + H+
Sbjct: 369 MKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNF--MTNYRIVHD 422


>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  348 bits (893), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 186/432 (43%), Positives = 266/432 (61%), Gaps = 30/432 (6%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G+  F+ HHR+S+ VK +L    LP+ GS  YY AL HRDR     GR L +  N++T +
Sbjct: 30  GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRRLTSN-NNQTTI 83

Query: 88  TFSAGNDTYRLNS----------LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
           +F+ GN T  ++             +LHY NV++G PA  F+VALDTGSDLFWLPC+C S
Sbjct: 84  SFAQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNS 143

Query: 138 -CVHGLNSSSG------QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
            CV  + +  G      Q I  NIY+P+ S++SSKV CNSTLC L+ +C S  S+CPY++
Sbjct: 144 TCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRI 203

Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
           RYLS G+ STG LVEDV+H++T+E +++  D+RI+FGC   Q G F +  A NG+ GL M
Sbjct: 204 RYLSPGSKSTGVLVEDVIHMSTEEGEAR--DARITFGCSETQLGLFQE-VAVNGIMGLAM 260

Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
              +VP++L   G+  +SFSMCFG +G G ISFGDKGS  Q ETP     +   Y+++IT
Sbjct: 261 ADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQHETPLGGTISPLFYDVSIT 320

Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
           +  VG   V  +FSAIFDSGT+ T+L DP YT ++  F+    ++R  +  D  FE+CY+
Sbjct: 321 KFKVGKVTVETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYI 380

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGRE 428
           ++      + P ++  MKGG  + V  PI++  +      +YCL V+K D  + NIIG+ 
Sbjct: 381 ITSTSDEEKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADFNIIGQN 440

Query: 429 YPIANNISLFHN 440
           +    N  + H+
Sbjct: 441 F--MTNYRIVHD 450


>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Brachypodium distachyon]
          Length = 509

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 200/425 (47%), Positives = 260/425 (61%), Gaps = 34/425 (8%)

Query: 28  GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           G  GFD HHR+S  VK      G  A      +GS  YYSAL+  DR      R + A G
Sbjct: 7   GGVGFDLHHRFSPVVKRWAESRGRPAAAAWWPEGSPEYYSALSAHDR-----ARRVLAGG 61

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
             ++ L+F+ GN T R    G LHY  V++G P  +F+VALDTGSDLFW+PCDC  C   
Sbjct: 62  KGESLLSFADGNSTTR--HAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPI 119

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
            N+S         YSP  SSTS  V C+ +LC+    C +   +CPY V+Y+S  T S+G
Sbjct: 120 ANTSE----LLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSG 175

Query: 202 FLVEDVLHLATDEKQS---------KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
            LVEDVL++      S         ++V +R+ FGCG+ QTG+FLDGAA  GL GLGMD+
Sbjct: 176 VLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDR 235

Query: 253 TSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPG-QGETPFSLRQTHPTYNITIT 310
            SVPS+LA  GL+  +SFSMCF  DG GRI+FG+    G Q ETPF + +T PTYNI++T
Sbjct: 236 VSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVT 295

Query: 311 QVSVGGN-AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
            V+V G  A+  EF+A+ DSGTSFTYLNDPAY+ ++ +FNS  +EKR   ++ +PFEYCY
Sbjct: 296 AVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCY 355

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL----YCLGVVKSD-NVNI 424
            LS  QT    P V+LT +GG  F V  P VIV+ E     +    YCL V KSD  ++I
Sbjct: 356 ALSRGQTEVLMPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDI 415

Query: 425 IGREY 429
           IG+ +
Sbjct: 416 IGQNF 420


>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 530

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 192/427 (44%), Positives = 259/427 (60%), Gaps = 16/427 (3%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
           +L+L+      C   G F F+ HH +SD VK  L  DDL P+ GS  Y+  LAHRDR+  
Sbjct: 13  MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 70

Query: 72  LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           +RGRGLA+  N++TPLT    N T  LN LGFLHY NVS+G PA  F+VALDTGSDLFWL
Sbjct: 71  IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 129

Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
           PC+C  +C+H L  +   + +  N+Y+PN S+TSS + C+   C    +C S  S CPYQ
Sbjct: 130 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 189

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
           +  LS  T++TG L++DVLHL T+++  K V++ ++ GCG+ QTG+F    A NG+ GL 
Sbjct: 190 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 248

Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
           M + SVPS+LA   +  NSFSMCFG      GRISFGDKG   Q ETP    +T   Y +
Sbjct: 249 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 308

Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
            +T VSVGG  V+    A+FD+G+SFT L + AY   ++ F+ L ++KR     D PFE+
Sbjct: 309 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 368

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGP-------FFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
           CY L     N +    ++  K   P          ND    VS   +G  +YCLG++KS 
Sbjct: 369 CYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSI 428

Query: 421 NVNIIGR 427
           N+NIIG+
Sbjct: 429 NLNIIGQ 435


>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
          Length = 518

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 192/427 (44%), Positives = 259/427 (60%), Gaps = 16/427 (3%)

Query: 13  LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
           +L+L+      C   G F F+ HH +SD VK  L  DDL P+ GS  Y+  LAHRDR+  
Sbjct: 1   MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 58

Query: 72  LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           +RGRGLA+  N++TPLT    N T  LN LGFLHY NVS+G PA  F+VALDTGSDLFWL
Sbjct: 59  IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 117

Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
           PC+C  +C+H L  +   + +  N+Y+PN S+TSS + C+   C    +C S  S CPYQ
Sbjct: 118 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 177

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
           +  LS  T++TG L++DVLHL T+++  K V++ ++ GCG+ QTG+F    A NG+ GL 
Sbjct: 178 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 236

Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
           M + SVPS+LA   +  NSFSMCFG      GRISFGDKG   Q ETP    +T   Y +
Sbjct: 237 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 296

Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
            +T VSVGG  V+    A+FD+G+SFT L + AY   ++ F+ L ++KR     D PFE+
Sbjct: 297 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 356

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGP-------FFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
           CY L     N +    ++  K   P          ND    VS   +G  +YCLG++KS 
Sbjct: 357 CYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSI 416

Query: 421 NVNIIGR 427
           N+NIIG+
Sbjct: 417 NLNIIGQ 423


>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
 gi|219887047|gb|ACL53898.1| unknown [Zea mays]
 gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 416

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 171/326 (52%), Positives = 222/326 (68%), Gaps = 5/326 (1%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           LHY  V+VG P  +F+VALDTGSDLFWLPC C  C     ++SG       Y P  SSTS
Sbjct: 6   LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSA---TFYIPGMSSTS 62

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             VPCNS  C+LQK+C S    CPY++ Y+S GT S+GFLVEDVL+L+T+    + + ++
Sbjct: 63  KAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQ 121

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           I  GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL  NSFSMCFG DG GRISF
Sbjct: 122 IMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISF 181

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
           GD+ S  Q ETP  + + HPTY ITI+ ++VG    + +F  IFD+GTSFTYL DPAYT 
Sbjct: 182 GDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFITIFDTGTSFTYLADPAYTY 241

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
           I+++F++  +  R  + S +PFEYCY LS ++  F  P + L    G  F V DP  ++S
Sbjct: 242 ITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVIS 301

Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREY 429
            + +  Y+YCL +VKS  +NIIG+ +
Sbjct: 302 IQ-EHEYVYCLAIVKSMKLNIIGQNF 326


>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
 gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
          Length = 575

 Score =  338 bits (867), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 202/425 (47%), Positives = 261/425 (61%), Gaps = 45/425 (10%)

Query: 28  GTFGFDFHHRYSDPVK----------------GILAVDDLPKKGSFAYYSALAHRDRYFR 71
           G  GF+ HHR+S  V+                  L  ++ P  GS  YYSAL   DR   
Sbjct: 28  GGIGFNLHHRFSPVVRQWMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSALLRHDRALF 87

Query: 72  LRGRGLAAQGNDK-TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
            R RGLA+  + + T LTF+ GN T RL++  +LHY  V VG P+  F+VALDTGSDLFW
Sbjct: 88  TRRRGLASAADGQSTTLTFADGNAT-RLDTYEYLHYAEVEVGTPSSKFLVALDTGSDLFW 146

Query: 131 LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCP 187
           LPC+C  C    N S+       +YSP+ SSTS  VPC   LCE    C +AG   S+CP
Sbjct: 147 LPCECKLCAK--NGST-------MYSPSLSSTSKTVPCGHPLCERPDACATAGKSSSSCP 197

Query: 188 YQVRYLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           Y+V+Y+S  T S+G LVEDVLHL         K+V + I FGCG+VQTG+FL GAA  GL
Sbjct: 198 YEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAFLRGAAAGGL 257

Query: 246 FGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPF----SLRQ 300
            GLG+DK SVPS LA+ GL+  +SFSMCF  DG GRI+FGD GSP Q ETP     SL+ 
Sbjct: 258 MGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETPLIAAGSLQP 317

Query: 301 THPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           ++  YNI++  ++V   A+  EF+A+ DSGTSFTYL+DPAYT ++  FNS   E  ET  
Sbjct: 318 SY--YNISVGAITVDSKAMAVEFTAVVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYG 375

Query: 361 SDL-PFEYCYVLSPNQTNFE-YPVVNLTMKGGGPFFVNDPIV-IVSSEPKGLYL---YCL 414
           S    FE+CY LSP QT+ +  P ++LT KGG  F +  PI+ +++S   G Y    YCL
Sbjct: 376 SGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPITWPIIPVLASTNGGPYHPIGYCL 435

Query: 415 GVVKS 419
           G++K+
Sbjct: 436 GIIKT 440


>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
           Japonica Group]
 gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
          Length = 551

 Score =  338 bits (866), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 208/428 (48%), Positives = 263/428 (61%), Gaps = 34/428 (7%)

Query: 30  FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
            GFD HHRYS  V+         G+         GS  YYSAL+  D     R RGLA Q
Sbjct: 27  LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84

Query: 81  GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
           G+    +TF+ GN T RL+  G LHY  V+VG P  +F+VALDTGSDLFW+PCDC  C  
Sbjct: 85  GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140

Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
             N ++   G   +   YSP+ SSTS  V C S LC+    C +A S+CPY VRY    T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200

Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
            S+G LVEDVL+L  ++  + +     V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260

Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
            SVPSILA+ G++  NSFSMCF  DG GRI+FGD GS  Q ETPF ++ TH  YNI+IT 
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320

Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
           +SVG   +   F AI DSGTSFTYLNDPAYT  +  FN+   E+R      T +   PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYCLGVVKSD-N 421
           YCY LSP+QT  E PVV+LT  GG  F V  P+  ++++       +  YCL V+KSD  
Sbjct: 381 YCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLP 440

Query: 422 VNIIGREY 429
           ++IIG+ +
Sbjct: 441 IDIIGQNF 448


>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
          Length = 551

 Score =  337 bits (864), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 207/428 (48%), Positives = 263/428 (61%), Gaps = 34/428 (7%)

Query: 30  FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
            GFD HHRYS  V+         G+         GS  YYSAL+  D     R RGLA Q
Sbjct: 27  LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84

Query: 81  GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
           G+    +TF+ GN T RL+  G LHY  V+VG P  +F+VALDTGSDLFW+PCDC  C  
Sbjct: 85  GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140

Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
             N ++   G   +   YSP+ SSTS  V C S LC+    C +A S+CPY VRY    T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200

Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
            S+G LVEDVL+L  ++  + +     V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260

Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
            SVPSILA+ G++  NSFSMCF  DG GRI+FGD GS  Q ETPF ++ TH  YNI+IT 
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320

Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
           +SVG   +   F AI DSGTSFTYLNDPAYT  +  FN+   E+R      T +   PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYCLGVVKSD-N 421
           YCY LSP+QT  E P+V+LT  GG  F V  P+  ++++       +  YCL V+KSD  
Sbjct: 381 YCYSLSPDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLP 440

Query: 422 VNIIGREY 429
           ++IIG+ +
Sbjct: 441 IDIIGQNF 448


>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
          Length = 335

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 178/317 (56%), Positives = 223/317 (70%), Gaps = 12/317 (3%)

Query: 33  DFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAG 92
           D HHRYS  V+   A    P  G+  YY+ALA  D    LR R LA  G     + F+ G
Sbjct: 25  DVHHRYSATVRE-WAGHRAPPAGTAEYYAALAGHD----LRRRSLAGGGE----VAFADG 75

Query: 93  NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF 152
           NDTYRLN LGFLHY  V++G P ++F+VALDTGSDLFW+PCDC++C   L S + + + F
Sbjct: 76  NDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRDLKF 134

Query: 153 NIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           + YSP  SSTS KVPC+S LC+ Q  C SA S+CPY ++YLSD T STG LVEDVL+L T
Sbjct: 135 DTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVT 194

Query: 213 DE-KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFS 270
           +  +Q K V + I+FGCGR QTGSFL  AAPNGL GLGMD  SVPS+LA+QG+   NSFS
Sbjct: 195 EYGRQPKIVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFS 254

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
           MCF  DG GRI+FGD GS  Q ETP ++ + +P YNI+IT  +VG  +++ +F+AI DSG
Sbjct: 255 MCFAQDGHGRINFGDTGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHTKFNAIVDSG 314

Query: 331 TSFTYLNDPAYTQISET 347
           TSFT L+DP YTQI+ +
Sbjct: 315 TSFTALSDPMYTQITSS 331


>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 141/251 (56%), Positives = 187/251 (74%), Gaps = 4/251 (1%)

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
           +VALDTGSDLFW+PCDC  C     ++     + +IY+P  S+T+ KV CN++LC  + Q
Sbjct: 1   MVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 60

Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
           C    S CPY V Y+S  T ++G L+EDV+HL T++K  + V++ ++FGCG+VQ+GSFLD
Sbjct: 61  CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 120

Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
            AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS  Q ETPF+L
Sbjct: 121 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 180

Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
             +HP YNIT+T+V VG   ++ EF+A+FD+GTSFTYL DP YT +SE+    A++KR +
Sbjct: 181 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 236

Query: 359 STSDLPFEYCY 369
             S +PFEYCY
Sbjct: 237 PDSRIPFEYCY 247


>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
           vinifera]
          Length = 294

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 141/202 (69%), Positives = 166/202 (82%), Gaps = 2/202 (0%)

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
           CG+VQTGSFL+GAAPNGLFGLGM   SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 1   CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60

Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
           S GQ ETPF+  ++   YNI+ITQ+SVGG + +  F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 61  SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 120

Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
           FN  AK+KR +S SDLPFEYCY +S  QT  EYP+VNLTMKGG  FFV DPIVIVS +  
Sbjct: 121 FNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ-- 178

Query: 408 GLYLYCLGVVKSDNVNIIGREY 429
           G Y+YCLGVVKS ++NIIG+ +
Sbjct: 179 GGYVYCLGVVKSGDINIIGQNF 200


>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
          Length = 306

 Score =  286 bits (731), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 141/202 (69%), Positives = 166/202 (82%), Gaps = 2/202 (0%)

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
           CG+VQTGSFL+GAAPNGLFGLGM   SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 13  CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72

Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
           S GQ ETPF+  ++   YNI+ITQ+SVGG + +  F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 73  SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 132

Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
           FN  AK+KR +S SDLPFEYCY +S  QT  EYP+VNLTMKGG  FFV DPIVIVS +  
Sbjct: 133 FNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ-- 190

Query: 408 GLYLYCLGVVKSDNVNIIGREY 429
           G Y+YCLGVVKS ++NIIG+ +
Sbjct: 191 GGYVYCLGVVKSGDINIIGQNF 212


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 170/416 (40%), Positives = 227/416 (54%), Gaps = 73/416 (17%)

Query: 24  CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
           C+GF      G FGF+ HH +SD VK  L + DL P++GS  Y+  LAHRDR   +RGRG
Sbjct: 17  CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74

Query: 77  LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
           LA+  ND+TP+TF  GN T  +  LG L+Y NVSVG P  SF+VALDTGSDLFWLPC+C 
Sbjct: 75  LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133

Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
            +C+  L      Q +  N+Y+PN S+TSS + C+   C   K+C S  S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192

Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
           + T + G L++DVLHLAT+++    V + ++ GCG+ QTG F    + NG+ GLG+   S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252

Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
           VPS+LA   +  NSFSMCFG      GRISFG                            
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFG---------------------------- 284

Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET-FNSLAKEKRETSTSDLPFEYCYVL 371
                                    D  YT   ET F S+A  +R     +LPFE+CY L
Sbjct: 285 -------------------------DRGYTDQEETPFISVAPRRRPVD-PELPFEFCYDL 318

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK---GLYLYCLGVVKSDNVNI 424
           SPN T  ++P+V +T  GG    +N+P     ++ +   G  +YCLGV+KS  + I
Sbjct: 319 SPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKI 374


>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 414

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 160/404 (39%), Positives = 223/404 (55%), Gaps = 59/404 (14%)

Query: 10  VCVLLILLSCCAGC--CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHR 66
           V VLL +L  C G   C   G F F+ HH +SD VK  L   DL P+KGS  Y+  LA R
Sbjct: 7   VFVLLSVLVACWGLQRCESAGKFSFEVHHMFSDTVKQNLGFGDLVPEKGSLEYFKLLAQR 66

Query: 67  DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
           DR   +RGRGL++  N++ P+TF  GN T  ++ L                       GS
Sbjct: 67  DRL--IRGRGLSSN-NEEAPVTFILGNRTVSIDFL-----------------------GS 100

Query: 127 DLFWLPCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
           DLFWLPC+C  +C+  L        D  +                     Q  C S  S 
Sbjct: 101 DLFWLPCNCGTTCIRDLE-------DIGLS--------------------QGGCSSPASV 133

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           CPYQ+ YL + T + G L EDVLHL T+++  + V + I+ GCG+ QTG +    A NGL
Sbjct: 134 CPYQIPYLFNTTSTRGTLFEDVLHLVTEDEGLEPVKANITLGCGQNQTGLYRKSLAVNGL 193

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP 303
            GLGM   SVPS+LA + +  NSFSMCFG+  D  GRISFGD+G   Q +TP    + +P
Sbjct: 194 LGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQLQTPLVPIEPNP 253

Query: 304 TYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
           TY + +T+V+VGG+ +  +  A+FD+GTSFT+L +PAY  +++ F+    +KR     ++
Sbjct: 254 TYAVNVTEVTVGGDILEIQMLALFDTGTSFTHLLEPAYGLLTKAFDDHVTDKRRPIDPEI 313

Query: 364 PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
           PFE+CY  SPN  +F++P VN+T  GG    + DP+  V +E +
Sbjct: 314 PFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVWNEAR 357


>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 430

 Score =  271 bits (692), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 141/287 (49%), Positives = 188/287 (65%), Gaps = 12/287 (4%)

Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
            N YSPN S+TSS VPC S+LC    +C S  + CPY++RYLS  T S G+LVEDVLHLA
Sbjct: 3   LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           TD+   K V+++I+FGCG VQTG F   AAPNGL GLGM+K SVPS LA+QGL  NSFSM
Sbjct: 60  TDDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSM 119

Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGT 331
           CFG+DG GRI FGD G   Q +TPF+    + +YN+T   ++VGG   +  F+AIFDSGT
Sbjct: 120 CFGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGT 179

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETS-TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           SFTYL +PAY+ I++  ++  K KR +    + PFEYCY + P    F+Y  +N TMKGG
Sbjct: 180 SFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFTMKGG 239

Query: 391 GPFFVNDPIVIVSSEPKGL--------YLYCLGVVKSDNVNIIGREY 429
             F   D  V +  +   +        ++ CL + KS ++++IG+ +
Sbjct: 240 DEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDIDLIGQNF 286


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 33  HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C            D  
Sbjct: 91  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED LHL   
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           E     V++ +  GCG+ Q+G +LDG AP+GL GLGM   SVPS LA  GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263

Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
             D +GRI FGD+G P Q  TPF  L     TY + + +  +G   +    F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
           SFT L    Y   +  F+      R     D  ++YCY  SP +   + P + LT     
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
                +PI+  + +   L  +CL V+ S + + II + + +  ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 33  HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C            D  
Sbjct: 91  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED LHL   
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           E     V++ +  GCG+ Q+G +LDG AP+GL GLGM   SVPS LA  GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263

Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
             D +GRI FGD+G P Q  TPF  L     TY + + +  +G   +    F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
           SFT L    Y   +  F+      R     D  ++YCY  SP +   + P + LT     
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
                +PI+  + +   L  +CL V+ S + + II + + +  ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 3   HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 60

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C            D  
Sbjct: 61  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 114

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED LHL   
Sbjct: 115 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 174

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           E     V++ +  GCG+ Q+G +LDG AP+GL GLGM   SVPS LA  GL+ NSFSMCF
Sbjct: 175 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 233

Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
             D +GRI FGD+G P Q  TPF  L     TY + + +  +G   +    F A+ DSGT
Sbjct: 234 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 293

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
           SFT L    Y   +  F+      R     D  ++YCY  SP +   + P + LT     
Sbjct: 294 SFTSLPLDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 351

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
                +PI+  + +   L  +CL V+ S + + II + + +  ++
Sbjct: 352 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 396


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 164/415 (39%), Positives = 233/415 (56%), Gaps = 23/415 (5%)

Query: 29  TFGFDFHHRYSDPVKGILA--VDDL----PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           TF     HR+SD VK +     D L    P+K S  YY  L + D  F+ +   L  Q  
Sbjct: 35  TFSSRLIHRFSDEVKALRVSRKDSLSYSWPEKKSMDYYQILVNSD--FQRQKMKLGPQYQ 92

Query: 83  DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
              P   S G+ T  L +  G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C   
Sbjct: 93  FLFP---SQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALDAGSDLLWVPCDCLQCAP- 148

Query: 142 LNSS--SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
           L++S  S    D N YSP+ SSTS  + C+  LCEL   C S    CPY + Y ++ T S
Sbjct: 149 LSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSS 208

Query: 200 TGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
           +G LVED+LHLA+  D   S SV + +  GCG  Q+G +LDG AP+GL GLG+ + SVPS
Sbjct: 209 SGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPS 268

Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
            LA  GLI NSFSMCF  D +GRI FGD+G   Q  TPF +L   + TY + +    VG 
Sbjct: 269 FLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGS 328

Query: 317 NAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           + +    F A+ D+GTSFT+L +  Y +I+E F+        +S +  P++YCY  S N 
Sbjct: 329 SCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATI-SSFNGYPWKYCYKSSSNH 387

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
              + P V L       F +++P+ ++    +G+  +CL +  ++ ++  IG+ +
Sbjct: 388 LT-KVPSVKLIFPLNNSFVIHNPVFMIYGI-QGITGFCLAIQPTEGDIGTIGQNF 440


>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
          Length = 263

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/220 (57%), Positives = 159/220 (72%), Gaps = 2/220 (0%)

Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
             T+E   K V + I FGCG+VQTG+FLD AAPNGLFGLGMDK SVPS+LA++G   NSF
Sbjct: 1   FKTEETIPKVVKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSF 60

Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
           SMCFGSDG GRI FGD GS  QGETPF +  +HPTYNI++  + VG ++++   SAI DS
Sbjct: 61  SMCFGSDGMGRIYFGDTGSSDQGETPFDVNHSHPTYNISLIGMEVGNSSIDVNSSAIVDS 120

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GTSFT L DP YT++SE+F++  +E R  S   +PFEYCY LS NQ +   P +NLT KG
Sbjct: 121 GTSFTCLADPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKG 180

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
           G  F +NDPI+++SSE      YCLG+VKS  +NIIG+ +
Sbjct: 181 GSQFPINDPIIVISSEQSS--FYCLGIVKSSQLNIIGQNF 218


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 156/405 (38%), Positives = 212/405 (52%), Gaps = 16/405 (3%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 33  HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C            D  
Sbjct: 91  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED LHL   
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           E     V++ +  GCG+ Q+G +LDG AP+GL  LGM   SVPS LA  GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCF 263

Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
             D +GRI FGD+G P Q  TPF  L     TY + + +  +G   +    F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
           SFT L    Y   +  F+      R     D  ++YCY  SP +   + P + LT     
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
                +PI+  + +   L  +CL V+ S + + II + + +  ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 161/433 (37%), Positives = 239/433 (55%), Gaps = 24/433 (5%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALA 64
           ++L++ S          TF     HR+S   K       G +     P+K S  YY  L 
Sbjct: 2   LILVMSSFLVQNTVELATFSSRLIHRFSKEYKEVSVSRGGDVNGTWWPEKKSKEYYQILV 61

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALD 123
             D    L+ + L   G     L  S G+ T  L N  G+LHYT + +G P +SF+VALD
Sbjct: 62  SSD----LKRQKLKL-GPHYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALD 116

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPS 181
           +GSDLFW+PCDCV C   L++S    +D ++  YSP+ SSTS ++ C+  LC++   C +
Sbjct: 117 SGSDLFWVPCDCVQCAP-LSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKN 175

Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDG 239
              +CPY + Y ++ T S+G LVED++HLA+  D+  + SV + +  GCG  Q+G +LDG
Sbjct: 176 PKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDG 235

Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SL 298
            AP+GL GLG+ + SVPS LA  GLI NSFSMCF  D +GRI FGD+G   Q   PF  L
Sbjct: 236 VAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKL 295

Query: 299 RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
              + TY + +    VG + +    FSA+ DSGTSFT+L D  +  I+E F++     R 
Sbjct: 296 NGNYTTYIVGVEVCCVGTSCLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASR- 354

Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           +S     ++YCY  S +Q   + P + L       F V +P+ ++    +G+  +CL + 
Sbjct: 355 SSFEGYSWKYCYKTS-SQDLPKIPSLRLIFPQNNSFMVQNPVFMIYGI-QGVIGFCLAIQ 412

Query: 418 KSD-NVNIIGREY 429
            +D ++  IG+ +
Sbjct: 413 PADGDIGTIGQNF 425


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score =  254 bits (648), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 160/441 (36%), Positives = 230/441 (52%), Gaps = 26/441 (5%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDD-----LPKKG 55
           MA+ +  +   V+L++ SC A        F     HR+SD VK   A         P+  
Sbjct: 1   MAARFLVAMSVVVLLIESCMAA------MFSARLIHRFSDEVKAFRAARSGLSGSWPEWR 54

Query: 56  SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQP 114
           +  YY  L   D       R     G+    L  S G+ T    N  G+LHYT + +G P
Sbjct: 55  TMEYYKMLVRSDW-----ERQKVMLGSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTP 109

Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLC 173
            +SF+VALD GSDL W+PCDC+ C     S  G +  D N YSP+ SSTS  + C+  LC
Sbjct: 110 NISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLC 169

Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRV 231
           E    C S    CPY + Y S+ T S+G L+ED+LHL +  D+  + SV + +  GCG  
Sbjct: 170 ESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMR 229

Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQ 291
           QTG +LDG AP+GL GLG+ + SVPS L+  GL+ NSFS+CF  D +GRI FGD+G   Q
Sbjct: 230 QTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQ 289

Query: 292 GETPFSLRQ-THPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFN 349
             T F      + TY + +    +G + +    F A+ DSG SFT+L D +Y  + + F+
Sbjct: 290 QTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFD 349

Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL 409
                 R  S    P+EYCY  S  +   + P V L       F V++P+ +V    +G+
Sbjct: 350 KQVNATR-FSFEGYPWEYCYKSSSKEL-LKNPSVILKFALNNSFVVHNPVFVVHGY-QGV 406

Query: 410 YLYCLGVVKSD-NVNIIGREY 429
             +CL +  +D ++ I+G+ +
Sbjct: 407 VGFCLAIQPADGDIGILGQNF 427


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 164/415 (39%), Positives = 223/415 (53%), Gaps = 21/415 (5%)

Query: 29  TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR+SD  K       G +  D  PKK SF YY  L   D    L+ + L   G
Sbjct: 14  TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 68

Query: 82  NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
            +   L  S G+D   L N  G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C  
Sbjct: 69  AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 128

Query: 141 GLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
              S   ++  D N YSP+ SSTS  + CN  LCEL   C S+   CPY   Y S+ T S
Sbjct: 129 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 188

Query: 200 TGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
           +G L+ED LHLA  ++     SV + +  GCGR Q+G+F DGAAP+GL GLG    SVPS
Sbjct: 189 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 248

Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
           +LA  GL+ N+FS+CF  + +G I FGD+G   Q  T F  L     TY I +    VG 
Sbjct: 249 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 308

Query: 317 NAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           +++    F A+ DSGTSFT+L    Y +I   F+      R +S    P++YCY  S +Q
Sbjct: 309 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCYN-SSSQ 366

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIGREY 429
                P V L       F V++P++ + SE +   ++CL +    +   IIG+ +
Sbjct: 367 ELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNF 421


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 161/401 (40%), Positives = 208/401 (51%), Gaps = 21/401 (5%)

Query: 36  HRYSDPVKGILAVD----DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
           HR SD  +  LA        P+ GS  YY AL   D   + R   L +         FS 
Sbjct: 80  HRLSDEAR--LAAGPHGARWPRHGSGGYYRALVRSDLQRQKRKHQLLSVSEAGG--IFSP 135

Query: 92  GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID 151
           GND       G+L+YT V VG P  SF+VALDTGSDLFW+PCDC+ C            D
Sbjct: 136 GND------FGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRD 189

Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
             IY P  S+TS  +PC+  LC     C S    CPY   YL + T S+G L+ED+LHL 
Sbjct: 190 LGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLD 249

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           + E  +  V + +  GCGR Q+GS+LDG AP+GL GLGM   SVPS LA  GL+ NSFSM
Sbjct: 250 SRESHAP-VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSM 308

Query: 272 CFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDS 329
           CF  D +GRI FGD+G   Q  TPF  L   + TY + + +  VG        F A+ DS
Sbjct: 309 CFKED-SGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDS 367

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GTSFT L    Y  ++  F+      R T   D  FEYCY  SP +   + P V LT   
Sbjct: 368 GTSFTALPLNVYKAVAVEFDKQVHAPRITQ-EDASFEYCYSASPLKMP-DVPTVTLTFAA 425

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREY 429
              F   +P +++      +  +CL + KS + + IIG+ +
Sbjct: 426 NKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNF 466


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 164/415 (39%), Positives = 223/415 (53%), Gaps = 21/415 (5%)

Query: 29  TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR+SD  K       G +  D  PKK SF YY  L   D    L+ + L   G
Sbjct: 24  TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 78

Query: 82  NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
            +   L  S G+D   L N  G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C  
Sbjct: 79  AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 138

Query: 141 GLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
              S   ++  D N YSP+ SSTS  + CN  LCEL   C S+   CPY   Y S+ T S
Sbjct: 139 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 198

Query: 200 TGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
           +G L+ED LHLA  ++     SV + +  GCGR Q+G+F DGAAP+GL GLG    SVPS
Sbjct: 199 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 258

Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
           +LA  GL+ N+FS+CF  + +G I FGD+G   Q  T F  L     TY I +    VG 
Sbjct: 259 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 318

Query: 317 NAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           +++    F A+ DSGTSFT+L    Y +I   F+      R +S    P++YCY  S +Q
Sbjct: 319 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCYN-SSSQ 376

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIGREY 429
                P V L       F V++P++ + SE +   ++CL +    +   IIG+ +
Sbjct: 377 ELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNF 431


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 163/422 (38%), Positives = 221/422 (52%), Gaps = 32/422 (7%)

Query: 29  TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR SD       P  G+      P++GS  YY AL   D   + + R LA + 
Sbjct: 26  TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78

Query: 82  N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
                 K   TFS GND      LG+L+Y  V VG P  SF+VALDTGSDLFW+PCDC+ 
Sbjct: 79  QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132

Query: 138 CVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDG 196
           C   L+S  G +  D  IY P  S+TS  +PC+  LC+    C +    C Y + Y S+ 
Sbjct: 133 CAP-LSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSEN 191

Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           T S+G L+ED LHL + E  +  V++ +  GCGR Q+G +LDG AP+GL GLGM   SVP
Sbjct: 192 TTSSGLLIEDSLHLNSREGHAP-VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVP 250

Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
           S LA  GL+ NSFSMCF  D +GRI FGD+G   Q  TPF  L     TY + + +  +G
Sbjct: 251 SFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIG 310

Query: 316 GNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
              +    F A+ DSGTSFT L    Y   +  F+      R     D  ++YCY  SP 
Sbjct: 311 HKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASR-VPYEDSTWKYCYSASPL 369

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIAN 433
           +   + P + L       F   +PI+  + E   L  +CL V+ S + + IIG+ + +  
Sbjct: 370 EMP-DVPTIILAFAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGY 428

Query: 434 NI 435
           ++
Sbjct: 429 HV 430


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 152/382 (39%), Positives = 211/382 (55%), Gaps = 14/382 (3%)

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
           Y+ AL   D   + R  G   Q      L+ S G   +   N LG+L+YT V VG P  S
Sbjct: 60  YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114

Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
           F+VALDTGSDLFW+PCDC+ C   L+S  G +  D  IY P+ S+TS  +PC+  LC   
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
             C +    CPY + Y S+ T S+G L+ED+LHL + E  +  V++ +  GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
           L+G AP+GL GLGM   SVPS LA  GL+ NSFSMCF  D +GRI FGD+G P Q  TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292

Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
             +     TY + + +  +G        F A+ D+GTSFT L   AY  I+  F+     
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352

Query: 355 KRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
            R  S+ D  FEYCY   P +   + P + LT      F   +PI+  +       ++CL
Sbjct: 353 SR-ASSDDYSFEYCYSTGPLEMP-DVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCL 410

Query: 415 GVVKS-DNVNIIGREYPIANNI 435
            V+ S + V IIG+ + +  ++
Sbjct: 411 AVLPSPEPVGIIGQNFMVGYHV 432


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 152/382 (39%), Positives = 211/382 (55%), Gaps = 14/382 (3%)

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
           Y+ AL   D   + R  G   Q      L+ S G   +   N LG+L+YT V VG P  S
Sbjct: 60  YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114

Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
           F+VALDTGSDLFW+PCDC+ C   L+S  G +  D  IY P+ S+TS  +PC+  LC   
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
             C +    CPY + Y S+ T S+G L+ED+LHL + E  +  V++ +  GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
           L+G AP+GL GLGM   SVPS LA  GL+ NSFSMCF  D +GRI FGD+G P Q  TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292

Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
             +     TY + + +  +G        F A+ D+GTSFT L   AY  I+  F+     
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352

Query: 355 KRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
            R  S+ D  FEYCY   P +   + P + LT      F   +PI+  +       ++CL
Sbjct: 353 SR-ASSDDYSFEYCYSTGPLEMP-DVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCL 410

Query: 415 GVVKS-DNVNIIGREYPIANNI 435
            V+ S + V IIG+ + +  ++
Sbjct: 411 AVLPSPEPVGIIGQNFMVGYHV 432


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 160/392 (40%), Positives = 215/392 (54%), Gaps = 20/392 (5%)

Query: 52  PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA-GNDTYRLNSLGFLHYTNVS 110
           P++GS  YY +L   D   + R  G    G     L+FS  G      N  G+L+YT V 
Sbjct: 158 PRRGSGDYYRSLVRSDLQRQKRRLG----GGKHQLLSFSKDGGIIPTGNDFGWLYYTWVD 213

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           VG P  SF+VALDTGSDLFW+PCDC+ C  + G + S  +  D  IY P  S+TS  +PC
Sbjct: 214 VGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDR--DLGIYKPAESTTSRHLPC 271

Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
           +  LC L   C +    CPY  +YL + T S+G LVED+LHL + E  +  V + +  GC
Sbjct: 272 SHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAP-VKASVIIGC 330

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGS 288
           GR Q+GS+LDG AP+GL GLGM   SVPS LA  GL+ NSFSMCF  D +GRI FGD+G 
Sbjct: 331 GRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKD-SGRIFFGDQGV 389

Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISE 346
             Q  TPF  L     TY + + +  VG     +  F AI DSGTSFT L    Y  ++ 
Sbjct: 390 STQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAI 449

Query: 347 TFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSS 404
            F+      R  + +TS   F+YCY  SP     + P V LT  G   F   +P  ++  
Sbjct: 450 EFDKQVNASRLPQEATS---FDYCYSASP-LVMPDVPTVTLTFAGNKSFQPVNPTFLLHD 505

Query: 405 EPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
           E   +  +CL VV+S + + II + + +  ++
Sbjct: 506 EEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHV 537


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 153/416 (36%), Positives = 222/416 (53%), Gaps = 39/416 (9%)

Query: 36  HRYSDPVKGILAV----DDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
           HR+SD  +  +      + LP+K S  YY  LA  D  FR +   L A+     P     
Sbjct: 31  HRFSDEGRASIRTPSSSESLPEKQSLEYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
           T S+GND       G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C    ++  S
Sbjct: 89  TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           S    D N Y+P++SSTS    C+  LC+    C S    CPY V YLS  T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVE 202

Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           D+LHL  +        S SV +R+  GCG+ Q+G +LDG AP+GL GLG  + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
             GL+ NSFS+CF  + +GRI FGD G   Q  TPF   + +  Y + +    +G + + 
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLENNSGYIVGVEACCIGNSCLK 322

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQ 375
              F+   DSG SFTYL +  Y ++     +L  ++   +TS     + +EYCY    + 
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY---ESS 374

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGREY 429
              + P + L       F ++ P+ +   + +GL  +CL +  S  + +  IG+ Y
Sbjct: 375 VEPKVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSIGQNY 429


>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 529

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 159/412 (38%), Positives = 227/412 (55%), Gaps = 20/412 (4%)

Query: 29  TFGFDFHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           TF     HR+S+ +K + +   D P + +  Y+  L  R+ + R +       G  +  L
Sbjct: 26  TFSVKLFHRFSEEMKPVQVQTGDWPDRRTLHYHEKLL-RNDFLRHK----INLGGARHKL 80

Query: 88  TF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
            F S G+ T    N  G+LHYT + +G P+ SF+VALD GSDL W+PCDC+ C   L++S
Sbjct: 81  LFPSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCA-PLSAS 139

Query: 146 --SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGF 202
             S    D N YSP+ S +S  + C+  LC++   C  S    CPY + YLSD T S+G 
Sbjct: 140 FYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGL 199

Query: 203 LVEDVLHLATDE--KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           LVED+ HL + +    + SV + +  GCG  Q+G +LDG AP+GL GLG  ++SVPS LA
Sbjct: 200 LVEDIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLA 259

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV 319
             GLI +SFS+CF  D +GR+ FGD+GS  Q  TPF L      TY + +    +G +  
Sbjct: 260 KSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCP 319

Query: 320 NF-EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
               F+A FDSGTSFT+L   AY  I+E F+      R T     P+EYCYV S  Q   
Sbjct: 320 KVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGS-PWEYCYVPSSQQLP- 377

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
           + P + L  +    F V +P V VS   +G+  +CL +  ++  +  IG+ +
Sbjct: 378 KIPTLTLMFQQNNSFVVYNP-VFVSYNEQGVDGFCLAIQPTEGGMGTIGQNF 428


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 152/406 (37%), Positives = 216/406 (53%), Gaps = 20/406 (4%)

Query: 36  HRYSDPVKGILAVDD-----LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
           HR+SD VK   A         P+  +  YY  L   D       R     G+    L  S
Sbjct: 11  HRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVRSDWE-----RQKVMLGSKYQFLFPS 65

Query: 91  AGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
            G+ T    N  G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C     S  G +
Sbjct: 66  EGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSL 125

Query: 150 -IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
             D N YSP+ SSTS  + C+  LCE    C S    CPY + Y S+ T S+G L+ED+L
Sbjct: 126 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185

Query: 209 HLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           HL +  D+  + SV + +  GCG  QTG +LDG AP+GL GLG+ + SVPS L+  GL+ 
Sbjct: 186 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 245

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV-NFEFS 324
           NSFS+CF  D +GRI FGD+G   Q  T F      + TY + +    +G + +    F 
Sbjct: 246 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 305

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
           A+ DSG SFT+L D +Y  + + F+      R  S    P+EYCY  S  +   + P V 
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATR-FSFEGYPWEYCYKSSSKEL-LKNPSVI 363

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
           L       F V++P+ +V    +G+  +CL +  +D ++ I+G+ +
Sbjct: 364 LKFALNNSFVVHNPVFVVHGY-QGVVGFCLAIQPADGDIGILGQNF 408


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 158/428 (36%), Positives = 222/428 (51%), Gaps = 23/428 (5%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKG-ILAVDDLPKKGSFAYYSALAHRDRYF 70
           +LL +LS  +        F     HR+SD  +  I +    P+K SF YY  L   D   
Sbjct: 8   ILLFILSLVSEKSLA-SLFSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSIDS-- 64

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
           R +   L A+     P   S G+ T    N  G+LHYT + +G P++SF+VALD+GSDL 
Sbjct: 65  RRQKMNLGAKFQSLVP---SEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLL 121

Query: 130 WLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCP 187
           W+PC+CV C  +     SS    D N + P+ S+TS   PC+  LCE    C S    CP
Sbjct: 122 WIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCP 181

Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
           Y V Y S+ T S+G LVEDVLHLA     S SV +R+  GCG  Q+G FL G AP+G+ G
Sbjct: 182 YTVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMG 241

Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYN 306
           LG  + SVPS LA  GL+ NSFSMCF  + +GRI FGD G   Q  T F   +     Y 
Sbjct: 242 LGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYF 301

Query: 307 ITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
           + +    VG + +    F+ + DSG SFT+L +  Y +++   +S      +      P+
Sbjct: 302 VGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGG-PW 360

Query: 366 EYCYVLSPNQTNFE--YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
           EYCY     +T+FE   P + L       F ++ P+ ++    +GL  +CL +  S+   
Sbjct: 361 EYCY-----ETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRS-EGLVQFCLPISASEEGT 414

Query: 424 --IIGREY 429
             +IG+ Y
Sbjct: 415 GGVIGQNY 422


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 161/409 (39%), Positives = 222/409 (54%), Gaps = 16/409 (3%)

Query: 29  TFGFDFHHRYSDPVKGILAVDD-LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           TF     HR++D +K +       P + S  YY  L   D    +  R +   G     L
Sbjct: 22  TFSARLVHRFADEMKPVRPPTGYWPDRWSMGYYRMLLTGD----ILRRKIKVGGARYQLL 77

Query: 88  TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
             S G+ T  L N  G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C   L+SS 
Sbjct: 78  FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 136

Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
            S    D N YSP+ S +S  + C+  LC+    C S+   CPY V YLS+ T S+G LV
Sbjct: 137 YSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 196

Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           ED+LHL +    S S V + +  GCG  Q+G +LDG AP+GL GLG  ++SVPS LA  G
Sbjct: 197 EDILHLQSGGSLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 256

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
           LI +SFS+CF  D +GRI FGD+G   Q  T F  L   + TY I +    VG + +   
Sbjct: 257 LIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMT 316

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
            F    DSGTSFT+L    Y  I+E F+      R +S    P+EYCYV S +Q   + P
Sbjct: 317 SFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSR-SSFEGSPWEYCYVPS-SQELPKVP 374

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
            + LT +    F V DP+ +     +G+  +CL +  ++ ++  IG+ +
Sbjct: 375 SLTLTFQQNNSFVVYDPVFVFYGN-EGVIGFCLAIQPTEGDMGTIGQNF 422


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 162/417 (38%), Positives = 216/417 (51%), Gaps = 23/417 (5%)

Query: 29  TFGFDFHHRYSDPVKGILA------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           TF     HR+SD  K  L       V   PK+GS  Y+  L + D     +   L +Q  
Sbjct: 24  TFSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEYFRLLLNSD--LTRQKMKLGSQDQ 81

Query: 83  DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
              P   S G+ T    N   +LHYT + +G P +SF+VALDTGSD+FW+PCDC+ C   
Sbjct: 82  SFYP---SEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALDTGSDMFWVPCDCIECAP- 137

Query: 142 LNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
           L+++    +D   N YSP+ SS+S  +PC   LC     C      CPY   Y SD T S
Sbjct: 138 LSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSS 197

Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
           +GFL+ED LHLA++     S+ + +  GCGR Q+G FL+GAAPNG+ GLG    SVP++L
Sbjct: 198 SGFLIEDKLHLASNNATKNSIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALL 257

Query: 260 ANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-TPFSLRQTH-PTYNITITQVSVGGN 317
           A  GLI NS S+C    G+GRI FGD+G   Q   TPF L       Y + + +  VG  
Sbjct: 258 AKAGLIRNSISICLNEKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSF 317

Query: 318 AVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                EF A  D+GTSFTYL    Y  +   F       R TS     F  CY  S  ++
Sbjct: 318 CYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRES 377

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI-IGREYPIA 432
           N  +P +  T      F + +P + +  E   +   CL VV+SD+  I IGR+Y IA
Sbjct: 378 N-NFPPMKFTFSKNQSFIIQNPFISMDQEDTTI---CLAVVQSDDELITIGRKYTIA 430


>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 532

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 152/416 (36%), Positives = 229/416 (55%), Gaps = 22/416 (5%)

Query: 29  TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
           TF     HR+S+ +K + A            P+KGS  YY  L   D  FR +   L ++
Sbjct: 23  TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80

Query: 81  GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
                P   S G+ T  L N  G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C 
Sbjct: 81  FQLLFP---SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137

Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
               S  G +  D N Y P++SSTS  + C+  LC+  + C S   +CPY + Y+++ T 
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197

Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           S+G L++DVLHL++  + S   ++ + +  GCG  Q+G +L G AP+GLFGLG+ + SV 
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257

Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
           S LA + L+ NSFS+CF  DG+GRI FGD+G   Q  T F  L   + TY + +    + 
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317

Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
            + +    F A+ DSGTSFTYL + AY  I   F+         S    P++YCY +S +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
               + P V L       F V+DP+  +  + +GL  +C  ++ +D ++ I+G+ Y
Sbjct: 378 AMP-KVPSVTLLFPLNNSFVVHDPVFPIYGD-QGLAGFCFAILPADGDIGILGQNY 431


>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 521

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 159/409 (38%), Positives = 221/409 (54%), Gaps = 16/409 (3%)

Query: 29  TFGFDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           TF     HR++D +K +       P + S  YY  L   D    +  R +   G     L
Sbjct: 23  TFSARLVHRFADEMKPVRPPTGYWPDQRSMRYYQMLLTGD----ILRRKIKVGGTRYQLL 78

Query: 88  TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
             S G+ T  L N  G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C   L+SS 
Sbjct: 79  FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 137

Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
            S    D N YSP+ S +S  + C+  LC+    C S+   CPY V YLS+ T S+G LV
Sbjct: 138 YSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 197

Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           ED+LHL +    S S V + +  GCG  Q+G +LDG AP+GL GLG  ++SVPS LA  G
Sbjct: 198 EDILHLQSGGTLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 257

Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
           LI  SFS+CF  D +GR+ FGD+G   Q  T F  L   + TY I +    +G + +   
Sbjct: 258 LIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMT 317

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
            F A  DSGTSFT+L    Y  I+E F+      R +S    P+EYCYV S +Q   + P
Sbjct: 318 SFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSR-SSFEGSPWEYCYVPS-SQDLPKVP 375

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
              L  +    F V DP+ +     +G+  +CL ++ ++ ++  IG+ +
Sbjct: 376 SFTLMFQRNNSFVVYDPVFVFYGN-EGVIGFCLAILPTEGDMGTIGQNF 423


>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
 gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
 gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
 gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
          Length = 528

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 156/424 (36%), Positives = 222/424 (52%), Gaps = 41/424 (9%)

Query: 30  FGFDFHHRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
           F     HR+SD     +K   + D LP K S  YY  LA  D  FR +   L A+     
Sbjct: 25  FSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAESD--FRRQRMNLGAKVQSLV 82

Query: 86  P----LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
           P     T S+GND       G+LHYT + +G P++SF+VALDTGS+L W+PC+CV C   
Sbjct: 83  PSEGSKTISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPL 136

Query: 142 LNS--SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
            ++  SS    D N Y+P++SSTS    C+  LC+    C S    CPY V YLS  T S
Sbjct: 137 TSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSS 196

Query: 200 TGFLVEDVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
           +G LVED+LHL  +        S SV +R+  GCG+ Q+G +LDG AP+GL GLG  + S
Sbjct: 197 SGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEIS 256

Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQV 312
           VPS L+  GL+ NSFS+CF  + +GRI FGD G   Q  TPF       +  Y + +   
Sbjct: 257 VPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEAC 316

Query: 313 SVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEY 367
            +G + +    F+   DSG SFTYL +  Y ++     +L  ++   +TS     + +EY
Sbjct: 317 CIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEY 371

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNII 425
           CY  S      + P + L       F ++ P+ +   + +GL  +CL +  S  + +  I
Sbjct: 372 CYESSAEP---KVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSI 427

Query: 426 GREY 429
           G+ Y
Sbjct: 428 GQNY 431


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  244 bits (622), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 221/421 (52%), Gaps = 27/421 (6%)

Query: 29  TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR+SD  K I        + D  PK+ SF Y+  L   D       R     G
Sbjct: 27  TFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDL-----KRQRMKLG 81

Query: 82  NDKTPLTF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
           + K  L F S G+      N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C 
Sbjct: 82  SQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCA 141

Query: 140 HGLNSSSGQV---IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS-D 195
             L++S   +    D + YSP+ SSTS  + C+  LCE    C +    CPY   Y   +
Sbjct: 142 -PLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFE 200

Query: 196 GTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
            T S GFLVED LHLA+  D    K + + +  GCGR Q GSF DGAAP+G+ GLG    
Sbjct: 201 NTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDI 260

Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQV 312
           SVPS+LA  GLI N FS+CF  + +GRI FGD+G   Q  TPF  ++ T+  Y + +   
Sbjct: 261 SVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESY 320

Query: 313 SVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
            VG + +    F A+ DSG+SFTYL    Y ++   F+     KR  S  D  ++YCY  
Sbjct: 321 CVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKR-ISFQDGLWDYCYNA 379

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREYP 430
           S +Q   + P + L       F V++P   +    +G  ++CL +  +D +  IIG+ + 
Sbjct: 380 S-SQELHDIPAIQLKFPRNQNFVVHNPTYSIPHH-QGFTMFCLSLQPTDGSYGIIGQNFM 437

Query: 431 I 431
           I
Sbjct: 438 I 438


>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
          Length = 426

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 148/403 (36%), Positives = 222/403 (55%), Gaps = 52/403 (12%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G+  F+ HHR+S+ VK +L    LP+ GS  YY AL HRDR     GR L +  N++T +
Sbjct: 20  GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
           +F+ GN T  ++    L+  N++   P L F +               V C   L     
Sbjct: 75  SFAQGNSTEEIS----LYDKNLA---PPLYFHLT------------QAVICFGYL----- 110

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
                          +  +P    +  L K +C S  S+CPY++RYLS G+ STG LVED
Sbjct: 111 ---------------AIAIPLVYGVWRLTKARCISPVSDCPYRIRYLSPGSKSTGVLVED 155

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
           V+H++T+E +++  D+RI+FG    Q G F +  A NG+ GL +   +VP++L   G+  
Sbjct: 156 VIHMSTEEGEAR--DARITFG--ESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 210

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
           +SFSMCFG +G G ISFGDKGS  Q ETP S   +   Y+++IT+  VG   V+ EF+A 
Sbjct: 211 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 270

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           FDSGT+ T+L +P YT ++  F+    ++R + + D PFE+CY+++      + P V+  
Sbjct: 271 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFE 330

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN--IIGR 427
           MKGG  + V  PI++  +      +YCL V+K  N +  IIGR
Sbjct: 331 MKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGR 373


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 160/423 (37%), Positives = 221/423 (52%), Gaps = 36/423 (8%)

Query: 29  TFGFDFHHRYSDPVKGIL-------AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR+S+  K +L       +    P K SF Y   L   D   + +   L AQ 
Sbjct: 23  TFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLLLDND--LKRQKMKLGAQN 80

Query: 82  NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
               P   S G+ T+   N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C  
Sbjct: 81  QLLFP---SLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCA- 136

Query: 141 GLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
            L++S  + +D ++  Y P+ S+TS  + CN  LCEL   C +    CPY   Y    T 
Sbjct: 137 PLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTS 196

Query: 199 STGFLVEDVLHLATDEKQSKSVDSRIS----FGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
           S+GFLVED+LHLA+    S S   R+      GCGR QTG +LDGAAP+G+ GLG    S
Sbjct: 197 SSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSIS 256

Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVS 313
           VPS+LA  GLI  SFS+CF  +G+G I FGD+G   Q  TP    Q  +  Y I +    
Sbjct: 257 VPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYC 316

Query: 314 VGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
           VG + +    F A+ DSG SFTYL    Y +I   F+     +R +S    P+ YCY  S
Sbjct: 317 VGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGG-PWNYCYNTS 375

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSS-----EPKGLYLYCLGVVKSD-NVNIIG 426
             Q +   P + L+      F +N  ++I +S     + +   ++CL +  +D N  IIG
Sbjct: 376 SKQLD-NVPAMRLS------FLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYGIIG 428

Query: 427 REY 429
           + Y
Sbjct: 429 QNY 431


>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 529

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 153/416 (36%), Positives = 222/416 (53%), Gaps = 39/416 (9%)

Query: 36  HRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
           HR+SD     +K   + + LP+K S AYY  LA  D  FR +   L A+     P     
Sbjct: 31  HRFSDEGRASIKTPSSSESLPEKQSLAYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
           T S+GND       G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C    ++  S
Sbjct: 89  TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           S    D N Y+P++SS+S    C+  LC     C S    C Y V+YLS  T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCDSPKEQCTYTVKYLSGNTSSSGLLVE 202

Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           D+LHL  +        S SV +R+  GCG+ Q+G +LDG AP+GL GLG  + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
             GL+ NSFS+CF  + +GRI FGD G   Q   PF   + +  Y + +    +G + + 
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAPFLQLENNSGYIVGVEACCIGNSCLK 322

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQ 375
              F+   DSG SFTYL +  Y ++     +L  ++   +TS     + +EYCY    + 
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY---ESS 374

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI--IGREY 429
              + P + L       F ++ P+ +   + +GL  +CL +  S+   I  IG+ Y
Sbjct: 375 VEPKVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSEQEGIGSIGQNY 429


>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like, partial [Cucumis sativus]
          Length = 408

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 145/385 (37%), Positives = 211/385 (54%), Gaps = 20/385 (5%)

Query: 29  TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
           TF     HR+S+ +K + A            P+KGS  YY  L   D  FR +   L ++
Sbjct: 23  TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80

Query: 81  GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
                P   S G+ T  L N  G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C 
Sbjct: 81  FQLLFP---SEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137

Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
               S  G +  D N Y P++SSTS  + C+  LC+  + C S   +CPY + Y+++ T 
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197

Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           S+G L++DVLHL++  + S   ++ + +  GCG  Q+G +L G AP+GLFGLG+ + SV 
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257

Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
           S LA + L+ NSFS+CF  DG+GRI FGD+G   Q  T F  L   + TY + +    + 
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317

Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
            + +    F A+ DSGTSFTYL + AY  I   F+         S    P++YCY +S +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPI 399
               + P V L       F V+DP+
Sbjct: 378 AMP-KVPSVTLLFPLNNSFVVHDPV 401


>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 525

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 152/419 (36%), Positives = 219/419 (52%), Gaps = 29/419 (6%)

Query: 28  GTFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
            TF     HR+S+  K  LA         +   P++ S  Y+  L   D   R R R   
Sbjct: 23  ATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSD-VARQRMR--- 78

Query: 79  AQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
             G+    L  S G  T+   N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+ 
Sbjct: 79  -LGSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIE 137

Query: 138 CVHGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
           C   L++ +  V+D   N Y P+ S+TS  +PC   LC++   C  +   CPY+V+Y S 
Sbjct: 138 CA-SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASA 196

Query: 196 GTMSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
            T S+G++ ED LHL +D K ++  SV + I  GCGR QTG +L GA P+G+ GLG    
Sbjct: 197 NTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNI 256

Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVS 313
           SVPS+LA  GLI NSFS+C   + +GRI FGD+G   Q  TPF        Y + +    
Sbjct: 257 SVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPF---LPIIAYMVGVESFC 313

Query: 314 VGGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
           VG   +    F A+ DSG+SFT+L +  Y ++   F+      R    S   +EYCY  S
Sbjct: 314 VGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS--WEYCYNAS 371

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVI-VSSEPKGLYLYCLGVVKS-DNVNIIGREY 429
            +Q     P + L       F + +PI    +S+ +   ++CL V  S D+   IG+ +
Sbjct: 372 -SQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDYAAIGQNF 429


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 151/456 (33%), Positives = 229/456 (50%), Gaps = 27/456 (5%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFG---FGTFGFDFHHRYSDPV-------KGILAVDD 50
           MA++ R+  V   L+++ CC               D  H++S           G+    D
Sbjct: 1   MATTVRSRGV---LVMVHCCVLWMLATTFANALRMDLFHKFSKQAIEAMRSRNGMDYAQD 57

Query: 51  LPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN 108
            P +G+  + + L   D  R+ R   R LAA   D+  L    GN T +L   G LHY+ 
Sbjct: 58  WPTEGTIEFQTMLRDHDVARHTRTARRILAASSMDQYVLI--QGNATEQLFG-GGLHYSY 114

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVH-GLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           + +G P + F+V LDTGSDL W+PC+C SC      S   +    N Y+P+ SST+  V 
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C+  LCE+   C +    CPY++ Y+S  T ++G L ED ++    E     V   +  G
Sbjct: 175 CSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYF-MRESGGNPVKLPVYLG 233

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
           CG+VQTGS L GAAPNGL GLG    SVP+ LA+ G + +SFS+C    G+G ++FGD+G
Sbjct: 234 CGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEG 293

Query: 288 SPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQIS 345
              Q  TP   +      TY + I  ++VG   +     A+FD+GTSFTYL+   Y Q  
Sbjct: 294 PAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYLSKTVYPQFV 353

Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
           + +++     +        ++ CY  S   TNF+ PVV+L + GG    V   +  +  +
Sbjct: 354 QAYDAQMSLPKWNDPRFSKWDLCYQTS--NTNFQVPVVSLALSGGNSLDVVSGLKSIVDD 411

Query: 406 PKGLYLYCLGVVKS-DNVNIIGREYPIANNISLFHN 440
              +   C+ V+ S   ++IIG+ +    N S+ +N
Sbjct: 412 NNAMIAVCVTVMDSGAGLSIIGQNF--MTNYSITYN 445


>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 880

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 151/411 (36%), Positives = 218/411 (53%), Gaps = 30/411 (7%)

Query: 29  TFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA 79
           TF     HR+S+  K  LA         +   P++ S  Y+  L   D   R R R L +
Sbjct: 24  TFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSD-VTRQRMR-LGS 81

Query: 80  QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
           Q     P  F  G      N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+ C 
Sbjct: 82  QYEMLYP--FEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECA 139

Query: 140 HGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
             L++ +  V+D   N Y P+ S+TS  +PC   LC++   C  +   CPY V+Y S  T
Sbjct: 140 -SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANT 198

Query: 198 MSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
            S+G++ ED LHL ++ K ++  SV + I  GCGR QTG +L GA P+G+ GLG    SV
Sbjct: 199 SSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISV 258

Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSV 314
           PS+LA  GLI NSFS+CF  + +GRI FGD+G   Q  TPF  +      Y + +    V
Sbjct: 259 PSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCV 318

Query: 315 GGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL---PFEYCYV 370
           G   +    F A+ DSG+SFT+L +  Y ++   F+     K+  +TS +    +EYCY 
Sbjct: 319 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFD-----KQVNATSIVLQNSWEYCYN 373

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
            S +Q     P +NL       + + +PI I     +   ++CL V  SD+
Sbjct: 374 AS-SQELISIPPLNLAFSRNQTYLIQNPIFI-DPASQEYTIFCLPVSPSDD 422


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 153/436 (35%), Positives = 227/436 (52%), Gaps = 29/436 (6%)

Query: 11  CVLLILL--SCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYY 60
           C LL+L   S    C     T   +  HR+SD  K        G ++    P   S  Y+
Sbjct: 4   CALLLLFIASLFVNCSLAL-TLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLKYF 62

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFI 119
             L   D    L+ R L   G+    L  S G+      N   +LHYT + +G P++ F+
Sbjct: 63  QMLMDYD----LKRRRLNI-GSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFL 117

Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQK 177
           VALD GSDL W+PCDC+ C   L+++   V+D ++  Y+P  SSTS  + C   LC    
Sbjct: 118 VALDVGSDLLWVPCDCIQCA-PLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWST 176

Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS--VDSRISFGCGRVQTGS 235
            C SA   C Y+  Y SD T ++GF++ED L L +  K      + + + FGCGR Q+GS
Sbjct: 177 TCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGS 236

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP 295
           +LDGAAP+G+ GLG    SVP++LA +GL+ N+FS+CF ++G+GRI FGD G   Q  T 
Sbjct: 237 YLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQ 296

Query: 296 F-SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
           F  L      Y I +    VG + +    F A+ DSG+SFTYL    Y +I   F+   K
Sbjct: 297 FLPLFGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVK 356

Query: 354 -EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
                    +LP+ YCY +S    +F  P + L        F++DP+ ++ +  +G  ++
Sbjct: 357 VNATRIVLRELPWNYCYNIS-TLVSFNIPSMQLVFPLNQ-IFIHDPVYVLPAN-QGYKVF 413

Query: 413 CLGVVKSD-NVNIIGR 427
           CL + ++D +  +IG+
Sbjct: 414 CLTLEETDEDYGVIGQ 429


>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
 gi|255630909|gb|ACU15817.1| unknown [Glycine max]
          Length = 244

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 99/168 (58%), Positives = 123/168 (73%), Gaps = 5/168 (2%)

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
           MCFG DG GRI+FGD GSP Q +TPF++R+ HPTYNITITQ+ V  +  + EF AIFDSG
Sbjct: 1   MCFGPDGAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITQIVVEDSVADLEFHAIFDSG 60

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           TSFTY+NDPAYT++ E +NS  K  R +S    S++PFEYCY +S NQT  E P +NLTM
Sbjct: 61  TSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQT-IEVPFLNLTM 119

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
           KGG  ++V DPIV V SE +G  L CLG+ KSD+VNIIG+ + I   I
Sbjct: 120 KGGDDYYVMDPIVQVFSEEEG-DLLCLGIQKSDSVNIIGQNFMIGYKI 166


>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 151

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)

Query: 10  VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
           V V++++    +  C+G GTFGFD HHR+SDPVKGIL VDDLP+K S  YY A+AHRD  
Sbjct: 10  VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
           + + GR L+     K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68  WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127

Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
           WLPCDC SC+ GLN++SG+V  F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150


>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
          Length = 150

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)

Query: 10  VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
           V V++++    +  C+G GTFGFD HHR+SDPVKGIL VDDLP+K S  YY A+AHRD  
Sbjct: 10  VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
           + + GR L+     K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68  WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127

Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
           WLPCDC SC+ GLN++SG+V  F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150


>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
 gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
          Length = 307

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 101/197 (51%), Positives = 128/197 (64%), Gaps = 11/197 (5%)

Query: 244 GLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTH 302
            L GLGM+K SVPSILA+ G++  NSFSMCF  DG GRI+FGD GS  Q ETPF ++ TH
Sbjct: 8   ALMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTH 67

Query: 303 PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----E 357
             YNI+IT +SVG   +   F AI DSGTSFTYLNDPAYT  +  FN+   E+R      
Sbjct: 68  SYYNISITSMSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGS 127

Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYC 413
           T +   PFEYCY LSP+QT  E PVV+LT  GG  F V  P+  ++++       +  YC
Sbjct: 128 TRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYC 187

Query: 414 LGVVKSD-NVNIIGREY 429
           L V+KSD  ++IIG+ +
Sbjct: 188 LAVIKSDLPIDIIGQNF 204


>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
          Length = 378

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 108/291 (37%), Positives = 153/291 (52%), Gaps = 6/291 (2%)

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
           Q  D  IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED 
Sbjct: 2   QDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDT 61

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           LHL   E     V++ +  GCG+ Q+G +LDG AP+GL GLGM   SVPS LA  GL+ N
Sbjct: 62  LHLNYREDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQN 120

Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSA 325
           SFSMCF  D +GRI FGD+G P Q  TPF  L     TY + + +  +G   +    F A
Sbjct: 121 SFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 180

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + DSGTSFT L    Y   +  F+      R     D  ++YCY  SP +   + P + L
Sbjct: 181 LVDSGTSFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITL 238

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
           T          +PI+  + +   L  +CL V+ S + + II + + +  ++
Sbjct: 239 TFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 289


>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
 gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
          Length = 260

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 108/163 (66%), Gaps = 5/163 (3%)

Query: 271 MCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
           MCFG+  D  GRISFGDKG   Q ETP    +  PTY +++T+VSVGG+AV  +  A+FD
Sbjct: 1   MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLLALFD 60

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           +GTSFT+L +P Y  I++ F+    +KR     +LPFE+CY LSPN+T   +P V +T +
Sbjct: 61  TGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFE 120

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
           GG   F+ +P+ IV +E     +YCLG++KS +  +NIIG+ +
Sbjct: 121 GGSQMFLRNPLFIVWNEDNSA-MYCLGILKSVDFKINIIGQNF 162


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 181/388 (46%), Gaps = 38/388 (9%)

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
           S L  RD    LR R +    +     +     D +++     L+YT V +G P + F V
Sbjct: 41  SQLRARDE---LRHRRMLQSSSGVVDFSVQGTFDPFQVG----LYYTKVQLGTPPVEFNV 93

Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
            +DTGSD+ W+ C+  SC +G   +SG  I  N + P +SSTSS + C+   C   KQ  
Sbjct: 94  QIDTGSDVLWVSCN--SC-NGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSS 150

Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
              C S  + C Y  +Y  DG+ ++G+ V D++HL T  + S + +S   + FGC   QT
Sbjct: 151 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVVFGCSNQQT 209

Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
           G       A +G+FG G  + SV S L++QG+ P  FS C   D  G G +  G+   P 
Sbjct: 210 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEIVEPN 269

Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
              T  SL    P YN+ +  +SV G  +  + S          I DSGT+  YL + AY
Sbjct: 270 IVYT--SLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 327

Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIV 400
                   +   +   T  S      CY+++ + T+  +P V+L   GG    +     +
Sbjct: 328 DPFVSAITAAIPQSVRTVVSR--GNQCYLITSSVTDV-FPQVSLNFAGGASMILRPQDYL 384

Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           I  +   G  ++C+G   ++   + I+G
Sbjct: 385 IQQNSIGGAAVWCIGFQKIQGQGITILG 412


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 114/388 (29%), Positives = 177/388 (45%), Gaps = 38/388 (9%)

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
           S L  RD    LR R +    N     +     D +++     L+YT V +G P + F V
Sbjct: 38  SQLRARDA---LRHRRMLQSSNGVVDFSVQGTFDPFQVG----LYYTKVQLGTPPVEFNV 90

Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
            +DTGSD+ W+ C+  S   G   +SG  I  N + P +SSTSS + C+   C    Q  
Sbjct: 91  QIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSS 147

Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
              C S  + C Y  +Y  DG+ ++G+ V D++HL T  + S + +S   + FGC   QT
Sbjct: 148 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQT 206

Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
           G       A +G+FG G  + SV S L++QG+ P  FS C   D  G G +  G+   P 
Sbjct: 207 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN 266

Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
              T  SL    P YN+ +  ++V G  +  + S          I DSGT+  YL + AY
Sbjct: 267 IVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 324

Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIV 400
                   +   +   T  S      CY+++ + T   +P V+L   GG    +     +
Sbjct: 325 DPFVSAITASIPQSVHTVVSR--GNQCYLITSSVTEV-FPQVSLNFAGGASMILRPQDYL 381

Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           I  +   G  ++C+G   ++   + I+G
Sbjct: 382 IQQNSIGGAAVWCIGFQKIQGQGITILG 409


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 124/406 (30%), Positives = 176/406 (43%), Gaps = 36/406 (8%)

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPL--TFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           S L  RDR    R    +  G    P+  TF      +   S   L+YT + +G P   F
Sbjct: 44  SQLKERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDF 103

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
            V +DTGSD+ W+ C   S  +G   SSG  I  N + P +S T+S + C+   C L  Q
Sbjct: 104 YVQIDTGSDVLWVSC---SSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ 160

Query: 179 -----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRV 231
                C +  + C Y  +Y  DG+ ++G+ V D+LH  T    S  K+  + I FGC  +
Sbjct: 161 SSDSVCAAQNNQCGYTFQY-GDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTL 219

Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGS 288
           QTG       A +G+FG G    SV S LA+QG+ P  FS C   D  G G +  G+   
Sbjct: 220 QTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVE 279

Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDP 339
           P    TP  L  + P YN+ +  + V G  +  + S          I DSGT+  YL + 
Sbjct: 280 PNIVYTP--LVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEA 337

Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG-GPFFVNDP 398
           AY        S          S      CY L+ +  N  +P V+L   GG     +   
Sbjct: 338 AYDPFISAITSTVSPSVSPYLSK--GNQCY-LTSSSINDVFPQVSLNFAGGTSMILIPQD 394

Query: 399 IVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
            +I  S   G  L+C+G  K     I G+E  I  ++ L    + Y
Sbjct: 395 YLIQQSSINGAALWCVGFQK-----IQGQEITILGDLVLKDKIFVY 435


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 160/336 (47%), Gaps = 29/336 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P + F V +DTGSD+ W+ C+  S   G   +SG  I  N + P +SSTS
Sbjct: 24  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTS 80

Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C+   C    Q     C S  + C Y  +Y  DG+ ++G+ V D++HL T  + S 
Sbjct: 81  SMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSV 139

Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + +S   + FGC   QTG       A +G+FG G  + SV S L++QG+ P  FS C   
Sbjct: 140 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKG 199

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           D  G G +  G+   P    T  SL    P YN+ +  ++V G  +  + S         
Sbjct: 200 DSSGGGILVLGEIVEPNIVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+  YL + AY        +   +   T+ S      CY+++ + T   +P V+
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSR--GNQCYLITSSVTEV-FPQVS 314

Query: 385 LTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKS 419
           L   GG    +     +I  +   G  ++C+G  KS
Sbjct: 315 LNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKS 350


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   F V +DTGSD+ W+   C SC +G   +SG  I  N + P +S T+
Sbjct: 80  LYYTKIRLGSPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           + V C+   C    Q   +G +     C Y  +Y  DG+ ++GF V DVL        S 
Sbjct: 137 TPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195

Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             +S   + FGC   QTG  +    A +G+FG G    SV S LA+QGL P  FS C   
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKG 255

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           +  G G +  G+   P    TP  L  + P YN+ +  +SV G A+    S         
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313

Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I D+GT+  YL++ AY    E   N++++  R   +       CYV++ +  +  +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVIATSVADI-FPPV 369

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           +L   GG   F+N    +I  +   G  ++C+G   +++  + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 175/388 (45%), Gaps = 37/388 (9%)

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
           S L  RDR     GR L + G            D + +     L+YT + +G P   F V
Sbjct: 14  SKLKERDRV--RHGRMLQSSGVGVVDFPVQGTFDPFLVG----LYYTRLQLGTPPRDFYV 67

Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
            +DTGSD+ W+ C   SC +G   +SG  I  N + P +S T+S + C+   C L  Q  
Sbjct: 68  QIDTGSDVLWVSCG--SC-NGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSS 124

Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
              C +  + C Y  +Y  DG+ ++G+ V D+LH  T    S   +S   I FGC  +QT
Sbjct: 125 DSVCSAQNNLCGYNFQY-GDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQT 183

Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
           G       A +G+FG G    SV S LA+QG+ P +FS C   D  G G +  G+   P 
Sbjct: 184 GDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPN 243

Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
              TP  L  + P YN+ +  +SV G  +  + S          I DSGT+  YL + AY
Sbjct: 244 IVYTP--LVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAY 301

Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF-FVNDPIV 400
                   S+         S     +CY++S +  N  +P V+L   GG     +    +
Sbjct: 302 DPFISAITSIVSPSVRPYLSK--GNHCYLIS-SSINDIFPQVSLNFAGGASMILIPQDYL 358

Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           I  S   G  L+C+G   ++   + I+G
Sbjct: 359 IQQSSIGGAALWCIGFQKIQGQGITILG 386


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   F V +DTGSD+ W+   C SC +G   +SG  I  N + P +S T+
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C+   C    Q   +G +     C Y  +Y  DG+ ++GF V DVL        S 
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195

Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             +S   + FGC   QTG  +    A +G+FG G    SV S LA+QG+ P  FS C   
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           +  G G +  G+   P    TP  L  + P YN+ +  +SV G A+    S         
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313

Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I D+GT+  YL++ AY    E   N++++  R   +       CYV++ +  +  +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           +L   GG   F+N    +I  +   G  ++C+G   +++  + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   F V +DTGSD+ W+   C SC +G   +SG  I  N + P +S T+
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C+   C    Q   +G +     C Y  +Y  DG+ ++GF V DVL        S 
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195

Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             +S   + FGC   QTG  +    A +G+FG G    SV S LA+QG+ P  FS C   
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           +  G G +  G+   P    TP  L  + P YN+ +  +SV G A+    S         
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313

Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I D+GT+  YL++ AY    E   N++++  R   +       CYV++ +  +  +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
           +L   GG   F+N    +I  +   G  ++C+G   +++  + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 115/377 (30%), Positives = 179/377 (47%), Gaps = 35/377 (9%)

Query: 63  LAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
           +AH     R+R GR L + G     + FS  + TY    +G L+YT V +G P   F V 
Sbjct: 45  IAHLRSRDRVRHGRMLQSSGG---VIDFSV-SGTYDPFLVG-LYYTRVQLGNPPKDFYVQ 99

Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--- 178
           +DTGSD+ W+ C+  SC +G  ++SG  I  N + P +S+T+S V C+  +C L  Q   
Sbjct: 100 IDTGSDVLWVSCN--SC-NGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSD 156

Query: 179 --CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTG 234
             C    + C Y  +Y  DG+ ++G+ V D++HL    D   + +  + + FGC   QTG
Sbjct: 157 SACFGQSNQCAYVFQY-GDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTG 215

Query: 235 SFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
                  A +G+FG G    SV S L+++G+ P  FS C   D  G G +  G+   P  
Sbjct: 216 DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNV 275

Query: 292 GETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSAIFDSGTSFTYLNDPAYT 342
             TP  L  + P YN+ +  +SV G          A +     I DSGT+  YL + AY 
Sbjct: 276 VYTP--LVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYN 333

Query: 343 QISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIVI 401
                  ++  +  ++    L    CYV S + ++  +P V+L   GG    +     +I
Sbjct: 334 AFVVAVTNIVSQSTQSVV--LKGNRCYVTSSSVSDI-FPQVSLNFAGGASLVLGAQDYLI 390

Query: 402 VSSEPKGLYLYCLGVVK 418
             +   G  ++C+G  K
Sbjct: 391 QQNSVGGTTVWCIGFQK 407


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 121/405 (29%), Positives = 185/405 (45%), Gaps = 47/405 (11%)

Query: 51  LPKKGSFAYYSALAHRDRYFRLRGRGL-----AAQGNDKTPLTFSAGNDTYRLNSLGFLH 105
           LP KG    +  L  RD     R RGL     A  G    P+  SA  + Y +     L+
Sbjct: 38  LPHKGVPVEH--LKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSA--NPYMVG----LY 89

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +T V +G PA  + V +DTGSD+ W+ C  C  C     +SSG  I    ++P++SSTSS
Sbjct: 90  FTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGC----PTSSGLNIQLEFFNPDSSSTSS 145

Query: 165 KVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
           ++PC+   C    Q   A         S C Y   Y  DG+ ++GF V D ++  T    
Sbjct: 146 RIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTY-GDGSGTSGFYVSDTMYFDTVMGN 204

Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +Q+ +  + + FGC   Q+G  +    A +G+FG G  + SV S L + G+ P +FS C 
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL 264

Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
            GSD G G +  G+   PG   TP  L  + P YN+ +  ++V G  +  + S       
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVFTP--LVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT+  YL D AY       N++A     +  S +       ++ +  +  +P 
Sbjct: 323 QGTIVDSGTTLVYLVDGAYDPF---INAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPT 379

Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             L  KGG    V  +  ++         L+C+G  +S  + I+G
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILG 424


>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 217

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 97/183 (53%), Gaps = 10/183 (5%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 33  HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C            D  
Sbjct: 91  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY P  S+TS  +PC+  LC+    C +    CPY + Y S+ T S+G L+ED LHL   
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204

Query: 214 EKQ 216
           E  
Sbjct: 205 EDH 207


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 169/366 (46%), Gaps = 39/366 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L++T V +G P   F V +DTGSD+ W+ C   S  +G   +SG  I    + P +S+T+
Sbjct: 83  LYFTRVQLGSPPKDFYVQIDTGSDVLWVSC---SSCNGCPVTSGLQIPLTFFDPGSSTTA 139

Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS- 217
           + V C+   C    Q     C S  + C Y  +Y  DG+ ++G+ V D++HL T    S 
Sbjct: 140 ALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQY-GDGSGTSGYYVADLMHLDTLLLSSG 198

Query: 218 ------KSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
                 ++ DS +SF C  +QTG       A +G+FG G  + SV S LA+QG+ P  FS
Sbjct: 199 ELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFS 258

Query: 271 MCFGSD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
            C   D  G G +  G+   P    TP  L  + P YN+ +  +SV G  +  + S    
Sbjct: 259 HCLKGDDSGGGVLVLGEIVEPNIVYTP--LVPSQPHYNLYLQSISVAGQTLAIDPSVFGA 316

Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
                 I DSGT+  YL + AY        S+      T  S      CY+++ +  N  
Sbjct: 317 SSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSK--GNQCYLVT-SSVNDV 373

Query: 380 YPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLF 438
           +P V+L   GG    +N    ++  +   G  ++C+G  K+      G++  I  ++ L 
Sbjct: 374 FPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTP-----GQQITILGDLVLK 428

Query: 439 HNCYSY 444
              + Y
Sbjct: 429 DKIFVY 434


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 115/381 (30%), Positives = 176/381 (46%), Gaps = 47/381 (12%)

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           YY  L   D+  RLR R L     +      S  +DT+       L+YT + +G P   F
Sbjct: 12  YYRTLREHDQR-RLR-RILP----EVVAFPISGDDDTFTTG----LYYTRIYLGTPPQQF 61

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
            V +DTGSD+ W+  +CV C +    +S   +  +I+ P  S++ + + C    C L   
Sbjct: 62  YVHVDTGSDVAWV--NCVPCTN-CKRASNVALPISIFDPEKSTSKTSISCTDEECYLASN 118

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQT 233
            +C     +CPY   Y  DG+ + G+L+ DVL    + +    + S  +R++FGCG  QT
Sbjct: 119 SKCSFNSMSCPYSTLY-GDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQT 177

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
           G++L     +GL G G  + S+PS L+ Q +  N F+ C   D  G+G +  G    PG 
Sbjct: 178 GTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGL 233

Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVN----FEFS----AIFDSGTSFTYLNDPAYTQ 343
             TP   +Q+H  YN+ +  + V G  V     F+ S     I DSGT+ TYL  PAY Q
Sbjct: 234 VYTPIVPKQSH--YNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQ 291

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
                   AK +    +  LP  + +  +       +P V L   GG    ++ P   + 
Sbjct: 292 FQ------AKVRDCMRSGVLPVAFQFFCT---IEGYFPNVTLYFAGGAAMLLS-PSSYLY 341

Query: 404 SE--PKGLYLYCLGVVKSDNV 422
            E    GL  YC   ++S +V
Sbjct: 342 KEMLTTGLSAYCFSWLESTSV 362


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 150/313 (47%), Gaps = 30/313 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   F V +DTGSD+ W+   C SC +G   +SG  I  N + P +S T+
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C+   C    Q   +G +     C Y  +Y  DG+ ++GF V DVL        S 
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195

Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             +S   + FGC   QTG  +    A +G+FG G    SV S LA+QG+ P  FS C   
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           +  G G +  G+   P    TP  L  + P YN+ +  +SV G A+    S         
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313

Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I D+GT+  YL++ AY    E   N++++  R   +       CYV++ +  +  +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369

Query: 384 NLTMKGGGPFFVN 396
           +L   GG   F+N
Sbjct: 370 SLNFAGGASMFLN 382


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 112/409 (27%), Positives = 180/409 (44%), Gaps = 52/409 (12%)

Query: 56  SFAYYSALAHRDRYFRLRGRGLAA---QGNDK--------------TPLTFSAGNDTYRL 98
           S  Y ++L H +R F L   GL     +  D+                 +    +D Y +
Sbjct: 4   SAVYCASLLHLERAFPLNNHGLELHQLRARDRLRHARLLQGFVGGVVDFSVQGSSDPYLV 63

Query: 99  NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
                L++T V +G P   F V +DTGSD+ W+ C+ C +C      +SG  I  N +  
Sbjct: 64  G----LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDS 115

Query: 158 NTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           ++SST+ +V C+  +C         QC S    C Y  +Y  DG+ ++G+ V D L+   
Sbjct: 116 SSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQY-GDGSGTSGYYVSDTLYFDA 174

Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
              QS   +  + I FGC   Q+G       A +G+FG G  + SV S L+ +G+ P  F
Sbjct: 175 ILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVF 234

Query: 270 SMCFGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-- 325
           S C   DG+  G +  G+   PG   +P  L  + P YN+ +  ++V G  +  + +A  
Sbjct: 235 SHCLKGDGSGGGILVLGEILEPGIVYSP--LVPSQPHYNLNLLSIAVNGQLLPIDPAAFA 292

Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
                  I DSGT+  YL   AY       N++        TS      CY++S + +  
Sbjct: 293 TSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSK--GNQCYLVSTSVSQM 350

Query: 379 EYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +P+ +    GG    +  +  +I      G  ++C+G  K   V I+G
Sbjct: 351 -FPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILG 398


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 149/338 (44%), Gaps = 37/338 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT + +G P + + V +DTGSD+ WL C  C SCV      S   I    Y P+ SST
Sbjct: 36  LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPS---IKLTTYDPSRSST 92

Query: 163 SSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
              + C  + C       +  C SAG  C Y   Y  DG+ + G+ ++DV+        +
Sbjct: 93  DGALSCRDSNCGAALGSNEVSCTSAGY-CAYSTTY-GDGSSTQGYFIQDVMTFQEIHNNT 150

Query: 218 K-SVDSRISFGCGRVQTGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + +  + + FGCG  Q+G+ L    A +GL G G    S+PS LA+ G + N F+ C   
Sbjct: 151 QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQG 210

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
           D  G G I  G    P    TP   R     Y + +  ++V G  V    S         
Sbjct: 211 DNQGGGTIVIGSVSEPNISYTPIVSRN---HYAVGMQNIAVNGRNVTTPASFDTTSTSAG 267

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  YL DPAYTQ     ++       + +  L   +C + +      ++P V
Sbjct: 268 GVIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFSSHSQCLQLAWCSLQA------DFPTV 321

Query: 384 NLTMKGGGPFFVNDPIVIVSSEP--KGLYLYCLGVVKS 419
            L    G    +  P   + S+P   G   YC+G  KS
Sbjct: 322 KLFFDAGAVMNLT-PRNYLYSQPLQNGQAAYCMGWQKS 358


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 165/363 (45%), Gaps = 37/363 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P   + V +DTGSD+ W+ C  C  C     SSSG  I    ++P+TSST
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145

Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
           SSK+PC+   C    Q   A       S C Y   Y  DG+ ++G+ V D ++  T    
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204

Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS C 
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264

Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
            GSD G G +  G+   PG   TP  L  + P YN+ +  + V G  +  + S       
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT+  YL D AY        +       +  S      C+V S +  +  +P 
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPT 379

Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNC 441
           V+L   GG    V  +  ++  +      L+C+G  ++      G++  I  ++ L    
Sbjct: 380 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKI 434

Query: 442 YSY 444
           + Y
Sbjct: 435 FVY 437


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 81/254 (31%), Positives = 128/254 (50%), Gaps = 24/254 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  F V +DTGSD+ W+ C  C+ C          +++   Y  + SST
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDVDASST 138

Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
           +  V C+   C    Q+    +GS C Y + Y  DG+ + G+LV+DV+H  L T  +Q+ 
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVIMY-GDGSSTNGYLVKDVVHLDLVTGNRQTG 197

Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
           S +  I FGCG  Q+G   +  AA +G+ G G   +S  S LA+QG +  SF+ C   ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
           G G  + G+  SP    TP   +  H  Y++ +  + VG + +    +A         I 
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAFDSGDDKGVII 315

Query: 328 DSGTSFTYLNDPAY 341
           DSGT+  YL D  Y
Sbjct: 316 DSGTTLVYLPDAVY 329


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 164/363 (45%), Gaps = 37/363 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P   + V +DTGSD+ W+ C  C  C     SSSG  I    ++P+TSST
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145

Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
           SSK+PC+   C    Q   A       S C Y   Y  DG+ ++G+ V D ++       
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDSVMGN 204

Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS C 
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264

Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
            GSD G G +  G+   PG   TP  L  + P YN+ +  + V G  +  + S       
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT+  YL D AY        +       +  S      C+V S +  +  +P 
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPT 379

Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNC 441
           V+L   GG    V  +  ++  +      L+C+G  ++      G++  I  ++ L    
Sbjct: 380 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKI 434

Query: 442 YSY 444
           + Y
Sbjct: 435 FVY 437


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/254 (31%), Positives = 126/254 (49%), Gaps = 24/254 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  F V +DTGSD+ W+ C  C+ C          +++   Y  + SST
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDADASST 138

Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
           +  V C+   C    Q+    +GS C Y + Y  DG+ + G+LV DV+H  L T  +Q+ 
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVILY-GDGSSTNGYLVRDVVHLDLVTGNRQTG 197

Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
           S +  I FGCG  Q+G   +  AA +G+ G G   +S  S LA+QG +  SF+ C   ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
           G G  + G+  SP    TP   +  H  Y++ +  + VG + +     A         I 
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLQLSSDAFDSGDDKGVII 315

Query: 328 DSGTSFTYLNDPAY 341
           DSGT+  YL D  Y
Sbjct: 316 DSGTTLVYLPDAVY 329


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 107/362 (29%), Positives = 164/362 (45%), Gaps = 37/362 (10%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V +G P   + V +DTGSD+ W+ C  C  C     SSSG  I    ++P+TSSTS
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSSTS 172

Query: 164 SKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEK 215
           SK+PC+   C    Q   A       S C Y   Y  DG+ ++G+ V D ++  T    +
Sbjct: 173 SKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNE 231

Query: 216 QSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
           Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS C  
Sbjct: 232 QTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 291

Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
           GSD G G +  G+   PG   TP  L  + P YN+ +  + V G  +  + S        
Sbjct: 292 GSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 349

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  YL D AY        +       +  S      C+V S +  +  +P V
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPTV 406

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCY 442
           +L   GG    V  +  ++  +      L+C+G  ++      G++  I  ++ L    +
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKIF 461

Query: 443 SY 444
            Y
Sbjct: 462 VY 463


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/382 (28%), Positives = 169/382 (44%), Gaps = 38/382 (9%)

Query: 62  ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
           AL  RDR     GR L          +    +D Y +     L++T V +G PA  F V 
Sbjct: 46  ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKEFYVQ 99

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
           +DTGSD+ W+ C  C +C H    SSG  I+ + +    SST++ V C   +C    Q  
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTA 155

Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
              C S  + C Y  +Y  DG+ +TG+ V D ++  T    +    +  S I FGC   Q
Sbjct: 156 TSECSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQ 214

Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
           +G       A +G+FG G    SV S L+++G+ P  FS C   G +G G +  G+   P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
               +P  L  + P YN+ +  ++V G  +  + +          I DSGT+  YL   A
Sbjct: 275 SIVYSP--LVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332

Query: 341 YTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPI 399
           Y    +   +   +  +   S      CY++S N     +P V+L   GG    +N +  
Sbjct: 333 YNPFVKAITAAVSQFSKPIISK--GNQCYLVS-NSVGDIFPQVSLNFMGGASMVLNPEHY 389

Query: 400 VIVSSEPKGLYLYCLGVVKSDN 421
           ++      G  ++C+G  K + 
Sbjct: 390 LMHYGFLDGAAMWCIGFQKVEQ 411


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 140/298 (46%), Gaps = 37/298 (12%)

Query: 67  DRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           D Y  LR    R L     +      S  ND + +     L+YT +S+G P   F V +D
Sbjct: 4   DHYHTLRKHDQRRLRRMLPEVVSFPISGDNDIFAMG----LYYTRISLGTPPQQFYVDVD 59

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCEL---QKQ 178
           TGS++ W+ C  C  C H     SG V +  + + P  S+T   + C    C +   + Q
Sbjct: 60  TGSNVAWVKCAPCTGCEH-----SGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQ 114

Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQTGS 235
           C     +CPY + Y  DG+ + G+ + DV     + +D   +KS  +R+ FGCG  QTGS
Sbjct: 115 CSPERLSCPYSLLY-GDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGS 173

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGSPGQGE 293
           +    + +GL G G    S+P+ LA Q +  N F+ C   D +GR  +  G    P    
Sbjct: 174 W----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY 229

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAV------NFEFS--AIFDSGTSFTYLNDPAYTQ 343
           TP    + H  YN+ +  + + G  V      + E++   I DSGT+ TYL  PAY +
Sbjct: 230 TPMVFGEDH--YNVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDE 285


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/383 (30%), Positives = 177/383 (46%), Gaps = 47/383 (12%)

Query: 41  PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
           P++    +D+L +       S L  RDR    R     GR  +  G    P+  S+  D 
Sbjct: 43  PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94

Query: 96  YRLNS-LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
           Y + S +  L++T V +G P   F V +DTGSD+ W+ C  C +C H    SSG  ID +
Sbjct: 95  YLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLH 150

Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
            +    S T+  V C+  +C         QC S  + C Y  RY  DG+ ++G+ + D  
Sbjct: 151 FFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTF 208

Query: 209 HLATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLI 265
           +      +S   +S   I FGC   Q+G       A +G+FG G  K SV S L+++G+ 
Sbjct: 209 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 268

Query: 266 PNSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NA 318
           P  FS C   DG+G   F  G+   PG   +P  L  + P YN+ +  + V G     +A
Sbjct: 269 PPVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDA 326

Query: 319 VNFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSP 373
             FE S     I D+GT+ TYL   AY       N+++    +  T  +   E CY++S 
Sbjct: 327 AVFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVST 383

Query: 374 NQTNFEYPVVNLTMKGGGPFFVN 396
           + ++  +P V+L   GG    + 
Sbjct: 384 SISDM-FPSVSLNFAGGASMMLR 405


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 104/300 (34%), Positives = 135/300 (45%), Gaps = 38/300 (12%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           AH DR    RGR LAA      PL    GN    L S   L+YT V +G PA  F V +D
Sbjct: 43  AHDDRR---RGRFLAAI---DVPL---GGNG---LPSSTGLYYTKVGLGSPAKEFYVQVD 90

Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
           TGSD+ W+ C  C +C       SG  +D  +Y PN S TS+ VPC    C      P +
Sbjct: 91  TGSDILWVNCAGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPIS 146

Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF 236
           G     +CPY + Y  DG+ ++G  V D L     +    +K  +S + FGCG  Q+GS 
Sbjct: 147 GCKQDMSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSL 205

Query: 237 LDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
              +  A +G+ G G   +SV S LA  G +   FS C  S  G G  S G    P    
Sbjct: 206 SSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNT 265

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
           TP   R  H  YN+ +  + V G  +               I DSGT+  YL    Y Q+
Sbjct: 266 TPLVPRMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 323


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 162/345 (46%), Gaps = 32/345 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P   F V +DTGSD+ W+ C  C +C      +SG  I  N +   +SST
Sbjct: 80  LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQ----TSGLGIQLNYFDTTSSST 135

Query: 163 SSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           +  VPC+  +C  Q      QCP   + C Y  +Y  DG+ ++G+ V D  +      +S
Sbjct: 136 ARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQY-GDGSGTSGYYVSDTFYFDAVLGES 194

Query: 218 KSVDSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
              +S   I FGC   Q+G       A +G+FG G  + SV S L++ G+ P  FS C  
Sbjct: 195 LIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLK 254

Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
           G D G G +  G+   PG   +P  L  + P YN+ +  ++V G  +  + +A       
Sbjct: 255 GEDSGGGILVLGEILEPGIVYSP--LVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNR 312

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I D+GT+  YL + AY        + A  +  T T +     CY++S N  +  +P V
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITA-AVSQLATPTIN-KGNQCYLVS-NSVSEVFPPV 369

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
           +    GG    +  +  ++  +   G  L+C+G  K    + I+G
Sbjct: 370 SFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILG 414


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 91/259 (35%), Positives = 128/259 (49%), Gaps = 28/259 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P   + V +DTGSD+ W+ C  C  C     SSSG  I    ++P+TSST
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145

Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
           SSK+PC+   C    Q   A       S C Y   Y  DG+ ++G+ V D ++  T    
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204

Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS C 
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264

Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
            GSD G G +  G+   PG   TP  L  + P YN+ +  + V G  +  + S       
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322

Query: 325 --AIFDSGTSFTYLNDPAY 341
              I DSGT+  YL D AY
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341


>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1388

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/352 (29%), Positives = 159/352 (45%), Gaps = 54/352 (15%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L++T + VG P  S+ + +DTGSDL W+ CD  C+SC  G +          +Y P  S+
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV---------LYKPTRSN 241

Query: 162 TSSKVPCNSTLC-ELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
             S V     LC ++QK   +   +     C Y+++Y +D + S G LV D LHL T   
Sbjct: 242 VVSSV---DALCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNG 297

Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
               ++  + FGCG  Q G  L+     +G+ GL   K S+P  LA++GLI N    C  
Sbjct: 298 SKTKLN--VVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLS 355

Query: 275 SDGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
           +DG   G +  GD   P  G    P +   T   Y   I  ++ G   + F+  +     
Sbjct: 356 NDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKVGKM 415

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN- 384
           +FDSG+S+TY    AY  +  + N ++        SD     C+     Q NF    V  
Sbjct: 416 VFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW-----QANFPIKSVKD 470

Query: 385 -------LTMKGGGPFFVNDPIVIVSSEPKGLYL------YCLGVVKSDNVN 423
                  LT++ G  +++   +  +S  P+G  +       CLG++   NVN
Sbjct: 471 VKDYFKTLTLRFGSKWWILSTLFQIS--PEGYLIISNKGHVCLGILDGSNVN 520


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 104/356 (29%), Positives = 158/356 (44%), Gaps = 37/356 (10%)

Query: 62  ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
           AL  RDR     GR L          +    +D Y +     L++T V +G PA  F V 
Sbjct: 46  ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKDFYVQ 99

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
           +DTGSD+ W+ C  C +C H    SSG  I+ + +    SST++ V C   +C    Q  
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTA 155

Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
              C S  + C Y  +Y  DG+ +TG+ V D ++  T    +    +  S I FGC   Q
Sbjct: 156 TSGCSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQ 214

Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
           +G       A +G+FG G    SV S L+++G+ P  FS C   G +G G +  G+   P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
               +P  L  + P YN+ +  ++V G  +  + +          I DSGT+  YL   A
Sbjct: 275 SIVYSP--LVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332

Query: 341 YTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
           Y    +   +   +  +   S      CY++S N     +P V+L   GG    +N
Sbjct: 333 YNPFVDAITAAVSQFSKPIISK--GNQCYLVS-NSVGDIFPQVSLNFMGGASMVLN 385


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 168/373 (45%), Gaps = 55/373 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G PA  F V +DTGSD+ W+ C  C  C     +SSG  I    ++P++SST
Sbjct: 88  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 143

Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
           +S++ C+   C    Q   A         S C Y   Y  DG+ ++G+ V D +   T  
Sbjct: 144 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 202

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
             +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS 
Sbjct: 203 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 262

Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
           C  GSD G G +  G+   PG   TP  L  + P YN+ +  ++V G  +  + S     
Sbjct: 263 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 320

Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                I DSGT+  YL D AY          +S +  SL  +  +          C++ S
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 370

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
            +  +  +P V L   GG    V  +  ++  +      L+C+G  ++      G+E  I
Sbjct: 371 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 424

Query: 432 ANNISLFHNCYSY 444
             ++ L    + Y
Sbjct: 425 LGDLVLKDKIFVY 437


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 168/373 (45%), Gaps = 55/373 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G PA  F V +DTGSD+ W+ C  C  C     +SSG  I    ++P++SST
Sbjct: 90  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 145

Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
           +S++ C+   C    Q   A         S C Y   Y  DG+ ++G+ V D +   T  
Sbjct: 146 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 204

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
             +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS 
Sbjct: 205 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 264

Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
           C  GSD G G +  G+   PG   TP  L  + P YN+ +  ++V G  +  + S     
Sbjct: 265 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 322

Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                I DSGT+  YL D AY          +S +  SL  +  +          C++ S
Sbjct: 323 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 372

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
            +  +  +P V L   GG    V  +  ++  +      L+C+G  ++      G+E  I
Sbjct: 373 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 426

Query: 432 ANNISLFHNCYSY 444
             ++ L    + Y
Sbjct: 427 LGDLVLKDKIFVY 439


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 170/373 (45%), Gaps = 55/373 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G PA  F V +DTGSD+ W+ C  C  C     +SSG  I    ++P++SST
Sbjct: 4   LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 59

Query: 163 SSKVPCNSTLCELQKQ-----CPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
           +S++ C+   C    Q     C ++ S    C Y   Y  DG+ ++G+ V D +   T  
Sbjct: 60  ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 118

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
             +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS 
Sbjct: 119 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 178

Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
           C  GSD G G +  G+   PG   TP  L  + P YN+ +  ++V G  +  + S     
Sbjct: 179 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 236

Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                I DSGT+  YL D AY          +S +  SL  +  +          C++ S
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 286

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
            +  +  +P V L   GG    V  +  ++  +      L+C+G  ++      G+E  I
Sbjct: 287 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 340

Query: 432 ANNISLFHNCYSY 444
             ++ L    + Y
Sbjct: 341 LGDLVLKDKIFVY 353


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 165/382 (43%), Gaps = 54/382 (14%)

Query: 10  VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
           V V L+LLS C     GF    F+  H++               KG     +AL   D  
Sbjct: 7   VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
            R  GR L+        +    G + +   +   L+Y  + +G P   F V +DTGSD+ 
Sbjct: 47  VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97

Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
           W+  +CV C +    S   V D  +Y+P +SSTS+ + C+   C      P  G      
Sbjct: 98  WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
           C Y+V Y  DG+ + G+ V D + L  A    ++   +  I FGCG  Q+G     + A 
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213

Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
           +G+ G G   +S+ S LA  G +   F+ C  S  G G  + G+   P    TP    Q 
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQA 273

Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETFNSLA 352
           H  YN+ +  V VG  A++     FE S    AI DSGT+  YL D  Y  + E     A
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKILG-A 330

Query: 353 KEKRETSTSDLPFEYCYVLSPN 374
           +   +  T D  F  C+V   N
Sbjct: 331 QPDLKLRTVDDQFT-CFVFDKN 351


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 121/428 (28%), Positives = 193/428 (45%), Gaps = 57/428 (13%)

Query: 41  PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
           P++    +D+L +       S L  RDR    R     GR  +  G    P+  S+  D 
Sbjct: 43  PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94

Query: 96  YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
           Y +     L++T V +G P   F V +DTGSD+ W+ C  C +C H    SSG  ID + 
Sbjct: 95  YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146

Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
           +    S T+  V C+  +C         QC S  + C Y  RY  DG+ ++G+ + D  +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204

Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
                 +S   +S   I FGC   Q+G       A +G+FG G  K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264

Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
             FS C   DG+G   F  G+   PG   +P  L  + P YN+ +  + V G     +A 
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322

Query: 320 NFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
            FE S     I D+GT+ TYL   AY       N+++    +  T  +   E CY++S +
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVSTS 379

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY----LYCLGVVKSDNVNIIGREYP 430
            ++  +P V+L   GG    +     +      G+Y    ++C+G  K+     I  +  
Sbjct: 380 ISDM-FPSVSLNFAGGASMMLRPQDYLFH---YGIYDGASMWCIGFQKAPEEQTILGDLV 435

Query: 431 IANNISLF 438
           + + + ++
Sbjct: 436 LKDKVFVY 443


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 175/382 (45%), Gaps = 50/382 (13%)

Query: 41  PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
           P++    +D+L +       S L  RDR    R     GR  +  G    P+  S+  D 
Sbjct: 43  PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94

Query: 96  YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
           Y +     L++T V +G P   F V +DTGSD+ W+ C  C +C H    SSG  ID + 
Sbjct: 95  YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146

Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
           +    S T+  V C+  +C         QC S  + C Y  RY  DG+ ++G+ + D  +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204

Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
                 +S   +S   I FGC   Q+G       A +G+FG G  K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264

Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
             FS C   DG+G   F  G+   PG   +P  L  + P YN+ +  + V G     +A 
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322

Query: 320 NFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
            FE S     I D+GT+ TYL   AY       N+++    +  T  +   E CY++S +
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVSTS 379

Query: 375 QTNFEYPVVNLTMKGGGPFFVN 396
            ++  +P V+L   GG    + 
Sbjct: 380 ISDM-FPSVSLNFAGGASMMLR 400


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 167/362 (46%), Gaps = 44/362 (12%)

Query: 61  SALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
           S L  RDR    R     GR  +  G    P+  S+  D Y +     L++T V +G P 
Sbjct: 57  SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DPYLVG----LYFTKVKLGSPP 110

Query: 116 LSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
             F V +DTGSD+ W+ C  C +C H    SSG  ID + +    S T+  V C+  +C 
Sbjct: 111 TEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHFFDAPGSFTAGSVTCSDPICS 166

Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFG 227
                   QC S  + C Y  RY  DG+ ++G+ + D  +      +S   +S   I FG
Sbjct: 167 SVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFG 224

Query: 228 CGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF--G 284
           C   Q+G       A +G+FG G  K SV S L+++G+ P  FS C   DG+G   F  G
Sbjct: 225 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLG 284

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS----AIFDSGTSFTY 335
           +   PG   +P  L  + P YN+ +  + V G     +A  FE S     I D+GT+ TY
Sbjct: 285 EILVPGMVYSP--LLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTY 342

Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
           L   AY       N+++    +  T  +   E CY++S + ++  +P V+L   GG    
Sbjct: 343 LVKEAYDPF---LNAISNSVSQLVTLIISNGEQCYLVSTSISDM-FPPVSLNFAGGASMM 398

Query: 395 VN 396
           + 
Sbjct: 399 LR 400


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 158/351 (45%), Gaps = 46/351 (13%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           L++T V +G PA  F V +DTGSD+ W+   PCD      G   SSG  I+ N++    S
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136

Query: 161 STSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
           S++  +PC   +C        QC +   +C Y   Y  D + ++GF V D +H  +   E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
               +  + I FGC   Q G       A +G+FG G  + SV S L+++G+ P  FS C 
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
             G +G G +  G+   P    +P  L  + P Y + +  +++ G    N   F  S   
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  YL +  Y  I     S   +    + S      C+ +S +  +  +PV+
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISR--GSQCFRVSMSVADI-FPVL 370

Query: 384 NLTMKGGGPFFVN-------DPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
               +G     V        D IV    EP    L+C+G  K+ D +NI+G
Sbjct: 371 RFNFEGIASMVVTPEEYLQFDSIV---REPA---LWCIGFQKAEDGLNILG 415


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 165/382 (43%), Gaps = 54/382 (14%)

Query: 10  VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
           V V L+LLS C     GF    F+  H++               KG     +AL   D  
Sbjct: 7   VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
            R  GR L+        +    G + +   +   L+Y  + +G P   F V +DTGSD+ 
Sbjct: 47  VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97

Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
           W+  +CV C +    S   V D  +Y+P +SSTS+ + C+   C      P  G      
Sbjct: 98  WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
           C Y+V Y  DG+ + G+ V D + L  A    ++   +  I FGCG  Q+G     + A 
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213

Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
           +G+ G G   +S+ S LA  G +   F+ C  S  G G  + G+   P    TP    Q 
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLXNTPVVPNQA 273

Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETFNSLA 352
           H  YN+ +  V VG  A++     FE S    AI DSGT+  YL +  Y  + E     A
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKILG-A 330

Query: 353 KEKRETSTSDLPFEYCYVLSPN 374
           +   +  T D  F  C+V   N
Sbjct: 331 QPDLKLRTVDDQFT-CFVFDKN 351


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 95/301 (31%), Positives = 139/301 (46%), Gaps = 32/301 (10%)

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
           + L  RDR  R  GR L   G      +    +D Y +     L++T V +G PA  F V
Sbjct: 32  TTLKARDRA-RHGGRILQDGGGGILDFSVQGTSDPYLVG----LYFTKVKMGSPAKEFYV 86

Query: 121 ALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
            +DTGSD+ WL C+ C +C      SSG  ID N +   +SST++ V C+  +C      
Sbjct: 87  QIDTGSDILWLNCNTCNNC----PKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQT 142

Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRVQ 232
              QC S  + C Y  +Y  DG+ ++G+ V D ++      QS   +  S + FGC   Q
Sbjct: 143 ATSQCSSQANQCSYTFQY-GDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQ 201

Query: 233 TGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSP 289
           +G       A +G+FG G    SV S +++QG+ P  FS C    G+  G +  G+   P
Sbjct: 202 SGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGSGGGILVLGEILEP 261

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
               TP    Q H  YN+ +  ++V G  +  +            I DSGT+  YL   A
Sbjct: 262 NIVYTPLVPLQPH--YNLNLQSIAVNGQILPIDQDVFATGNNRGTIVDSGTTLAYLVQEA 319

Query: 341 Y 341
           Y
Sbjct: 320 Y 320


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 98/311 (31%), Positives = 146/311 (46%), Gaps = 27/311 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P   F V +DTGSD+ W+ C   SC +G   +SG  I  N + P +SSTS
Sbjct: 76  LYYTKVKLGTPPREFYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPRSSSTS 132

Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S +      C S +      C S  + C Y  +Y  DG+ ++G+ V D++H A   + + 
Sbjct: 133 SLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQY-GDGSGTSGYYVSDLMHFAGIFEGTL 191

Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + +S  S  FGC  +QTG       A +G+FG G    SV S L+ QG+ P  FS C   
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           D  G G +  G+   P    +P  L Q+ P YN+ +  +SV G  V    +         
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNNRG 309

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+  YL + AY        +L  +   +  S      CY+++ +     +P V+
Sbjct: 310 TIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSR--GNQCYLITTSSNVDIFPQVS 367

Query: 385 LTMKGGGPFFV 395
           L   GG    +
Sbjct: 368 LNFAGGASLVL 378


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/259 (32%), Positives = 127/259 (49%), Gaps = 25/259 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   + V +DTGSD+ W+  +C+SC       SG  ++  +Y P  SST 
Sbjct: 32  LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 88

Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
           SKV C+   C      L   C ++   C Y V Y  DG+ +TG+ V D+L     + + Q
Sbjct: 89  SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 146

Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  +S ++FGCG  Q G       A +G+ G G   TS+ S L+  G +   F+ C  +
Sbjct: 147 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 206

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
            +G G  + G+   P    TP  L    P YN+ +  + VGG A+           +   
Sbjct: 207 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 264

Query: 326 IFDSGTSFTYLNDPAYTQI 344
           I DSGT+ TYL +  Y +I
Sbjct: 265 IIDSGTTLTYLPEIVYKEI 283


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 131/273 (47%), Gaps = 27/273 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   + V +DTGSD+ W+  +C+SC       SG  ++  +Y P  SST 
Sbjct: 88  LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 144

Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
           SKV C+   C      L   C ++   C Y V Y  DG+ +TG+ V D+L     + + Q
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 202

Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  +S ++FGCG  Q G       A +G+ G G   TS+ S L+  G +   F+ C  +
Sbjct: 203 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 262

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
            +G G  + G+   P    TP  L    P YN+ +  + VGG A+           +   
Sbjct: 263 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 320

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
           I DSGT+ TYL +  Y +I       AK K  T
Sbjct: 321 IIDSGTTLTYLPEIVYKEI--MLAVFAKHKDIT 351


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 87/273 (31%), Positives = 131/273 (47%), Gaps = 27/273 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   + V +DTGSD+ W+  +C+SC       SG  ++  +Y P  SST 
Sbjct: 3   LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 59

Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
           SKV C+   C      L   C ++   C Y V Y  DG+ +TG+ V D+L     + + Q
Sbjct: 60  SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 117

Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  +S ++FGCG  Q G       A +G+ G G   TS+ S L+  G +   F+ C  +
Sbjct: 118 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 177

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
            +G G  + G+   P    TP  L    P YN+ +  + VGG A+           +   
Sbjct: 178 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 235

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
           I DSGT+ TYL +  Y +I       AK K  T
Sbjct: 236 IIDSGTTLTYLPEIVYKEI--MLAVFAKHKDIT 266


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 103/351 (29%), Positives = 158/351 (45%), Gaps = 43/351 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           L++T V +G PA  F V +DTGSD+ W+   PCD      G   SSG  I+ N++    S
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136

Query: 161 STSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
           S++  +PC   +C        QC +   +C Y   Y  D + ++GF V D +H  +   E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
               +  + I FGC   Q G       A +G+FG G  + SV S L+++G+ P  FS C 
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
             G +G G +  G+   P    +P  L  + P Y + +  +++ G    N   F  S   
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  YL +  Y  I     S   +    + S      C+ +S +  +  +PV+
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISR--GSQCFRVSMSVADI-FPVL 370

Query: 384 NLTMKGGGPFFVN-------DPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
               +G     V        D IV   S  K   L+C+G  K+ D +NI+G
Sbjct: 371 RFNFEGIASMVVTPEEYLQFDSIV---SCYKFASLWCIGFQKAEDGLNILG 418


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 83/259 (32%), Positives = 119/259 (45%), Gaps = 25/259 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT + +G P   + V +DTGSD+ W+ C  C  C H     SG  +D  +Y P  SST
Sbjct: 85  LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPH----KSGLGLDLTLYDPKASST 140

Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
            S V C+   C      + P  G+N  C Y V Y  DG+ + G  V D L     T + Q
Sbjct: 141 GSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTY-GDGSSTIGSFVTDALQFDQVTRDGQ 199

Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  ++ + FGCG  Q G       A +G+ G G   TS+ S L   G +   F+ C  +
Sbjct: 200 TQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDT 259

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
             G G  S GD   P    TP  L    P YN+ +  + VGG  +           +   
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGT 317

Query: 326 IFDSGTSFTYLNDPAYTQI 344
           I DSGT+ TYL +  + ++
Sbjct: 318 IIDSGTTLTYLPELVFKEV 336


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 124/261 (47%), Gaps = 30/261 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P  ++ + +DTGSDL W+ C  C+ C     + S   I    Y    S++
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           SSKVPC+   C L  Q   +G N    C Y  +Y  DG+ + G+LVEDVLH   +   + 
Sbjct: 91  SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148

Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
                + FGCG  Q+G       A +G+ G G    S  S LA QG  PN F+ C   G 
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
            G G +  G+   P    TP     +H  YN+ +  +SV    +  +   FS       I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMSH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261

Query: 327 FDSGTSFTYLNDPAYTQISET 347
           FDSGT+  YL D AY   ++ 
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQA 282


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 163/361 (45%), Gaps = 33/361 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P     V +DTGSD+ W+ C   SC +G   +SG  I  N + P +SSTS
Sbjct: 76  LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132

Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C    C    Q     C    + C Y  +Y  DG+ ++G+ V D++H A+  + + 
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191

Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + +S  S  FGC  +QTG       A +G+FG G    SV S L++QG+ P  FS C   
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
           D  G G +  G+   P    +P  L  + P YN+ +  +SV G  V    S         
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNNRG 309

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+  YL + AY        ++  +   +  S      CY+++ +     +P V+
Sbjct: 310 TIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSR--GNQCYLITTSSNVDIFPQVS 367

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGL-YLYCLGVVKSDNVNIIGREYPIANNISLFHNCYS 443
           L   GG    +     ++     G   ++C+G  K     I G+   I  ++ L    + 
Sbjct: 368 LNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQK-----ISGQSITILGDLVLKDKIFV 422

Query: 444 Y 444
           Y
Sbjct: 423 Y 423


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/283 (31%), Positives = 126/283 (44%), Gaps = 26/283 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT V +G P   F V +DTGSD+ W+ C  C  C H     SG  +D  +Y P  SST
Sbjct: 87  LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPH----KSGLGLDLTLYDPKASST 142

Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
            S V C+   C      + P   +N  C Y V Y  DG+ + G  V D L     T + Q
Sbjct: 143 GSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTY-GDGSSTVGSFVNDALQFDQVTGDGQ 201

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  ++ + FGCG  Q G     + A +G+ G G   TS+ S LA  G +   F+ C  +
Sbjct: 202 TQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDT 261

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
             G G  + GD   P    TP  L    P YN+ +  + VGG  +           +   
Sbjct: 262 IKGGGIFAIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGT 319

Query: 326 IFDSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEY 367
           I DSGT+ TYL +  + ++    FN             L FEY
Sbjct: 320 IIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEY 362


>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
          Length = 313

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 116/222 (52%), Gaps = 18/222 (8%)

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           S SV +R+  GCG+ Q+G +LDG AP+GL GLG  + SVPS L+  GL+ NSFS+CF  +
Sbjct: 4   SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 63

Query: 277 GTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSF 333
            +GRI FGD G   Q  TPF       +  Y + +    +G + +    F+   DSG SF
Sbjct: 64  DSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 123

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           TYL +  Y ++     +L  ++   +TS     + +EYCY  S      + P + L    
Sbjct: 124 TYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEYCYESSAEP---KVPAIKLKFSH 175

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGREY 429
              F ++ P+ +   + +GL  +CL +  S  + +  IG+ Y
Sbjct: 176 NNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSIGQNY 216


>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1336

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 156/351 (44%), Gaps = 52/351 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L++T + VG P  S+ + +DTGSDL W+ CD  C SC  G +           Y P  S+
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQ---------YKPTRSN 243

Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             S V  +S   ++QK   +   +     C Y+++Y +D + S G LV D LHL T    
Sbjct: 244 VVSSV--DSLCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNGS 300

Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
              ++  + FGCG  Q G  L+  A  +G+ GL   K S+P  LA++GLI N    C  +
Sbjct: 301 KTKLN--VVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSN 358

Query: 276 DGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----I 326
           DG   G +  GD   P  G    P +   T   Y   I  ++ G   + F+  +      
Sbjct: 359 DGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVF 418

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN-- 384
           FDSG+S+TY    AY  +  + N ++        SD     C+     Q NF+   +   
Sbjct: 419 FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW-----QANFQIRSIKDV 473

Query: 385 ------LTMKGGGPFFVNDPIVIVSSEPKGLYL------YCLGVVKSDNVN 423
                 LT++ G  +++   +  +   P+G  +       CLG++    VN
Sbjct: 474 KDYFKTLTLRFGSKWWILSTLFQIP--PEGYLIISNKGHVCLGILDGSKVN 522


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 91/285 (31%), Positives = 127/285 (44%), Gaps = 34/285 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  + +G PA  + + +DTGSDL WL CD  C SC  G +          +Y P  + 
Sbjct: 30  LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPH---------GLYDPKRAR 80

Query: 162 TSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
               V C    C ++Q+     C      C Y+V Y+ DG+ + G LVED + L      
Sbjct: 81  V---VDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYV-DGSSTMGILVEDTITLVL--TN 134

Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
                +R   GCG  Q G+     A  +G+ GL   K S+PS LA +G+  N    C   
Sbjct: 135 GTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG 194

Query: 274 GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
           GS+G G + FGD   P  G   TP   R     Y   +  +  GG  +  E +      A
Sbjct: 195 GSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGA 254

Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
           +FDSGTSFTYL   AYT + S       +   E   +D    +C+
Sbjct: 255 MFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCW 299


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 118/401 (29%), Positives = 190/401 (47%), Gaps = 43/401 (10%)

Query: 51  LPKKGSFAYYSALAHRDRYFRLRG-RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
           +P  G     +AL  RDR    R  RG+A    D     FS    T   NS+G L+YT V
Sbjct: 30  IPPTGHRVEVAALKARDRARHARMLRGVAGGVVD-----FSV-QGTSDPNSVG-LYYTKV 82

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
            +G P   F V +DTGSD+ W+ C+ C +C      SS   I+ N +    SST++ +PC
Sbjct: 83  KMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPC 138

Query: 169 NSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           +  +C  + Q     C    + C Y  +Y  DG+ ++G+ V D ++ +    Q  +V+S 
Sbjct: 139 SDPICTSRVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFSLIMGQPPAVNSS 197

Query: 224 --ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGT 278
             I FGC   Q+G       A +G+FG G    SV S L+++G+ P  FS C     DG 
Sbjct: 198 ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGG 257

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFD 328
           G +  G+   P    +P  L  + P YN+ +  ++V G     N   F  S      I D
Sbjct: 258 GVLVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTIVD 315

Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
            GT+  YL   AY  +    N ++++  R+T++       CY++S +  +  +P V+L  
Sbjct: 316 CGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDI-FPSVSLNF 371

Query: 388 KGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
           +GG    +  +  ++ +    G  ++C+G  K  +  +I+G
Sbjct: 372 EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILG 412


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 123/261 (47%), Gaps = 30/261 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P  ++ + +DTGSDL W+ C  C+ C     + S   I    Y    S++
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           SSKVPC+   C L  Q   +G N    C Y  +Y  DG+ + G+LVEDVLH   +   + 
Sbjct: 91  SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148

Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
                + FGCG  Q+G       A +G+ G G    S  S LA QG  PN F+ C   G 
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
            G G +  G+   P    TP      H  YN+ +  +SV    +  +   FS       I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMYH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261

Query: 327 FDSGTSFTYLNDPAYTQISET 347
           FDSGT+  YL D AY   ++ 
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQA 282


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G F F+  H+++   K    + +L    SF +   LA+ D                  PL
Sbjct: 26  GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 65

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
               G D+ R +S+G L++T + +G P   + V +DTGSD+ W+ C  C  C   + +  
Sbjct: 66  ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 117

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
           G  I  ++Y    SSTS  V C    C    Q  + G+   C Y V Y  DG+ S G  V
Sbjct: 118 G--IPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFV 174

Query: 205 EDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
           +D + L   T   ++  +   + FGCG+ Q+G      +A +G+ G G   TSV S LA 
Sbjct: 175 KDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAA 234

Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
            G +   FS C  + +G G  + G+  SP    TP    Q H  YN+ +  + V G  ++
Sbjct: 235 GGSVKRIFSHCLDNMNGGGIFAIGEVESPVVKTTPLVPNQVH--YNVILKGMDVDGEPID 292

Query: 321 F---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
                     +   I DSGT+  YL    Y  + E     AK++ +       F  C+  
Sbjct: 293 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 349

Query: 372 SPNQTNFEYPVVNL 385
           + N T+  +PVVNL
Sbjct: 350 TSN-TDKAFPVVNL 362


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 137/316 (43%), Gaps = 42/316 (13%)

Query: 51  LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF-----LH 105
            P+ GS       AH       RGR LAA      PL             LG      L+
Sbjct: 38  FPRLGSKGGGDITAHLTHDSNRRGRLLAAA---DVPL-----------GGLGLPTDTGLY 83

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           YT + +G P   + V +DTGSD+ W+  +C+SC +     S   ID  +Y P  SS+ S 
Sbjct: 84  YTEIEIGTPPKQYHVQVDTGSDILWV--NCISC-NKCPRKSDLGIDLRLYDPKGSSSGST 140

Query: 166 VPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKS 219
           V C+   C      + P    N  C Y V Y  DG+ +TG+ V D L     + + Q++ 
Sbjct: 141 VSCDQKFCAATYGGKLPGCAKNIPCEYSVMY-GDGSSTTGYFVSDSLQYNQVSGDGQTRH 199

Query: 220 VDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DG 277
            ++ + FGCG  Q G       A +G+ G G   TS+ S LA  G +   FS C  +  G
Sbjct: 200 ANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKG 259

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFD 328
            G  + GD   P    TP  L    P YN+ +  ++VGG  +           +   I D
Sbjct: 260 GGIFAIGDVVQPKVKSTP--LVPDMPHYNVNLESINVGGTTLQLPSHMFETGEKKGTIID 317

Query: 329 SGTSFTYLNDPAYTQI 344
           SGT+ TYL +  Y  +
Sbjct: 318 SGTTLTYLPELVYKDV 333


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 92/283 (32%), Positives = 130/283 (45%), Gaps = 26/283 (9%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +YT + +G P   F V +DTGSD+ W+  +CVSC     + SG  ID  +Y P  SS+ S
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWV--NCVSC-DKCPTKSGLGIDLALYDPKGSSSGS 143

Query: 165 KVPCNSTLCELQ----KQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
            V C++  C       ++ P  +AG  C Y+  Y  DG+ + G  V D L     +   Q
Sbjct: 144 AVSCDNKFCAATYGSGEKLPGCTAGKPCEYRAEY-GDGSSTAGSFVSDSLQYNQLSGNAQ 202

Query: 217 SKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++   + + FGCG  Q G       A +G+ G G   TS  S LA+ G +   FS C  +
Sbjct: 203 TRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDT 262

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS----A 325
             G G  + G+   P    TP     +H  YN+ +  + V GNA+      FE S     
Sbjct: 263 IKGGGIFAIGEVVQPKVKSTPLLPNMSH--YNVNLQSIDVAGNALQLPPHIFETSEKRGT 320

Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
           I DSGT+ TYL +  Y  I +  F         T    L FEY
Sbjct: 321 IIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEY 363


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G F F+  H+++   K    + +L    SF +   LA+ D                  PL
Sbjct: 27  GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 66

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
               G D+ R +S+G L++T + +G P   + V +DTGSD+ W+ C  C  C   + +  
Sbjct: 67  ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 118

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
           G  I  ++Y   TSSTS  V C    C    Q  + G+   C Y V Y  DG+ S G  +
Sbjct: 119 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 175

Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
           +D   L   T   ++  +   + FGCG+ Q+G      +A +G+ G G   TS+ S LA 
Sbjct: 176 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 235

Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
            G     FS C  + +G G  + G+  SP    TP    Q H  YN+ +  + V G+ + 
Sbjct: 236 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 293

Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
                   N +   I DSGT+  YL    Y  + E     AK++ +       F  C+  
Sbjct: 294 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 350

Query: 372 SPNQTNFEYPVVNL 385
           + N T+  +PVVNL
Sbjct: 351 TSN-TDKAFPVVNL 363


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G F F+  H+++   K    + +L    SF +   LA+ D                  PL
Sbjct: 23  GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 62

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
               G D+ R +S+G L++T + +G P   + V +DTGSD+ W+ C  C  C   + +  
Sbjct: 63  ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 114

Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
           G  I  ++Y   TSSTS  V C    C    Q  + G+   C Y V Y  DG+ S G  +
Sbjct: 115 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 171

Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
           +D   L   T   ++  +   + FGCG+ Q+G      +A +G+ G G   TS+ S LA 
Sbjct: 172 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 231

Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
            G     FS C  + +G G  + G+  SP    TP    Q H  YN+ +  + V G+ + 
Sbjct: 232 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 289

Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
                   N +   I DSGT+  YL    Y  + E     AK++ +       F  C+  
Sbjct: 290 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 346

Query: 372 SPNQTNFEYPVVNL 385
           + N T+  +PVVNL
Sbjct: 347 TSN-TDKAFPVVNL 359


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 148/315 (46%), Gaps = 31/315 (9%)

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D YR+     L++T V +G P   F V +DTGSD+ W+ C   SC +G   SSG  I  N
Sbjct: 76  DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 128

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
            + P +SST+S + C+   C L  Q     C S G+ C Y  +Y  DG+ ++G+ V D+L
Sbjct: 129 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 187

Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
           +  A       +  + I FGC   QTG       A +G+FG G    SV S +++QG+ P
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
             FS C   DG G           +      L  + P YN+ +  +SV G   A++ E  
Sbjct: 248 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 307

Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE--YCYVLSPNQ 375
           A       I DSGT+  YL + AY    + F S   E    S   L  +   CY+++ + 
Sbjct: 308 ATSTNRGTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 363

Query: 376 TNFEYPVVNLTMKGG 390
               +P V+L   GG
Sbjct: 364 KGI-FPTVSLNFAGG 377


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 148/315 (46%), Gaps = 31/315 (9%)

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
           D YR+     L++T V +G P   F V +DTGSD+ W+ C   SC +G   SSG  I  N
Sbjct: 61  DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 113

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
            + P +SST+S + C+   C L  Q     C S G+ C Y  +Y  DG+ ++G+ V D+L
Sbjct: 114 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 172

Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
           +  A       +  + I FGC   QTG       A +G+FG G    SV S +++QG+ P
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232

Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
             FS C   DG G           +      L  + P YN+ +  +SV G   A++ E  
Sbjct: 233 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 292

Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE--YCYVLSPNQ 375
           A       I DSGT+  YL + AY    + F S   E    S   L  +   CY+++ + 
Sbjct: 293 ATSTNRGTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 348

Query: 376 TNFEYPVVNLTMKGG 390
               +P V+L   GG
Sbjct: 349 KGI-FPTVSLNFAGG 362


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 172/378 (45%), Gaps = 60/378 (15%)

Query: 90  SAGNDTYRLNSLG-----FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGL 142
           S GN + R +  G      L+Y  + +G P   + + +DTGSDL W  CD  C +C  G 
Sbjct: 20  SVGNHSVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGP 79

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGT 197
           +          +Y+P  +     V C+  +C ++Q+    +C S    C Y+V Y +DG+
Sbjct: 80  H---------GLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSDVKQCDYEVEY-ADGS 126

Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVP 256
            + G LVED L +         + ++   GCG  Q G+     A+ +G+ GL   K ++P
Sbjct: 127 STMGVLVEDTLTVRL--TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALP 184

Query: 257 SILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQV 312
           + LA +G+I N    C   GS+G G + FGD+  P  G   TP   +     Y   +  +
Sbjct: 185 AQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSI 244

Query: 313 SVGGNAVNFE---------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
             GG+++             S +FDSGTSFTYL   AY  +       +   R  S + L
Sbjct: 245 RYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTL 304

Query: 364 PFEYCYV-LSPNQ--TNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKGLYL------YC 413
           P  YC+   SP Q  T+       LT+  GG  +F  D  + +S  P+G  +       C
Sbjct: 305 P--YCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLS--PQGYLIVSTQGNVC 360

Query: 414 LGVVKS-----DNVNIIG 426
           LG++ +     +  NIIG
Sbjct: 361 LGILDASGASLEVTNIIG 378


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 149/310 (48%), Gaps = 37/310 (11%)

Query: 68  RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
           RY RL+G   A  + +D+  LT  AG D     T R +  G L+Y  + +G PA S+ V 
Sbjct: 38  RYPRLQGSLSALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
           +DTGSD+ W+ C  C  C     S+ G  I+  +Y+ + S +   V C+   C      P
Sbjct: 97  VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152

Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
            +G     +CPY   Y  DG+ + G+ V+DV+    +A D K +++ +  + FGCG  Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210

Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
           G  LD +   A +G+ G G   +S+ S LA+ G +   F+ C  G +G G  + G    P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
               TP    Q H  YN+ +T V VG   +N             AI DSGT+  YL +  
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAYLPEII 327

Query: 341 YTQISETFNS 350
           Y  + +   S
Sbjct: 328 YEPLVKKITS 337


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 43/395 (10%)

Query: 63  LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL------NSLGF-LHYTNVSVGQPA 115
           L HR     LR R     G  +       G   +R+      ++LG+ L+ T V +G P 
Sbjct: 37  LNHRVEIDTLRARDRVRHG--RILRASVGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPP 94

Query: 116 LSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
             F V +DTGSD+ W+ C+ C +C      SSG  I+ N +    SST++ VPC+  +C 
Sbjct: 95  REFTVQIDTGSDILWINCNTCSNC----PKSSGLGIELNFFDTVGSSTAALVPCSDPMCA 150

Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD----SRIS 225
                   QC    + C Y  +Y  DG+ ++G  V D ++      QS   +    + I 
Sbjct: 151 SAIQGAAAQCSPQVNQCSYTFQY-EDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIV 209

Query: 226 FGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--IS 282
           FGC   Q+G       A +G+ G G  + SV S L+++G+ P  FS C   DG G   + 
Sbjct: 210 FGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGGGILV 269

Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSF 333
            G+   P    +P  L  + P YN+ +  ++V G  ++          +   I DSGT+ 
Sbjct: 270 LGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRGTIIDSGTTL 327

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           +YL   AY  +    ++   +   +  S      CY L     +  +P V+   +GG   
Sbjct: 328 SYLVQEAYDPLVNAVDTAVSQFATSFISK--GSQCY-LVLTSIDDSFPTVSFNFEGGASM 384

Query: 394 FVNDPIVIVSSE-PKGLYLYCLGVVK-SDNVNIIG 426
            +     +++     G  ++C+G  K  + V I+G
Sbjct: 385 DLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILG 419


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/298 (32%), Positives = 140/298 (46%), Gaps = 32/298 (10%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G P   F +  DTGSDL W  C+   C           +D     P  S++  
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCE--PCAKTCYKQKEPRLD-----PTKSTSYK 185

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            + C+S  C+L      + C S    C YQV+Y  DG+ S GF   + L L+     S +
Sbjct: 186 NISCSSAFCKLLDTEGGESCSSP--TCLYQVQY-GDGSYSIGFFATETLTLS-----SSN 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           V     FGCG+  +G F  GAA  GL GLG  K S+PS  A +     S+ +   S   G
Sbjct: 238 VFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKG 294

Query: 280 RISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTS 332
            +SFG + S     TP S   ++ P Y + IT++SVGGN ++ + S       + DSGT 
Sbjct: 295 YLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTV 354

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            T L   AY+ +S  F  L  +   T    + F+ CY  S N+T  + P V ++ KGG
Sbjct: 355 ITRLPSTAYSALSSAFQKLMTDYPSTDGYSI-FDTCYDFSKNET-IKIPKVGVSFKGG 410


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 158/357 (44%), Gaps = 52/357 (14%)

Query: 100 SLGFLHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIY 155
            +G L+YT + VG+P     + + +DTGS+L W+ CD  C SC  G N          +Y
Sbjct: 25  QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLY 75

Query: 156 SP---NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
            P   N   +S          +L + C +    C Y++ Y +D + S G L +D  HL  
Sbjct: 76  KPRKDNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL 133

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                   +S I FGCG  Q G  L+     +G+ GL   K S+PS LA++G+I N    
Sbjct: 134 --HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 191

Query: 272 CFGSD--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
           C  SD  G G I  G    P  G T  P         Y + +T++S G   ++ +     
Sbjct: 192 CLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGR 251

Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               +FD+G+S+TY  + AY+Q+  +   ++  +     SD     C+     +TNF + 
Sbjct: 252 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW---RAKTNFPFS 308

Query: 382 VVN--------LTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
            ++        +T++ G  + +    +++  E    YL        CLG++   +V+
Sbjct: 309 SLSDVKKFFRPITLQIGSKWLIISRKLLIQPED---YLIISNKGNVCLGILDGSSVH 362


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 99/354 (27%), Positives = 154/354 (43%), Gaps = 54/354 (15%)

Query: 104 LHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
           L+YT + VG+P     + + +DTGSDL W+ CD  C SC  G N          +Y P  
Sbjct: 197 LYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGAN---------QLYKPRK 247

Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            N   +S          +L + C S    C Y++ Y +D + S G L +D  HL      
Sbjct: 248 DNLVRSSEPFCVEVQRNQLTEHCESC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 303

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
               +S I FGCG  Q G  L+     +G+ GL   K S+PS LA++G+I N    C  S
Sbjct: 304 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 363

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHP---TYNITITQVSVGGNAVNFEFS------ 324
           D  G G I  G    P  G T   +   HP    Y + +T++S G   ++ +        
Sbjct: 364 DLNGEGYIFMGSDLVPSHGMTWVPMLH-HPHLEVYQMQVTKMSYGNAMLSLDGENGRVGK 422

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ--------T 376
            +FD+G+S+TY  + AY+Q+  +   ++  +     SD     C+    N          
Sbjct: 423 VLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVK 482

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
            F  P+   T++ G  + +    +++  E    YL        CLG++   NV+
Sbjct: 483 KFFRPI---TLQIGSKWLIISKKLLIQPED---YLIISNKGNVCLGILDGSNVH 530


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 160/344 (46%), Gaps = 30/344 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L++T V +G P + F V +DTGSD+ W+ C+  SC +G   SSG  I  N +  ++SS+S
Sbjct: 78  LYFTKVKLGTPPMEFTVQIDTGSDILWVNCN--SC-NGCPRSSGLGIQLNFFDASSSSSS 134

Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S V      CNS       QC +  + C Y  +Y  DG+ ++G+ V + ++      QS 
Sbjct: 135 SLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQY-GDGSGTSGYYVSESMYFDMVMGQSM 193

Query: 219 SVDSRIS--FGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             +S  S  FGC   Q+G       A +G+FG G    SV S L+ +G+ P  FS C   
Sbjct: 194 IANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKG 253

Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFS 324
           +G   G +  G+   PG   +P  L  + P YN+ +  +SV G          A +    
Sbjct: 254 EGNGGGILVLGEVLEPGIVYSP--LVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRG 311

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+  YL + AYT       +   +    + S      CY++S +     +P+V+
Sbjct: 312 TIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISK--GNQCYLVSTSVGEI-FPLVS 368

Query: 385 LTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
           L   G     +  +  ++      G  L+C+G  K  + V I+G
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILG 412


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 156/345 (45%), Gaps = 32/345 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T V +G P   F V +DTGSD+ W+ C+ C +C      +SG  I  N +  ++SST
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDSSSSST 120

Query: 163 SSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           +  V C+  +C         QC    + C Y  +Y  DG+ ++G+ V D L+      +S
Sbjct: 121 AGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQY-EDGSGTSGYYVSDTLYFDAILGES 179

Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
             V+S   I FGC   Q+G   +   A +G+FG G  + SV S L+  G+ P  FS C  
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239

Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
            +  G G +  G+   PG   +P  L  + P YN+ +  ++V G  +  + S        
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSP--LVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  YL   AY       N +         S      CY++S + +   +P+ 
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISK--GNQCYLVSTSVSQM-FPLA 354

Query: 384 NLTMKGGGPFFVN--DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           +    GG    +   D ++       G  ++C+G  K   V I+G
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILG 399


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 82/265 (30%), Positives = 124/265 (46%), Gaps = 24/265 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L++T + +G P   + V +DTGSD+ W+  +C+SC       SG  +D   Y P  SS+ 
Sbjct: 83  LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISC-EKCPRKSGLGLDLTFYDPKASSSG 139

Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S V C+   C      + P   +N  C Y V Y  DG+ +TGF V D L     T + Q+
Sbjct: 140 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFVTDALQFDQVTGDGQT 198

Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
           +  ++ ++FGCG  Q G       A +G+ G G   TS+ S LA  G +   F+ C  + 
Sbjct: 199 QPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTI 258

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAI 326
            G G  + G+   P    TP  L    P YN+ +  + VGG  +               I
Sbjct: 259 KGGGIFAIGNVVQPKVKTTP--LVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTI 316

Query: 327 FDSGTSFTYLNDPAYTQI-SETFNS 350
            DSGT+ TYL +  + ++ +  FN 
Sbjct: 317 IDSGTTLTYLPELVFKEVMAAIFNK 341


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 108/405 (26%), Positives = 172/405 (42%), Gaps = 39/405 (9%)

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           ++  L   DR     GR L    N     T     D Y    +  L+YT + +G P   F
Sbjct: 5   HFEMLKAHDR--ARHGRSL----NTIVDFTLQGTADPY----VAGLYYTRIELGTPPRPF 54

Query: 119 IVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC---- 173
            V +DTGSD+ W+ C  C +C      +SG  +  N + P  SST+S + C  + C    
Sbjct: 55  YVQIDTGSDILWVNCKPCNACPL----TSGLGVALNFFDPRGSSTASPLSCIDSKCVSSN 110

Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRV 231
           ++ +   +    C Y   Y  DG+ + G+ V D    +   ++  + +  ++I+FGC   
Sbjct: 111 QISESVCTTDRYCGYSFEY-GDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYN 169

Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD-GTGRISFGDKGS 288
           Q+G       A +G+FG G +  SV S L +QGL P  FS C  G+D G G +  G+   
Sbjct: 170 QSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITE 229

Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---------FSAIFDSGTSFTYLNDP 339
           PG   TP    Q H  YN+ +  ++V G  ++ +            I D GT+  YL + 
Sbjct: 230 PGMVYTPIVPSQPH--YNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEE 287

Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI 399
           AY     T   +A   + T    L    C+ L+ +  +  +P V L  +G          
Sbjct: 288 AYEPFVNTI--IAAVSQSTQPFMLKGNPCF-LTVHSIDEIFPSVTLYFEGAPMDLKPKDY 344

Query: 400 VIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
           +I    P    ++C+G  KS        +  I  ++ L    + Y
Sbjct: 345 LIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVY 389


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 132/278 (47%), Gaps = 30/278 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
           L+YT +S+G P   + + +DTGS   W+ CD   C SC  G +          +Y P  +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207

Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            T+  +P +  LCE  Q + P   + C Y++ Y +DG+ S G  V D +    ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263

Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
            D  I FGCG  Q G  L+     +G+ GL     S+P+ LA++G+I N+F  C  +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321

Query: 279 GR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
           G    +  GD   P  G T   +R           + Q++ G   +N +      +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
           +++TY  D A T++  +    A  +     SD    +C
Sbjct: 382 STYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 419


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 148/310 (47%), Gaps = 37/310 (11%)

Query: 68  RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
           RY RL+G   A  + +D+  LT  AG D     T R +  G L+Y  + +G PA S+ V 
Sbjct: 38  RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
           +DTGSD+ W+ C  C  C     S+ G  I+  +Y+ + S +   V C+   C      P
Sbjct: 97  VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152

Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
            +G     +CPY   Y  DG+ + G+ V+DV+    +A D K +++ +  + FGCG  Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210

Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
           G  LD +   A +G+ G G   +S+ S LA+ G +   F+ C  G +G G  + G    P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
               TP    Q H  YN+ +T V VG   +              AI DSGT+  YL +  
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327

Query: 341 YTQISETFNS 350
           Y  + +   S
Sbjct: 328 YEPLVKKITS 337


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 107/460 (23%), Positives = 185/460 (40%), Gaps = 63/460 (13%)

Query: 6   RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
           R + V  L++++      C   G + F+  H+++   + + A+     +      SA+  
Sbjct: 9   RLATVLSLVVIVELGFVVCLSNGNYVFNVQHKFAGKERSLSALKQHDARRHRRILSAVD- 67

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
                 L G G  A+                       L++  + +G P   + V +DTG
Sbjct: 68  ----LPLGGNGHPAEAG---------------------LYFAKIGLGNPPKDYYVQVDTG 102

Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
           SD+ W+ C +C  C     + S   +   +Y P +S++++++ C+   C         G 
Sbjct: 103 SDILWVNCANCDKC----PTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGC 158

Query: 185 N----CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-L 237
                C Y V Y  DG+ + GF V+D L     T   Q+ S +  + FGCG  Q+G    
Sbjct: 159 TKDLPCQYSVVY-GDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGT 217

Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
              A +G+ G G   +S+ S LA  G +   F+ C  +  G G  + G+  SP    TP 
Sbjct: 218 SSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPM 277

Query: 297 SLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQISET 347
              Q H  YN+ + ++ VGGN +               I DSGT+  YL +  Y  +   
Sbjct: 278 VPNQPH--YNVVMKEIEVGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESM--- 332

Query: 348 FNSLAKEKRETSTSDLPFEY-CYVLSPNQTNFEYPVVNLTMKGGGPFFVN--DPIVIVSS 404
              +  E+       +  ++ C+  + N  N  +PVV     G     VN  D +  +  
Sbjct: 333 MTKIVSEQPGLKLHTVEEQFTCFQYTGN-VNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE 391

Query: 405 EPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
           E     ++C G   S   +  GR+  +  ++ L +    Y
Sbjct: 392 E-----VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLY 426


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L++T++ VG P   + + +DTGSDL W+ CD  C SC  G N          +Y P   +
Sbjct: 100 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 150

Query: 162 TSSKVPCNSTLC-ELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
               VP   +LC E+Q+   +        C Y++ Y +D + S G L  D LHL      
Sbjct: 151 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 206

Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
              +   I FGC   Q G  L+  A  +G+ GL   K S+PS LA+Q +I N    C  S
Sbjct: 207 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 264

Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
           D T  G +  GD   P  G     +  +H P Y+  I ++S G   ++           +
Sbjct: 265 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 324

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           FD+G+S+TY    AY  +  +   ++ E      SD     C+
Sbjct: 325 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 367


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 138/303 (45%), Gaps = 37/303 (12%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
           RGR L+A       + F+ G +   L ++  L++T + +G P+  + V +DTGSD+ W+ 
Sbjct: 46  RGRILSA-------VDFNLGGNG--LPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVN 96

Query: 133 C-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCP 187
           C +C  C       S   I   +Y P  S TS  V C    C    E +     A + CP
Sbjct: 97  CVECTRCPR----KSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCP 152

Query: 188 YQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRVQTGSFLDGA--APN 243
           Y + Y  DG+ +TG+ V+D L  +       + + +S I FGCG  Q+G+F   +  A +
Sbjct: 153 YSISY-GDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALD 211

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-GTGRISFGDKGSPGQGETPFSLRQTH 302
           G+ G G   +SV S LA  G +   FS C  ++ G G  S G+   P    TP      H
Sbjct: 212 GIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAH 271

Query: 303 PTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
             YN+ +  + V G+ +               + DSGT+  YL    Y Q+      LAK
Sbjct: 272 --YNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV--LAK 327

Query: 354 EKR 356
           + R
Sbjct: 328 QPR 330


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/332 (31%), Positives = 155/332 (46%), Gaps = 38/332 (11%)

Query: 82  NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
           +D+  L   AG D       R + LG L+Y  + +G P   + V +DTGSD+ W+ C  C
Sbjct: 51  DDQRQLRILAGVDLPLGGIGRPDILG-LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQC 109

Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQ-KQCP--SAGSNCPYQVR 191
             C     SS G  ID  +Y+ N S T   VPC+   C E+   Q P  +A  +CPY   
Sbjct: 110 RECPK--TSSLG--IDLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEI 165

Query: 192 YLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
           Y  DG+ + G+ V+DV+  A  + + ++ + +  + FGCG  Q+G     +  A +G+ G
Sbjct: 166 Y-GDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILG 224

Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
            G   +S+ S LA  G +   F+ C  G++G G    G    P    TP    Q H  YN
Sbjct: 225 FGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGHVVQPKVNMTPLIPNQPH--YN 282

Query: 307 ITITQVSVGGNAVN-----FEF----SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
           + +T V VG   ++     FE      AI DSGT+  YL +  Y  +     S   + + 
Sbjct: 283 VNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKV 342

Query: 358 TSTSD--LPFEYCYVLS---PNQT-NFEYPVV 383
            +  D    F+Y   L    PN T +FE  V+
Sbjct: 343 HTVRDEYTCFQYSDSLDDGFPNVTFHFENSVI 374


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/311 (31%), Positives = 149/311 (47%), Gaps = 37/311 (11%)

Query: 68  RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
           RY RL+G   A  + +D+  LT  AG D     T R +  G L+Y  + +G PA S+ V 
Sbjct: 38  RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
           +DTGSD+ W+ C  C  C     S+ G  I+  +Y+ + S +   V C+   C      P
Sbjct: 97  VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152

Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
            +G     +CPY   Y  DG+ + G+ V+DV+    +A D K +++ +  + FGCG  Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210

Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
           G  LD +   A +G+ G G   +S+ S LA+ G +   F+ C  G +G G  + G    P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269

Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
               TP    Q H  YN+ +T V VG   +              AI DSGT+  YL +  
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327

Query: 341 YTQISETFNSL 351
           Y  + +   +L
Sbjct: 328 YEPLVKKEPAL 338


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L++T++ VG P   + + +DTGSDL W+ CD  C SC  G N          +Y P   +
Sbjct: 313 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 363

Query: 162 TSSKVPCNSTLC-ELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
               VP   +LC E+Q+   +        C Y++ Y +D + S G L  D LHL      
Sbjct: 364 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 419

Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
              +   I FGC   Q G  L+  A  +G+ GL   K S+PS LA+Q +I N    C  S
Sbjct: 420 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 477

Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
           D T  G +  GD   P  G     +  +H P Y+  I ++S G   ++           +
Sbjct: 478 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 537

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           FD+G+S+TY    AY  +  +   ++ E      SD     C+
Sbjct: 538 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 580


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/317 (29%), Positives = 142/317 (44%), Gaps = 32/317 (10%)

Query: 70  FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
           F  + R LAA + +D +  L   AG D     T R  ++G L+Y  + +G PA  + V +
Sbjct: 57  FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115

Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
           DTGSD+ W+ C  C  C     SS G  ++  +Y    S T   V C+   C      P 
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171

Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
               A  +C Y   Y +DG+ S G+ V D++     + + ++ S +  + FGC   Q+G 
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
                A +G+ G G   TS+ S LA+ G +   F+ C  G +G G  + G    P    T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290

Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQIS 345
           P    QTH  YN+ +  V VGG  +N          +   I DSGT+  YL +  Y Q+ 
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348

Query: 346 ETFNSLAKEKRETSTSD 362
               S   + +  +  D
Sbjct: 349 SKIFSWQSDLKVHTIHD 365


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 117/260 (45%), Gaps = 26/260 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T + +G P   + V +DTGSD+ W+ C  C  C       S   ID  +Y P  S T
Sbjct: 69  LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPR----KSDLGIDLTLYDPKGSET 124

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
           S  + C+   C      P  G      CPY + Y  DG+ +TG+ V+D L  +   D  +
Sbjct: 125 SELISCDQEFCSATYDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNHVNDNLR 183

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           +   +S I FGCG VQ+G+    +  A +G+ G G   +SV S LA  G +   FS C  
Sbjct: 184 TAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLD 243

Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
           +  G G  + G+   P    TP   R  H  YN+ +  + V  + +              
Sbjct: 244 NIRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSGNGKG 301

Query: 325 AIFDSGTSFTYLNDPAYTQI 344
            I DSGT+  YL    Y ++
Sbjct: 302 TIIDSGTTLAYLPAIVYDEL 321


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/340 (30%), Positives = 150/340 (44%), Gaps = 45/340 (13%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L++T V +G P   +IV +DTGSD+ W+ C   S   G    S   I   +Y P  SST+
Sbjct: 1   LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCS---GCPRKSALNIPLTMYDPRESSTT 57

Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
           S V C+  LC       + QC  A +NC Y   Y  DG+ S G+ V D +          
Sbjct: 58  SLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSY-GDGSTSEGYYVRDAMQYNVISSNGL 116

Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
            +  S++ FGC   QTG       A +G+ G G  + SVP+ LA Q  IP  FS C   +
Sbjct: 117 ANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--E 174

Query: 277 GTGR----ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFSA---- 325
           G  R    +  G    PG   TP      H  YN+ +  +SV  N +     +FS+    
Sbjct: 175 GEKRGGGILVIGGIAEPGMTYTPLVPDSVH--YNVVLRGISVNSNRLPIDAEDFSSTNDT 232

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY------CYVLSPNQTN 377
             I DSGT+  Y    AY       N   +  RE +TS  P         C+++S   ++
Sbjct: 233 GVIMDSGTTLAYFPSGAY-------NVFVQAIRE-ATSATPVRVQGMDTQCFLVSGRLSD 284

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIV-SSEPKGLY-LYCLG 415
             +P V L  +GG      D  ++   + P G   ++C+G
Sbjct: 285 L-FPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIG 323


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/334 (29%), Positives = 152/334 (45%), Gaps = 32/334 (9%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           R++S+G L++T + +G P   + V +DTGSD+ W+ C  C  C    N +       +++
Sbjct: 67  RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
             N SSTS KV C+   C    Q  S      C Y + Y +D + S G  + D+L L   
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
           T + ++  +   + FGCG  Q+G   +G +A +G+ G G   TSV S LA  G     FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240

Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
            C  +  G G  + G   SP    TP    Q H  YN+ +  + V G +++   S     
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  Y     Y  + ET   LA++  +    +  F+ C+  S N  +  +P V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEETFQ-CFSFSTN-VDEAFPPV 354

Query: 384 NLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
           +   +      V  +D +  +  E     LYC G
Sbjct: 355 SFEFEDSVKLTVYPHDYLFTLEEE-----LYCFG 383


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 129/285 (45%), Gaps = 35/285 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L++T + VG P   + + +DT SDL W+ CD  C SC  G N+         +Y P   +
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANA---------LYKPRRDN 257

Query: 162 TSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             +  P +S   EL +    AG       C Y++ Y +D + S G L  D LHL      
Sbjct: 258 IVT--PKDSLCVELHRN-QKAGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTM--AN 311

Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
             S + + +FGC   Q G  L+     +G+ GL   K S+PS LAN+G+I N    C  +
Sbjct: 312 GSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAN 371

Query: 276 D--GTGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQ-------VSVGGNAVNFEFS 324
           D  G G +  GD   P  G    P     +  +Y   I +       +S+GG        
Sbjct: 372 DVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVR-R 430

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
            +FDSG+S+TY    AY+++  +   ++ E     TSD    +C+
Sbjct: 431 IVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCW 475


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 132/274 (48%), Gaps = 25/274 (9%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++++G PA  + + +DTGS L W+ CD  C +C  G +       + NI  P  S  
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKE-NIVPPRDSHC 187

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             ++  N   C+  KQ       C Y++ Y +D + S G L  D + L T + + +++D 
Sbjct: 188 -QELQGNQNYCDTCKQ-------CDYEIAY-ADRSSSAGVLARDNMELITADGERENMD- 237

Query: 223 RISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTG 279
            + FGC   Q G  L   A+ +G+ GL     S+P+ LA QG+I N F  C  +D  G+ 
Sbjct: 238 -LVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSA 296

Query: 280 RISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDSGTS 332
            +  GD   P  G T   +R      Y+  + +V+ G   +N    A      IFDSG+S
Sbjct: 297 YMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSS 356

Query: 333 FTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
           +TY     YT +  +  +++    R+ S   LPF
Sbjct: 357 YTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPF 390


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 163/372 (43%), Gaps = 55/372 (14%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
           RGR LA +G D     FS G     L+  G L++T V +G P   +IV +DTGSD+ W+ 
Sbjct: 5   RGRFLA-EGVD-----FSLGGTADPLS--GGLYFTQVGLGNPVKHYIVQVDTGSDVLWVN 56

Query: 133 CD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNC 186
           C  C  C       S   I   +Y P  SST+S V C+  LC       + QC    +NC
Sbjct: 57  CRPCSGCPR----KSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNC 112

Query: 187 PYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNG 244
            Y   Y  DG+ S G+ V D +           +  S++ FGC   QTG       A +G
Sbjct: 113 EYIFSY-GDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDG 171

Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR----ISFGDKGSPGQGETPFSLRQ 300
           + G G  + SVP+ LA Q  IP  FS C   +G  R    +  G    PG   TP     
Sbjct: 172 IIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEKRGGGILVIGGIAEPGMTYTPLVPDS 229

Query: 301 THPTYNITITQVSVGGNAVNF---EFSA------IFDSGTSFTYLNDPAYTQISETFNSL 351
            H  YN+ +  +SV  N +     +FS+      I DSGT+  Y    AY       N  
Sbjct: 230 VH--YNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAY-------NVF 280

Query: 352 AKEKRETSTSDLPFEY------CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV-SS 404
            +  RE +TS  P         C+++S   ++  +P V L  +GG      D  ++   +
Sbjct: 281 VQAIRE-ATSATPVRVQGMDTQCFLVSGRLSDL-FPNVTLNFEGGAMELQPDNYLMWGGT 338

Query: 405 EPKGLY-LYCLG 415
            P G   ++C+G
Sbjct: 339 APTGTTDVWCIG 350


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/314 (30%), Positives = 144/314 (45%), Gaps = 33/314 (10%)

Query: 70  FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
           F  + R LAA + +D +  L   AG D     T R  ++G L+Y  + +G PA  + V +
Sbjct: 57  FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115

Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
           DTGSD+ W+ C  C  C     SS G  ++  +Y    S T   V C+   C      P 
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171

Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
               A  +C Y   Y +DG+ S G+ V D++     + + ++ S +  + FGC   Q+G 
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
                A +G+ G G   TS+ S LA+ G +   F+ C  G +G G  + G    P    T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290

Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQ-I 344
           P    QTH  YN+ +  V VGG  +N          +   I DSGT+  YL +  Y Q +
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348

Query: 345 SETFNSLAKEKRET 358
           S+ F+  +  K  T
Sbjct: 349 SKIFSWQSDLKVHT 362


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 138/305 (45%), Gaps = 31/305 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T + +G P   + V +DTGSD+ W+ C +C  C       S   ID  +Y P  S T
Sbjct: 69  LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPR----KSDLGIDLTLYDPKGSET 124

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
           S  V C+   C      P  G      CPY + Y  DG+ +TG+ V+D L  +      +
Sbjct: 125 SDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNRINGNLR 183

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           +   +S I FGCG VQ+G+    +  A +G+ G G   +SV S LA  G +   FS C  
Sbjct: 184 TSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD 243

Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
           +  G G  + G+   P    TP   R  H  YN+ +  + V  + +              
Sbjct: 244 NVRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSVNGKG 301

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            + DSGT+  YL D  Y ++ +    LA++   +    +  F  C++ + N  +  +PVV
Sbjct: 302 TVIDSGTTLAYLPDIVYDELIQKV--LARQPGLKLYLVEQQFR-CFLYTGN-VDRGFPVV 357

Query: 384 NLTMK 388
            L  K
Sbjct: 358 KLHFK 362


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 98/334 (29%), Positives = 152/334 (45%), Gaps = 32/334 (9%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           R++S+G L++T + +G P   + V +DTGSD+ W+ C  C  C    N +       +++
Sbjct: 67  RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
             N SSTS KV C+   C    Q  S      C Y + Y +D + S G  + D+L L   
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
           T + ++  +   + FGCG  Q+G   +G +A +G+ G G   TSV S LA  G     FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240

Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
            C  +  G G  + G   SP    TP    Q H  YN+ +  + V G +++   S     
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+  Y     Y  + ET   LA++  +    +  F+ C+  S N  +  +P V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEETFQ-CFSFSTN-VDEAFPPV 354

Query: 384 NLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
           +   +      V  +D +  +  E     LYC G
Sbjct: 355 SFEFEDSVKLTVYPHDYLFTLEEE-----LYCFG 383


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 156/343 (45%), Gaps = 36/343 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P   F V +DTGSD+ W+   C SC +G   +S   I  + + P  SS++
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           S V C+   C    Q  S  S    C Y  +Y  DG+ ++GF + D +   T    + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGFYISDFMSFDTVITSTLAI 198

Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
           +S     FGC  +QTG       A +G+FGLG    SV S LA QGL P  FS C   D 
Sbjct: 199 NSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258

Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
            G G +  G    P    TP  L  + P YN+ +  ++V G  +  + S          I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316

Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEY----CYVLSPNQTNFEYP 381
            D+GT+  YL D AY+  I    N++++  R       P  Y    C+ ++    +  +P
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAIANAVSQYGR-------PITYESYQCFEITAGDVDV-FP 368

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
            V+L+  GG    +     +      G  ++C+G  +  +  I
Sbjct: 369 EVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRI 411


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 170/407 (41%), Gaps = 59/407 (14%)

Query: 25  FGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH-RDRYFRLRGRGLAAQGND 83
           F  G F F   H+++   K                   L H +    R   R LA+    
Sbjct: 20  FASGNFVFKVQHKFAGKEK------------------KLEHFKSHDTRRHSRMLAS---- 57

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
              +    G D+ R++S+G L++T + +G P   + V +DTGSD+ W+ C  C  C    
Sbjct: 58  ---IDLPLGGDS-RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKT 112

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMST 200
           N +       +++  N SSTS KV C+   C    Q  S      C Y + Y +D + S 
Sbjct: 113 NLN----FHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVY-ADESTSE 167

Query: 201 GFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPS 257
           G  + D L L   T + Q+  +   + FGCG  Q+G      +A +G+ G G   TSV S
Sbjct: 168 GNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLS 227

Query: 258 ILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG 316
            LA  G     FS C  +  G G  + G   SP    TP    Q H  YN+ +  + V G
Sbjct: 228 QLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDG 285

Query: 317 NAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
            A++   S       I DSGT+  Y     Y  + ET   LA++  +    +  F+ C+ 
Sbjct: 286 TALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEDTFQ-CFS 342

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
            S N  +  +P V+   +      V  +D +  +  E     LYC G
Sbjct: 343 FSEN-VDVAFPPVSFEFEDSVKLTVYPHDYLFTLEKE-----LYCFG 383


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ +G P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209

Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
              VP   + C ELQ  +        C Y++ Y +D + S G L  D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265

Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           +D    FGCG  Q G+ L   A  +G+ GL     S+P+ LA+QG+I N F  C  +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323

Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
             G +  GD   P  G T   +R      Y+  + +V+ G   +N    A      IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383

Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
           G+S+TYL    YT  I+   +      ++ S   LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 123/283 (43%), Gaps = 32/283 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
           L+  ++++G P   + + +DTGSDL W+ CD     C  C    +          +Y PN
Sbjct: 61  LYTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKD---------KLYKPN 111

Query: 159 TSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
                  V C+  +C        L + C      C Y V+Y +D   + G LV D +H+ 
Sbjct: 112 GKQV---VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQY-ADHASTLGVLVRDYMHIG 167

Query: 212 TDEKQSKSVDSRISFGCGRVQ--TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
           +    +K  D  ++FGCG  Q  +G     + P G+ GLG  KTS+ S L + G I N  
Sbjct: 168 SPSSSTK--DPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVL 225

Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
             C  ++G G +  GDK  P  G   TP         YN     +   G     +    I
Sbjct: 226 GHCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTPAKGLQII 285

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           FDSG+S+TY + P YT ++   N+  K K  +   D     C+
Sbjct: 286 FDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICW 328


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ +G P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209

Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
              VP   + C ELQ  +        C Y++ Y +D + S G L  D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265

Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           +D    FGCG  Q G+ L   A  +G+ GL     S+P+ LA+QG+I N F  C  +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323

Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
             G +  GD   P  G T   +R      Y+  + +V+ G   +N    A      IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383

Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
           G+S+TYL    YT  I+   +      ++ S   LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 95/339 (28%), Positives = 160/339 (47%), Gaps = 35/339 (10%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G     F V +DTGSD+ W+ C+ C +C      SS   I+ N +    SST++ +PC+ 
Sbjct: 75  GXXXXXFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPCSD 130

Query: 171 TLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-- 223
            +C         +C    + C Y  +Y  DG+ ++G+ V D ++      Q  +V+S   
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFNLIMGQPPAVNSTAT 189

Query: 224 ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GR 280
           I FGC   Q+G       A +G+FG G    SV S L++QG+ P  FS C   DG   G 
Sbjct: 190 IVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249

Query: 281 ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFDSG 330
           +  G+   P    +P  L  + P YN+ +  ++V G     N   F  S      I D G
Sbjct: 250 LVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCG 307

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           T+  YL   AY  +    N ++++  R+T++       CY++S +  +  +P+V+L  +G
Sbjct: 308 TTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDI-FPLVSLNFEG 363

Query: 390 GGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
           G    +  +  ++ +    G  ++C+G  K  +  +I+G
Sbjct: 364 GASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILG 402


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 154/344 (44%), Gaps = 41/344 (11%)

Query: 101 LGFLH------YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
           LG+ H      YT + +G P  +F V +DTGS + ++PC DC  C  G +++        
Sbjct: 3   LGYRHTRHSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHC--GKHTA-------E 53

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            + P+ S+T+ K+ C   LC       +  ++  Y  R  ++ + S G+++ED       
Sbjct: 54  WFDPDKSTTAKKLACGDPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDS 113

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +        R+ FGC   +TG      A +G+ G+G +  +  S L  + +I + FS+CF
Sbjct: 114 DSPV-----RLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF 167

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTH---PTYNITITQVSVGGNAVNFE-------F 323
           G    G +  GD   P    T ++   TH     YN+ +  ++V G  + F+       +
Sbjct: 168 GYPKDGILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY 227

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY---CYVLSPNQ---TN 377
             + DSGT+FTYL   A+  +++      ++K   ST     +Y   C+  +P+Q    +
Sbjct: 228 GTVLDSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLD 287

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
             +P       GG    +     +  S+P     YCLG+  + N
Sbjct: 288 KYFPPAEFVFGGGAKLTLPPLRYLFLSKPAE---YCLGIFDNGN 328


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ + T +++GQP   + + LDTGSDL WL CD  CV C+   +          +Y P 
Sbjct: 35  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 83

Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
              +S  +PCN  LC+       ++C +    C Y+V Y +DG  S G LV DV   + +
Sbjct: 84  ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVF--SMN 136

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C 
Sbjct: 137 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 196

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
            S G G + FGD    S     TP S R+    Y+  +  ++  GG     +    +FDS
Sbjct: 197 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 255

Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           G+S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 256 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 297


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ + T +++GQP   + + LDTGSDL WL CD  CV C+   +          +Y P 
Sbjct: 54  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 102

Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
              +S  +PCN  LC+       ++C +    C Y+V Y +DG  S G LV DV  +  +
Sbjct: 103 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 155

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C 
Sbjct: 156 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 215

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
            S G G + FGD    S     TP S R+    Y+  +  ++  GG     +    +FDS
Sbjct: 216 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 274

Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           G+S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 275 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ + T +++GQP   + + LDTGSDL WL CD  CV C+   +          +Y P 
Sbjct: 57  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105

Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
              +S  +PCN  LC+       ++C +    C Y+V Y +DG  S G LV DV  +  +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C 
Sbjct: 159 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
            S G G + FGD    S     TP S R+    Y+  +  ++  GG     +    +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277

Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           G+S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ + T +++GQP   + + LDTGSDL WL CD  CV C+   +          +Y P 
Sbjct: 45  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 93

Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
              +S  +PCN  LC+       ++C +    C Y+V Y +DG  S G LV DV  +  +
Sbjct: 94  ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 146

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C 
Sbjct: 147 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 206

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
            S G G + FGD    S     TP S R+    Y+  +  ++  GG     +    +FDS
Sbjct: 207 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 265

Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           G+S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 266 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 307


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 156/353 (44%), Gaps = 52/353 (14%)

Query: 104 LHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
           L+YT + VG+P     + + +DTGS+L W+ CD  C SC  G N          +Y P  
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLYKPRK 252

Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            N   +S          +L + C +    C Y++ Y +D + S G L +D  HL      
Sbjct: 253 DNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 308

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
               +S I FGCG  Q G  L+     +G+ GL   K S+PS LA++G+I N    C  S
Sbjct: 309 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 368

Query: 276 D--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
           D  G G I  G    P  G T  P         Y + +T++S G   ++ +         
Sbjct: 369 DLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKV 428

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN- 384
           +FD+G+S+TY  + AY+Q+  +   ++  +     SD     C+     +TNF +  ++ 
Sbjct: 429 LFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW---RAKTNFPFSSLSD 485

Query: 385 -------LTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
                  +T++ G  + +    +++  E    YL        CLG++   +V+
Sbjct: 486 VKKFFRPITLQIGSKWLIISRKLLIQPED---YLIISNKGNVCLGILDGSSVH 535


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  + V +DTGSD+ W+ C  C  C     + S   +D  +Y    S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209

Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S  V C+   C L     P    G  C Y V Y  DG+ +TG+ V+D +     +   Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268

Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
              +  + FGCG  Q+G     + A +G+ G G   +S+ S LA+ G +   FS C  + 
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
           DG G  + G+   P    TP    Q H  YN+ + ++ VGG+ ++    A         I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
            DSGT+  Y     Y  + E   S   + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  + V +DTGSD+ W+ C  C  C     + S   +D  +Y    S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209

Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S  V C+   C L     P    G  C Y V Y  DG+ +TG+ V+D +     +   Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268

Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
              +  + FGCG  Q+G     + A +G+ G G   +S+ S LA+ G +   FS C  + 
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
           DG G  + G+   P    TP    Q H  YN+ + ++ VGG+ ++    A         I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
            DSGT+  Y     Y  + E   S   + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  + V +DTGSD+ W+ C  C  C     + S   +D  +Y    S+T
Sbjct: 73  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 128

Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S  V C+   C L     P    G  C Y V Y  DG+ +TG+ V+D +     +   Q+
Sbjct: 129 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 187

Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
              +  + FGCG  Q+G     + A +G+ G G   +S+ S LA+ G +   FS C  + 
Sbjct: 188 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 247

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
           DG G  + G+   P    TP    Q H  YN+ + ++ VGG+ ++    A         I
Sbjct: 248 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 305

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
            DSGT+  Y     Y  + E   S   + R
Sbjct: 306 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 335


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 158/348 (45%), Gaps = 38/348 (10%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           R R     GR L          T    +D Y +     L++T V +G P   F V +DTG
Sbjct: 51  RARDQARHGRLLRGVVGGVVDFTVYGTSDPYLVG----LYFTKVKLGSPPREFNVQIDTG 106

Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP-----CNSTLCELQKQC 179
           SD+ W+ C+ C  C      +SG  I+ + + P++SST+S V      C S +     +C
Sbjct: 107 SDILWVTCNSCNDCPR----TSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAEC 162

Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS--RISFGCGRVQTGSFL 237
               + C Y   Y  DG+ +TG+ V D+L+  T    S   +S   I FGC   Q+G   
Sbjct: 163 SPQSNQCSYSFHY-GDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLT 221

Query: 238 D-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGET 294
               A +G+FG G    SV S L++ G+ P  FS C     DG G++  G+   P    +
Sbjct: 222 KVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLVLGEILEPNIIYS 281

Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAYTQIS 345
           P    Q+H  YN+ +  +SV G  +  + +          I DSGT+ TYL + AY    
Sbjct: 282 PLVPSQSH--YNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAY---- 335

Query: 346 ETFNSLAKEKRETSTSDLPFE--YCYVLSPNQTNFEYPVVNLTMKGGG 391
           + F S       +ST+ +  +   CY++S +     +P V+L   GG 
Sbjct: 336 DPFVSAITATVSSSTTPVLSKGNQCYLVSTSVDEI-FPPVSLNFAGGA 382


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 113/463 (24%), Positives = 187/463 (40%), Gaps = 68/463 (14%)

Query: 6   RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVD--DLPKKGSFAYYSAL 63
           R   V +++ L      CC       F    ++  P + + A+   D  ++G F     L
Sbjct: 4   RERLVRLVVSLFVVVQLCCHANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDL 63

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A       L G G                    R  S G L+YT + +G     + V +D
Sbjct: 64  A-------LGGNG--------------------RPTSTG-LYYTKIGLGPN--DYYVQVD 93

Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
           TGSD  W+ C  C +C       SG  ++  +Y PN+S TS  VPC+   C      P +
Sbjct: 94  TGSDTLWVNCVGCTTC----PKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPIS 149

Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV--DSRISFGCGRVQTGSF 236
           G     +CPY + Y  DG+ ++G  ++D L         ++V  ++ + FGCG  Q+G+ 
Sbjct: 150 GCKKDMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTL 208

Query: 237 --LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
                 + +G+ G G   +SV S LA  G +   FS C  + +G G  + G+   P    
Sbjct: 209 SSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKT 268

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQI 344
           TP   R  H  YN+ +  + V G+ +               I DSGT+  YL    Y Q+
Sbjct: 269 TPLVPRMAH--YNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQL 326

Query: 345 SETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF--FVNDPIVI 401
            E   +LA+    E    +  F   +       +  +P V  T + G     + +D +  
Sbjct: 327 LE--KTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFP 384

Query: 402 VSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
              +     ++C+G  KS      G++  +  ++ L +  + Y
Sbjct: 385 FKED-----MWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIY 422


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 129/285 (45%), Gaps = 28/285 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T + +G P+  + V +DTGSD+ W+ C  C SC       SG  ID  +Y P  S++
Sbjct: 88  LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPR----KSGLGIDLTLYDPTASAS 143

Query: 163 SSKVPCNSTLCELQKQC---PSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
           S  V C    C         PS  +N  C Y + Y  DG+ +TGF V D L     + + 
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCAANSPCQYSITY-GDGSSTTGFFVADFLQYDQVSGDG 202

Query: 216 QSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           Q+   ++ ++FGCG    G+      A +G+ G G   +S+ S L + G +   FS C  
Sbjct: 203 QTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLD 262

Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------EF 323
           + +G G  + G+   P    TP  L    P YN+ +  + VGG+ +              
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTP--LVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320

Query: 324 SAIFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
             I DSGT+  YL +  Y  + S  F++      +     L F+Y
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQY 365


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 156/343 (45%), Gaps = 36/343 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P   F V +DTGSD+ W+   C SC +G   +S   I  + + P  SS++
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           S V C+   C    Q  S  S    C Y  +Y  DG+ ++G+ + D +   T    + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISDFMSFDTVITSTLAI 198

Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
           +S     FGC  +Q+G       A +G+FGLG    SV S LA QGL P  FS C   D 
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258

Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
            G G +  G    P    TP  L  + P YN+ +  ++V G  +  + S          I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316

Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEY----CYVLSPNQTNFEYP 381
            D+GT+  YL D AY+  I    N++++  R       P  Y    C+ ++    +  +P
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAVANAVSQYGR-------PITYESYQCFEITAGDVDV-FP 368

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
            V+L+  GG    +     +      G  ++C+G  +  +  I
Sbjct: 369 QVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRI 411


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
           G   + SAL   D   R  GR LAA      PL  S       L +   L++T + +G P
Sbjct: 51  GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99

Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
           A  + V +DTGSD+ W+  +CVSC  G    S   I+  +Y P  S +   V C+   C 
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156

Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
                +   C S  S C Y + Y  DG+ + GF V D L     + + Q+   ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214

Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
           CG    G       A +G+ G G   +S+ S LA  G +   F+ C  + +G G  + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274

Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
              P    TP  L    P YN+ +  + VGG A+               I DSGT+  Y+
Sbjct: 275 VVQPKVKTTP--LVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
            +  Y  +   F  +  + ++ S   L    C+  S    +  +P V    +G       
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381

Query: 397 DPIVIVSSE----PKGLYLYCLG 415
           D  +IVS        G  LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ + T +++GQP   + + LDTGSDL WL CD  CV C+   +          +Y P 
Sbjct: 57  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105

Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
              +S  +PCN  LC+       ++C +    C Y+V Y +DG  S G LV DV  +  +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C 
Sbjct: 159 YTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
            S G G + FGD    S     TP S R+    Y+  +  ++  GG     +    +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277

Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           G+S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 81/268 (30%), Positives = 131/268 (48%), Gaps = 29/268 (10%)

Query: 95  TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
           T R +S+G L+Y  + +G P+  + + +DTG+D+ W+ C  C  C     + S   +D  
Sbjct: 64  TGRPDSVG-LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKEC----PTRSNLGMDLT 118

Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDV 207
           +Y+   SS+   VPC+  LC+     L   C S  ++ CPY   Y  DG+ + G+ V+DV
Sbjct: 119 LYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIY-GDGSSTAGYFVKDV 177

Query: 208 LHL--ATDEKQSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           +     + + ++ S +  + FGCG  Q+G  S+ +  A +G+ G G    S+ S L++ G
Sbjct: 178 VLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSG 237

Query: 264 LIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
            +   F+ C  G +G G  + G    P    TP  L    P Y++ +T + VG   +N  
Sbjct: 238 KVKKMFAHCLNGVNGGGIFAIGHVVQPTVNTTP--LLPDQPHYSVNMTAIQVGHTFLNLS 295

Query: 323 FSA---------IFDSGTSFTYLNDPAY 341
             A         I DSGT+  YL D  Y
Sbjct: 296 TDASEQRDSKGTIIDSGTTLAYLPDGIY 323


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 92/329 (27%), Positives = 145/329 (44%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC +CV C +  +           + P  SST   
Sbjct: 91  TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN+  C     C   G  C Y+ RY ++ + S+G L EDV+      K+S+ V  R  
Sbjct: 142 VKCNAD-C----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  +++G      A +G+ GLG    SV   L  +G++ NSFS+C+G    G G +  
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G   SP       S     P YNI + ++ V G  +         ++ AI DSGT++ Y 
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE----YPVVNLTMKGGGP 392
            + AY    +         ++ S  D  F+        +   E    +P V++    G  
Sbjct: 312 PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
             ++ P   +    K    YCLG+ K+ N
Sbjct: 372 ISLS-PENYLFRHTKVSGAYCLGIFKNGN 399


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 91/329 (27%), Positives = 144/329 (43%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC +CV C +  +           + P  SST   
Sbjct: 91  TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN+        C   G  C Y+ RY ++ + S+G L EDV+      K+S+ V  R  
Sbjct: 142 VKCNADC-----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  +++G      A +G+ GLG    SV   L  +G++ NSFS+C+G    G G +  
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G   SP       S     P YNI + ++ V G  +         ++ AI DSGT++ Y 
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE----YPVVNLTMKGGGP 392
            + AY    +         ++ S  D  F+        +   E    +P V++    G  
Sbjct: 312 PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
             ++ P   +    K    YCLG+ K+ N
Sbjct: 372 ISLS-PENYLFRHTKVSGAYCLGIFKNGN 399


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
           G   + SAL   D   R  GR LAA      PL  S       L +   L++T + +G P
Sbjct: 51  GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99

Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
           A  + V +DTGSD+ W+  +CVSC  G    S   I+  +Y P  S +   V C+   C 
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156

Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
                +   C S  S C Y + Y  DG+ + GF V D L     + + Q+   ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214

Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
           CG    G       A +G+ G G   +S+ S LA  G +   F+ C  + +G G  + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274

Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
              P    TP  L    P YN+ +  + VGG A+               I DSGT+  Y+
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
            +  Y  +   F  +  + ++ S   L    C+  S    +  +P V    +G       
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381

Query: 397 DPIVIVSSE----PKGLYLYCLG 415
           D  +IVS        G  LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404


>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 406

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 75/254 (29%), Positives = 125/254 (49%), Gaps = 30/254 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
           L+YT +S+G P   + + +DTGS   W+ CD   C SC  G +          +Y P  +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207

Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            T+  +P +  LCE  Q + P   + C Y++ Y +DG+ S G  V D +    ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263

Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
            D  I FGCG  Q G  L+     +G+ GL     S+P+ LA++G+I N+F  C  +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321

Query: 279 GR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
           G    +  GD   P  G T   +R           + Q++ G   +N +      +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381

Query: 331 TSFTYLNDPAYTQI 344
           +++TY  D A T++
Sbjct: 382 STYTYFPDEALTRL 395


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 88/261 (33%), Positives = 123/261 (47%), Gaps = 23/261 (8%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y ++S+GQP   + +  DTGSDL WL CD  CV C    +          +Y PN
Sbjct: 64  LGY-YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHP---------LYRPN 113

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            +    K P  ++L     +C      C Y+V Y +DG  S G LV+DV  L  +     
Sbjct: 114 NNLVICKDPMCASLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
            +  R++ GCG  Q         P +G+ GLG  K+S+ S L +QG+I N    C  S G
Sbjct: 170 RLAPRLALGCGYDQIPG--QSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRG 227

Query: 278 TGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFT 334
            G + FGD    S     TP  LR  H  Y+    ++ +GG    F+     FDSG+S+T
Sbjct: 228 GGFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYT 286

Query: 335 YLNDPAYTQISETFNSLAKEK 355
           YLN  AY  +         EK
Sbjct: 287 YLNSLAYQALVHLVRKELSEK 307


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 129/282 (45%), Gaps = 43/282 (15%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  + +G PA  + + +DTGSDL WL CD  C SC  G +          +Y P  + 
Sbjct: 22  LYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPH---------GLYDPKKAR 72

Query: 162 TSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH-LATDEK 215
               V C   LC L +Q     C      C Y V Y +DG+ + G L+ED +  L T+  
Sbjct: 73  L---VDCRVPLCALVQQGGSYACGGPVRQCDYDVEY-ADGSSTMGVLMEDTITLLLTNGT 128

Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
           +SK+       GCG  Q G+     A+ +G+ GL   K S+PS LA +G++ N    C  
Sbjct: 129 RSKTT---AIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLA 185

Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
            GS+G G + FGD   P  G T   +     T NI       GG + + +         +
Sbjct: 186 GGSNGGGYLFFGDSLVPALGMTWTPIMGKSITGNI-------GGKSGDADDKTGDIGGVM 238

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
           FDSGTSFTYL   AY  +        ++    R  + + LPF
Sbjct: 239 FDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPF 280


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
           G   + SAL   D   R  GR LAA      PL  S       L +   L++T + +G P
Sbjct: 51  GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99

Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
           A  + V +DTGSD+ W+  +CVSC  G    S   I+  +Y P  S +   V C+   C 
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156

Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
                +   C S  S C Y + Y  DG+ + GF V D L     + + Q+   ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214

Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
           CG    G       A +G+ G G   +S+ S LA  G +   F+ C  + +G G  + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274

Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
              P    TP  L    P YN+ +  + VGG A+               I DSGT+  Y+
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
            +  Y  +   F  +  + ++ S   L    C+  S    +  +P V    +G       
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381

Query: 397 DPIVIVSSE----PKGLYLYCLG 415
           D  +IVS        G  LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 133/296 (44%), Gaps = 36/296 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L++T + +G P   + V +DTGSD+ W+  +C+SC       SG  +D   Y P  SS+ 
Sbjct: 86  LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISCSK-CPRKSGLGLDLTFYDPKASSSG 142

Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S V C+   C      + P   +N  C Y V Y  DG+ +TGF + D L     T + Q+
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFITDALQFDQVTGDGQT 201

Query: 218 KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
           +  ++ I+FGCG  Q G   +   A +G+ G G   TS+ S LA  G     F+ C  + 
Sbjct: 202 QPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTI 261

Query: 276 DGTGRISFGDKGSP----------GQGETPFSL----RQTHPTYNITITQVSVGGNAVNF 321
            G G  + G+   P          G    P  L      + P YN+ +  + VGG  +  
Sbjct: 262 KGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQL 321

Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD-LPFEY 367
                    +   I DSGT+ TYL +  + Q+ +   S  ++    +  D L F+Y
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQY 377


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 149/331 (45%), Gaps = 35/331 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++T + +G PA S+ V +DTGSD+ W+ C  C +C       SG  I+  +Y P+ SS+
Sbjct: 80  LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPR----KSGLGIELTLYDPSGSSS 135

Query: 163 SSKVPCNSTLCELQKQ--CPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
            + V C    C        PS    + C Y + Y  DG+ +TGF V D L     +   Q
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISY-GDGSSTTGFFVTDFLQYNQVSGNSQ 194

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           +   ++ I+FGCG    G     + A +G+ G G   +S+ S LA  G +   F+ C  +
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDT 254

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------A 325
            +G G  + GD   P    TP  L    P YN+ +  + VGG  +    +          
Sbjct: 255 INGGGIFAIGDVVQPKVSTTP--LVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGT 312

Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
           I DSGT+  YL    Y  I S+ F   A+       +D  F+ C+  S    +  +P++ 
Sbjct: 313 IIDSGTTLAYLPGVVYNAIMSKVF---AQYGDMPLKNDQDFQ-CFRYS-GSVDDGFPIIT 367

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLG 415
              +GG P  ++    +  +      LYC+G
Sbjct: 368 FHFEGGLPLNIHPHDYLFQNGE----LYCMG 394


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 134/284 (47%), Gaps = 32/284 (11%)

Query: 82  NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
           +D+  L   AG D     + R +++G L+Y  V +G P+  + V +DTGSD+ W+ C  C
Sbjct: 59  DDRRQLRILAGVDLPLGGSGRPDTVG-LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQC 117

Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP----SAGSNCPYQVR 191
             C     SS G  ++  +Y+   S +   VPC+   C      P    +A  +CPY   
Sbjct: 118 RECPR--TSSLG--MELTLYNIKDSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEI 173

Query: 192 YLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
           Y  DG+ + G+ V+DV+     + + Q+ S +  + FGCG  Q+G        A +G+ G
Sbjct: 174 Y-GDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILG 232

Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
            G   +S+ S LA    +   F+ C  G +G G  + G    P    TP    Q H  YN
Sbjct: 233 FGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIGHVVQPKVNMTPLIPNQPH--YN 290

Query: 307 ITITQVSVGGNAVNF---EFS------AIFDSGTSFTYLNDPAY 341
           + +T V VG + ++    EF       AI DSGT+  YL +  Y
Sbjct: 291 VNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAYLPEIVY 334


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 86/290 (29%), Positives = 131/290 (45%), Gaps = 29/290 (10%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           Y  + +G PA  F V +DTGS + ++PC       G N           + P  SST+S+
Sbjct: 79  YATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDA------AFDPEASSTASR 132

Query: 166 VPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           + C S  C     +C  +   C Y  R  ++ + S+G L+EDVL L           + I
Sbjct: 133 ISCTSPKCSCGSPRCGCSTQQCTY-TRSYAEQSSSSGILLEDVLAL-----HDGLPGAPI 186

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISF 283
            FGC   +TG      A +GLFGLG    SV + L   G+I + FS+CFG  +G G +  
Sbjct: 187 IFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLL 245

Query: 284 GDKGSPGQ---GETPFSLRQTHP-TYNITITQVSVGGNAVNFE-------FSAIFDSGTS 332
           GD   PG      TP     THP  YN+ +  ++V G  +          +  + DSGT+
Sbjct: 246 GDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTT 305

Query: 333 FTYLNDPAYTQISETFN--SLAKEKRETSTSDLPF-EYCYVLSPNQTNFE 379
           FTY+  P +   +      +L+   +     D  F + C+  +P+  + E
Sbjct: 306 FTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 153/362 (42%), Gaps = 36/362 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT + +G     + V +DTGSD  W+ C  C +C       SG  +D  +Y PN S T
Sbjct: 75  LYYTKIGLGPK--DYYVQVDTGSDTLWVNCVGCTAC----PKKSGLGMDLTLYDPNLSKT 128

Query: 163 SSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S  VPC+   C    + Q    + G +CPY + Y  DG+ ++G  ++D L         +
Sbjct: 129 SKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLR 187

Query: 219 SV--DSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           +V  ++ + FGCG  Q+G+       + +G+ G G   +SV S LA  G +   FS C  
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLD 247

Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
           S  G G  + G+   P    TP  L Q    YN+ +  + V G+ +              
Sbjct: 248 SISGGGIFAIGEVVQPKVKTTP--LLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRG 305

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+  YL    Y Q+ E   +     +     D  F   +       +  +P V 
Sbjct: 306 TIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVED-QFTCFHYSDEESVDDLFPTVK 364

Query: 385 LTMKGGGPF--FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCY 442
            T + G     +  D + +   +     ++C+G  KS      G+E  +  ++ L +   
Sbjct: 365 FTFEEGLTLTTYPRDYLFLFKED-----MWCVGWQKSMAQTKDGKELILLGDLVLANKLV 419

Query: 443 SY 444
            Y
Sbjct: 420 VY 421


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 168/406 (41%), Gaps = 72/406 (17%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
           +L++L +   GC    G F      R   P  G         +G   + +AL   D  R+
Sbjct: 14  LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
            RL G    A G    P       DT        L+YT + +G P   + V +DTGSD+ 
Sbjct: 62  GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108

Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPSAG 183
           W+  +C+ C  G  + SG  I+   Y P  S T+  V C    C           CPS  
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
           S C +++ Y  DG+ +TGF V D +     +   Q+ + ++ I+FGCG  Q G  L  + 
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221

Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
            A +G+ G G   +S+ S LA    +   F+ C  +  G G  + G+   P    TP   
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281

Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
             TH  YN+ +  +SVGG  +    S          I DSGT+  YL    Y T ++  F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339

Query: 349 NSLAKEKRETSTSDLPFE-----YCYVLSPNQTNFEYPVVNLTMKG 389
           +            DLP        C+  S    +  +PV+  + KG
Sbjct: 340 DKY---------QDLPLHNYQDFVCFQFS-GSIDDGFPVITFSFKG 375


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +S+G P   + + +DTGSDL WL CD  CVSC           +   +Y P  + 
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------SKVPHPLYRPTKNK 107

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   +C         + +C S    C Y+++Y   G+ S G LV D   L    
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161

Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             S  V   ++FGCG   Q GS  + +A +G+ GLG    S+ S L   G+  N    C 
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
            + G G + FGD   P    T  P +   +   Y+     +  GG  +       +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
           +SFTY +   Y  + +     L+K  +E     LP 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +S+G P   + + +DTGSDL WL CD  CVSC           +   +Y P  + 
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   +C         + +C S    C Y+++Y   G+ S G LV D   L    
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161

Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             S  V   ++FGCG   Q GS  + +A +G+ GLG    S+ S L   G+  N    C 
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
            + G G + FGD   P    T  P +   +   Y+     +  GG  +       +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
           +SFTY +   Y  + +     L+K  +E     LP 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +S+G P   + + +DTGSDL WL CD  CVSC           +   +Y P  + 
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   +C         + +C S    C Y+++Y   G+ S G LV D   L    
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161

Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             S  V   ++FGCG   Q GS  + +A +G+ GLG    S+ S L   G+  N    C 
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
            + G G + FGD   P    T  P +   +   Y+     +  GG  +       +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
           +SFTY +   Y  + +     L+K  +E     LP 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +S+G P   + + +DTGSDL WL CD  CVSC           +   +Y P  + 
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   +C         + +C S    C Y+++Y   G+ S G LV D   L    
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161

Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             S  V   ++FGCG   Q GS  + +A +G+ GLG    S+ S L   G+  N    C 
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
            + G G + FGD   P    T  P +   +   Y+     +  GG  +       +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
           +SFTY +   Y  + +     L+K  +E     LP 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317


>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/260 (33%), Positives = 120/260 (46%), Gaps = 21/260 (8%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y ++S+GQP   + +   TGSDL WL CD  CV C    +          +Y PN
Sbjct: 64  LGY-YYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHX---------LYRPN 113

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            +    K P  + L     +C      C Y+V Y +DG  S G LV+DV  L  +     
Sbjct: 114 NNLVICKDPMCAXLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
            +  R++ GCG  Q          +G+ GLG  K+S+ S L +QG+I N    C  S G 
Sbjct: 170 RLAPRLALGCGYDQIPG-XSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGG 228

Query: 279 GRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
           G + FGD    S     TP  LR  H  Y+    ++ +GG    F+     FDSG+S+TY
Sbjct: 229 GFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTY 287

Query: 336 LNDPAYTQISETFNSLAKEK 355
           LN  AY  +         EK
Sbjct: 288 LNSLAYQALVHLVRKELSEK 307


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ VG P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 250

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L  D +H+ T     +
Sbjct: 251 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 308

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S+PS LANQG+I N F  C   D 
Sbjct: 309 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 366

Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
            G G +  GD   P  G T   +R         +    Y  +  ++     GN+V     
Sbjct: 367 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 422

Query: 325 AIFDSGTSFTYLNDPAYTQI 344
            IFDSG+S+TYL D  Y  +
Sbjct: 423 VIFDSGSSYTYLPDEIYKNL 442


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ VG P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 251

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L  D +H+ T     +
Sbjct: 252 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 309

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S+PS LANQG+I N F  C   D 
Sbjct: 310 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 367

Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
            G G +  GD   P  G T   +R         +    Y  +  ++     GN+V     
Sbjct: 368 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 423

Query: 325 AIFDSGTSFTYLNDPAYTQI 344
            IFDSG+S+TYL D  Y  +
Sbjct: 424 VIFDSGSSYTYLPDEIYKNL 443


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 125/278 (44%), Gaps = 33/278 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHG----LNSSSGQVIDFNIYSPN 158
           +YT++++G P   + + +DTGSD  W+ CD  C +C  G       + G+++        
Sbjct: 16  YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVH------P 69

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
                 ++  N   CE  KQ       C Y++ Y +D + S G L  D + L T + + K
Sbjct: 70  RDPLCEELQGNQNYCETCKQ-------CDYEITY-ADRSSSKGVLARDNMQLTTADGEMK 121

Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
           +VD    FGC   Q G  LD   + +G+ GL     S+ + LAN G+I N F  C  +D 
Sbjct: 122 NVD--FVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDP 179

Query: 278 T--GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
           +  G +  GD   P  G T   +R      Y+  + +V+ G   +N    A      IFD
Sbjct: 180 SSGGYMFLGDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFD 239

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
           SG+S+TY     YT +       +    R+ S   LPF
Sbjct: 240 SGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPF 277


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 131/280 (46%), Gaps = 31/280 (11%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           LG+ + T +++GQP   + + LDTGSDL WL CD   CVH L +         +Y P   
Sbjct: 54  LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEAPH------PLYQP--- 102

Query: 161 STSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            ++  +PCN  LC+        +C +    C Y+V Y +DG  S G LV DV  L  +  
Sbjct: 103 -SNDLIPCNDPLCKALHFNGNHRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSL--NYT 157

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           +   +  R++ GCG  Q          +G+ GLG  K S+ S L +QG + N    C  S
Sbjct: 158 KGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSS 217

Query: 276 DGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDSGT 331
            G G + FG+    S     TP + R+    Y+  +  ++  GG     +    +FDSG+
Sbjct: 218 LGGGILFFGNDLYDSSRVSWTPMA-RENSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGS 276

Query: 332 SFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
           S+TY N  AY  ++     E      KE R+  T  L ++
Sbjct: 277 SYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 159/365 (43%), Gaps = 44/365 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P+  + V +DTGSD+ W+  +C+ C  G  ++SG  I+   Y P  S T+
Sbjct: 84  LYYTQIEIGSPSKGYYVQVDTGSDILWV--NCIRC-DGCPTTSGLGIELTQYDPAGSGTT 140

Query: 164 SKVPCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
             V C+   C       L   CPS  S C +++ Y  DG+ +TGF V D +     +   
Sbjct: 141 --VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAY-GDGSSTTGFYVSDSVQYNQVSGNG 197

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           Q+   ++ I+FGCG  Q G  L  +  A +G+ G G   +S+ S LA    +   F+ C 
Sbjct: 198 QTTPSNASITFGCG-AQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256

Query: 274 GS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
            +  G G  + G+   P    TP     TH  YN+ +  +SVGG  +    S        
Sbjct: 257 DTVHGGGIFAIGNVVQPKVKTTPLVQNVTH--YNVNLQGISVGGATLQLPSSTFDSGDSK 314

Query: 325 -AIFDSGTSFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
             I DSGT+  YL    Y    T + + +  LA    +          C+  S    +  
Sbjct: 315 GTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQDFV-------CFQFS-GSIDDG 366

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFH 439
           +PVV  + +G     V     +  +E     LYC+G +        G++  +  ++ L +
Sbjct: 367 FPVVTFSFEGEITLNVYPHDYLFQNEND---LYCMGFLDGGVQTKDGKDMVLLGDLVLSN 423

Query: 440 NCYSY 444
               Y
Sbjct: 424 KLVVY 428


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195

Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
            + ++FGCG  Q+GS  + A A +G+ G G    +  S LA  G     FS C  S +G 
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
           G  + G+   P    TP  ++     + + +  ++V G  +    +            DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314

Query: 330 GTSFTYLNDPAYTQI 344
           G++  YL +  Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171

Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
            + ++FGCG  Q+GS  + A A +G+ G G    +  S LA  G     FS C  S +G 
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
           G  + G+   P    TP  ++     + + +  ++V G  +    +            DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290

Query: 330 GTSFTYLNDPAYTQI 344
           G++  YL +  Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 112/406 (27%), Positives = 168/406 (41%), Gaps = 72/406 (17%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
           +L++L +   GC    G F      R   P  G         +G   + +AL   D  R+
Sbjct: 14  LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
            RL G    A G    P       DT        L+YT + +G P   + V +DTGSD+ 
Sbjct: 62  GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108

Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPSAG 183
           W+  +C+ C  G  + SG  I+   Y P  S T+  V C    C           CPS  
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
           S C +++ Y  DG+ +TGF V D +     +   Q+ + ++ I+FGCG  Q G  L  + 
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221

Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
            A +G+ G G   +S+ S LA    +   F+ C  +  G G  + G+   P    TP   
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281

Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
             TH  YN+ +  +SVGG  +    S          I DSGT+  YL    Y T ++  F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339

Query: 349 NSLAKEKRETSTSDLPFE-----YCYVLSPNQTNFEYPVVNLTMKG 389
           +            DLP        C+  S    +  +PV+  + +G
Sbjct: 340 DKY---------QDLPLHNYQDFVCFQFS-GSIDDGFPVITFSFEG 375


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195

Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
            + ++FGCG  Q+GS  + A A +G+ G G    +  S LA  G     FS C  S +G 
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
           G  + G+   P    TP  ++     + + +  ++V G  +    +            DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314

Query: 330 GTSFTYLNDPAYTQI 344
           G++  YL +  Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195

Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
            + ++FGCG  Q+GS  + A A +G+ G G    +  S LA  G     FS C  S +G 
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
           G  + G+   P    TP  ++     + + +  ++V G  +    +            DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314

Query: 330 GTSFTYLNDPAYTQI 344
           G++  YL +  Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL-PCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171

Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
            + ++FGCG  Q+GS  + A A +G+ G G    +  S LA  G     FS C  S +G 
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
           G  + G+   P    TP  ++     + + +  ++V G  +    +            DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290

Query: 330 GTSFTYLNDPAYTQI 344
           G++  YL +  Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 142/296 (47%), Gaps = 32/296 (10%)

Query: 71  RLRGRGLAA-QGND-KTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           + + R L+A + +D +  L+  AG D     + R +++G L+Y  + +G P  ++ + +D
Sbjct: 43  KYQDRSLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVG-LYYAKIGIGTPPKNYYLQVD 101

Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQK 177
           TGSD+ W+ C  C  C     + S   +D  +Y    SS+   VPC+   C+     L  
Sbjct: 102 TGSDIMWVNCIQCKEC----PTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEINGGLLT 157

Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRVQTG- 234
            C +A  +CPY   Y  DG+ + G+ V+D++     + + ++ S +  I FGCG  Q+G 
Sbjct: 158 GC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGARQSGD 215

Query: 235 -SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQG 292
            S  +  A +G+ G G   +S+ S LA+ G +   F+ C  G +G G  + G    P   
Sbjct: 216 LSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGHVVQPKVN 275

Query: 293 ETPFSLRQTHPTYNITITQV-------SVGGNAVNFEFSAIFDSGTSFTYLNDPAY 341
            TP    Q H + N+T  QV       S   +A       I DSGT+  YL +  Y
Sbjct: 276 MTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPEGIY 331


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 90/284 (31%), Positives = 127/284 (44%), Gaps = 47/284 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +  
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              VPC +++C          K+C +    C YQ++Y +D   S G LV D   L    K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRNK 162

Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
              +V   +SFGCG   Q G   +GAAP   +GL GLG    S+ S L  QG+  N    
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218

Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
           C  + G G + FGD   P    T  S+ R T   Y       S G   + F+        
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272

Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
              +FDSG+++TY +  P    IS    SL+K  ++ S   LP 
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 35/335 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC DC  C        G+  D   + P+ SST   
Sbjct: 90  TRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHC--------GKHQDPR-FQPDESSTYHP 140

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C   G NC Y+ RY ++ + S+G L ED++       QS+ V  R  
Sbjct: 141 VKCN-----MDCNCDHDGVNCVYERRY-AEMSSSSGVLGEDIISFGN---QSEVVPQRAV 191

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
           FGC  V+TG      A +G+ GLG  + S+   L ++ +I +SFS+C+G    G  +   
Sbjct: 192 FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVL 250

Query: 286 KGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
            G P   +  FS    +  P YNI + ++ V G  +         +   + DSGT++ YL
Sbjct: 251 GGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYL 310

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            + A+    +     +   ++    D  + + C+       +Q +  +P V++    G  
Sbjct: 311 PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQK 370

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
             +  P   +    K    YCLG+ ++ D+  ++G
Sbjct: 371 LSLT-PENYLFQHTKVHGAYCLGIFRNGDSTTLLG 404


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
           T + +G P+  F + +D+GS + ++PC          S S  +I+ +   + P+ SST S
Sbjct: 94  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            V CN     +   C +  S C Y+ +Y ++ + S+G L ED++      K+S+    R 
Sbjct: 154 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 204

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
            FGC   +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G  +  
Sbjct: 205 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 263

Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
             G P   +  FS       P YNI + ++ V G A+       N +   + DSGT++ Y
Sbjct: 264 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 323

Query: 336 LNDPAYT----QISETFNSLAK 353
           L + A+      ++   NSL K
Sbjct: 324 LPEQAFVAFKDAVTNKVNSLKK 345


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
           T + +G P+  F + +D+GS + ++PC          S S  +I+ +   + P+ SST S
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            V CN     +   C +  S C Y+ +Y ++ + S+G L ED++      K+S+    R 
Sbjct: 153 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 203

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
            FGC   +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G  +  
Sbjct: 204 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 262

Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
             G P   +  FS       P YNI + ++ V G A+       N +   + DSGT++ Y
Sbjct: 263 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 322

Query: 336 LNDPAYT----QISETFNSLAK 353
           L + A+      ++   NSL K
Sbjct: 323 LPEQAFVAFKDAVTNKVNSLKK 344


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 124/275 (45%), Gaps = 36/275 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ VG P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 241

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L +D +H+       +
Sbjct: 242 EKIVPPRDLLCQELQGDQNYCATC-KQCDYEIEY-ADRSSSMGVLAKDDMHMIATNGGRE 299

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S+PS LA+QG+I N F  C   + 
Sbjct: 300 KLD--FVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEP 357

Query: 277 -GTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFSA------IFD 328
            G G +  GD   P  G T   +R      Y+    +V+ G   +     A      IFD
Sbjct: 358 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417

Query: 329 SGTSFTYLNDPAY----TQISETFNSLAKEKRETS 359
           SG+S+TYL D  Y    T I   + S  ++  +T+
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTT 452


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 114/427 (26%), Positives = 170/427 (39%), Gaps = 40/427 (9%)

Query: 18  SCCAGCCFGFGTFGFDFHHRYS--DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGR 75
           +C A    G G F  DF HR S   P +        P     A   A A R     + GR
Sbjct: 21  TCTASAAAGEGGFSVDFIHRDSARSPYR-------HPALSPHARALAAARRSLRGEVLGR 73

Query: 76  GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
             +       P++ + G    ++ +  F +   V+VG P    +   DTGSDL W+ C  
Sbjct: 74  SYSGASPAAAPVSAADGGVESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNC-- 131

Query: 136 VSCVHGLNSSSGQVIDFN-----IYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQ 189
                  +SS G + D +     ++ P  SST S++ C S  C+   Q    A S C YQ
Sbjct: 132 -------SSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQ 184

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
             Y  DG+ + G L  +         + +    R++FGC     G+F      +GL GLG
Sbjct: 185 YSY-GDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLG 239

Query: 250 MDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDKG---SPGQGETPFSLRQTH 302
               S+ S L     I    S C    + ++ +  ++FG +     PG   TP       
Sbjct: 240 AGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVD 299

Query: 303 PTYNITITQVSVGGNAVNFEFSAIF-DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
             Y + +  V+VGG  V    S I  DSGT+ T+L+      +        K +R     
Sbjct: 300 SYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPE 359

Query: 362 DLPFEYCY-VLSPNQT-NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
            L  + CY V   ++T NF  P V L   GG    +         +   L L  + V +S
Sbjct: 360 QL-LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSES 418

Query: 420 DNVNIIG 426
             V+I+G
Sbjct: 419 QPVSILG 425


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/256 (31%), Positives = 112/256 (43%), Gaps = 32/256 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ VG P   + + +DTGSDL W+ CD  C +C  G +          +Y P     
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPTKEKI 237

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L  D +HL       +
Sbjct: 238 ---VPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHLIATNGGRE 292

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S+PS LA+ G+I N F  C   + 
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQ 350

Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
            G G +  GD   P  G T  S+R      Y+     V  G   +     A      IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFD 410

Query: 329 SGTSFTYLNDPAYTQI 344
           SG+S+TYL D  Y  +
Sbjct: 411 SGSSYTYLPDEIYENL 426


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/272 (32%), Positives = 122/272 (44%), Gaps = 28/272 (10%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           +GF + T +++G P   + + +DTGSDL WL CD  C  C    +          +Y P 
Sbjct: 82  VGFYNVT-INIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 130

Query: 159 TSSTSSKVPCNSTLCELQKQCPS----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TD 213
              ++  VPC   LC    Q  +        C Y+V Y +D   S G LV DV  L  T+
Sbjct: 131 ---SNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEY-ADHYSSLGVLVNDVYVLNFTN 186

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q K    R++ GCG  Q          +G+ GLG  K+S+ S L  QGL+ N    C 
Sbjct: 187 GVQLKV---RMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCL 243

Query: 274 GSDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGT 331
            + G G I FGD   S     TP S R  +  Y+    ++ +GG    F    A+FD+G+
Sbjct: 244 SAQGGGYIFFGDVYDSSRLAWTPMSSRD-YKHYSAGAAELVLGGKRTGFGNLLAVFDAGS 302

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
           S+TY N  AY    E      KE  E  T  L
Sbjct: 303 SYTYFNSNAYQLTKELAGKPIKEAPEDQTLPL 334


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 126/284 (44%), Gaps = 47/284 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +  
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              VPC +++C          K+C +    C YQ++Y +D   S G LV D   L    K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRNK 162

Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
              +V   +SFGCG   Q G   +GAAP   +GL GLG    S+ S L  QG+  N    
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218

Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
           C  + G G + FGD   P    T   + R T   Y       S G   + F+        
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272

Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
              +FDSG+++TY +  P    IS    SL+K  ++ S   LP 
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 147/329 (44%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P  +F + +DTGS L ++PC  C  C        G+  D N + P+ SST   
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + C+     ++  C S   +C Y  +Y ++ + S+G L ED++      KQS+    R  
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L  +G+I NSFS+C+G    G G +  
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S       YNI + ++ + G  +         ++  I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            +PA+    +         +     D  + + C+       +Q +  +P V+L    G  
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNR 374

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
             ++ P   +    K    YCLG+ +++N
Sbjct: 375 LSLS-PENYLFQHSKAHGAYCLGIFQNEN 402


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 147/329 (44%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P  +F + +DTGS L ++PC  C  C        G+  D N + P+ SST   
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + C+     ++  C S   +C Y  +Y ++ + S+G L ED++      KQS+    R  
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L  +G+I NSFS+C+G    G G +  
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S       YNI + ++ + G  +         ++  I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            +PA+    +         +     D  + + C+       +Q +  +P V+L    G  
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNR 374

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
             ++ P   +    K    YCLG+ +++N
Sbjct: 375 LSLS-PENYLFQHSKAHGAYCLGIFQNEN 402


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/306 (32%), Positives = 138/306 (45%), Gaps = 37/306 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  VSVG P     + +DTGSD+ WL C  CVSC H  +          ++ P  SST 
Sbjct: 37  YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCD---------EVFDPYKSSTY 87

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + CNS  C         G+ C YQV Y  DG+ STG    D + L +     + V ++
Sbjct: 88  STLGCNSRQCLNLDVGGCVGNKCLYQVDY-GDGSFSTGEFATDAVSLNSTSGGGQVVLNK 146

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           I  GCG    G F+  A   GL        S P+ + ++      FS C     +D T R
Sbjct: 147 IPLGCGHDNEGYFVGAAGLLGLG---KGPLSFPNQINSEN--GGRFSYCLTGRDTDSTER 201

Query: 281 IS--FGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
            S  FGD   P  G   TP +      T Y + +T +SVGG+ +    SA          
Sbjct: 202 SSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGG 261

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGTS T L + AY  + E F +   +   T+   L F+ CY LS + ++ + P V 
Sbjct: 262 VIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL-FDTCYNLS-DLSSVDVPTVT 319

Query: 385 LTMKGG 390
           L  +GG
Sbjct: 320 LHFQGG 325


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 82/281 (29%), Positives = 122/281 (43%), Gaps = 43/281 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +S 
Sbjct: 54  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANSL 104

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              VPC + LC           +CPS    C YQ++Y +D   S G L+ D   L     
Sbjct: 105 ---VPCANALCTALHSGHGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDNFSLPM--- 156

Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +S ++   ++FGCG  Q    +    AA +G+ GLG    S+ S L  QG+  N    C 
Sbjct: 157 RSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE--------FSA 325
            ++G G + FGD         P S     P   I+    S G   + F+           
Sbjct: 217 STNGGGFLFFGDD------IVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEV 270

Query: 326 IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
           +FDSG+++TY     Y  +     S L+K  ++ S   LP 
Sbjct: 271 VFDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPL 311


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 122/256 (47%), Gaps = 24/256 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+Y  + +G P  ++ + +DTGSD+ W+ C  C  C     + S   +D  +Y    SS+
Sbjct: 84  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKEC----PTRSNLGMDLTLYDIKESSS 139

Query: 163 SSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEK 215
              VPC+   C+     L   C +A  +CPY   Y  DG+ + G+ V+D++     + + 
Sbjct: 140 GKFVPCDQEFCKEINGGLLTGC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDL 197

Query: 216 QSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           ++ S +  I FGCG  Q+G  S  +  A  G+ G G   +S+ S LA+ G +   F+ C 
Sbjct: 198 KTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL 257

Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------A 325
            G +G G  + G    P    TP    Q H + N+T  QV     +++ + S        
Sbjct: 258 NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGT 317

Query: 326 IFDSGTSFTYLNDPAY 341
           I DSGT+  YL +  Y
Sbjct: 318 IIDSGTTLAYLPEGIY 333


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 94/339 (27%), Positives = 149/339 (43%), Gaps = 44/339 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ VG P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 238

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP   +LC+     Q  C +    C Y++ Y +D + S G L +D +HL       +
Sbjct: 239 EKIVPPRDSLCQELQGDQNYCETC-KQCDYEIEY-ADRSSSMGVLAKDDMHLIATNGGRE 296

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
            +D    FGC   Q G  L   A  +G+ GL     S+PS LA++G+I N F  C    +
Sbjct: 297 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354

Query: 276 DGTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNF--EFSAIFDSGTS 332
           +G G +  GD   P  G T   +R      Y+    +V+ G   ++       IFDSG+S
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
           +TYL +  Y  + +     +    + S SD     C+    +  +F  P   L +  G  
Sbjct: 415 YTYLPEEMYKNLIDAIKEDSPSFVQDS-SDTTLPLCWKADFSVRSFFKP---LNLHFGRR 470

Query: 393 FF--------VNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
           +F        V D  +I+S +       CLG++    +N
Sbjct: 471 WFVVPKTFTIVPDDYLIISDKGN----VCLGLLNGTEIN 505


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 110/414 (26%), Positives = 168/414 (40%), Gaps = 32/414 (7%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
           G F  DF HR  D  +   A   LP        +  + R       GR +        P+
Sbjct: 28  GGFSVDFIHR--DSARSPFAQPSLPPHARALAAARRSLRGAAL---GRYVGGASPAPGPV 82

Query: 88  TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
             + G    ++ +  F +   V+VG P    +   DTGSDL W+  +C S   G  +S G
Sbjct: 83  PEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWV--NCSSNGGGGGASDG 140

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVED 206
            V    ++ P+ S+T S + C S  C+   Q    A S C YQ  Y  DG+ + G L  +
Sbjct: 141 AV----VFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAY-GDGSRTIGVLSTE 195

Query: 207 VLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
                 A    + +    R+SFGC     GSF      +GL GLG    S+ S L     
Sbjct: 196 TFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAAR 251

Query: 265 IPNSFSMCF-----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGG 316
           I   FS C       ++ +  +SFG +     PG   TP    +    Y + +  V+V G
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAG 311

Query: 317 NAVNFEFSA--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
             V    S+  I DSGT+ T+L+      +        +  R      L  + CY +   
Sbjct: 312 QDVASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQL-LQLCYDVQGK 370

Query: 375 QTNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKG-LYLYCLGVVKSDNVNIIG 426
               ++ + ++T++ GGG      P    S   +G L L  + V +S  V+I+G
Sbjct: 371 SQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILG 424


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 83/278 (29%), Positives = 121/278 (43%), Gaps = 28/278 (10%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
           GF + T + VGQP   + +  DTGSDL WL CD  C  C   L+          +Y P  
Sbjct: 55  GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102

Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             ++  VPC   LC      +  +C +    C Y+V Y +DG  S G LV DV  L  + 
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
                +  R++ GCG  Q          +G+ GLG    S+ S L NQG++ N    CF 
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
           S G G + FGD            + + +P  Y+    ++   G +        +FDSG+S
Sbjct: 217 SKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276

Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
           +TY N  AY  ++   N  LA +    +  D     C+
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCW 314


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 131/284 (46%), Gaps = 27/284 (9%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P   + V +DTGSD+ W+ C  C +C       S   I+ ++YSP++SST
Sbjct: 73  LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNC----PKKSDLGIELSLYSPSSSST 128

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVED--VLHLATDEKQ 216
           S++V CN   C      P  G      C Y+V Y  DG+ + G+ V D  VL   T   Q
Sbjct: 129 SNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAY-GDGSSTAGYFVRDHVVLDRVTGNFQ 187

Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + S +  I FGCG  Q+G      AA +G+ G G   +S+ S LA+ G +   F+ C  +
Sbjct: 188 TTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDN 247

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN---------FEFSA 325
            +G G  + G+   P    TP   +Q H  YN+ +  + V    +N              
Sbjct: 248 INGGGIFAIGEVVQPKVRTTPLVPQQAH--YNVFMKAIEVDNEVLNLPTDVFDTDLRKGT 305

Query: 326 IFDSGTSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLP-FEY 367
           I DSGT+  Y  D  Y   IS+ F   +  K  T       FEY
Sbjct: 306 IIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEY 349


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 99/310 (31%), Positives = 139/310 (44%), Gaps = 42/310 (13%)

Query: 97  RLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
           R  SLG  +Y  +V +G PA  + V  DTGSDL W+ C  C  C    +          +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------L 190

Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           + P+ SST + V C +  C EL     S+ S C Y+V+Y  D + + G LV D L L+  
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSAS 249

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN---SFS 270
           +     V     FGCG    G F      +GLFGLG +K S+PS    QG  P+    F+
Sbjct: 250 DTLPGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPS----QG-APSYGPGFT 296

Query: 271 MCFGSDGTGR--ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF------- 321
            C  S  +GR  +S G         T  +   T   Y I +  + VGG A+         
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAA 356

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
               + DSGT  T L   AY  +   F  S+A+ K+  + S L  + CY  + ++T  + 
Sbjct: 357 AGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCYDFTGHRTA-QI 413

Query: 381 PVVNLTMKGG 390
           P V L   GG
Sbjct: 414 PTVELAFAGG 423


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)

Query: 55  GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
           G F+     A R+R    L+   ++ Q      L F AG D     + R +++G L+Y  
Sbjct: 38  GVFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGVDIPLGGSGRPDAVG-LYYAK 90

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           + +G P+  + V +DTGSD+ W+ C  C  C     SS G  ++   Y    S+T   V 
Sbjct: 91  IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146

Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
           C+   C      P +G     +CPY ++   DG+ + G+ V+D +     + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205

Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
             I FGCG  Q+G        A +G+ G G   +S+ S LA+   +   F+ C  G++G 
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
           G  + G    P    TP    Q H  YN+ +T V VG   +N     F A      I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323

Query: 330 GTSFTYLNDPAY 341
           GT+  YL +  Y
Sbjct: 324 GTTLAYLPELIY 335


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 98/307 (31%), Positives = 138/307 (44%), Gaps = 42/307 (13%)

Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
           SLG  +Y  +V +G PA  + V  DTGSDL W+ C  C  C    +          ++ P
Sbjct: 143 SLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------LFDP 193

Query: 158 NTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           + SST + V C +  C EL     S+ S C Y+V+Y  D + + G LV D L L+  +  
Sbjct: 194 SLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSASDTL 252

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN---SFSMCF 273
              V     FGCG    G F      +GLFGLG +K S+PS    QG  P+    F+ C 
Sbjct: 253 PGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPS----QG-APSYGPGFTYCL 299

Query: 274 GSDGTGR--ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFS 324
            S  +GR  +S G         T  +   T   Y I +  + VGG A+            
Sbjct: 300 PSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG 359

Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            + DSGT  T L   AY  +   F  S+A+ K+  + S L  + CY  + ++T  + P V
Sbjct: 360 TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCYDFTGHRTA-QIPTV 416

Query: 384 NLTMKGG 390
            L   GG
Sbjct: 417 ELAFAGG 423


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 80/269 (29%), Positives = 121/269 (44%), Gaps = 17/269 (6%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            ++++G+   +F   +D+GSDL W+ CD   C H             +Y PN ++ +   
Sbjct: 57  VSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNNALNCFE 109

Query: 167 P-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           P C S        C SA   C Y++ Y   G+ S G LV D  H+            RI+
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSLAAPRIA 166

Query: 226 FGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
           FGCG     S  D + P  G+ GLG  + S  S L++ G++ N    C   +G G + FG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFG 225

Query: 285 DKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYLNDPAY 341
           D+  P  G T  S+        Y+    +V  GG A    + + +FDSG+S+TY N  AY
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAY 285

Query: 342 TQI-SETFNSLAKEKRETSTSDLPFEYCY 369
             I +   N+L  +  E +  D     C+
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCW 314


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 84/278 (30%), Positives = 125/278 (44%), Gaps = 39/278 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +    Y P  +  
Sbjct: 73  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPWYKPTKNKI 123

Query: 163 SSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
              VPC ++LC      K+C +    C YQ++Y +D   S G L+ D   L+   + S +
Sbjct: 124 ---VPCAASLCTSLTPNKKC-AVPQQCDYQIKY-TDKASSLGVLIADNFTLSL--RNSST 176

Query: 220 VDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
           V + ++FGCG  Q    +    AA +GL GLG    S+ S L  QG+  N    CF ++G
Sbjct: 177 VRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNG 236

Query: 278 TGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FSAIFD 328
            G + FGD   P    T   + R T   Y       S G   + F+           +FD
Sbjct: 237 GGFLFFGDDIVPTSRVTWVPMARTTSGNY------YSPGSGTLYFDRRSLGMKPMEVVFD 290

Query: 329 SGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
           SG+++ Y   +P    +S     L+K  +E S   LP 
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPL 328


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 109/360 (30%), Positives = 155/360 (43%), Gaps = 58/360 (16%)

Query: 99  NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
           N  G  H   +SVG P L+F   +DTGSDL W  C  C +      +         +Y P
Sbjct: 91  NGAGAYHMI-LSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTP--------LYDP 141

Query: 158 NTSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
             SST SK+PC S LC+     PSA      + C Y  RY      + G+L  D L +  
Sbjct: 142 ARSSTFSKLPCASPLCQ---ALPSAFRACNATGCVYDYRYAVG--FTAGYLAADTLAIGD 196

Query: 213 DEKQSKSVDS--RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            +    +  S   ++FGC     G  +DGA  +G+ GLG    S  S+L+  G+    FS
Sbjct: 197 GDGDGDASSSFAGVAFGCSTANGGD-MDGA--SGIVGLGR---SALSLLSQIGV--GRFS 248

Query: 271 MCFGSD---GTGRISF-------GDK-GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV 319
            C  SD   G   I F       GDK  S      P + R+  P Y + +T ++VG   +
Sbjct: 249 YCLRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDL 308

Query: 320 -----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEY 367
                 F F+A      I DSGT+FTYL +  YT + + F S  A      S +   F+ 
Sbjct: 309 PVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDL 368

Query: 368 CYVLSPNQTNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           C+      T    PV  L  +  GG  +         +  +G  + CL V+ +  V++IG
Sbjct: 369 CFEAGAADT----PVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIG 424


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 151/347 (43%), Gaps = 44/347 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   + +G PA  + V  DTGSD  W+ C+ CV   +             ++ 
Sbjct: 179 RALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQE--------KLFD 230

Query: 157 PNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           P  SST + + C +  C     K C  +G +C Y V+Y  DG+ S GF   D L L+   
Sbjct: 231 PARSSTDANISCAAPACSDLYTKGC--SGGHCLYGVQY-GDGSYSIGFFAMDTLTLS--- 284

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
             S        FGCG    G F + A   GL GLG  KTS+P    ++      F+ CF 
Sbjct: 285 --SYDAIKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFP 337

Query: 274 -GSDGTGRISFGDKGSPG---QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---- 325
             S GTG + FG   SP    +  TP  +      Y + +T + VGG  ++   S     
Sbjct: 338 ARSSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA 397

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
             I DSGT  T L   AY+ +   F S +A    + + +    + CY  +   +    P 
Sbjct: 398 GTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFT-GMSQVAIPT 456

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV---KSDNVNIIG 426
           V+L  +GG    V+   +I ++    +   CLG     + D+V I+G
Sbjct: 457 VSLLFQGGASLDVDASGIIYAAS---VSQACLGFAANEEDDDVGIVG 500


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 115/276 (41%), Gaps = 29/276 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +S+G P   + + +DTGSDL WL CD  CVSC           +   +Y P  + 
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------NKVPHPLYRPTKNK 107

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   LC         + +C S    C Y+++Y   G+ S G L+ D    A   
Sbjct: 108 I---VPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGS-SLGVLLTD--SFAVRL 161

Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             S  V   ++FGCG   Q GS  + A  +G+ GLG    S+ S L   G+  N    C 
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
              G G + FGD   P    T  P         Y+     +  GG ++       + DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281

Query: 331 TSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
           +SFTY     Y  +     S L+K  +E     LP 
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPL 317


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 145/342 (42%), Gaps = 34/342 (9%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
           ++LG  +Y   + +G PA  + V  DTGSD  W+ C+ CV   +             ++ 
Sbjct: 154 SALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQE--------KLFD 205

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + + C +  C        +G +C Y V+Y  DG+ S GF   D L L+     
Sbjct: 206 PARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLS----- 259

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
           S        FGCG    G + + A   GL GLG  KTS+P    ++      F+ CF   
Sbjct: 260 SYDAIKGFRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFPAR 314

Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
           S GTG + FG    P    +  TP  +      Y + +T + VGG  ++   S       
Sbjct: 315 SSGTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGT 374

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVN 384
           I DSGT  T L   AY+ +   F S   E+       L   + CY  +   +    P V+
Sbjct: 375 IVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFT-GMSEVAIPTVS 433

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           L  +GG    V+   +I ++      L   G  + D+V I+G
Sbjct: 434 LLFQGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVG 475


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 94/307 (30%), Positives = 142/307 (46%), Gaps = 38/307 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            SL  L Y   V +G PA++  +++DTGSD+ W+ C  C  C   ++S         ++ 
Sbjct: 124 TSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS---------LFD 174

Query: 157 PNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           P+ SST S   C+S  C    + Q+    + S C Y V Y+ DG+ +TG    D L L +
Sbjct: 175 PSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYV-DGSSTTGTYSSDTLTLGS 233

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           +  +         FGC + ++G F D    +GL GLG D  S+ S  A  G    +FS C
Sbjct: 234 NAIKG------FQFGCSQSESGGFSD--QTDGLMGLGGDAQSLVSQTA--GTFGKAFSYC 283

Query: 273 F--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVN-----FEF 323
                  +G ++ G     G  +TP  LR T  PT Y + +  + VGG  +N     F  
Sbjct: 284 LPPTPGSSGFLTLGAASRSGFVKTPM-LRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA 342

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            ++ DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P V
Sbjct: 343 GSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGI-LDTCFDFS-GQSSVSIPSV 400

Query: 384 NLTMKGG 390
            L   GG
Sbjct: 401 ALVFSGG 407


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)

Query: 55  GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
           G F+     A R+R    L+   ++ Q      L F AG D     + R +++G L+Y  
Sbjct: 38  GIFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGIDIPLGGSGRPDAVG-LYYAK 90

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           + +G P+  + V +DTGSD+ W+ C  C  C     SS G  ++   Y    S+T   V 
Sbjct: 91  IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146

Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
           C+   C      P +G     +CPY ++   DG+ + G+ V+D +     + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205

Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
             I FGCG  Q+G        A +G+ G G   +S+ S LA+   +   F+ C  G++G 
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
           G  + G    P    TP    Q H  YN+ +T V VG   +N     F A      I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323

Query: 330 GTSFTYLNDPAY 341
           GT+  YL +  Y
Sbjct: 324 GTTLAYLPELIY 335


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 152/330 (46%), Gaps = 36/330 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P +SST   
Sbjct: 86  TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 136

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V C      +   C S    C Y+ +Y ++ + S+G L ED++       QS+    R  
Sbjct: 137 VKCT-----IDCNCDSDRMQCVYERQY-AEMSTSSGVLGEDLISFGN---QSELAPQRAV 187

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L ++ +I +SFS+C+G    G G +  
Sbjct: 188 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVL 246

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
           G    P      +S     P YNI + ++ V G       N  + +   + DSGT++ YL
Sbjct: 247 GGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 306

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            + A+    +      +  ++ S  D  + + C+    +  +Q +  +PVV++  + G  
Sbjct: 307 PEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQK 366

Query: 393 FFVN-DPIVIVSSEPKGLYLYCLGVVKSDN 421
           + ++ +  +   S+ +G   YCLGV ++ N
Sbjct: 367 YTLSPENYMFRHSKVRG--AYCLGVFQNGN 394


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 124/261 (47%), Gaps = 33/261 (12%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P+  F + +D+GS + ++PC  C  C +  +           + P+ SST S 
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPR---------FQPDLSSTYSP 143

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C +  S C Y+ +Y ++ + S+G L ED++      K+S+    R  
Sbjct: 144 VKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRAV 194

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
           FGC   +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G  +   
Sbjct: 195 FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253

Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYL 336
            G P   +  FS       P YNI + ++ V G A+       N +   + DSGT++ YL
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313

Query: 337 NDPAYT----QISETFNSLAK 353
            + A+      ++   NSL K
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKK 334


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/222 (30%), Positives = 106/222 (47%), Gaps = 18/222 (8%)

Query: 99  NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSP 157
           N +  ++YT + +G P   F V +DTGSD+ W+ C  CV C          + +   + P
Sbjct: 76  NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC---------PLQNVTFFDP 126

Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
             SS++ K+ C+   C       S  S   Y+V Y SDG+ ++G+ + D++   T    +
Sbjct: 127 GASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEY-SDGSFTSGYYISDLISFETVMSSN 185

Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
            +V S     FGC  +  G   L   + +G+ GLG  +  V S L++Q L P  FS+C  
Sbjct: 186 LTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLS 245

Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
            G +G G I  G+   P    TP    QTH  YN+ +   +V
Sbjct: 246 GGQEGGGVIILGENRLPNTVYTPLVRSQTH--YNVNLKTFAV 285


>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
          Length = 290

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 112/221 (50%), Gaps = 16/221 (7%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P     V +DTGSD+ W+ C   SC +G   +SG  I  N + P +SSTS
Sbjct: 76  LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132

Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C    C    Q     C    + C Y  +Y  DG+ ++G+ V D++H A+  + + 
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191

Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           + +S  S  FGC  +QTG       A +G+FG G    SV S L++QG+ P  FS C   
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251

Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
           D  G G +  G+   P    +P  L  + P YN+ +  +SV
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISV 290


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +D+GS + ++PC   SC    N    +      + P+ SST S V
Sbjct: 87  TRLYIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 138

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            C++        C S  S C Y+ +Y ++ + S+G L ED++   T   +S+    R  F
Sbjct: 139 KCSADCT-----CDSDKSQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 189

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG      A +G+ GLG  + S+   L ++G+I +SFSMC+G    G G +  G
Sbjct: 190 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 248

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
              +P       S     P YNI + ++ V G A+  +          + DSGT++ YL 
Sbjct: 249 AMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLP 308

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
           + A+    +   S  +  ++    D  + + C+     + +Q +  +P V++   G G  
Sbjct: 309 EQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVF-GDGQK 367

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
               P   +    K    YCLGV ++  D   ++G
Sbjct: 368 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 402


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 151/346 (43%), Gaps = 42/346 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   + +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 175 RALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQE--------KLFD 226

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 227 PARSSTYANVSCAAPACSDLYTRGCSGGHCLYSVQY-GDGSYSIGFFAMDTLTLSSYDAV 285

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 286 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 335

Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
           S GTG + FG  GSP   G  +T   L    PT Y + +T + VGG  ++   S      
Sbjct: 336 SSGTGYLDFG-PGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAG 394

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I DSGT  T L   AY+ +   F S +A    + + +    + CY  +   +    P V
Sbjct: 395 TIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFT-GMSEVAIPKV 453

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
           +L  +GG    VN   ++ ++    L   CLG   +   D+V I+G
Sbjct: 454 SLLFQGGAYLDVNASGIMYAAS---LSQVCLGFAANEDDDDVGIVG 496


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 142/329 (43%), Gaps = 34/329 (10%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           R R +AA+ N  +  + +   D    L+  G  +  ++SVG P   F    DTGSDL W+
Sbjct: 22  RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81

Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
             + C  C  G            I+ P  SST  ++ C+S LC EL   C    S C Y 
Sbjct: 82  QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYS 130

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
             Y S  T   G    D + L T    S+   S  + GCG V +G   DG   +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSDGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183

Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
               S+ S L+    I + FS C         +  + FG      G+  Q         T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241

Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           +PTY + T+  ++V G  +    + I DSGT+ TY+    Y ++     S+    R    
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300

Query: 361 SDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           S +  + CY  S N+ N+++P + + + G
Sbjct: 301 SSMGLDLCYDRSSNR-NYKFPALTIRLAG 328


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 90/297 (30%), Positives = 138/297 (46%), Gaps = 47/297 (15%)

Query: 93  NDTYRL-NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
           N++Y    S G+  +   + +G P    +V +DTGSDL W+  + C +C    +      
Sbjct: 11  NESYEFPESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADP----- 65

Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
               I+ P+ SST +K+ C+S+ C   L  Q  SA +NC Y   Y  DG+++ G+  ++ 
Sbjct: 66  ----IFDPSKSSTYNKIACSSSACADLLGTQTCSAAANCIYAYGY-GDGSVTRGYFSKET 120

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           +  ATD     +    + FG     TG+F D     G+ GLG    S+PS L +  ++ N
Sbjct: 121 I-TATD-----TAGEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGN 171

Query: 268 SFSMCF------GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGN 317
            FS C       GS+ T  + FGD   P  GE   TP      HPT Y I +  +SVGG+
Sbjct: 172 KFSYCLVDWLSAGSE-TSTMYFGDAAVP-SGEVQYTPIVPNADHPTYYYIAVQGISVGGS 229

Query: 318 AVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
            ++ + S            I DSGT+ TYL    +  +   + S  +    TS + L
Sbjct: 230 LLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGL 286


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 151/345 (43%), Gaps = 36/345 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT + +G P     V +DTGSD+ W+ C  C SC+    S    +   +IY+ + SST
Sbjct: 82  LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137

Query: 163 SSKVPCNSTLCELQK-QCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           SS   C+  LC  ++  C  +G+N  C Y   Y  D + S G  V D +H       + +
Sbjct: 138 SSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSY-QDKSASVGAYVRDDMHYVLHGGNATT 196

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
             SRI FGC    TGS+      +G+ G G+   +VP+ +A Q  +   FS C G +  G
Sbjct: 197 --SRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHG 250

Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNF---EFS--------- 324
            G + FG+  +P   E  F+ L      YN+ +  +SV    +     EFS         
Sbjct: 251 GGILEFGE--APNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNT 308

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+F  L   A   + +   SL   K       L  E  Y+ S       +P V
Sbjct: 309 GVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGL--ECFYLKSGLTMETSFPNV 366

Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
            LT  GG    +  D  ++++   K    YC     +D + I G 
Sbjct: 367 TLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLTIFGE 411


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/329 (26%), Positives = 145/329 (44%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P+ SST   
Sbjct: 79  TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQC--------GKHQDPR-FQPDLSSTYRP 129

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN + C     C   G  C Y+ RY ++ + S+G + EDV+    +   S+    R  
Sbjct: 130 VKCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGVIAEDVVSFGNE---SELKPQRAV 180

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG  + SV   L ++G+I +SFS+C+G    G G +  
Sbjct: 181 FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVL 239

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S     P YNI + ++ V G  +         +   + DSGT++ Y 
Sbjct: 240 GQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYF 299

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNF---EYPVVNLTMKGGGP 392
            + A+  + +      +  ++    D  + + C+  +  + +     +P VN+   G G 
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVF-GSGQ 358

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
                P   +    K    YCLG+ ++ N
Sbjct: 359 KLSLSPENYLFRHTKVSGAYCLGIFQNGN 387


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 57/357 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  ++VG P    +V +DTGSDL WL   CV C H     +       +Y P +SST  
Sbjct: 88  YFAVINVGDPPTRALVVIDTGSDLIWL--QCVPCRHCYRQVT------PLYDPRSSSTHR 139

Query: 165 KVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           ++PC S  C        C +    C Y V Y  DG+ S+G L  D L    D        
Sbjct: 140 RIPCASPRCRDVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDRLVFPDDTHVHN--- 195

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG------S 275
             ++ GCG    G  L+ AA  GL G+G  + S P+ LA      + FS C G       
Sbjct: 196 --VTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRLSRAQ 248

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
           +G+  + FG   +P    T F+  +T+P     Y + +   SVGG  V    +A      
Sbjct: 249 NGSSYLVFGR--TPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNP 306

Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLSPN- 374
                  + DSGT+ +     AY  + + F+S A      R+ +T    F+ CY L  N 
Sbjct: 307 ATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNG 366

Query: 375 --QTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
                   P + L   GG    +     ++ V    +  Y +CLG+  +D+ +N++G
Sbjct: 367 APAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTY-FCLGLQAADDGLNVLG 422


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 120/273 (43%), Gaps = 32/273 (11%)

Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           HY+ + ++G P  +F + +DTGSDL W+ CD  C  C   L+          +Y P    
Sbjct: 67  HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDK---------LYKPK--- 114

Query: 162 TSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            +++VPC S+LC+      C      C Y+V Y   G+ S G L+ D   L  +      
Sbjct: 115 -NNRVPCASSLCQAIQNNNCDIPTEQCDYEVEYADLGS-SLGVLLSDYFPLRLNN--GSL 170

Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           +  RI+FGCG  Q   +L   +P    G+ GLG  K S+ S L   G+  N    CF   
Sbjct: 171 LQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRV 228

Query: 277 GTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
             G + FGD   P  G   TP     +   Y+    ++  GG     +    IFDSG+S+
Sbjct: 229 TGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSY 288

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
           TY N   Y  I    N + K+       D P E
Sbjct: 289 TYFNAQVYQSI---LNLVRKDLSGMPLKDAPEE 318


>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 454

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 134/309 (43%), Gaps = 38/309 (12%)

Query: 78  AAQGNDKTPLTFSAGNDTYRLNSLGFL----------HYT-NVSVGQPALSFIVALDTGS 126
           A   N K P T  + N+ +RL+S              HYT ++++G P   + + +D+GS
Sbjct: 26  AQPRNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGS 85

Query: 127 DLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQC 179
           DL W+ CD  C  C    +          +Y PN     + V C   LC      +   C
Sbjct: 86  DLTWVQCDAPCKGCTKPRD---------QLYKPN----HNLVQCVDQLCSEVHLSMAYNC 132

Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
           PS    C Y+V Y   G+ S G LV D  ++         V  R++FGCG  Q  S  + 
Sbjct: 133 PSPDDPCDYEVEYADHGS-SLGVLVRD--YIPFQFTNGSVVRPRVAFGCGYDQKYSGSNS 189

Query: 240 A-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
             A +G+ GLG  + S+ S L + GLI N    C  + G G + FGD   P  G    S+
Sbjct: 190 PPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSM 249

Query: 299 RQTHPTYNITI--TQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
             +    + +    ++   G A   +    IFDSG+S+TY N  AY  + +      K K
Sbjct: 250 LSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGK 309

Query: 356 RETSTSDLP 364
           +    +D P
Sbjct: 310 QLKRATDDP 318


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 128/298 (42%), Gaps = 35/298 (11%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P  +    +DTGSD+ WL C  C  C               I++P+ SS+   +PC
Sbjct: 92  SVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTP---------IFNPSKSSSYKNIPC 142

Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           +S LC+  +       N C Y + + SD + S G L  + L L +    S S    +  G
Sbjct: 143 SSNLCQSVRYTSCNKQNSCEYTINF-SDQSYSQGELSVETLTLDSTTGHSVSFPKTV-IG 200

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRIS 282
           CG    G F      +G+ GLG+   S+ + L +   I   FS C       S+ T +++
Sbjct: 201 CGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLN 256

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF-------SAIFDSGTS 332
           FGD       G   TPF  +     Y +T+   SVG   + FE        + I DSGT+
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTT 316

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            T L    YT +      L K  R    + L    CY ++ +Q  +++P++    KG 
Sbjct: 317 LTLLPSHVYTNLESAVAQLVKLDRVDDPNQL-LNLCYSITSDQ--YDFPIITAHFKGA 371


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 142/329 (43%), Gaps = 34/329 (10%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
           R R +AA+ N  +  + +   D    L+  G  +  ++SVG P   F    DTGSDL W+
Sbjct: 22  RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81

Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
             + C  C  G            I+ P  SST  ++ C+S LC EL   C    S C Y 
Sbjct: 82  QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYS 130

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
             Y S  T   G    D + L T    S+   S  + GCG V +G   DG   +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSGGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183

Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
               S+ S L+    I + FS C         +  + FG      G+  Q         T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241

Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           +PTY + T+  ++V G  +    + I DSGT+ TY+    Y ++     S+    R    
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300

Query: 361 SDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           S +  + CY  S N+ N+++P + + + G
Sbjct: 301 SSMGLDLCYDRSSNR-NYKFPALTIRLAG 328


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 176 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 227

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P +SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 228 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 286

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P  +   G     F+ C    
Sbjct: 287 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 336

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
           S GTG + FG    P    TP  L    PT Y + +T + VGG  +      F+A   I 
Sbjct: 337 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 395

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           DSGT  T L   AY+ +   F +    +  R+ +   L  + CY  +   +    P V+L
Sbjct: 396 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 453

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             +GG    V+   ++ +     + L   G     +V I+G
Sbjct: 454 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 494


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 145/338 (42%), Gaps = 42/338 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G PA  + V  DTGSD  W+   C  CV       G + D     P  SST +
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWV--QCRPCVVKCYKQKGPLFD-----PAKSSTYA 215

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            V C  + C         G +C Y V+Y  DG+ + GF  +D L +A D  +        
Sbjct: 216 NVSCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------F 268

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRIS 282
            FGCG    G F   A   GL GLG  KTS+     N+     +F+ C  +   GTG + 
Sbjct: 269 RFGCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYLD 323

Query: 283 FGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFT 334
           FG  GS G     TP    +    Y + +T + VGG  V    S       + DSGT  T
Sbjct: 324 FG-PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVIT 382

Query: 335 YLNDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
            L   AYT +S  F+   LA+  ++     +  + CY  +   ++ E P V+L  +GG  
Sbjct: 383 RLPATAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFT-GLSDVELPTVSLVFQGGAC 440

Query: 393 FFVN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
             V+   IV   SE +     CL    +   ++V I+G
Sbjct: 441 LDVDVSGIVYAISEAQ----VCLAFASNGDDESVAIVG 474


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 101/316 (31%), Positives = 136/316 (43%), Gaps = 36/316 (11%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           F +   VS+G P +S  V +DTGSD+ W+   PC   +C    NS   Q+ D     P  
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191

Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           SST S VPC +  C EL+  +   +GS C Y V Y  DG+ +TG    D L LA      
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
             +     FGCG  Q G F   A  +GL  LG    S+ S  A  G     FS C  S  
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300

Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
              G ++ G   S  G   T        PT Y + +T +SVGG  V    SA     + D
Sbjct: 301 SAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360

Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           +GT  T L   AY  +   F  ++A     ++ ++   + CY  S        P V LT 
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFS-RYGVVTLPTVALTF 419

Query: 388 KGGGPFFVNDPIVIVS 403
            GG    +  P ++ S
Sbjct: 420 SGGATLALEAPGILSS 435


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 150/366 (40%), Gaps = 57/366 (15%)

Query: 87  LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSS 145
           +  +AG    R  S    +     +G P  + +VA+D  +D  W+PC  C+ C  G +S 
Sbjct: 86  VPIAAGRQILRTPS----YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSP 141

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCE----LQKQCPSA-GSNCPYQVRYLSDGTMST 200
           S        + P  SST   V C +  C         CP+  G++C + + Y S    + 
Sbjct: 142 S--------FDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA- 192

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSIL 259
             L +D L L +D   +   D   +FGC RV TGS      P GL G G    S +    
Sbjct: 193 -VLGQDALSL-SDSNGAAVPDDHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPLSFLSQTK 249

Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVS 313
           A  G I   FS C      S+ +G +  G  G P + +T   L   H P+ Y + +  V 
Sbjct: 250 ATYGSI---FSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVR 306

Query: 314 VGGNAVNFEFSA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
           V G AV    SA            I D+GT FT L+ PAY  +   F      +R  S  
Sbjct: 307 VNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAF------RRGVSAP 360

Query: 362 DLP----FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
             P    F+ CY ++  ++    P V     GG    + +  V++SS   G+    +   
Sbjct: 361 AAPALGGFDTCYYVNGTKS---VPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAG 417

Query: 418 KSDNVN 423
            SD VN
Sbjct: 418 PSDGVN 423


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 79/269 (29%), Positives = 120/269 (44%), Gaps = 17/269 (6%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            ++++G+   +F   +D+GSDL W+ CD   C H             +Y PN ++ +   
Sbjct: 57  VSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNNALNCFE 109

Query: 167 P-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           P C S        C SA   C Y++ Y   G+ S G LV D  H+            RI+
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSLAAPRIA 166

Query: 226 FGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
           FGCG     S  D + P  G+ GLG  + S  S L++ G++ N    C   +G G + FG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFG 225

Query: 285 DKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYLNDPAY 341
           D+  P  G T  S+        Y+    +V   G A    + + +FDSG+S+TY N  AY
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAY 285

Query: 342 TQI-SETFNSLAKEKRETSTSDLPFEYCY 369
             I +   N+L  +  E +  D     C+
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCW 314


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 101/316 (31%), Positives = 136/316 (43%), Gaps = 36/316 (11%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           F +   VS+G P +S  V +DTGSD+ W+   PC   +C    NS   Q+ D     P  
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191

Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           SST S VPC +  C EL+  +   +GS C Y V Y  DG+ +TG    D L LA      
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
             +     FGCG  Q G F   A  +GL  LG    S+ S  A  G     FS C  S  
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300

Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
              G ++ G   S  G   T        PT Y + +T +SVGG  V    SA     + D
Sbjct: 301 SAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360

Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           +GT  T L   AY  +   F  ++A     ++ ++   + CY  S        P V LT 
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFS-RYGVVTLPTVALTF 419

Query: 388 KGGGPFFVNDPIVIVS 403
            GG    +  P ++ S
Sbjct: 420 SGGATLALEAPGILSS 435


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 223

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P +SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 224 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P  +   G     F+ C    
Sbjct: 283 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 332

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
           S GTG + FG    P    TP  L    PT Y + +T + VGG  +      F+A   I 
Sbjct: 333 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 391

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           DSGT  T L   AY+ +   F +    +  R+ +   L  + CY  +   +    P V+L
Sbjct: 392 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 449

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             +GG    V+   ++ +     + L   G     +V I+G
Sbjct: 450 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 490


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 224

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P +SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 225 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P  +   G     F+ C    
Sbjct: 284 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPR 333

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
           S GTG + FG    P    TP  L    PT Y + +T + VGG  +      F+A   I 
Sbjct: 334 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 392

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           DSGT  T L   AY+ +   F +    +  R+ +   L  + CY  +   +    P V+L
Sbjct: 393 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 450

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             +GG    V+   ++ +     + L   G     +V I+G
Sbjct: 451 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 491


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 131/293 (44%), Gaps = 30/293 (10%)

Query: 67  DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN-VSVGQPALSFIVALDTG 125
           DR F  RGR L             +   T   + L   +YT+ V +G P   F + +DTG
Sbjct: 12  DRRFERRGRKLE-----------ESARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTG 60

Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFN--IYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
           S + ++PC  C  C H   S S   +      + P  SS+  K+ C S+ C +   C S 
Sbjct: 61  STVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIGCRSSDC-ITGLCDSN 119

Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
              C Y+ R  ++ + S G L +D+L      +    +   +SFGC   ++G      A 
Sbjct: 120 SHQCKYE-RMYAEMSTSKGVLGKDLLDFGPASRLQSQL---LSFGCETAESGDLYLQVA- 174

Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQ 300
           +G+ GLG    S+   L   G I +SFS+C+G   +G G +  G   +P       S  +
Sbjct: 175 DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPR 234

Query: 301 THPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
               YN+ +T++ V G       N  N +F  I DSGT++ YL D A+   ++
Sbjct: 235 RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTD 287


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 36/330 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C    +           + P +SST   
Sbjct: 85  TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN     +   C S G  C Y+ +Y ++ + S+G L EDV+       QS+ +  R  
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  ++TG      A +G+ GLG    S+   L  +G I +SFS+C+G    G G +  
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P      +S     P YN+ + ++ V G  +          + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305

Query: 337 NDPAYT----QISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
              A++     I +  +SL K +  + +  D+ F      +   +N ++P V++  + G 
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN-KFPTVDMVFENGQ 364

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
              +  P        K    YCLG+ ++ N
Sbjct: 365 KLSLT-PENYFFRHSKVHGAYCLGIFENGN 393


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 166/374 (44%), Gaps = 47/374 (12%)

Query: 34  FHHRYSD----PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
            HHR+      P K + +++D   +      +A   R     ++  G  A G +++ +T 
Sbjct: 61  LHHRHGPCSPLPTKKMPSLEDRLHRDQL--RAAYIKRKFSGDVKKDGQGAGGVEQSHVTV 118

Query: 90  SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQ 148
                T  LN+L +L    V +G PA +  V +D+GSD+ W+ C  C+ C   ++     
Sbjct: 119 PTTLGT-SLNTLEYL--ITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDP---- 171

Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLV 204
                ++ P+ SST S   C+S  C    Q    C S+ S C Y VRY +DG+ +TG   
Sbjct: 172 -----LFDPSLSSTYSPFSCSSAACAQLGQDGNGC-SSSSQCQYIVRY-ADGSSTTGTYS 224

Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
            D L L ++        S   FGC  V++G F D    +GL GLG    S+ S  A  G 
Sbjct: 225 SDTLALGSN------TISNFQFGCSHVESG-FND--LTDGLMGLGGGAPSLASQTA--GT 273

Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN- 320
              +FS C       +G ++ G  G+ G  +TP       PT Y + +  + VGG  ++ 
Sbjct: 274 FGTAFSYCLPPTPSSSGFLTLG-AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSI 332

Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
               F    + DSGT  T L   AY+ +S  F +  K+ R      +  + C+  S  Q+
Sbjct: 333 PTSVFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSI-MDTCFDFS-GQS 390

Query: 377 NFEYPVVNLTMKGG 390
           +   P V L   GG
Sbjct: 391 SVRLPSVALVFSGG 404


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 138/308 (44%), Gaps = 55/308 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    ++G PA   +VALDT +D  W+PC  CV C   +           ++ P+ SS+S
Sbjct: 91  YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV-----------LFDPSKSSSS 139

Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C++  C   KQ P    +AG +C + + Y   G+     L +D L LA D  +S  
Sbjct: 140 RNLQCDAPQC---KQAPNPTCTAGKSCGFNMTY--GGSTIEASLTQDTLTLANDVIKS-- 192

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
                +FGC    TG+ L      GL GLG    S+  I   Q L  ++FS C      S
Sbjct: 193 ----YTFGCISKATGTSLPA---QGLMGLGRGPLSL--ISQTQNLYMSTFSYCLPNSKSS 243

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
           + +G +  G K  P + +T   L+    +  Y + +  + VG   V+   SA        
Sbjct: 244 NFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTG 303

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
              IFDSGT FT L +PAY  +   F    K    TS     F+ CY   V+ P+ T F 
Sbjct: 304 AGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLGG--FDTCYSGSVVYPSVT-FM 360

Query: 380 YPVVNLTM 387
           +  +N+T+
Sbjct: 361 FAGMNVTL 368


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 123/279 (44%), Gaps = 34/279 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  + V +DTGSD+ W+ C  C  C     + S   +D  +Y    S+T
Sbjct: 77  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 132

Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
           S  V C+   C L     P    G  C Y V Y  DG+ +TG+ V+D +     +   Q+
Sbjct: 133 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 191

Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
              +  + FGCG  Q+G     + A +G+ G G   +S+ S LA+ G +   FS C  + 
Sbjct: 192 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 251

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQ---------THPTYNITITQVSVGGNAVNFEFSA- 325
           DG G  + G+   P   +  F L           +   YN+ + ++ VGG+ ++    A 
Sbjct: 252 DGGGIFAIGEVVEP---KVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF 308

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
                   I DSGT+  Y     Y  + E   S   + R
Sbjct: 309 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLR 347


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 36/330 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C    +           + P +SST   
Sbjct: 85  TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN     +   C S G  C Y+ +Y ++ + S+G L EDV+       QS+ +  R  
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  ++TG      A +G+ GLG    S+   L  +G I +SFS+C+G    G G +  
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P      +S     P YN+ + ++ V G  +          + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305

Query: 337 NDPAYT----QISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
              A++     I +  +SL K +  + +  D+ F      +   +N ++P V++  + G 
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN-KFPTVDMVFENGQ 364

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
              +  P        K    YCLG+ ++ N
Sbjct: 365 KLSLT-PENYFFRHSKVHGAYCLGIFENGN 393


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +D+GS + ++PC   SC    N    +      + P+ SST S V
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            CN     +   C S  + C Y+ +Y ++ + S+G L ED++   T   +S+    R  F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG      A +G+ GLG  + S+   L ++G+I +SFSMC+G    G G +  G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
              +P       S     P YNI + ++ V G A+  +          + DSGT++ YL 
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
           + A+    +  +S     ++    D  + + C+     + +Q +  +P V++   G G  
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVF-GNGQK 370

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
               P   +    K    YCLGV ++  D   ++G
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 405


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 146/329 (44%), Gaps = 34/329 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P +SST   
Sbjct: 114 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 164

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V C      +   C      C Y+ +Y ++ + S+G L EDV+       QS+    R  
Sbjct: 165 VKCT-----IDCNCDGDRMQCVYERQY-AEMSTSSGVLGEDVISFG---NQSELAPQRAV 215

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L ++ +I +SFS+C+G    G G +  
Sbjct: 216 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVL 274

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
           G    P      +S     P YNI + ++ V G       N  + +   + DSGT++ YL
Sbjct: 275 GGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 334

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPN---QTNFEYPVVNLTMKGGGP 392
            + A+    +      +  ++ S  D  + + C+  + N   Q +  +PVV++   G G 
Sbjct: 335 PEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVF-GNGH 393

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
            +   P   +    K    YCLG+ ++ N
Sbjct: 394 KYSLSPENYMFRHSKVRGAYCLGIFQNGN 422


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 77/256 (30%), Positives = 112/256 (43%), Gaps = 32/256 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ +G P   + + +DTGSDL W+ CD  C +C  G +          +Y P   + 
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 234

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L  D +H+       +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S PS LA+ G+I N F  C   + 
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350

Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
            G G +  GD   P  G T  S+R      Y+     V  G   +     A      IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 329 SGTSFTYLNDPAYTQI 344
           SG+S+TYL +  Y  +
Sbjct: 411 SGSSYTYLPNEIYENL 426


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +D+GS + ++PC   SC    N    +      + P+ SST S V
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            CN     +   C S  + C Y+ +Y ++ + S+G L ED++   T   +S+    R  F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG      A +G+ GLG  + S+   L ++G+I +SFSMC+G    G G +  G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
              +P       S     P YNI + ++ V G A+  +          + DSGT++ YL 
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
           + A+    +  +S     ++    D  + + C+     + +Q +  +P V++   G G  
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVF-GNGQK 370

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
               P   +    K    YCLGV ++  D   ++G
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 405


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 150/347 (43%), Gaps = 44/347 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + + C +  C        +G NC Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 225 PARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333

Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
           S GTG + FG  GSP        TP  L    PT Y + +T + VGG  ++   S     
Sbjct: 334 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTA 391

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
             I DSGT  T L   AY+ +   F S +A    + + +    + CY  +   +    P 
Sbjct: 392 GTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFT-GMSQVAIPT 450

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
           V+L  +GG    V+   ++ ++    +   CLG   ++   +V I+G
Sbjct: 451 VSLLFQGGARLDVDASGIMYAAS---VSQVCLGFAANEDGGDVGIVG 494


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 87/311 (27%), Positives = 138/311 (44%), Gaps = 43/311 (13%)

Query: 51  LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVS 110
           LP   S+   S LA   R    RG G  A  N +  L      + Y        + T + 
Sbjct: 47  LPLTRSYPNASRLAASSR----RGLGDGAHPNARMRLHDDLLTNGY--------YTTRLY 94

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G P   F + +D+GS + ++PC   SC    N    +      + P+ SS+ S V CN 
Sbjct: 95  IGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPVKCN- 145

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
               +   C S    C Y+ +Y ++ + S+G L ED++      ++S+    R  FGC  
Sbjct: 146 ----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQRAVFGCEN 197

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
            +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G  +    G P 
Sbjct: 198 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPA 256

Query: 291 QGETPFS----LRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYLNDP 339
             +  FS    LR   P YNI + ++ V G A+       N +   + DSGT++ YL + 
Sbjct: 257 PSDMVFSHSDPLRS--PYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQ 314

Query: 340 AYTQISETFNS 350
           A+    +   S
Sbjct: 315 AFVAFKDAVTS 325


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 122/283 (43%), Gaps = 46/283 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +  
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL 103

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              VPC + LC           +CPS    C YQ++Y +D   S G L+ D   L     
Sbjct: 104 ---VPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155

Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +S ++   ++FGCG  Q    +    AA +G+ GLG    S+ S L  QG+  N    C 
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
            ++G G + FGD   P    T  P + R +   Y       S G   + F+         
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
             +FDSG+++TY     Y  +       L+K  ++ S   LP 
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 81/283 (28%), Positives = 122/283 (43%), Gaps = 46/283 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +  
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL 103

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              VPC + LC           +CPS    C YQ++Y +D   S G L+ D   L     
Sbjct: 104 ---VPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155

Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +S ++   ++FGCG  Q    +    AA +G+ GLG    S+ S L  QG+  N    C 
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
            ++G G + FGD   P    T  P + R +   Y       S G   + F+         
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
             +FDSG+++TY     Y  +       L+K  ++ S   LP 
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 145/350 (41%), Gaps = 52/350 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +V +G P   F   +DTGSDL W  C  C+ CV        Q   +  + P  S++ 
Sbjct: 88  YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 138

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + +PC+S +C          + C YQ  Y  D   S G L  +     T+   ++    R
Sbjct: 139 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 195

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           +SFGCG +  G+  +G   +G+ G G    S+ S L +       FS C   F S  T R
Sbjct: 196 VSFGCGNMNAGTLFNG---SGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 247

Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
           + FG   +          P Q  TPF +    PT Y + +T +SV G+ +  + S     
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 306

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                   I DSGT+ T+L  PAY  +   F +     R  +T    F+ C+   P    
Sbjct: 307 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 366

Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               P + L   G       +  +++      L   CL ++ SD+ +IIG
Sbjct: 367 MVTLPEMVLHFDGADMELPLENYMVMDGGTGNL---CLAMLPSDDGSIIG 413


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 145/350 (41%), Gaps = 52/350 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +V +G P   F   +DTGSDL W  C  C+ CV        Q   +  + P  S++ 
Sbjct: 85  YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 135

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + +PC+S +C          + C YQ  Y  D   S G L  +     T+   ++    R
Sbjct: 136 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 192

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           +SFGCG +  G+  +G   +G+ G G    S+ S L +       FS C   F S  T R
Sbjct: 193 VSFGCGNMNAGTLFNG---SGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 244

Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
           + FG   +          P Q  TPF +    PT Y + +T +SV G+ +  + S     
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 303

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                   I DSGT+ T+L  PAY  +   F +     R  +T    F+ C+   P    
Sbjct: 304 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 363

Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               P + L   G       +  +++      L   CL ++ SD+ +IIG
Sbjct: 364 MVTLPEMVLHFDGADMELPLENYMVMDGGTGNL---CLAMLPSDDGSIIG 410


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 160/371 (43%), Gaps = 61/371 (16%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
           L   G  +Y  + VG PA+  ++ +DTGSD+ W+ C  C  CV  L            ++
Sbjct: 132 LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 182

Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           P  SS+  K+PC S+ C      ++  C  +G  C + ++Y  DG++S+G L  + +   
Sbjct: 183 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 241

Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           T    D +  K   S I+ GC  +       GA+  GL G+     S PS L+++     
Sbjct: 242 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 295

Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
            FS CF       + +G + FG  D  SP    TP       P+ ++    V + G +V 
Sbjct: 296 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 355

Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
                    NF+          I DSGT+FTYL  PA+  +   F  LA+        D 
Sbjct: 356 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 413

Query: 364 P-FEYCYVLSPNQTNFE---YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVV 417
             F  CY ++      E    P + L  +GG    +  N  ++ VSS  +   L CL  +
Sbjct: 414 SGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTL-CLAFL 472

Query: 418 KSDNV--NIIG 426
            S ++  NIIG
Sbjct: 473 MSGDIPFNIIG 483


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 89/300 (29%), Positives = 129/300 (43%), Gaps = 37/300 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P        DTGSD+ WL C+ C  C +             I++P+ SS+   +PC
Sbjct: 92  SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142

Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           +S LC   +    +  N C Y++ Y  D + S G L  D L L +      S   +I  G
Sbjct: 143 SSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSF-PKIVIG 200

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
           CG    G+F  G A +G+ GLG    S+ + L +   I   FS C        S+ +  +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256

Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
           SFGD     G G     L +  P  Y +T+   SVG   V F         E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T+ T +    YT +      L K  R     +  F  CY L  N+  +++P++ +  KG 
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKSNE--YDFPIITVHFKGA 373


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 149/346 (43%), Gaps = 38/346 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT + +G P     V +DTGSD+ W+ C  C SC+    S    +   +IY+ + SST
Sbjct: 82  LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137

Query: 163 SSKVPCNSTLCE-LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           SS   C+  LC   Q  C  +GSN  C Y + Y  D + S G  V+D +H     +   +
Sbjct: 138 SSVSSCSDPLCTGEQAVCSRSGSNSACAYGISY-QDKSTSIGAYVKDDMHYVL--QGGNA 194

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
             S I FGC    TGS+      +G+ G G    +VP+ +A Q  +   FS C G +  G
Sbjct: 195 TTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHG 250

Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAV---NFEFS--------- 324
            G + FG++  P   E  F+ L      YN+ +  +SV    +   + EFS         
Sbjct: 251 GGILEFGEE--PNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNET 308

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEYPV 382
             I DSGTSF  L   A   +     +L   K       L    C+ L    T    +P 
Sbjct: 309 GVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQ---CFYLKSGLTVETSFPN 365

Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
           V LT  GG    +  D  +++    K    YC     +D + I G 
Sbjct: 366 VTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADGLTIFGE 411


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 137/308 (44%), Gaps = 35/308 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT + +G P   + V +DTGSD+ W+  + +SC  G  + SG  I+   Y P  S T+
Sbjct: 84  LYYTRIEIGSPPKGYYVQVDTGSDILWV--NGISC-DGCPTRSGLGIELTQYDPAGSGTT 140

Query: 164 SKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
             V C    C        +   CPSA S C +++ Y  DG+ +TGF V D +     +  
Sbjct: 141 --VGCEQEFCVANSAASGVPPACPSAASPCQFRITY-GDGSSTTGFYVTDFVQYNQVSGN 197

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
            Q+   +  I+FGCG  Q G  L  +  A +G+ G G    S+ S LA    +   F+ C
Sbjct: 198 GQTTPSNVSITFGCG-AQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHC 256

Query: 273 FGS-DGTGRISFGDKGSPG-QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------ 324
             +  G G  + G+   P     TP     TH  YN+ +  +SVGG  +    S      
Sbjct: 257 LDTVRGGGIFAIGNVVQPPIVKTTPLVPNATH--YNVNLQGISVGGATLQLPTSTFDSGD 314

Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               I DSGT+  YL    Y  +     ++  +  + +  +     C+  S    + E+P
Sbjct: 315 SKGTIIDSGTTLAYLPREVYRTL---LTAVFDKHPDLAVRNYEDFICFQFS-GSLDEEFP 370

Query: 382 VVNLTMKG 389
           V+  + +G
Sbjct: 371 VITFSFEG 378


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 152/347 (43%), Gaps = 44/347 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 223

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 224 PARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332

Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
           S GTG + FG  GSP        TP  L    PT Y + +T + VGG  ++   S     
Sbjct: 333 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATA 390

Query: 326 --IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
             I DSGT  T L  PAY+ +   F +++A    + + +    + CY  +   +    P 
Sbjct: 391 GTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFT-GMSQVAIPT 449

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
           V+L  +GG    V+   ++ ++    +   CLG   ++   +V I+G
Sbjct: 450 VSLLFQGGAILDVDASGIMYAAS---VSQVCLGFAANEDGGDVGIVG 493


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 136/304 (44%), Gaps = 39/304 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T++ +G PA   +V LDTGSD  W+ C  C  C     +         ++ P+ SST 
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEA---------LFDPSKSSTY 184

Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S + C+S  C+      K   S+   CPY++ Y +D + + G L  D L L+  +     
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKKCPYEITY-ADDSYTVGNLARDTLTLSPTDAVPGF 243

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG 277
           V     FGCG    GSF      +GL GLG  K S+ S +A +      FS C  S    
Sbjct: 244 V-----FGCGHNNAGSF---GEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSA 293

Query: 278 TGRISFG--DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------IF 327
           TG +SF      +P   +    +   HP+ Y + +T ++V G A+    S        I 
Sbjct: 294 TGYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTII 353

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+F+ L   AY  +  +  S     +   +S + F+ CY L+ ++T    P V L  
Sbjct: 354 DSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTI-FDTCYDLTGHET-VRIPSVALVF 411

Query: 388 KGGG 391
             G 
Sbjct: 412 ADGA 415


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 115/389 (29%), Positives = 157/389 (40%), Gaps = 53/389 (13%)

Query: 36  HRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDT 95
           HR+  P   +   DD P       +   A  D   R+     A  G D   ++  A    
Sbjct: 24  HRHG-PCSPLQTPDDAPSDADLLEHDQ-ARVDSIHRMIANETAVVGQD---VSLPA---- 74

Query: 96  YRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVID 151
            R  S+G  +Y  +V +G PA    V  DTGSDL W+   PC    C H  +        
Sbjct: 75  ERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDP------- 127

Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQ-CPSAGSN--CPYQVRYLSDGTMSTGFLVEDVL 208
             +++P++SST S V C    C   +Q C S+  +  CPY+V Y  D + + G L  D L
Sbjct: 128 --LFAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEVVY-GDKSRTVGHLGNDTL 184

Query: 209 HLATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
            L T    + S ++        FGCG   TG F      +GLFGLG  K S+ S  A  G
Sbjct: 185 TLGTTPSTNASENNSNKLPGFVFGCGENNTGLF---GKADGLFGLGRGKVSLSSQAA--G 239

Query: 264 LIPNSFSMCF---GSDGTGRISFGDKG-SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN 317
                FS C     S+  G +S G    +P     TP   R   P+ Y + +  + V G 
Sbjct: 240 KYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGR 299

Query: 318 AVN-------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEY 367
           A+        +    I DSGT  T L   AY+ +   F S   +   KR    S L   Y
Sbjct: 300 AIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCY 359

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
            +    N T    P V L   GG    V+
Sbjct: 360 DFTAHANAT-VSIPAVALVFAGGATISVD 387


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 157/369 (42%), Gaps = 53/369 (14%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
              +G P    ++ +DT S+L W+     SC    N S  +V  FN   P  SS+    P
Sbjct: 2   QTKIGTPPREVLLLVDTASELTWV--QGTSCT---NCSPTKVPPFN---PGLSSSFISEP 53

Query: 168 CNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           C S++C        Q  C  +  +C +QV YL DG+ + G +  ++  L + +  + ++ 
Sbjct: 54  CTSSVCLGRSKLGFQSACNRSTGSCSFQVAYL-DGSEAYGVIAREIFSLQSWDGAASTLG 112

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL--IPNSFSMCFGS---- 275
             I FGC        +D ++  G  GL     S P+ + ++    + + FS CF +    
Sbjct: 113 DVI-FGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEH 169

Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI-- 326
            + +G I FGD G P       SL Q  P       Y + +  +SVGG  ++   SA   
Sbjct: 170 LNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKI 229

Query: 327 ---------FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                    FDSGT+ ++L +PA+T + E F         TS SD   E CY ++     
Sbjct: 230 DRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDAR 289

Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVS-SEPKGLYLYCL-----GVVKSDNVNIIG---- 426
               P+V L  K      + +  V V  +    +   CL     G V    VN+IG    
Sbjct: 290 LPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQ 349

Query: 427 REYPIANNI 435
           ++Y I +++
Sbjct: 350 QDYLIEHDL 358


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 132/308 (42%), Gaps = 37/308 (12%)

Query: 96  YRLNSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
           +R   LG  +Y  +V +G P    +V  DTGSDL W+ C  C +C    +          
Sbjct: 178 HRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDP--------- 228

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++ P+ S+T S VPC +  C     C S    C Y+V Y  D + + G L  D L L   
Sbjct: 229 LFDPSQSTTYSAVPCGAQECLDSGTCSSG--KCRYEVVY-GDMSQTDGNLARDTLTLGPS 285

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             Q +       FGCG   TG F      +GLFGLG D+ S+ S  A +      FS C 
Sbjct: 286 SDQLQG----FVFGCGDDDTGLF---GRADGLFGLGRDRVSLASQAAAR--YGAGFSYCL 336

Query: 274 GSD--GTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA- 325
            S     G +S G   +P   + T    R   P+ Y + +  + V G  V      F A 
Sbjct: 337 PSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP 396

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPFEYCYVLSPNQTNFEYPV 382
             + DSGT  T L   AY+ +  +F    +  KR  + S L  + CY  +  +T  + P 
Sbjct: 397 GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSIL--DTCYDFT-GRTKVQIPS 453

Query: 383 VNLTMKGG 390
           V L   GG
Sbjct: 454 VALLFDGG 461


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 119/462 (25%), Positives = 188/462 (40%), Gaps = 64/462 (13%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYY 60
           MASS  +  + +LL+L    +   F      +    R  +     +++  +   G++  +
Sbjct: 1   MASSASHMIIVILLVL--AVSSALFSPAASTWRSLDRRPEKNGFRVSLRHVDSGGNYTKF 58

Query: 61  SALAHRDRYFRLRGRGLAAQGNDKTPLT---FSAGNDTYRLNSLGFLHYTNVSVGQPALS 117
             L    +  RLR + L+A+     P       AGN  + +N         +++G PA +
Sbjct: 59  ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAIGTPAET 109

Query: 118 FIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
           +   +DTGSDL W  C  C  C               I+ P  SS+ SK+PC+S LC + 
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSSDLC-VA 159

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG-S 235
               S    C Y+  Y  D + + G L  +            SV S+I FGCG    G +
Sbjct: 160 LPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGEDNRGRA 212

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQ 291
           +  GA   GL GLG    S+ S L     +P  FS C      S G   +  G + +   
Sbjct: 213 YSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGSEATVKS 264

Query: 292 G-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLND 338
              TP     + P+ Y +++  +SVG   +  E S            I DSGT+ TYL D
Sbjct: 265 AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKD 324

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
            A+  + + F S  K   + S S    E C+ L P+ +  + P +    +G       + 
Sbjct: 325 SAFAALKKEFISQMKLDVDASGST-ELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 383

Query: 399 IVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
            +I   E   L + CL +  S  ++I G       NI + H+
Sbjct: 384 YII---EDSALRVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420


>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 452

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 121/272 (44%), Gaps = 28/272 (10%)

Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           HYT ++++G P   + + +D+GSDL W+ CD  C  C    +          +Y PN   
Sbjct: 63  HYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRD---------QLYKPN--- 110

Query: 162 TSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             + V C   LC      ++  C S    C Y+V Y   G+ S G LV D  ++      
Sbjct: 111 -HNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGS-SLGVLVRD--YIPFQFTN 166

Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
              V  R++FGCG  Q  S  +   A +G+ GLG  + S+ S L + GLI N    C  +
Sbjct: 167 GSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSA 226

Query: 276 DGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTS 332
            G G + FGD   P  G    S+    +   Y+    ++   G A   +    IFDSG+S
Sbjct: 227 RGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLELIFDSGSS 286

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
           +TY N  AY  + +      K K+    +D P
Sbjct: 287 YTYFNSQAYQAVVDLVTQDLKGKQLKRATDDP 318


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 138/308 (44%), Gaps = 55/308 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    ++G PA + +VALDT +D  W+PC  CV C   +           ++ P+ SS+S
Sbjct: 88  YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136

Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C +  C   KQ P    +   +C + + Y   G+    +L +D L LATD      
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSAIEAYLTQDTLTLATD------ 185

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
           V    +FGC    +G+ L      GL GLG    S+  I  +Q L  ++FS C      S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
           + +G +  G K  P + +T   L+    +  Y + +  + VG   V+   SA        
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
              IFDSGT +T L +PAY  +   F    K    TS     F+ CY   V+ P+ T F 
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357

Query: 380 YPVVNLTM 387
           +  +N+T+
Sbjct: 358 FAGMNVTL 365


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 112/393 (28%), Positives = 164/393 (41%), Gaps = 67/393 (17%)

Query: 65  HRDRYFRLRGRGL-AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           HR      R  G+ A  G     +   AGN  + ++         V++G PALS+   +D
Sbjct: 68  HRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMD---------VAIGTPALSYAAIVD 118

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPS 181
           TGSDL W  C  CV C               ++ P++SST + VPC+S LC +L     +
Sbjct: 119 TGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCSSALCSDLPTSTCT 169

Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGA 240
           + S C Y   Y  D + + G L  +   L  ++K+   V    +FGCG    G  F  GA
Sbjct: 170 SASKCGYTYTY-GDASSTQGVLASETFTLGKEKKKLPGV----AFGCGDTNEGDGFTQGA 224

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKG----------- 287
              GL GLG    S+ S L   GL  + FS C  S  DG G+      G           
Sbjct: 225 ---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDGDGKSPLLLGGSAAAISESAAT 276

Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
           +P Q  TP     + P+ Y +++T ++VG   +    SA           I DSGTS TY
Sbjct: 277 APVQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITY 335

Query: 336 LNDPAYTQISETFNSLAKEKRET-STSDLPFEYCYVLSPNQTN-FEYPVVNLTMKGGGPF 393
           L    Y  + + F  +A+    T   S++  + C+       +  + P + L   GG   
Sbjct: 336 LELQGYRALKKAF--VAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADL 393

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +     +V     G    CL V  S  ++IIG
Sbjct: 394 DLPAENYMVLDSASG--ALCLTVAPSRGLSIIG 424


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 111/408 (27%), Positives = 168/408 (41%), Gaps = 62/408 (15%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP---LTFSAGNDTYRLNSLGFLHYTNVSV 111
           G++  +  L    +  RLR + L+A+     P       AGN  + +N         +++
Sbjct: 53  GNYTKFERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAI 103

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G PA ++   +DTGSDL W  C  C  C               I+ P  SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSS 154

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
            LC +     S    C Y+  Y  D + + G L  +            SV S+I FGCG 
Sbjct: 155 DLC-VALPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGE 206

Query: 231 VQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
              G ++  GA   GL GLG    S+ S L     +P  FS C      S G   +  G 
Sbjct: 207 DNRGRAYSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGS 258

Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
           + +      TP     + P+ Y +++  +SVG   +  E S            I DSGT+
Sbjct: 259 EATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTT 318

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
            TYL D A+  + + F S  K   + S S    E C+ L P+ +  E P +    +G   
Sbjct: 319 ITYLKDNAFAALKKEFISQMKLDVDASGST-ELELCFTLPPDGSPVEVPQLVFHFEGVDL 377

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
               +  +I   E   L + CL +  S  ++I G       NI + H+
Sbjct: 378 KLPKENYII---EDSALRVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 156/367 (42%), Gaps = 52/367 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           +GF + T +++GQP   + + +DTGSDL WL CD  C  C    +          +Y P 
Sbjct: 74  VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 122

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
              ++  VPC  +LC       +     P+Q  Y    +D   S G L+ DV  L  T+ 
Sbjct: 123 ---SNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 179

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
            Q K    R++ GCG  Q          +G+ GLG  KTS+ S L +QGL+ N    C  
Sbjct: 180 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 236

Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
           + G G I FGD   S     TP S R           ++  GG         A+FD+G+S
Sbjct: 237 AQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSS 296

Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLSPNQTNFEYP 381
           +TY N  AY  +      E+     KE  +  T  L      PF   Y +   +  F+  
Sbjct: 297 YTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEV---RKYFKPI 353

Query: 382 VVNLTMKGGGPFFVNDP---IVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIAN 433
           V++ T  G        P    +I+S+    +   CLG++    V     N+IG +  + N
Sbjct: 354 VLSFTSNGRSKAQFEMPPEAYLIISN----MGNVCLGILNGSEVGMGDLNLIG-DISMLN 408

Query: 434 NISLFHN 440
            + +F N
Sbjct: 409 KVMVFDN 415


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/268 (29%), Positives = 124/268 (46%), Gaps = 35/268 (13%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+YT + VG+P   + + +DTGSDL W+ CD  C SC  G +          +Y P   +
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSP---------LYKPRREN 248

Query: 162 TSSKVPCNSTLC-ELQK-----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
               V    +LC E+Q+     QC +A   C Y+V+Y +D + S G LV+D   L     
Sbjct: 249 V---VSFKDSLCMEVQRNYDGDQC-AACQQCNYEVQY-ADQSSSLGVLVKDEFTLRFSNG 303

Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
               +++   FGC   Q G  L+  +  +G+ GL   K S+PS LA++G+I N    C  
Sbjct: 304 SLTKLNA--IFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLT 361

Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEF------S 324
            D  G G +  GD   P  G    ++  +     Y   + ++  G   ++ +        
Sbjct: 362 GDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQ 421

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLA 352
            +FDSG+S+TY    AY Q+      ++
Sbjct: 422 VVFDSGSSYTYFTKEAYYQLVANLEEVS 449


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 41/277 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y  +++G P   F + +DTGSDL W+ CD  C  C                Y PN
Sbjct: 64  LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK--------------YKPN 108

Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            ++    +PC+  LC        + C      C Y++ Y SD   S G LV D + L   
Sbjct: 109 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 162

Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                 ++ R++FGCG   Q           G+ GLG  K  + + L + G+  N    C
Sbjct: 163 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 221

Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
               G G +S GD+  P  G T  SL    P+ N       +     + G   +N     
Sbjct: 222 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 277

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           +FDSG+S+TY N  AY  I +        K  T T D
Sbjct: 278 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 314


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 157/348 (45%), Gaps = 59/348 (16%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           Y  + +G PA  F V +DTGS + ++PC   SC      + G       + P +SS+S+ 
Sbjct: 63  YATLHLGTPARQFAVIVDTGSTITYVPC--ASC----GRNCGPHHKDAAFDPASSSSSAV 116

Query: 166 VPCNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + C+S  C   +  P  G      C YQ  Y ++ + S G LV D L L     +  +V+
Sbjct: 117 IGCDSDKCICGR--PPCGCSEKRECTYQRTY-AEQSSSAGLLVSDQLQL-----RDGAVE 168

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR 280
             + FGC   +TG   +  A +G+ GLG  + S+ + LA  G+I + F++CFGS +G G 
Sbjct: 169 --VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGA 225

Query: 281 ISFGDKGSPGQGETPFSLRQT-------HPT-YNITITQVSVGGNAVNFE-------FSA 325
           +  GD  +    E   +L+ T       HP  Y++ +  + VGG  +  +       +  
Sbjct: 226 LMLGDVDA---AEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGT 282

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEK----------RETSTSDLPFEYCYVLSP-- 373
           + DSGT+FTYL   A+    E  ++ A E           +E S +    + C+  +P  
Sbjct: 283 VLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQF-HDICFGGAPHA 341

Query: 374 ---NQTNFE--YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGV 416
              +Q+  E  +PV  L     G      P+  +      +  YCLGV
Sbjct: 342 GHADQSKLEKVFPVFELQF-ADGVRLRTGPLNYLFMHTGEMGAYCLGV 388


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 147/337 (43%), Gaps = 37/337 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +D+GS + ++PC DC  C        G+  D   + P  SST   
Sbjct: 96  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPELSSTYQP 146

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C      C Y+  Y ++ + S G L ED++       +S+    R  
Sbjct: 147 VKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 197

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L ++GLI NSF +C+G    G G +  
Sbjct: 198 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 256

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S     P YNI +T + V G  ++        E  A+ DSGT++ YL
Sbjct: 257 GGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYL 316

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNFE----YPVVNLTMKGGG 391
            D A+    E         ++    D  F + C++++ +    E    +P V +  K G 
Sbjct: 317 PDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQ 376

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
            + ++ P   +    K    YCLGV  +  D+  ++G
Sbjct: 377 SWLLS-PENYMFRHSKVHGAYCLGVFPNGKDHTTLLG 412


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 118/257 (45%), Gaps = 39/257 (15%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P  SS+   
Sbjct: 82  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSSSYKA 132

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN   C     C   G  C Y+ RY ++ + S+G L ED++       +S+    R  
Sbjct: 133 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGN---ESQLTPQRAV 183

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG  K SV   L ++G+I + FS+C+G    G G +  
Sbjct: 184 FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 242

Query: 284 GDKGSPGQGET-----PFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
           G K SP  G       PF      P YNI + Q+ V G ++       N +   + DSGT
Sbjct: 243 G-KISPPAGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 297

Query: 332 SFTYLNDPAYTQISETF 348
           ++ Y    A+  I +  
Sbjct: 298 TYAYFPKEAFIAIKDAI 314


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/336 (29%), Positives = 144/336 (42%), Gaps = 44/336 (13%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            V +G PA  + V  DTGSD  W+ C  CV   +             ++ P  SST + V
Sbjct: 166 TVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEP--------LFDPAKSSTYANV 217

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            C  + C         G +C Y V+Y  DG+ + GF  +D L +A D  +         F
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------FRF 270

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFG 284
           GCG    G F   A   GL GLG  KTS+     N+     +F+ C  +   GTG + FG
Sbjct: 271 GCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFG 325

Query: 285 DKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYL 336
             GS G     TP    +    Y + +T + VGG  V    S       + DSGT  T L
Sbjct: 326 -PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRL 384

Query: 337 NDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
              AYT +S  F+   LA+  ++     +  + CY  +   ++ E P V+L  +GG    
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFT-GLSDVELPTVSLVFQGGACLD 442

Query: 395 VN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
           V+   IV   SE +     CL    +   ++V I+G
Sbjct: 443 VDVSGIVYAISEAQ----VCLAFASNGDDESVAIVG 474


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/275 (28%), Positives = 114/275 (41%), Gaps = 28/275 (10%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +++G P   + + +DTGSDL WL CD  C SC           +   +Y P  + 
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTKNK 115

Query: 162 TSSKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VPC   LC         + +C S    C Y ++Y   G+ STG LV D   L    
Sbjct: 116 L---VPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGS-STGVLVNDSFALRL-- 169

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
                V   ++FGCG  Q  S  + +  +G+ GLG    S+ S     G+  N    C  
Sbjct: 170 ANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCLS 229

Query: 275 SDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFDSGT 331
             G G + FGD   P Q    TP         Y+     +  G  ++  + +  +FDSG+
Sbjct: 230 LRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGS 289

Query: 332 SFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
           SFTY     Y  +       L++  +E S   LP 
Sbjct: 290 SFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPL 324


>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
 gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
          Length = 603

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 117/274 (42%), Gaps = 33/274 (12%)

Query: 114 PALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
           P   + +  DTGSDL W+ CD  C SC  G N+          Y P   +    VP    
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANA---------WYKPRRGNI---VPPKDL 246

Query: 172 LCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           LC   ++   AG       C Y++ Y +D + S G L  D L L         ++    F
Sbjct: 247 LCMEVQRNQKAGYCETCDQCDYEIEY-ADHSSSMGVLATDKLLLMVANGSLTKLN--FIF 303

Query: 227 GCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISF 283
           GC   Q G  L      +G+ GL   K S+PS LA+QG+I N    C  +D  G G +  
Sbjct: 304 GCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFL 363

Query: 284 GDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTY 335
           GD   P  G    P     +   Y+  + +++ G + ++           +FDSG+S+TY
Sbjct: 364 GDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTY 423

Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
               AY+++  + N ++      STSD     C+
Sbjct: 424 FPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCW 457


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 36/277 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y  +++G P   F + +DTGSDL W+ CD  C  C                Y PN
Sbjct: 64  LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113

Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            ++    +PC+  LC        + C      C Y++ Y SD   S G LV D + L   
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167

Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                 ++ R++FGCG   Q           G+ GLG  K  + + L + G+  N    C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226

Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
               G G +S GD+  P  G T  SL    P+ N       +     + G   +N     
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           +FDSG+S+TY N  AY  I +        K  T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 129/300 (43%), Gaps = 32/300 (10%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           + +G PA  F V  DTGSD  W+ C  CV+  +             +++P  S+T + + 
Sbjct: 169 IRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEP--------LFTPTKSATYANIS 220

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C S+ C        +G +C Y V+Y  DG+ + GF  +D L L  D  +         FG
Sbjct: 221 CTSSYCSDLDTRGCSGGHCLYAVQY-GDGSYTVGFYAQDTLTLGYDTVKD------FRFG 273

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
           CG    G F   A   GL GLG  KTSVP    ++      F+ C    S GTG + FG 
Sbjct: 274 CGEKNRGLFGKAA---GLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFGP 328

Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLN 337
                     TP  +      Y + +T + VGG+ ++       +  A+ DSGT  T L 
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388

Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
             AY  +   F   +     +T+ +    + CY L+  Q +   P V+L  +GG    V+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVD 448


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 59/320 (18%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            + +G P   F   +DTGSDL W+ C  C  C    +          IY P+ SST +K 
Sbjct: 7   EIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDP---------IYDPSASSTFAKT 57

Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            C+++ C+      C S+   C Y  +Y  D + + G    + L L +    SK+     
Sbjct: 58  SCSTSSCQSLPASGCSSSAKTCIYGYQY-GDSSSTQGDFALETLTLRSSGGSSKAFP-NF 115

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
            FGCGR+ +GSF  GAA  G+ GLG  K S+ + L +   I N FS C       S  T 
Sbjct: 116 QFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTS 170

Query: 280 RISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--------- 325
            + FG   S G G       P S R T+  Y + +  +SVGG  ++    A         
Sbjct: 171 PLIFGSSASTGSGAISTPIIPNSGRSTY--YFVGLEGISVGGKQLSLATRAIDFLSVRSK 228

Query: 326 ---------------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
                          IFDSGT+ T L+D  Y+++   F +S++    + S+S   F+ CY
Sbjct: 229 KKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSG--FDLCY 286

Query: 370 VLSPNQTNFEYPVVNLTMKG 389
            +S ++ NF++P + L  KG
Sbjct: 287 DVSKSK-NFKFPALTLAFKG 305


>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
 gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
          Length = 424

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 139/346 (40%), Gaps = 39/346 (11%)

Query: 104 LHYTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
           L Y +VS  +G P   F + +DTGSDL W+ CD  C  C   L+         ++Y P  
Sbjct: 64  LGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLH---------HLYKPRN 114

Query: 160 SSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           +  S   P C++       QC SA   C Y+++Y  +G+ S G LV D   L        
Sbjct: 115 NLLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGS-SLGVLVTDYFPLRL--MNGS 171

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
            +  +++FGCG  Q         P  G+ GLG  KTS+ S L   G++ N    C    G
Sbjct: 172 FLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKG 231

Query: 278 TGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-IFDSGTSFT 334
            G + FG    P  G    P S +     Y     ++  GG     +    IFDSG+S+T
Sbjct: 232 GGFLFFGQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSYT 291

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF- 393
           Y N   Y     T N + KE       D P E    +    T   +  VN       PF 
Sbjct: 292 YFNAQVY---QSTLNLIRKELSGKPLRDAPEEKALAICWKGTK-RFKSVNEVKSYFKPFA 347

Query: 394 --FVNDPIVIVSSEPKGLYL------YCLGVVKSD-----NVNIIG 426
             F     V +   P+   +       CLG++        N N+IG
Sbjct: 348 LSFTKAKSVQLQIPPEDYLIVTNDGNVCLGILNGSEVGLGNFNVIG 393


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 36/277 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y  +++G P   F + +DTGSDL W+ CD  C  C                Y PN
Sbjct: 64  LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113

Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            ++    +PC+  LC        + C      C Y++ Y SD   S G LV D + L   
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167

Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                 ++ R++FGCG   Q           G+ GLG  K  + + L + G+  N    C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226

Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
               G G +S GD+  P  G T  SL    P+ N       +     + G   +N     
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           +FDSG+S+TY N  AY  I +        K  T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 167/391 (42%), Gaps = 46/391 (11%)

Query: 51  LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVS 110
           LP   S+   S LA   R    RG G  A  N +  L      + Y        + T + 
Sbjct: 47  LPLTRSYPNASRLAASLR----RGLGDGAHPNARMRLHDDLLTNGY--------YTTRLY 94

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G P   F + +D+GS + ++PC   SC    N    +      + P+ SS+ S V CN 
Sbjct: 95  IGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPVKCN- 145

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
               +   C S    C Y+ +Y ++ + S+G L ED++      ++S+    R  FGC  
Sbjct: 146 ----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKAQRAVFGCEN 197

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
            +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G  +    G P 
Sbjct: 198 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPT 256

Query: 291 QGETPFSLRQ--THPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAY 341
             +  FS       P YNI + ++ V G A+  +          + DSGT++ YL + A+
Sbjct: 257 PSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF 316

Query: 342 TQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPFFVND 397
               +   S     ++    D  + + C+     + ++ +  +P V++   G G      
Sbjct: 317 MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVF-GNGQKLSLT 375

Query: 398 PIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
           P   +    K    YCLGV ++  D   ++G
Sbjct: 376 PENYLFRHSKVDGAYCLGVFQNGKDPTTLLG 406


>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
          Length = 165

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 69/127 (54%), Gaps = 12/127 (9%)

Query: 29  TFGFDFHHRYSDPVKGI------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
           ++    +H++S+ VK        L  D  P +GS  YY AL H D      GR LA    
Sbjct: 27  SYSLQMYHKFSNEVKEWMTWRHGLDTDGWPVEGSNEYYKALYHHDS--ARHGRKLA---- 80

Query: 83  DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
           D   LTF  GN+T  +  LGFL Y+ V VG P ++  VALDTGSD+FW+PCDC +C    
Sbjct: 81  DHPSLTFLEGNETVEIPQLGFLFYSMVQVGTPNVTLFVALDTGSDVFWVPCDCQACAPTS 140

Query: 143 NSSSGQV 149
            +S G V
Sbjct: 141 AASYGLV 147


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 125/288 (43%), Gaps = 38/288 (13%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  + VG P+  + + +D+GS+L W+ CD  C+SC  G +          +Y     S
Sbjct: 78  LYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHP---------LYKLKKGS 128

Query: 162 TSSKVPCNSTLCELQK-------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
               VP    LC   +           A   C Y V Y +D   S GFLV D +      
Sbjct: 129 L---VPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAY-ADHGYSEGFLVRDSVRALLTN 184

Query: 215 KQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           K   + +S   FGCG  Q  S  +  A  +G+ GLG    S+PS  A QGLI N    C 
Sbjct: 185 KTVLTANS--VFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242

Query: 274 ---GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
              G DG G + FGD    +      P   R +   Y +   Q++ G   ++ +      
Sbjct: 243 FGAGRDG-GYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKL 301

Query: 326 ---IFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
              IFDSG+++TY  + AY   +S    +L+ ++ E  +SD     C+
Sbjct: 302 GGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCW 349


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 159/371 (42%), Gaps = 61/371 (16%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
           L   G  +Y  + +G PA+  ++ +DTGSD+ W+ C  C  CV  L            ++
Sbjct: 131 LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 181

Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           P  SS+  K+PC S+ C      ++  C  +G  C + ++Y  DG++S+G L  + +   
Sbjct: 182 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 240

Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           T    D +  K   S I+ GC  +       GA+  GL G+     S PS L+++     
Sbjct: 241 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 294

Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
            FS CF       + +G + FG  D  SP    TP       P+ ++    V + G +V 
Sbjct: 295 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 354

Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
                    NF+          I DSGT+FTYL  PA+  +   F  LA+        D 
Sbjct: 355 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 412

Query: 364 P-FEYCYVLSPNQTNFE---YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVV 417
             F  CY ++      E    P + L  +GG    +  N  ++ VSS  +   L CL   
Sbjct: 413 SGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTL-CLAFQ 471

Query: 418 KSDNV--NIIG 426
            S ++  NIIG
Sbjct: 472 MSGDIPFNIIG 482


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 134/314 (42%), Gaps = 36/314 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTS 163
           +  +V +G PA    V  DTGSDL W+ C       G  SS G     + +++P+ SST 
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-------GPCSSGGCYKQQDPLFAPSDSSTF 206

Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV- 220
           S V C +  C  ++ C  +  +  CPY+V Y  D + + G L  D L L T    + S  
Sbjct: 207 SAVRCGARECRARQSCGGSPGDDRCPYEVVY-GDKSRTQGHLGNDTLTLGTMAPANASAE 265

Query: 221 -DSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
            D+++    FGCG   TG F      +GLFGLG  K S+ S  A  G     FS C    
Sbjct: 266 NDNKLPGFVFGCGENNTGLF---GQADGLFGLGRGKVSLSSQAA--GKFGEGFSYCLPSS 320

Query: 274 GSDGTGRISFGDK-GSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE-----FSA 325
            S   G +S G    +P   + TP   R T P+ Y + +  + V G A+           
Sbjct: 321 SSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPL 380

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           I DSGT  T L   AY  +   F S   +   KR    S L   Y +    N T    P 
Sbjct: 381 IVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANAT-VSIPA 439

Query: 383 VNLTMKGGGPFFVN 396
           V L   GG    V+
Sbjct: 440 VALVFAGGATISVD 453


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 146/337 (43%), Gaps = 37/337 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +D+GS + ++PC DC  C        G+  D   + P  SST   
Sbjct: 95  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C      C Y+  Y ++ + S G L ED++       +S+    R  
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 196

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L ++GLI NSF +C+G    G G +  
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 255

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S     P YNI +T + V G  ++        E  A+ DSGT++ YL
Sbjct: 256 GGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYL 315

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNFE----YPVVNLTMKGGG 391
            D A+    E         ++    D  F + C+ ++ +    E    +P V +  K G 
Sbjct: 316 PDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQ 375

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
            + ++ P   +    K    YCLGV  +  D+  ++G
Sbjct: 376 SWLLS-PENYMFRHSKVHGAYCLGVFPNGKDHTTLLG 411


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 55/308 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    ++G PA   +VALDT +D  W+PC  CV C   +           ++ P+ SS+S
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136

Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C +  C   KQ P    +   +C + + Y   G+    +L +D L LA+D      
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
           V    +FGC    +G+ L      GL GLG    S+  I  +Q L  ++FS C      S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
           + +G +  G K  P + +T   L+    +  Y + +  + VG   V+   SA        
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
              IFDSGT +T L +PAY  +   F    K    TS     F+ CY   V+ P+ T F 
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357

Query: 380 YPVVNLTM 387
           +  +N+T+
Sbjct: 358 FAGMNVTL 365


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 55/308 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    ++G PA   +VALDT +D  W+PC  CV C   +           ++ P+ SS+S
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136

Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C +  C   KQ P    +   +C + + Y   G+    +L +D L LA+D      
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
           V    +FGC    +G+ L      GL GLG    S+  I  +Q L  ++FS C      S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
           + +G +  G K  P + +T   L+    +  Y + +  + VG   V+   SA        
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
              IFDSGT +T L +PAY  +   F    K    TS     F+ CY   V+ P+ T F 
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357

Query: 380 YPVVNLTM 387
           +  +N+T+
Sbjct: 358 FAGMNVTL 365


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 152/341 (44%), Gaps = 59/341 (17%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L   N S+GQPA   +  +DTGS++ W+ C  C  C       +G ++D     P+ SST
Sbjct: 98  LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQ----QNGPLLD-----PSKSST 148

Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + +PC +T+C      PSA  N    C Y + Y + G  S G L  + L   + ++   
Sbjct: 149 YASLPCTNTMCHY---APSAYCNRLNQCGYNLSY-ATGLSSAGVLATEQLIFHSSDEGVN 204

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
           +V S + FGC   + G + D     G+FGLG   TS  + + ++      FS C G+   
Sbjct: 205 AVPS-VVFGCSH-ENGDYKDRRF-TGVFGLGKGITSFVTRMGSK------FSYCLGNIAD 255

Query: 277 ---GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------EF 323
              G  ++ FG+K +     TP  +   H  Y +T+  +SVG   ++           E 
Sbjct: 256 PHYGYNQLVFGEKANFEGYSTPLKVVNGH--YYVTLEGISVGEKRLDIDSTAFSMKGNEK 313

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEY----CYVLSPNQTNF 378
           SA+ DSGT+ T+L + A       F +L  E R+     L PF      CY  + +Q   
Sbjct: 314 SALIDSGTALTWLAESA-------FRALDNEVRQLLDGVLMPFWRGSFACYKGTVSQDLI 366

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
            +PVV     GG    ++   +   + P  L   C+ V ++
Sbjct: 367 GFPVVTFHFSGGADLDLDTESMFYQATPDIL---CIAVRQA 404


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 101/346 (29%), Positives = 148/346 (42%), Gaps = 53/346 (15%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           ++S+G PAL++   +DTGSDL W    C  CV   N S+       ++ P++SST S +P
Sbjct: 121 DMSIGTPALAYAAIVDTGSDLVW--TQCKPCVECFNQST------PVFDPSSSSTYSTLP 172

Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           C+S+LC       C SA  +C Y   Y  D + + G L  +   LA      K+    ++
Sbjct: 173 CSSSLCSDLPTSTCTSAAKDCGYTYTY-GDASSTQGVLAAETFTLA------KTKLPGVA 225

Query: 226 FGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR--- 280
           FGCG    G  F  GA   GL GLG    S+ S L   GL    FS C  S D T +   
Sbjct: 226 FGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--GKFSYCLTSLDDTSKSPL 277

Query: 281 -------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------- 325
                  IS     +     TP     + P+ Y +T+  ++VG   +    SA       
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDG 337

Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEY 380
               I DSGTS TYL    Y  + + F +  K      ++ +  + C+    +   + E 
Sbjct: 338 TGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSA-VGLDLCFKAPASGVDDVEV 396

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P + L   GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 397 PKLVLHFDGGADLDLPAENYMVLDSASG--ALCLTVMGSRGLSIIG 440


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/305 (28%), Positives = 127/305 (41%), Gaps = 37/305 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L + N SVGQP +     +DTGS L W+ C  C  C      SS  +I   +++P  SST
Sbjct: 67  LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHC------SSNHMIH-PVFNPALSST 119

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  C+   C        + + C Y+  Y+S GT S G L ++ L   T    +  V  
Sbjct: 120 FVECSCDDRFCRYAPNGHCSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVTQ 177

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDG 277
            I+FGCG  + G  L+     G+ GLG   TS+   L ++      FS C G     + G
Sbjct: 178 PIAFGCGH-ENGEQLESEF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNYG 229

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAIF 327
             ++  G+        TP      +  Y + +  +SVG   +N E             I 
Sbjct: 230 YNQLVLGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVIL 289

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETST-SDLPFEYCYVLSPNQTNFEYPVVNLT 386
           D+GT +T+L D AY ++     S+   K E     D     CY    N+    +PVV   
Sbjct: 290 DTGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF---LCYHGRVNEELIGFPVVTFH 346

Query: 387 MKGGG 391
             GG 
Sbjct: 347 FAGGA 351


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 104/335 (31%), Positives = 149/335 (44%), Gaps = 43/335 (12%)

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
           RL  RG+  +    T L   +G       S+G   Y   V +G P   F +  DTGSD+ 
Sbjct: 91  RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 143

Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
           W  C+ CV   +               +P+TS++   + C+S LC+L        + C S
Sbjct: 144 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 195

Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
             S C YQV+Y  DG+ S GF   + L L+     S +V     FGCG+   G F   A 
Sbjct: 196 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 247

Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
             GL      K ++PS  A       S+ +   S   G +S G + S     TP S    
Sbjct: 248 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 304

Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
           + P Y + IT +SVGG  ++ + SA     + DSGT  T L+  AY+++S  F +L  + 
Sbjct: 305 STPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 364

Query: 356 RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
             TS   + F+ CY  S   T    P V +T KGG
Sbjct: 365 PSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 397


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 125/305 (40%), Gaps = 33/305 (10%)

Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG  +Y  +V +G P    +V  DTGSDL W+ C  C  C    +          ++ P+
Sbjct: 133 LGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDP---------LFDPS 183

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            S+T S VPC +  C        +   C Y+V Y  D + + G L  D L L      S 
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVY-GDMSQTDGNLARDTLTLGPSSSSSS 242

Query: 219 SVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
           S       FGCG   TG F      +GLFGLG D+ S+ S  A +      FS C  S  
Sbjct: 243 SDQLQEFVFGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSS 297

Query: 278 T--GRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFD 328
           T  G +S G    P    T    R   P+ Y + +  + V G  V    +       + D
Sbjct: 298 TAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVID 357

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           SGT  T L   AY  +  +F  L +    KR  + S L  + CY  +  +   + P V L
Sbjct: 358 SGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSIL--DTCYDFT-GRNKVQIPSVAL 414

Query: 386 TMKGG 390
              GG
Sbjct: 415 LFDGG 419


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 150/337 (44%), Gaps = 43/337 (12%)

Query: 69  YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSD 127
           + RL  RG+  +    T L   +G       S+G   Y   V +G P   F +  DTGSD
Sbjct: 101 HARLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSD 153

Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQC 179
           + W  C+ CV   +               +P+TS++   + C+S LC+L        + C
Sbjct: 154 ITWTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSC 205

Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
            S  S C YQV+Y  DG+ S GF   + L L+     S +V     FGCG+   G F   
Sbjct: 206 SS--STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGA 257

Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR 299
           A   GL      K ++PS  A       S+ +   S   G +S G + S     TP S  
Sbjct: 258 AGLLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSAD 314

Query: 300 -QTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAK 353
             + P Y + IT +SVGG  ++ + SA     + DSGT  T L+  AY+++S  F +L  
Sbjct: 315 FDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 374

Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           +   TS   + F+ CY  S   T    P V +T KGG
Sbjct: 375 DYPSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 409


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 159/361 (44%), Gaps = 40/361 (11%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           +GF + T +++GQP   + + +DTGS+L WL CD  C  C    +          +Y P+
Sbjct: 71  VGFYNVT-LNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHP---------LYKPS 120

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQS 217
                 K P  ++L           + C Y+++Y +D   + G L+ DV  L  T+  Q 
Sbjct: 121 NDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKY-ADQYSTLGVLLNDVYLLNFTNGVQL 179

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
           K    R++ GCG  Q  S       +G+ GLG  K S+ S L +QGL+ N    C  S G
Sbjct: 180 KV---RMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRG 236

Query: 278 TGRISFGDK-GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTY 335
            G I FG+   S     TP S   +   Y+    ++  GG        + IFD+G+S+TY
Sbjct: 237 GGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTY 296

Query: 336 LNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY-----VLSPNQTNFEYPVVNLTMKG 389
            N  AY  +    N  L ++  + +  D     C+       S N+    +  + L+   
Sbjct: 297 FNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTN 356

Query: 390 GG---PFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIANNISLFH 439
           GG   P F   P   +I+S+    +   CLG++    V     N+IG +  + + + +F 
Sbjct: 357 GGRVKPQFEIPPEAYLIISN----MGNVCLGILNGPEVGLGELNLIG-DISMLDKVMVFD 411

Query: 440 N 440
           N
Sbjct: 412 N 412


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 119/270 (44%), Gaps = 32/270 (11%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           +G PA  + + +DTGSDL WL CD  C SC           +   +Y P  +     VPC
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL---VPC 48

Query: 169 NSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            + LC           +CPS    C YQ++Y +D   S G L+ D   L     +S ++ 
Sbjct: 49  ANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM---RSSNIR 103

Query: 222 SRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
             ++FGCG  Q    +    AA +G+ GLG    S+ S L  QG+  N    C  ++G G
Sbjct: 104 PGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 163

Query: 280 RISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
            + FGD   P    T  P + R +   Y+     +     ++  +    +FDSG+++TY 
Sbjct: 164 FLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYF 223

Query: 337 N-DPAYTQISETFNSLAKEKRETSTSDLPF 365
              P    +S     L+K  ++ S   LP 
Sbjct: 224 TAQPYQAVVSALKGGLSKSLKQVSDPTLPL 253


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 72/251 (28%), Positives = 121/251 (48%), Gaps = 25/251 (9%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           Y  +++G+PA  + + +DTGS+L WL  +C   VHG      +      Y+P  +  + K
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93

Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           V C S LC   ++     P    N    C Y+++Y++    S G L  D++ +   +K+ 
Sbjct: 94  VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
                RI+FGCG  Q        +P +G+ GLGM K  + + L    +I  N    C  S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSS 205

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
            G G +  GD   P +G T   +R++   Y+  + +V +    +  N  F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265

Query: 334 TYLNDPAYTQI 344
           T++    Y +I
Sbjct: 266 THVPAQIYNEI 276


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 86/355 (24%), Positives = 141/355 (39%), Gaps = 58/355 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++T + VG PA  F V +DTGS+L W     V+C +       +     ++  + S +  
Sbjct: 84  YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 134

Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            V C +  C++          CP+  + C Y  RY +DG+ + G   ++ + +     + 
Sbjct: 135 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 193

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
             +   +  GC    TG    GA  +G+ GL     S  S   +  L    FS C     
Sbjct: 194 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 248

Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
                     FGS  + + +F       +  TP  L +  P Y I +  +S+G + ++  
Sbjct: 249 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 301

Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
                       I DSGTS T L D AY Q+         E +      +P EYC+  + 
Sbjct: 302 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 361

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
                + P +   +KGG  F  +    +V + P    + CLG V +     N+IG
Sbjct: 362 GFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFVSAGTPATNVIG 413


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 104/335 (31%), Positives = 149/335 (44%), Gaps = 43/335 (12%)

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
           RL  RG+  +    T L   +G       S+G   Y   V +G P   F +  DTGSD+ 
Sbjct: 43  RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95

Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
           W  C+ CV   +               +P+TS++   + C+S LC+L        + C S
Sbjct: 96  WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 147

Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
             S C YQV+Y  DG+ S GF   + L L+     S +V     FGCG+   G F   A 
Sbjct: 148 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 199

Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
             GL      K ++PS  A       S+ +   S   G +S G + S     TP S    
Sbjct: 200 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 256

Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
           + P Y + IT +SVGG  ++ + SA     + DSGT  T L+  AY+++S  F +L  + 
Sbjct: 257 STPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 316

Query: 356 RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
             TS   + F+ CY  S   T    P V +T KGG
Sbjct: 317 PSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 349


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 167/381 (43%), Gaps = 41/381 (10%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           +DR   +R +  A+    K  ++  A N    L++  ++   ++ +G PA   +V LDTG
Sbjct: 103 QDRVDAIRRKVTASSNKPKGGVSLLA-NWGKSLSTTNYV--ASLRLGTPATELVVELDTG 159

Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
           SD  W+ C  C  C    +          ++ P  SST S VPC +  C+        + 
Sbjct: 160 SDQSWVQCKPCADCYEQRDP---------VFDPTASSTYSAVPCGARECQELASSSSSRN 210

Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS-VDSRISFGCGRVQTGSF 236
                  NCPY+V Y  D + + G L  D L L+     S +       FGCG    G+F
Sbjct: 211 CSSDNNKNCPYEVSY-DDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTF 269

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGE- 293
            +    +GL GLG+ K S+PS +A +     +FS C  S     G +SFG   +    + 
Sbjct: 270 GE---VDGLLGLGLGKASLPSQVAAR--YGAAFSYCLPSSPSAAGYLSFGGAAARANAQF 324

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------IFDSGTSFTYLNDPAYTQISE 346
           T     Q   +Y + +T + V G A+    SA       I DSGT+F+ L   AY  +  
Sbjct: 325 TEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRS 384

Query: 347 TFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
           +F S + + + + + S   F+ CY  + ++T    P V L    G    ++   V+ +  
Sbjct: 385 SFRSAMGRYRYKRAPSSPIFDTCYDFTGHET-VRIPAVELVFADGATVHLHPSGVLYTW- 442

Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
              +   CL  V + ++ I+G
Sbjct: 443 -NDVAQTCLAFVPNHDLGILG 462


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 101/235 (42%), Gaps = 25/235 (10%)

Query: 128 LFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---- 183
           +F L   C +C       SG  +D  +Y PN S TS+ VPC    C      P +G    
Sbjct: 26  VFLLQLGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQD 81

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
            +CPY + Y  DG+ ++G  V D L     +    +K  +S + FGCG  Q+GS    + 
Sbjct: 82  MSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSD 140

Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
            A +G+ G G   +SV S LA  G +   FS C  S  G G  S G    P    TP   
Sbjct: 141 EALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVP 200

Query: 299 RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
           R  H  YN+ +  + V G  +               I DSGT+  YL    Y Q+
Sbjct: 201 RMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 253


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 155/353 (43%), Gaps = 45/353 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + +G P   + + LDTGS L WL C  CV   H         +D  ++ P+ S+T 
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCH-------SQVD-PLFEPSASNTY 171

Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             + C+S+ C L K        C ++G  C Y   Y  D + S G+L  D+L L      
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGV-CVYTASY-GDASYSMGYLSRDLLTLTP---- 225

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
           S+++ S  ++GCG+   G F   A   G+ GL  DK S+ + L+ +     +FS C    
Sbjct: 226 SQTLPS-FTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPK--YGYAFSYCLPTS 279

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFSAIF 327
            S G G +S G         TP      +P+        IT+    VG  A  ++   I 
Sbjct: 280 TSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTII 339

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT  T L    Y  + E F  +   + E + +    + C+  S    +   P + +  
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGA-PEIRMIF 398

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
           +GG    +  P +++ ++ KG  + CL    S+ + IIG    + Y IA ++S
Sbjct: 399 QGGADLSLRAPNILIEAD-KG--IACLAFASSNQIAIIGNHQQQTYNIAYDVS 448


>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
          Length = 566

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 82/254 (32%), Positives = 115/254 (45%), Gaps = 40/254 (15%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+YT V +G P   F V +DTGSD+ W+   C SC +G   +S   I  + + P  SS++
Sbjct: 131 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 187

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           S V C+   C    Q  S  S    C Y  +Y  DG+ ++G+ + D              
Sbjct: 188 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISD-------------- 232

Query: 221 DSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
                F C  +Q+G       A +G+FGLG    SV S LA QGL P  FS C   D  G
Sbjct: 233 -----FMCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 287

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFD 328
            G +  G    P    TP  L  + P YN+ +  ++V G  +  + S          I D
Sbjct: 288 GGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIID 345

Query: 329 SGTSFTYLNDPAYT 342
           +GT+  YL D AY+
Sbjct: 346 TGTTLAYLPDEAYS 359


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 111/256 (43%), Gaps = 32/256 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +YT++ +G P   + + +DTGSDL W+ CD  C +   G +          +Y P   + 
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHP---------LYKP---AK 234

Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              VP    LC+     Q  C +    C Y++ Y +D + S G L  D +H+       +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292

Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
            +D    FGC   Q G  L   A  +G+ GL     S PS LA+ G+I N F  C   + 
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350

Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
            G G +  GD   P  G T  S+R      Y+     V  G   +     A      IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 329 SGTSFTYLNDPAYTQI 344
           SG+S+TYL +  Y  +
Sbjct: 411 SGSSYTYLPNEIYENL 426


>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 440

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 155/367 (42%), Gaps = 52/367 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           +GF + T +++GQP   + + +DTGSDL WL CD  C  C    +          +Y P 
Sbjct: 76  VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 124

Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
              ++  VPC   LC       +     P+Q  Y    +D   S G L+ DV  L  T+ 
Sbjct: 125 ---SNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 181

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
            Q K    R++ GCG  Q          +G+ GLG  KTS+ S L +QGL+ N    C  
Sbjct: 182 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 238

Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
           + G G I FGD   S     TP S R           ++  GG         A+FD+G+S
Sbjct: 239 AQGGGYIFFGDVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSS 298

Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLSPNQTNFEYP 381
           +TY N  AY  +      E+     KE  +  T  L      PF   Y +   +  F+  
Sbjct: 299 YTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEV---RKYFKPI 355

Query: 382 VVNLTMKGGGPF---FVNDPIVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIAN 433
           V++ T  G        + +  +IVS+        CLG++    V     N+IG +  + N
Sbjct: 356 VLSFTSNGRSKAQFEMLPEAYLIVSNMGN----VCLGILNGSEVGMGDLNLIG-DISMLN 410

Query: 434 NISLFHN 440
            + +F N
Sbjct: 411 KVMVFDN 417


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 137/332 (41%), Gaps = 39/332 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L   N SVGQP +  +  +DTGS L W+ C  C  C      SS  +I   +++P  SST
Sbjct: 95  LFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHC------SSDHMIH-PVFNPALSST 147

Query: 163 SSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             +  C+   C          SN C Y+  Y+S GT S G L ++ L   T    +  V 
Sbjct: 148 FVECSCDDRFCRYAPNGHCGSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVT 205

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SD 276
             I+FGCG  + G  L+     G+ GLG   TS+   L ++      FS C G     + 
Sbjct: 206 QPIAFGCG-YENGEQLESHF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNY 257

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAI 326
           G  ++  G+        TP      +  Y + +  +SVG   +N E             I
Sbjct: 258 GYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVI 317

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETST-SDLPFEYCYVLSPNQTNFEYPVVNL 385
            DSGT +T+L D AY ++     S+   K E     D     CY    ++    +PVV  
Sbjct: 318 LDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF---LCYHGRVSEELIGFPVVTF 374

Query: 386 TMKGGGPFFVNDPIVIVS-SEPKGLYLYCLGV 416
              GG    +    +    SEP    ++C+ V
Sbjct: 375 HFAGGAELAMEATSMFYPLSEPNTFNVFCMSV 406


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +++G P   + + +D+GSDL WL CD  C SC           +   +Y P  S 
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
               VPC   LC         + +C S    C Y ++Y   G+ STG L+ D   L L  
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                 SV    +FGCG  Q     D ++P +G+ GLG    S+ S L  +G+  N    
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227

Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
           C    G G + FGD   P Q    TP +       Y+     +  G  ++    +  +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287

Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
           SG+SFTY     Y  +     + L++   E   + LP 
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +++G P   + + +D+GSDL WL CD  C SC           +   +Y P  S 
Sbjct: 56  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 106

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
               VPC   LC         + +C S    C Y ++Y   G+ STG L+ D   L L  
Sbjct: 107 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 162

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                 SV    +FGCG  Q     D ++P +G+ GLG    S+ S L  +G+  N    
Sbjct: 163 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 218

Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
           C    G G + FGD   P Q    TP +       Y+     +  G  ++    +  +FD
Sbjct: 219 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 278

Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
           SG+SFTY     Y  +     + L++   E   + LP 
Sbjct: 279 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 316


>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 123/276 (44%), Gaps = 25/276 (9%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           Y  +++G+PA  + + +DTGS+L WL  +C   VHG      +      Y+P  +  + K
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93

Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           V C S LC   ++     P    N    C Y+++Y++    S G L  D++ +   +K+ 
Sbjct: 94  VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
                RI+FGCG  Q        +P +G+ GLGM K    + L    +I  N    C  S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSS 205

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
            G G +  GD   P +G T   +R++   Y+  + +V +    +  N  F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           T++    Y +I         E             C+
Sbjct: 266 THVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCW 301


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 148/336 (44%), Gaps = 36/336 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P+ SST   
Sbjct: 83  TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPDLSSTYQP 133

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V C      L   C +    C Y+ +Y ++ + S+G L EDV+       QS+    R  
Sbjct: 134 VKCT-----LDCNCDNDRMQCVYERQY-AEMSTSSGVLGEDVVSFGN---QSELAPQRAV 184

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC  V+TG      A +G+ GLG    S+   L ++ ++ +SFS+C+G    G G +  
Sbjct: 185 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 243

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G    P       S     P YNI + ++ V G  +         +  ++ DSGT++ YL
Sbjct: 244 GGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYL 303

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            + A+    E      +   + S  D  + + C+    +  +Q +  +PVV++   G G 
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIF-GNGH 362

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
            +   P   +    K    YCLG+ ++  D   ++G
Sbjct: 363 KYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLG 398


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 118/279 (42%), Gaps = 34/279 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +++G P   + + +D+GSDL WL CD  C SC           +   +Y P  S 
Sbjct: 63  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 113

Query: 162 TSSKVPCNSTLCEL--------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLA 211
               VPC   LC          + +C S    C Y ++Y   G+ STG LV D   L L 
Sbjct: 114 L---VPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGS-STGVLVNDSFALRLT 169

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFS 270
                  SV    +FGCG  Q     D ++P +G+ GLG    S+ S L  +G+  N   
Sbjct: 170 NGSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVG 225

Query: 271 MCFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIF 327
            C    G G + FGD   P Q    TP +       Y+     +  G  ++    +  +F
Sbjct: 226 HCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVF 285

Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
           DSG+SFTY     Y  +     + L++   E   + LP 
Sbjct: 286 DSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 324


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 126/300 (42%), Gaps = 37/300 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P        DTGSD+ WL C+ C  C +             I++P+ SS+   +PC
Sbjct: 92  SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142

Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
            S LC   +    +  N C Y++ Y  D + S G L  D L L +      S    +  G
Sbjct: 143 LSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSFPKTV-IG 200

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
           CG    G+F  G A +G+ GLG    S+ + L +   I   FS C        S+ +  +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256

Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
           SFGD     G G     L +  P  Y +T+   SVG   V F         E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T+ T +    YT +      L K  R     +  F  CY L  N+  +++P++    KG 
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKSNE--YDFPIITAHFKGA 373


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/355 (24%), Positives = 141/355 (39%), Gaps = 58/355 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++T + VG PA  F V +DTGS+L W     V+C +       +     ++  + S +  
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 156

Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            V C +  C++          CP+  + C Y  RY +DG+ + G   ++ + +     + 
Sbjct: 157 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 215

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
             +   +  GC    TG    GA  +G+ GL     S  S   +  L    FS C     
Sbjct: 216 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 270

Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
                     FGS  + + +F       +  TP  L +  P Y I +  +S+G + ++  
Sbjct: 271 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 323

Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
                       I DSGTS T L D AY Q+         E +      +P EYC+  + 
Sbjct: 324 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
                + P +   +KGG  F  +    +V + P    + CLG V +     N+IG
Sbjct: 384 GFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFVSAGTPATNVIG 435


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 149/362 (41%), Gaps = 44/362 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 223

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 224 PARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332

Query: 275 SDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
           S GTG + FG  GSP      TP  +      Y + +T + VGG  +    S       I
Sbjct: 333 STGTGYLDFG-AGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTI 391

Query: 327 FDSGTSFTYLNDPAYTQISETFNSL--AKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            DSGT  T L   AY+ +   F +   A+  ++     L  + CY  +   +    P V+
Sbjct: 392 VDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSL-LDTCYDFA-GMSQVAIPTVS 449

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIANNIS 436
           L  +GG    V+   ++ ++    + L          +G+V +  +   G  Y I   + 
Sbjct: 450 LLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVV 509

Query: 437 LF 438
            F
Sbjct: 510 SF 511


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/282 (29%), Positives = 122/282 (43%), Gaps = 43/282 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  +++G PA  + + +DTGSDL WL CD  C SC           +   +Y P   + 
Sbjct: 52  YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSC---------NKVPHPLYKP---TK 99

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           +  VPC +++C          K+C +    C YQ++Y +D   S G LV D   L    +
Sbjct: 100 NKLVPCAASICTTLHSAQSPNKKC-AVPQQCDYQIKY-TDSASSLGVLVTDNFTLPL--R 155

Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
            S SV    +FGCG  Q    + +  A  +GL GLG    S+ S L   G+  N    C 
Sbjct: 156 NSSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCL 215

Query: 274 GSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FS 324
            ++G G + FGD   P    T   + R T   Y       S G   + F+          
Sbjct: 216 STNGGGFLFFGDNVVPTSRATWVPMVRSTSGNY------YSPGSGTLYFDRRSLGVKPME 269

Query: 325 AIFDSGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
            +FDSG+++TY    P    +S     L+K  ++ S   LP 
Sbjct: 270 VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPL 311


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           L+Y  +++G P   + + +D+GSDL WL CD  C SC           +   +Y P  S 
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115

Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
               VPC   LC         + +C S    C Y ++Y   G+ STG L+ D   L L  
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                 SV    +FGCG  Q     D ++P +G+ GLG    S+ S L  +G+  N    
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227

Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
           C    G G + FGD   P Q    TP +       Y+     +  G  ++    +  +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287

Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
           SG+SFTY     Y  +     + L++   E   + LP 
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 142/334 (42%), Gaps = 38/334 (11%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
           L++L F+    V  G PA ++ V  DTGSD+ W+   C+ C       SG     +  I+
Sbjct: 130 LDTLEFV--VTVGFGTPAQTYTVIFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 178

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            P  S+T S VPC    C        +   C Y+V Y  DG+ S G L  + L L +   
Sbjct: 179 DPTKSATYSVVPCGHPQCAAADGSKCSNGTCLYKVEY-GDGSSSAGVLSHETLSLTSTRA 237

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
                    +FGCG+   G F D    +GL GLG  + S+ S  A       +FS C  S
Sbjct: 238 LPG-----FAFGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPS 287

Query: 276 DGT--GRISFGDKGSPGQGETPFSL---RQTHPT-YNITITQVSVGGNAVNF------EF 323
           D T  G ++ G        +  ++    +Q +P+ Y + +  + +GG  +        + 
Sbjct: 288 DNTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDD 347

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
               DSGT  TYL   AYT + + F     + +     D PF+ CY  +  Q+    P V
Sbjct: 348 GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCYDFT-GQSAIFIPAV 405

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           +     G  F ++   +++  +     + CLG V
Sbjct: 406 SFKFSDGSVFDLSFFGILIFPDDTAPAIGCLGFV 439


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 97/343 (28%), Positives = 150/343 (43%), Gaps = 60/343 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA + ++A+DT +D  W+PC  CV C               +++   S+T 
Sbjct: 96  YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC------------SSTVFNNVKSTTF 143

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             V C +  C+        GS C + + Y S    +   L +DV+ LATD   S      
Sbjct: 144 KTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDSIPS------ 195

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
            +FGC    TGS +    P GL GLG    S+ S    Q L  ++FS C  S    + +G
Sbjct: 196 YTFGCLTEATGSSIP---PQGLLGLGRGPMSLLS--QTQNLYQSTFSYCLPSFRSLNFSG 250

Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
            +  G  G P + +T   L+    +  Y + +  + VG   V+   SA           I
Sbjct: 251 SLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTI 310

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
           FDSGT FT L  PAYT + + F    +    T TS   F+ CY   +++P  T F +  +
Sbjct: 311 FDSGTVFTRLVAPAYTAVRDAFRK--RVGNATVTSLGGFDTCYTSPIVAPTIT-FMFSGM 367

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
           N+T+         D ++I S+      + CL +  + DNVN +
Sbjct: 368 NVTLPP-------DNLLIHSTASS---ITCLAMAAAPDNVNSV 400


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 143/346 (41%), Gaps = 46/346 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G P   F V +DTGSDL W+ C      +  N +        ++ PNTS++ +
Sbjct: 13  YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDA--------LFLPNTSTSFT 64

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           K+ C S LC          + C Y   Y  DG+++TG  V D + +     Q + V    
Sbjct: 65  KLACGSALCNGLPFPMCNQTTCVYWYSY-GDGSLTTGDFVYDTITMDGINGQKQQV-PNF 122

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
           +FGCG    GSF   A  +G+ GLG    S  S L  + +    FS C          T 
Sbjct: 123 AFGCGHDNEGSF---AGADGILGLGQGPLSFHSQL--KSVYNGKFSYCLVDWLAPPTQTS 177

Query: 280 RISFGDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
            + FGD   P   +    P       PT Y + +  +SVG N +N   +           
Sbjct: 178 PLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG 237

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
            IFDSGT+ T L + AY ++    N+  +A  ++    S L  + C    P       P 
Sbjct: 238 TIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRL--DLCLSGFPKDQLPTVPA 295

Query: 383 VNLTMKGGGPFF--VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           +    +GG       N  I + SS+      YC  +  S +VNIIG
Sbjct: 296 MTFHFEGGDMVLPPSNYFIYLESSQS-----YCFAMTSSPDVNIIG 336


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/344 (27%), Positives = 140/344 (40%), Gaps = 43/344 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  NV +G P     +  DTGSDL W  C  CV   +             I+ P+TS T 
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSTSKTY 205

Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C S  C   K         + SNC Y ++Y  D + + GF  +D L L  ++    
Sbjct: 206 SNISCTSAACSSLKSATGNSPGCSSSNCVYGIQY-GDSSFTIGFFAKDKLTLTQND---- 260

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
            V     FGCG+   G F   A   GL GLG D  S+    A +      FS C  +   
Sbjct: 261 -VFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314

Query: 277 GTGRISFGD----KGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE------ 322
             G ++FG+    K S     G   TPF+  Q    Y I +  +SVGG A++        
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN 374

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT  T L   AY  +   F      K  T+ +    + CY LS N T+   P 
Sbjct: 375 AGTIIDSGTVITRLPSTAYGSLKSAFKQFM-SKYPTAPALSLLDTCYDLS-NYTSISIPK 432

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++    G     ++   +++++    + L   G    D++ I G
Sbjct: 433 ISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFG 476


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 154/373 (41%), Gaps = 37/373 (9%)

Query: 67  DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
           DR F  RGRGL               +D   L + G+ + + V +G PA  F + +DTGS
Sbjct: 71  DRRFERRGRGLVEDAR------MVLHDD---LLTKGY-YTSRVFIGTPAQEFALIVDTGS 120

Query: 127 DLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
            + ++PC  C  C H       Q      + P+ SS+   V CNS  C + K C +    
Sbjct: 121 TVTYVPCSSCTHCGHH------QACFDPRFKPDNSSSYQTVSCNSPDC-ITKMCDARVHQ 173

Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           C Y+ R  ++ + S G L +D+L        S+     + FGC   +TG      A +G+
Sbjct: 174 CKYE-RVYAEMSSSKGVLGKDLLGFGNG---SRLQPHPLLFGCETAETGDLYLQHA-DGI 228

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP 303
            GLG    S+   L   G + +SFS+C+G   +G G +  G    P       S      
Sbjct: 229 MGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPPPPAMVFAKSDPNRSN 288

Query: 304 TYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
            YN+ ++++ V G ++N            + DSGT++ YL D A+    +         +
Sbjct: 289 YYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQ 348

Query: 357 ETSTSDLPF-EYCYVLSPNQTNF---EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
                D  + + C+  + + +      +P V+    G    F+  P   +    K    Y
Sbjct: 349 AVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLA-PENYLFKHTKVPGAY 407

Query: 413 CLGVVKSDNVNII 425
           CLG  K+ +   +
Sbjct: 408 CLGFFKNQDATTL 420


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 131/298 (43%), Gaps = 40/298 (13%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           V +G PA  F V  DTGSD  W+ C  CV+  +             ++ P  S+T + + 
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 151

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C+S+ C        +G +C Y ++Y  DG+ + GF  +D L LA D  ++        FG
Sbjct: 152 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 204

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
           CG    G F   A   GL GLG  KTS+P    ++      F+ C    S GTG +  G 
Sbjct: 205 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 258

Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
            G+P      TP  + +    Y + +T + VGG+ +    S       + DSGT  T L 
Sbjct: 259 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 318

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQ-TNFEYPVVNLTMKGG 390
             AY  +   F+   K  +    S  P     + CY L+ ++  +   P V+L  +GG
Sbjct: 319 PSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGG 373


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 144/338 (42%), Gaps = 49/338 (14%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           S+G P++  +   DTGSDL WL C  C +C            +  ++ P  SST   VPC
Sbjct: 93  SLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQ---------EAPLFDPTQSSTYVDVPC 143

Query: 169 NSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSR 223
            S  C L    Q++C S+   C Y  +Y +D + + G L  D +   +T   Q  +   +
Sbjct: 144 ESQPCTLFPQNQRECGSS-KQCIYLHQYGTD-SFTIGRLGYDTISFSSTGMGQGGATFPK 201

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
             FGC      +F      NG  GLG    S+ S L +Q  I + FS C   F S  TG+
Sbjct: 202 SVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGK 259

Query: 281 ISFGDKGSPGQ-GETPFSLRQTHPTYNI-TITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
           + FG      +   TPF +  ++P+Y +  +  ++VG   V       + I DS    T+
Sbjct: 260 LKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTH 319

Query: 336 LNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
           L    YT  IS    ++  E  E + +  PFEYC     N TN  +P       G     
Sbjct: 320 LEQGIYTDFISSVKEAINVEVAEDAPT--PFEYCVR---NPTNLNFPEFVFHFTGAD--- 371

Query: 395 VNDPIVIVSSEPKGLY------LYCLGVVKSDNVNIIG 426
                  V   PK ++      L C+ VV S  ++I G
Sbjct: 372 -------VVLGPKNMFIALDNNLVCMTVVPSKGISIFG 402


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/310 (29%), Positives = 134/310 (43%), Gaps = 43/310 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +DTGSD+ WL C  C +C    ++         +++P++SS+ 
Sbjct: 16  YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDA---------LFNPSSSSSF 66

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C+S+LC          + C YQ  Y  DG+ + G LV D + L       + V + 
Sbjct: 67  KVLDCSSSLCLNLDVMGCLSNKCLYQADY-GDGSFTMGELVTDNVVLDDAFGPGQVVLTN 125

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           I  GCG    G+F   A   G+ GLG    S P+ L       N FS C     SD   +
Sbjct: 126 IPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNL--DASTRNIFSYCLPDRESDPNHK 180

Query: 281 --ISFGDKGSP--GQGETPFSLRQTHPT----YNITITQVSVGGNAVN------FEFSA- 325
             + FGD   P    G   F  +  +P     Y + IT +SVGGN +       F+  + 
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFE 379
                IFDSGT+ T L   AYT + + F   A     TS +D   F+ CY  +    +  
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFR--AATMHLTSAADFKIFDTCYDFT-GMNSIS 297

Query: 380 YPVVNLTMKG 389
            P V    +G
Sbjct: 298 VPTVTFHFQG 307


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 119/265 (44%), Gaps = 27/265 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +D+GS + ++PC   SC    N    +      + P+ SS+ S V
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCS--SCEQCGNHQDPR------FQPDLSSSYSPV 141

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            CN     +   C S    C Y+ +Y ++ + S+G L ED++      ++S+       F
Sbjct: 142 KCN-----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQHAIF 192

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG      A +G+ GLG  + S+   L  +G+I +SFS+C+G    G G +  G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
              +P       S     P YNI + ++ V G A+  E          + DSGT++ YL 
Sbjct: 252 GMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLP 311

Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
           + A+    E   S     ++    D
Sbjct: 312 EQAFVAFKEAVTSKVHSLKKIRGPD 336


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 87/327 (26%), Positives = 140/327 (42%), Gaps = 34/327 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P +SST   
Sbjct: 90  TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQC--------GKHQDPR-FQPESSSTYKP 140

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN + C     C   G  C Y+ RY ++ + S+G L EDVL       +S+    R  
Sbjct: 141 MQCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGLLAEDVLSFGN---ESELTPQRAI 191

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
           FGC  V+TG      A +G+ GLG    SV   L  + ++ NSFS+C+G      G +  
Sbjct: 192 FGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVL 250

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
           G+   P       S       YNI + ++ V G  +         +   + DSGT++ YL
Sbjct: 251 GNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYL 310

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
            + A+    +      K  ++    D  + + C+       +Q +  +P VN+   G G 
Sbjct: 311 PEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVF-GNGQ 369

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS 419
                P   +    K    YCLG+ ++
Sbjct: 370 KLSLSPENYLFRHTKVSGAYCLGIFQN 396


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 77/261 (29%), Positives = 121/261 (46%), Gaps = 50/261 (19%)

Query: 105 HYTNVSVGQPA-LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y N+++G P+  +F V +DTGS L ++PC   +C      + G   D            
Sbjct: 112 YYANIALGDPSPRTFQVIVDTGSTLTYVPC--ATCAKCGTHTGGTRFD------------ 157

Query: 164 SKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
              P    L   +KQC +AG             + C Y  R  ++G+  +G LV D +H 
Sbjct: 158 ---PTGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYS-RTYAEGSGVSGDLVRDKMHF 213

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK-TSVPSILANQGLIPNSF 269
             D   + +    + FGC   ++G+  D  A +GL GLG ++  S+P+ LA+   +P  F
Sbjct: 214 GGDIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVF 272

Query: 270 SMCFGS-DGTGRISFGDKGSPGQGETP------FSLRQTHPTYNITIT-QVSVGGNAV-- 319
           S+CFGS +G G +SFG    P    TP        + + HP Y +  T  + +G  AV  
Sbjct: 273 SLCFGSFEGGGALSFGRL--PATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVAT 330

Query: 320 ----NFEFSAIFDSGTSFTYL 336
                  +  + DSGT+FTY+
Sbjct: 331 PSDLAVGYGTVMDSGTTFTYV 351


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 147/351 (41%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P L +   +DTGSDL W  C  CV C       + Q   +  + P  S+T 
Sbjct: 92  YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             VPC S LC  L        S C YQ  Y  D   + G L  +          SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTFGA-ANSSKVMVS 200

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
            ++FGCG + +G   +    +G+ GLG    S+ S L      P+ FS C   F S    
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252

Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
           R++FG             GSP Q  TP  +    P+ Y +++  +S+G           A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311

Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
           +N + +     DSGTS T+L   AY  +     S+ +    T+ +++  E C+   P  +
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPS 371

Query: 377 -NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
                P + L   GG    V     ++     G    CL +++S +  IIG
Sbjct: 372 VAVTVPDMELHFDGGANMTVPPENYMLIDGATG--FLCLAMIRSGDATIIG 420


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 109/424 (25%), Positives = 172/424 (40%), Gaps = 70/424 (16%)

Query: 31  GFDFHHRYSDPVKGILAVDDLPKKG-SFAYYS----ALAHRDRYFRLRGRGLAAQGNDKT 85
           G   HH    P  G+  V +    G +   Y     A+   +R  R     L +    +T
Sbjct: 28  GTLLHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRSINAMLQSSSGIET 87

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNS 144
           P+   AG+  Y +N         V++G PA S    +DTGSDL W  C+ C  C      
Sbjct: 88  PVY--AGSGEYLMN---------VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTP 136

Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGF 202
                    I++P  SS+ S +PC S  C+     PS    ++C Y   Y  DG+ + G+
Sbjct: 137 ---------IFNPQDSSSFSTLPCESQYCQ---DLPSESCYNDCQYTYGY-GDGSSTQGY 183

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA-- 260
           +  +     T      S    I+FGCG    G F  G    GL G+G    S+PS L   
Sbjct: 184 MATETFTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGVG 235

Query: 261 ------NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
                       +  ++  GS  +G      +GSP       SL  T+  Y IT+  ++V
Sbjct: 236 QFSYCMTSSGSSSPSTLALGSAASGV----PEGSPSTTLIHSSLNPTY--YYITLQGITV 289

Query: 315 GGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSD 362
           GG+ +    S            I DSGT+ TYL   AY  +++ F + +     + S+S 
Sbjct: 290 GGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSG 349

Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
           L    C+ L  + +  + P +++   GG      + ++I  +E  G+    +G      +
Sbjct: 350 L--STCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLISPAE--GVICLAMGSSSQQGI 405

Query: 423 NIIG 426
           +I G
Sbjct: 406 SIFG 409


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 147/351 (41%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P L +   +DTGSDL W  C  CV C       + Q   +  + P  S+T 
Sbjct: 92  YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             VPC S LC  L        S C YQ  Y  D   + G L  +          SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTFGA-ANSSKVMVS 200

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
            ++FGCG + +G   +    +G+ GLG    S+ S L      P+ FS C   F S    
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252

Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
           R++FG             GSP Q  TP  +    P+ Y +++  +S+G           A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311

Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
           +N + +     DSGTS T+L   AY  +     S+ +    T+ +++  E C+   P  +
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPS 371

Query: 377 -NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
                P + L   GG    V     ++     G    CL +++S +  IIG
Sbjct: 372 VAVTVPDMELHFDGGANMTVPPENYMLIDGATG--FLCLAMIRSGDATIIG 420


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 154/368 (41%), Gaps = 58/368 (15%)

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
           P T  +  DT         +   V +G PA++  + +DTGSD+ W+ C         NS+
Sbjct: 117 PTTLGSALDTME-------YVITVGIGSPAVTQTMMIDTGSDVSWVRC---------NST 160

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFL 203
            G      ++ P+ S+T +   C+S  C          SN  C Y+V+Y  DG+ +TG  
Sbjct: 161 DG----LTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQY-GDGSNTTGTY 215

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
             D L L+  +  +        FGC   +     DG   +GL GLG D  S+ S  A   
Sbjct: 216 SSDTLALSASDTVTD-----FHFGCSHHEED--FDGEKIDGLMGLGGDAQSLVSQTA--A 266

Query: 264 LIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPT-YNITITQVSVGGNA 318
               SFS C    +  +G ++FG       G   TP       PT Y + +  +SVGG  
Sbjct: 267 TYGKSFSYCLPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTP 326

Query: 319 VNFEFS-----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLS 372
           +  + S     ++ DSGT  T+L   AY+ +S  F S     R    + L   + CY  +
Sbjct: 327 LGIQPSVLSNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFT 386

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY-CLGVVKSDNVNIIG----R 427
               N   P V+L + GG          +V  +  G+ +  CL    +   +IIG    R
Sbjct: 387 -GLVNVSIPAVSLVLDGG---------AVVDLDGNGIMIQDCLAFAATSGDSIIGNVQQR 436

Query: 428 EYPIANNI 435
            + + +++
Sbjct: 437 TFEVLHDV 444


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 107/420 (25%), Positives = 169/420 (40%), Gaps = 67/420 (15%)

Query: 34  FHHRYSDPVKGI-LAVDDLPKKGSFAYYS----ALAHRDRYFRLRGRGLAAQGNDKTPLT 88
            HH    P  G+ + ++ +    +   Y     A+   +R  R     L +    +TP+ 
Sbjct: 31  LHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90

Query: 89  FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
             AG+  Y +N         V++G P  SF   +DTGSDL W  C+ C  C         
Sbjct: 91  --AGDGEYLMN---------VAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTP--- 136

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
                 I++P  SS+ S +PC S  C+         + C Y   Y  DG+ + G++  + 
Sbjct: 137 ------IFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGY-GDGSTTQGYMATET 189

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
               T      S    I+FGCG    G F  G    GL G+G    S+PS L        
Sbjct: 190 FTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGV-----G 236

Query: 268 SFSMC---FGSDGTGRISFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
            FS C   +GS     ++ G       +GSP       SL  T+  Y IT+  ++VGG+ 
Sbjct: 237 QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTY--YYITLQGITVGGDN 294

Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE 366
           +    S            I DSGT+ TYL   AY  +++ F + +     + S+S L   
Sbjct: 295 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGL--S 352

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            C+    + +  + P +++   GG        I+I  +E  G+    +G      ++I G
Sbjct: 353 TCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAE--GVICLAMGSSSQLGISIFG 410


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 127/307 (41%), Gaps = 39/307 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++    VG P+  F++  DTGSDL W+ C   C S  +  N  + ++    ++  N SS+
Sbjct: 83  YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSS 141

Query: 163 SSKVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
              +PC + +C+++         CP+  + C Y  RY SDG+ + GF   + + +   E 
Sbjct: 142 FKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEG 200

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           +   + + +  GC     G     A  +G+ GLG  K S     A +      FS C   
Sbjct: 201 RKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVD 255

Query: 274 ---GSDGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
                + +  ++FG   S          T   L   +  Y + +  +S+GG  +      
Sbjct: 256 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315

Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                    I DSG+S T+L +PAY  +         + R+      P EYC+    N T
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NST 371

Query: 377 NFEYPVV 383
            FE  +V
Sbjct: 372 GFEESLV 378


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 92/347 (26%), Positives = 145/347 (41%), Gaps = 45/347 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++ +G P   +   LDTGSDL W  C  C+ CV        Q   F  + P  S + 
Sbjct: 89  YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVD-------QPTPF--FDPAQSPSY 139

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           +K+PCNS +C          + C YQ  Y  D   + G L  +     T++  ++    R
Sbjct: 140 AKLPCNSPMCNALYYPLCYRNVCVYQYFY-GDSANTAGVLSNETFTFGTND--TRVTVPR 196

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG--------LIPNSFSMCFGS 275
           I+FGCG +  GS  +G+   G+ G G    S+ S L +          + P    + FG+
Sbjct: 197 IAFGCGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGA 253

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---------- 324
             T   +    G P Q  TPF +    PT Y + +T +SVGG  +  + S          
Sbjct: 254 YATLNSTSASTGEPVQ-STPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLA--KEKRETSTSDLPFEYCYVLSPNQTNF-E 379
              I DSG++ TYL   AY  + + F           TS +D+  + C+V  P       
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV-LDTCFVWPPPPRKIVT 371

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            P +    +G       +  +++  +   L   CL +  SD+ +IIG
Sbjct: 372 MPELAFHFEGANMELPLENYMLIDGDTGNL---CLAIAASDDGSIIG 415


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 151/348 (43%), Gaps = 46/348 (13%)

Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
           S+G  +Y T + +G P  ++++ +D+GS L WL   C  C    +  +G      +Y P 
Sbjct: 102 SVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWL--QCAPCAVSCHPQAGP-----LYDPR 154

Query: 159 TSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
            SST + VPC++  C ELQ     PS+ S    C YQ  Y  DG+ S G+L +D + L+ 
Sbjct: 155 ASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASY-GDGSFSFGYLSKDTVSLS- 212

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
               S        +GCG+   G F   A   GL GL  +K S+ S LA    + NSF+ C
Sbjct: 213 ----SSGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--VGNSFAYC 263

Query: 273 F---GSDGTGRISFG---DKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
                +   G +SFG   D  +PG+    +  S       Y +++  +SV G+ +    S
Sbjct: 264 LPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSS 323

Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
                  I DSGT  T L  P YT +S+   +        + S L  + C+         
Sbjct: 324 EYGSLPTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSIL--QTCF--KGQVAKL 379

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             P VN+   GG    +    V+V          CL    +D+  IIG
Sbjct: 380 PVPAVNMAFAGGATLRLTPGNVLVDVNET---TTCLAFAPTDSTAIIG 424


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/256 (29%), Positives = 116/256 (45%), Gaps = 39/256 (15%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P  S++   
Sbjct: 78  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN         C   G  C Y+ RY ++ + S+G L ED++    + + S     R  
Sbjct: 129 LKCNPDC-----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC   +TG      A +G+ GLG  K SV   L ++G+I + FS+C+G    G G +  
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238

Query: 284 GDKGSPGQG-----ETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
           G K SP  G       PF      P YNI + Q+ V G ++       N +   + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293

Query: 332 SFTYLNDPAYTQISET 347
           ++ Y    A+  I + 
Sbjct: 294 TYAYFPKEAFIAIKDA 309


>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
          Length = 423

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/282 (27%), Positives = 124/282 (43%), Gaps = 23/282 (8%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSC--VHGL--NSSSGQVIDFNIYSPNT 159
           +  +++G PA  + + +DTGS L WL CD  C++C   H L      G  +   +Y P  
Sbjct: 39  FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPHGLYKPEL 98

Query: 160 --SSTSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             +   ++  C     +L+K       N C Y ++Y+  G  S G L+ D   L      
Sbjct: 99  KYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGT 156

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
           +    + I+FGCG  Q  +  +   P NG+ GLG  K ++ S L +QG+I  +    C  
Sbjct: 157 N---PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCIS 213

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
           S G G + FGD   P  G T   + + H  Y+     +    N+          IFDSG 
Sbjct: 214 SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGA 273

Query: 332 SFTYLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
           ++TY    P +  +S   ++L+KE +   E    D     C+
Sbjct: 274 TYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 315


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 76/256 (29%), Positives = 117/256 (45%), Gaps = 39/256 (15%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C        G+  D   + P  S++   
Sbjct: 78  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           + CN   C     C   G  C Y+ RY ++ + S+G L ED++    + + S     R  
Sbjct: 129 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC   +TG      A +G+ GLG  K SV   L ++G+I + FS+C+G    G G +  
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238

Query: 284 GDKGSPGQG-----ETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
           G K SP  G       PF      P YNI + Q+ V G ++       N +   + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293

Query: 332 SFTYLNDPAYTQISET 347
           ++ Y    A+  I + 
Sbjct: 294 TYAYFPKEAFIAIKDA 309


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 94/340 (27%), Positives = 146/340 (42%), Gaps = 45/340 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VG+PA    + LDTGSD+ WL C  C  C    +          +Y P+ S++ 
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDP---------VYDPSVSTSY 213

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C+S  C       C ++  +C Y+V Y  DG+ + G    + L L      S    
Sbjct: 214 ATVGCDSPRCRDLDAAACRNSTGSCLYEVAY-GDGSYTVGDFATETLTLGDSAPVSN--- 269

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F+  A    L G  +   S PS ++       +FS C     S  +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
             + FGD   P          +T+  Y + ++ +SVGG A++   SA           I 
Sbjct: 320 STLQFGDSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIV 379

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L   AY  + E F    +     S   L F+ CY L+  +++ + P V L  
Sbjct: 380 DSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSL-FDTCYDLA-GRSSVQVPAVALWF 437

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
           +GGG   +     ++  +  G   YCL     S  V+IIG
Sbjct: 438 EGGGELKLPAKNYLIPVDAAG--TYCLAFAGTSGPVSIIG 475


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 131/298 (43%), Gaps = 40/298 (13%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           V +G PA  F V  DTGSD  W+ C  CV+  +             ++ P  S+T + + 
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 216

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C+S+ C        +G +C Y ++Y  DG+ + GF  +D L LA D  ++        FG
Sbjct: 217 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 269

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
           CG    G F   A   GL GLG  KTS+P    ++      F+ C    S GTG +  G 
Sbjct: 270 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 323

Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
            G+P      TP  + +    Y + +T + VGG+ +    S       + DSGT  T L 
Sbjct: 324 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 383

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQ-TNFEYPVVNLTMKGG 390
             AY  +   F+   K  +    S  P     + CY L+ ++  +   P V+L  +GG
Sbjct: 384 PSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGG 438


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 73/251 (29%), Positives = 119/251 (47%), Gaps = 25/251 (9%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           Y  +++G+PA  + + +DTGS+L WL  +C   VHG      +      Y+P  +    K
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWL--ECHPPVHGCKGCHPRP-PHPYYTP--ADGKLK 93

Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           V C S LC   ++     P    N    C Y+++Y++    S G L  D++ +   +K+ 
Sbjct: 94  VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
                RI+FGCG  Q        +P NG+ GLGM K    + L    +I  N    C  S
Sbjct: 151 -----RIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSS 205

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
            G G +  GD   P +G T   +R++   Y+  + +V +    +  N  F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265

Query: 334 TYLNDPAYTQI 344
           T++    Y +I
Sbjct: 266 THVPAQIYNEI 276


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 132/324 (40%), Gaps = 38/324 (11%)

Query: 80  QGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVS 137
           Q   K  +T  A     R  SLG  +Y  ++ +G PA    V  DTGSDL W+ C  C  
Sbjct: 124 QARGKKGVTLPA----QRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSD 179

Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDG 196
           C    +          ++ P  SST S VPC S  C+ L  +  S    C Y+V Y  D 
Sbjct: 180 CYEQKDP---------LFDPARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVY-GDQ 229

Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           + + G L  D L L   +     V     FGCG   TG F  G A +GL GLG +K S+ 
Sbjct: 230 SQTDGALARDTLTLTQSD-----VLPGFVFGCGEQDTGLF--GRA-DGLVGLGREKVSLS 281

Query: 257 SILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVS 313
           S  A++      FS C  S     G +S G         T    R   P+ Y + +  V 
Sbjct: 282 SQAASK--YGAGFSYCLPSSPSAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVK 339

Query: 314 VGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE 366
           V G  V      FSA   + DSGT  T L    Y  +   F  S+ +   + + +    +
Sbjct: 340 VAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILD 399

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGG 390
            CY  +   T    P V L   GG
Sbjct: 400 TCYDFT-GHTTVRIPSVALVFAGG 422


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 108/408 (26%), Positives = 163/408 (39%), Gaps = 62/408 (15%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQG---NDKTPLTFSAGNDTYRLNSLGFLHYTNVSV 111
           G++  +  L    +  +LR + L+A+             AGN  + +          +++
Sbjct: 53  GNYTKFERLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMK---------LAI 103

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G PA ++   +DTGSDL W  C  C  C               I+ P  SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTP---------IFDPKKSSSFSKLPCSS 154

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
            LC       S    C Y   Y  D + + G L  +            SV S+I FGCG 
Sbjct: 155 DLCA-ALPISSCSDGCEYLYSY-GDYSSTQGVLATETFAFG-----DASV-SKIGFGCGE 206

Query: 231 VQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
              GS F  GA   GL GLG    S+ S L         FS C      S G   +  G 
Sbjct: 207 DNDGSGFSQGA---GLVGLGRGPLSLISQLGEP-----KFSYCLTSMDDSKGISSLLVGS 258

Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
           + +      TP     + P+ Y +++  +SVG   +  E S            I DSGT+
Sbjct: 259 EATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTT 318

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
            TYL D A+  + + F S  K   + S S    + C+ L P+ +  + P +    +G   
Sbjct: 319 ITYLEDSAFAALKKEFISQLKLDVDESGST-GLDLCFTLPPDASTVDVPQLVFHFEGADL 377

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
               +  +I  S   GL + CL +  S  ++I G       NI + H+
Sbjct: 378 KLPAENYIIADS---GLGVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 148/343 (43%), Gaps = 48/343 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V +G PA    + LDTGSD+ W+ C  C  C    +          ++ P+ S++ 
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 216

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C+S  C       C +A   C Y+V Y  DG+ + G    + L L           
Sbjct: 217 AAVSCDSQRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVGN--- 272

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F+  A    L G  +   S PS ++      ++FS C     S   
Sbjct: 273 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 322

Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
             + FGD  +     T   +R  +T   Y + ++ +SVGG  ++   SA           
Sbjct: 323 STLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGG 382

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+ T L   AY  + + F   A     TS   L F+ CY LS ++T+ E P V+
Sbjct: 383 VIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVS 440

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
           L  +GGG   +     ++  +  G   YCL    ++  V+IIG
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 481


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 136/344 (39%), Gaps = 43/344 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  NV +G P     +  DTGSDL W  C  CV   +             I+ P+ S T 
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSASKTY 205

Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + C ST C   K         + SNC Y ++Y  D + + GF  +D L L  ++    
Sbjct: 206 SNISCTSTACSGLKSATGNSPGCSSSNCVYGIQY-GDSSFTVGFFAKDTLTLTQND---- 260

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
            V     FGCG+   G F   A   GL GLG D  S+    A +      FS C  +   
Sbjct: 261 -VFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314

Query: 277 GTGRISFGDKGSPGQGE--------TPFSLRQTHPTYNITITQVSVGGNAVNFE------ 322
             G ++FG+       +        TPF+  Q    Y I +  +SVGG A++        
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT  T L    Y  +  TF      K  T+ +    + CY LS N T+   P 
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFM-SKYPTAPALSLLDTCYDLS-NYTSISIPK 432

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++    G     +    +++++    + L   G    D + I G
Sbjct: 433 ISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFG 476


>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
 gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 405

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 113/273 (41%), Gaps = 30/273 (10%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           + +G P  +F   +DTGSDL W+ CD  C  C    N           Y P      + +
Sbjct: 53  MQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQ---------YKPK----GNII 99

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PC++ +C       +  CP+    C Y+V+Y   G+ S G LV D   L         + 
Sbjct: 100 PCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGS-SMGALVTDQFPLKL--VNGSFMQ 156

Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
             ++FGCG  Q+  S     A  G+ GLG  K  + + L + GL  N    C  S G G 
Sbjct: 157 PPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGF 216

Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
           + FGD   P  G   TP   +  H  Y      +   G     +    IFD+G+S+TY N
Sbjct: 217 LFFGDNLVPSIGVAWTPLLSQDNH--YTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFN 274

Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLPFEYCY 369
             AY T I+   N L     + +  D     C+
Sbjct: 275 SKAYQTIINLIGNDLKVSPLKVAKEDKTLPICW 307


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 95/341 (27%), Positives = 142/341 (41%), Gaps = 41/341 (12%)

Query: 41  PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNS 100
           PV G     ++P    F     L  R + F++R     + G  K   T    +    +  
Sbjct: 82  PVTGAPKTINVPSTAEFLLQDQL--RVKSFQVRLSMNPSSGVFKEMQTTIPAS----IVP 135

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
            G  +   V +G P   F ++ DTGSDL W  C+   C+ G    +    D     P TS
Sbjct: 136 TGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE--PCLGGCFPQNQPKFD-----PTTS 188

Query: 161 STSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++   V C+S  C+L        + C S  + C Y ++Y S  T+  GFL  + L +A+ 
Sbjct: 189 TSYKNVSCSSEFCKLIAEGNYPAQDCIS--NTCLYGIQYGSGYTI--GFLATETLAIASS 244

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +     V     FGC     G+F       GL GLG    ++PS   N+    N FS C 
Sbjct: 245 D-----VFKNFLFGCSEESRGTF---NGTTGLLGLGRSPIALPSQTTNK--YKNLFSYCL 294

Query: 274 GS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---AIFD 328
            +    TG +SFG + S     TP S +     Y +    +SV G  +    S    I D
Sbjct: 295 PASPSSTGHLSFGVEVSQAAKSTPISPKLKQ-LYGLNTVGISVRGRELPINGSISRTIID 353

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           SGT+FT+L  P Y+ +   F  +      T+ +   F+ CY
Sbjct: 354 SGTTFTFLPSPTYSALGSAFREMMANYTLTNGTS-SFQPCY 393


>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
 gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
          Length = 433

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 87/280 (31%), Positives = 130/280 (46%), Gaps = 38/280 (13%)

Query: 106 YTNV--SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           Y NV  S+GQPA  + + +DTGSDL WL CD  C  C+   +          +Y P    
Sbjct: 70  YYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHP---------LYRP---- 116

Query: 162 TSSKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           +++ V C   LC    Q P   +      C Y+V Y +DG  S G LV+DV  L  +   
Sbjct: 117 SNNLVICEDPLCA-SLQPPGVHNCQDPDQCDYEVEY-ADGGSSLGVLVKDVFVL--NFTN 172

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
            K ++  ++ GCG  Q    L G +    +G+ GLG   +S+PS L++QGL+ N    C 
Sbjct: 173 GKRLNPLLALGCGYDQ----LPGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNVIGHCL 228

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
              G G + FG+    S G   TP S R     Y+    ++   G +        +FDSG
Sbjct: 229 SGRGGGFLFFGEDIYDSSGVTWTPMS-RDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSG 287

Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
           +S+TYLN  AY  +  +    L+++    +  D     C+
Sbjct: 288 SSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCW 327


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 95/310 (30%), Positives = 136/310 (43%), Gaps = 43/310 (13%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           L++L ++    VS+G PA++  V +DTGSD+ W+ C          + +G  + F+   P
Sbjct: 120 LDTLAYV--ITVSIGTPAMTQAVMIDTGSDVSWVHCHA-------RAGAGSSLFFD---P 167

Query: 158 NTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             SST +   C+S  C   E +    S  S C Y VRY  DG+ +TG    D L L + E
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRY-GDGSNTTGTYGSDTLALNSTE 226

Query: 215 KQSKSVDSRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS-FSMC 272
           K          FGC      G  LD    +GL GLG      PS+++       S FS C
Sbjct: 227 KVEN-----FQFGCSETSDPGEGLDEDQTDGLMGLG---GGAPSLVSQTAATYGSAFSYC 278

Query: 273 F--GSDGTGRISFG-DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVN-----FEF 323
               +  +G ++ G   G+ G   TP    +  PT+   I Q ++VGG+ V      F  
Sbjct: 279 LPATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA 338

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAK---EKRETSTSDLPFEYCYVLSPNQTNFEY 380
            +I DSGT  T L   AY+ +S  F +  +     R  S  D  F++       Q N   
Sbjct: 339 GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFT-----GQDNVSI 393

Query: 381 PVVNLTMKGG 390
           P V L   GG
Sbjct: 394 PAVELVFSGG 403


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 88/322 (27%), Positives = 136/322 (42%), Gaps = 45/322 (13%)

Query: 69  YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDL 128
           + R + R   +Q +D++P T +  ++      + F       +G P +      DTGSDL
Sbjct: 62  FARSKRRLRLSQNDDRSPGTITIPDEPITEYLMRFY------IGTPPVERFAIADTGSDL 115

Query: 129 FWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG 183
            W+ C  C  CV           +  ++ P  SST   VPC+S  C L    Q+ C    
Sbjct: 116 IWVQCAPCEKCVPQ---------NAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKS 166

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
             C YQ  Y  D T+ +G L  + ++  +     K    +++FGC      +  +     
Sbjct: 167 GQCYYQYIY-GDHTLVSGILGFESINFGSKNNAIKF--PKLTFGCTFSNNDTVDESKRNM 223

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGD----KGSPGQGETPF 296
           GL GLG+   S+ S L  Q  I   FS CF    S+ T ++ FG+    K   G   TP 
Sbjct: 224 GLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPL 281

Query: 297 SLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
            ++   P+ Y + +  VS+G   V    S      + DSGTSFT L    Y +    F +
Sbjct: 282 IIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNK----FVA 337

Query: 351 LAKEKRETSTSDLP---FEYCY 369
           L KE        +P   + +C+
Sbjct: 338 LVKEVYGVEAVKIPPLVYNFCF 359


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 137/344 (39%), Gaps = 42/344 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G P   F V +DTGSDL W+ C      +  N S        ++ PNTS++ +
Sbjct: 3   YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDS--------LFIPNTSTSFT 54

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           K+ C + LC          + C Y   Y  DG++STG  V D + +     Q + V    
Sbjct: 55  KLACGTELCNGLPYPMCNQTTCVYWYSY-GDGSLSTGDFVYDTITMDGINGQKQQV-PNF 112

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
           +FGCG    GSF   A  +G+ GLG    S PS L    +    FS C          T 
Sbjct: 113 AFGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTS 167

Query: 280 RISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA---------- 325
            + FGD   P      +    T+P     Y + +  +SVGG  +N   +A          
Sbjct: 168 PLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAG 227

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            IFDSGT+ T L    + ++    N+   +    S      + C            P + 
Sbjct: 228 TIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMT 287

Query: 385 LTMKGGGPFF--VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              +GG       N  I + SS+      YC  +V S +V IIG
Sbjct: 288 FHFEGGDMELPPSNYFIFLESSQS-----YCFSMVSSPDVTIIG 326


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 125/302 (41%), Gaps = 39/302 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
            VG P+  F++  DTGSDL W+ C   C S  +  N  + ++    ++  N SS+   +P
Sbjct: 88  KVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIP 146

Query: 168 CNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           C + +C+++         CP+  + C Y  RY SDG+ + GF   + + +   E +   +
Sbjct: 147 CLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKL 205

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
            + +  GC     G     A  +G+ GLG  K S     A +      FS C        
Sbjct: 206 HN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHK 260

Query: 276 DGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
           + +  ++FG   S          T   L   +  Y + +  +S+GG  +           
Sbjct: 261 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 320

Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               I DSG+S T+L +PAY  +         + R+      P EYC+    N T FE  
Sbjct: 321 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NSTGFEES 376

Query: 382 VV 383
           +V
Sbjct: 377 LV 378


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 46/354 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  V +G PA  + + +DTGS L WL C  CV   H        V    ++ P+ S T 
Sbjct: 13  YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 64

Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             + C S+ C            C ++ + C Y   Y  D + S G+L +D+L LA  +  
Sbjct: 65  KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 123

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
              V     +GCG+   G F   A   G+ GLG +K S+   ++++     +FS C  + 
Sbjct: 124 PGFV-----YGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 173

Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
           G G  +S G     G     TP +    +P+ Y + +T ++VGG A+      +    I 
Sbjct: 174 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 233

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLT 386
           DSGT  T L    YT   + F  +   K   +      + C+    N  + +  P V L 
Sbjct: 234 DSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCF--KGNLKDMQSVPEVRLI 291

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
            +GG    +  P+ ++    +G  L CL    ++ V IIG    + + +A++IS
Sbjct: 292 FQGGADLNLR-PVNVLLQVDEG--LTCLAFAGNNGVAIIGNHQQQTFKVAHDIS 342


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 136/311 (43%), Gaps = 44/311 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
           +   VS G PA+  +V +DTGSD+ WL C           SSGQ       +Y P+ SST
Sbjct: 79  YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 130

Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            S VPC S +C+          C ++G  C + + Y +DGT + G   +D L LA     
Sbjct: 131 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 185

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
             ++     FGCG    G        +G+ GLG  +    S+ A  G +   FS C  S 
Sbjct: 186 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 234

Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
            +  G ++ G   +P G   TP       PT++ +T+  ++VGG  ++   SA     I 
Sbjct: 235 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 294

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT  T L   AY  +   F    +  R     DL  + CY L+    N   P + LT 
Sbjct: 295 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLT-GYKNVVVPKIALTF 351

Query: 388 KGGGPFFVNDP 398
            GG    ++ P
Sbjct: 352 TGGATINLDVP 362


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/306 (30%), Positives = 133/306 (43%), Gaps = 37/306 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  +SVG P     + +DTGSD+ WL C  CV+C H  ++         I+ P  SST 
Sbjct: 58  YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDA---------IFDPYKSSTY 108

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + C++  C          + C YQV Y  DG+ +TG    D + L +     + V ++
Sbjct: 109 STLGCSTRQCLNLDIGTCQANKCLYQVDY-GDGSFTTGEFGTDDVSLNSTSGVGQVVLNK 167

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
           I  GCG    G F+  A    L GLG    S P+ +  Q      FS C     T     
Sbjct: 168 IPLGCGHDNEGYFVGAAG---LLGLGKGPLSFPNQVDPQN--GGRFSYCLTDRETDSTEG 222

Query: 279 GRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
             + FG+   P  G   TP       PT Y + +T +SVGG  +    SA          
Sbjct: 223 SSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGG 282

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGTS T L + AY  + + F +   +   T+   L F+ CY LS    + + P V 
Sbjct: 283 VIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSL-FDTCYDLS-GLASVDVPTVT 340

Query: 385 LTMKGG 390
           L  +GG
Sbjct: 341 LHFQGG 346


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 112/277 (40%), Gaps = 36/277 (12%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +Y  +++G P   F + +DTGSDL W+ CD  C  C                Y PN
Sbjct: 65  LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 114

Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            ++    +PC+  LC        + C      C Y++ Y SD   S G LV D   L   
Sbjct: 115 HNT----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGY-SDHASSIGALVTDEFPLKL- 168

Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                 ++  ++FGCG   Q           G+ GLG  K  + + L + G+  N    C
Sbjct: 169 -ANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHC 227

Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
               G G +S GD+  P  G T  SL     + N       +     + G   +N     
Sbjct: 228 LSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGIN----V 283

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           +FDSG+S+TY N  AY  I +        K  T T D
Sbjct: 284 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 320


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 76/301 (25%), Positives = 125/301 (41%), Gaps = 39/301 (12%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           VG P+  F++  DTGSDL W+ C   C S  +  N  + ++    ++  N SS+   +PC
Sbjct: 18  VGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIPC 76

Query: 169 NSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            + +C+++         CP+  + C Y  RY SDG+ + GF   + + +   E +   + 
Sbjct: 77  LTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKLH 135

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
           + +  GC     G     A  +G+ GLG  K S     A +      FS C        +
Sbjct: 136 N-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHKN 190

Query: 277 GTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
            +  ++FG   S          T   L   +  Y + +  +S+GG  +            
Sbjct: 191 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKGA 250

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSG+S T+L +PAY  +         + R+      P EYC+    N T FE  +
Sbjct: 251 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NSTGFEESL 306

Query: 383 V 383
           V
Sbjct: 307 V 307


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 149/343 (43%), Gaps = 48/343 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V +G PA    + LDTGSD+ W+ C  C  C    +          ++ P+ S++ 
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 219

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C+S  C       C +A   C Y+V Y  DG+ + G    + L L      +    
Sbjct: 220 AAVSCDSPRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVTN--- 275

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F+  A    L G  +   S PS ++      ++FS C     S   
Sbjct: 276 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 325

Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
             + FG  G+     T   +R  +T   Y + ++ +SVGG A++   SA           
Sbjct: 326 STLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGG 385

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+ T L   AY  + + F         TS   L F+ CY LS ++T+ E P V+
Sbjct: 386 VIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVS 443

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
           L  +GGG   +     ++  +  G   YCL    ++  V+IIG
Sbjct: 444 LRFEGGGALRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 484


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 136/311 (43%), Gaps = 44/311 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
           +   VS G PA+  +V +DTGSD+ WL C           SSGQ       +Y P+ SST
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 164

Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            S VPC S +C+          C ++G  C + + Y +DGT + G   +D L LA     
Sbjct: 165 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 219

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
             ++     FGCG    G        +G+ GLG  +    S+ A  G +   FS C  S 
Sbjct: 220 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 268

Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
            +  G ++ G   +P G   TP       PT++ +T+  ++VGG  ++   SA     I 
Sbjct: 269 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 328

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT  T L   AY  +   F    +  R     DL  + CY L+    N   P + LT 
Sbjct: 329 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLT-GYKNVVVPKIALTF 385

Query: 388 KGGGPFFVNDP 398
            GG    ++ P
Sbjct: 386 TGGATINLDVP 396


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 130/307 (42%), Gaps = 46/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  N+S+G PA  F   +DTGSDL W    C  C    N S+       I++P  SS+ S
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            +PC+S LC+  +    + ++C Y   Y  DG+ + G +  + L        S S+   I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
           +FGCG    G F  G    GL G+G    S+PS L         FS C    GS  +  +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTSSTL 252

Query: 282 SFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGG------------NAVNFEF 323
             G        GSP    T     Q    Y IT+  +SVG             N+ N   
Sbjct: 253 LLGSLANSVTAGSP--NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT+ TY  D AY  + + F S         +S   F+ C+ +  +Q+N + P  
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPSDQSNLQIPTF 369

Query: 384 NLTMKGG 390
            +   GG
Sbjct: 370 VMHFDGG 376


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 115/256 (44%), Gaps = 37/256 (14%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P  +F + +DTGS + ++PC  C  C    +           + P  SST   
Sbjct: 92  TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FEPELSSTYQP 142

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C +    C Y+ +Y ++ + S+G L ED++       QS+ V  R  
Sbjct: 143 VSCN-----IDCTCDNERKQCVYERQY-AEMSSSSGVLGEDIISFG---NQSELVPQRAI 193

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
           FGC   +TG      A +G+ GLG    S+   L  +G+I +SFS+C+G    G G +  
Sbjct: 194 FGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMIL 252

Query: 284 GDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTS 332
           G    P    +     ++ P     YNI +  + V G  ++ + S        + DSGT+
Sbjct: 253 GGISPP----SGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTT 308

Query: 333 FTYLNDPAYTQISETF 348
           + YL + A+T   +  
Sbjct: 309 YAYLPEAAFTAFKDAM 324


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 131/305 (42%), Gaps = 42/305 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  N+S+G PA  F   +DTGSDL W    C  C    N S+       I++P  SS+ S
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            +PC+S LC+  +    + ++C Y   Y  DG+ + G +  + L        S S+   I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
           +FGCG    G F  G    GL G+G    S+PS L         FS C    GS  +  +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSNSSTL 252

Query: 282 ---SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAVNFEFSA 325
              S  +  + G   T        PT Y IT+  +SVG             N+ N     
Sbjct: 253 LLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGI 312

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT+ TY  D AY  + + F S         +S   F+ C+ +  +Q+N + P   +
Sbjct: 313 IIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPSDQSNLQIPTFVM 371

Query: 386 TMKGG 390
              GG
Sbjct: 372 HFDGG 376


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 156/376 (41%), Gaps = 31/376 (8%)

Query: 56  SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
           +F   + +  RD+  R++        N  T   F+           G  +   V +G P 
Sbjct: 84  TFPSAAEILRRDQ-LRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPK 142

Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
             F +  DTGSDL W  C+   C  G    + +  D    +   + + S  PC S   E 
Sbjct: 143 KDFSLLFDTGSDLTWTQCE--PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKES 200

Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
            + C S+ S C Y V+Y +  T+  GFL  + L +   +     V      GCG    G 
Sbjct: 201 AQGCSSSNS-CLYGVKYGTGYTV--GFLATETLTITPSD-----VFENFVIGCGERNGGR 252

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE 293
           F   A   GL GLG    ++PS  ++     N FS C    S  TG +SFG   S     
Sbjct: 253 FSGTA---GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGGGVSQAAKF 307

Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISET 347
           TP +  +    Y + ++ +SVGG  +  + S       I DSGT+ TYL   A++ +S  
Sbjct: 308 TPIT-SKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSA 366

Query: 348 FNSLAKEKRETS-TSDLPFEYCYVLSPNQT-NFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
           F  +      T  TS L  + CY  S +   N   P +++  +GG    ++D  + +++ 
Sbjct: 367 FQEMMTNYTLTKGTSGL--QPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAAN 424

Query: 406 PKGLYLYCLGVVKSDN 421
             GL   CL    + N
Sbjct: 425 --GLEEVCLAFKDNGN 438


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +  ++++G P   + + +DTGSDL W+ CD  C  C    N          +Y PN
Sbjct: 61  LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRN---------RLYKPN 110

Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
                + V C   LC+  +  P+   AG N  C Y+V Y   G+ S G L+ D + L  T
Sbjct: 111 ----GNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           +   ++ +   ++FGCG  Q     +  A+  G+ GLG  KTS+ S L + GLI N    
Sbjct: 166 NGSLARPI---LAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGH 222

Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITI---------TQVSVGGNAVNFE 322
           C    G G + FGD+  P  G     L Q+  T +               SV G      
Sbjct: 223 CLSERGGGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKG------ 276

Query: 323 FSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
              IFDSG+S+TY N  A+   ++   N L  +    +T D     C+
Sbjct: 277 LQLIFDSGSSYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICW 324


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/272 (30%), Positives = 121/272 (44%), Gaps = 32/272 (11%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
            ++ +G P   + + +D+GSDL WL CD  CVSC    +           Y PN      
Sbjct: 70  VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPN----KG 116

Query: 165 KVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            + CN  +C       +  C ++   C Y+V Y   G+ S G LV D+  L        +
Sbjct: 117 PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA 175

Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
              R++FGCG  Q  S+    AP   +G+ GLG  K+S+ + L + GLI +    C    
Sbjct: 176 --PRLAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGR 231

Query: 277 GTGRISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
           G G +  GD  S  PG   TP S +     Y +    +   G     +    +FDSG+S+
Sbjct: 232 GGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSY 291

Query: 334 TYLNDPAY-TQISETFNSLAKEKRETSTSDLP 364
           TY N  AY T +S     L  + +ET+   LP
Sbjct: 292 TYFNAQAYKTTLSLVRKYLNGKLKETADESLP 323


>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 654

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/386 (26%), Positives = 156/386 (40%), Gaps = 58/386 (15%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
           H DRY R     +     +  PL    G            HYT V  G P     V  DT
Sbjct: 38  HPDRYARRLN--IEEDAPEIVPLHLGLGT-----------HYTWVYAGTPPQRASVIADT 84

Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-KQCPSAG 183
           GS L   PC   S   G  S + Q      +  + SST   V C+      Q K+C    
Sbjct: 85  GSGLMAFPC---SGCDGCGSHTDQP-----FQADNSSTLIHVTCSQQQSHFQCKECTEKS 136

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTGSFLD 238
             C     Y+ +G+     +VEDV++L       DE       +   FGC   +TG F+ 
Sbjct: 137 DTCAISQSYM-EGSSWKASVVEDVVYLGGESSFHDEAMRDRYGTHFQFGCQSSETGLFVT 195

Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPG-QGETPF 296
             A +G+ GL    T + + L  +  IP N FS+CF  +G G +S G+  +   +GE  +
Sbjct: 196 QVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLCFTENG-GTMSVGEPNTKAHRGEISY 253

Query: 297 SL----RQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISE 346
           +     R     YN+ +  + +GG ++N +  A      I DSGT+ +YL      +  +
Sbjct: 254 AKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYTRGHYIVDSGTTDSYLPRAMKNEFLQ 313

Query: 347 TFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
            F  +A    +  TS      C+  + N+     P + L M+  G     +  VI+   P
Sbjct: 314 VFKEVAGRDYQVGTS------CHGYT-NEDLASLPKIQLVMEAYGD---ENGEVIIDIPP 363

Query: 407 KGLYL-----YCLGVVKSDNV-NIIG 426
           +   L     YC  +  S+N   +IG
Sbjct: 364 EQYLLHNDNSYCGSIYLSENAGGVIG 389


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 152/362 (41%), Gaps = 57/362 (15%)

Query: 62  ALAHRDRYFRLRGRGL---AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           +L+ R R  R R + +   A++ N   P       D+         +   V +G PA+S 
Sbjct: 81  SLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLE-------YVVTVGLGTPAVSQ 133

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK 177
           ++ +DTGSDL W+   C  C    NS++       ++ P+ SST + +PCN+  C +L +
Sbjct: 134 VLLIDTGSDLSWV--QCAPC----NSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTR 187

Query: 178 -----QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
                 C S    G+ C Y + Y  DG+ +TG    + L +A              FGCG
Sbjct: 188 DGYGSDCTSGSGGGAQCGYAITY-GDGSQTTGVYSNETLTMAPGVTVKD-----FHFGCG 241

Query: 230 RVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISF 283
             Q G       PN    GL GLG    S+  ++    +   +FS C    +D  G ++ 
Sbjct: 242 HDQDG-------PNDKYDGLLGLGGAPESL--VVQTSSVYGGAFSYCLPAANDQAGFLAL 292

Query: 284 GDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYL 336
           G   +   G   TP  +R+    Y + +T ++VGG  ++   SA     I DSGT  T L
Sbjct: 293 GAPVNDASGFVFTPM-VREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTEL 351

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
              AY  +   F             +L  + CY  +   +N   P V LT  GG    ++
Sbjct: 352 QHTAYAALQAAFRKAMAAYPLLPNGEL--DTCYNFT-GHSNVTVPRVALTFSGGATVDLD 408

Query: 397 DP 398
            P
Sbjct: 409 VP 410


>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
          Length = 410

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/278 (26%), Positives = 121/278 (43%), Gaps = 28/278 (10%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
           +  +++G PA  + + +DTGS L WL CD  C++C           +   +Y P    + 
Sbjct: 39  FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89

Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             ++  C     +L+K       N C Y ++Y+  G  S G L+ D   L      +   
Sbjct: 90  KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144

Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
            + I+FGCG  Q  +  +   P NG+ GLG  K ++ S L +QG+I  +    C  S G 
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
           G + FGD   P  G T   + + H  Y+     +    N+          IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTY 264

Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
               P +  +S   ++L+KE +   E    D     C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 70/269 (26%), Positives = 115/269 (42%), Gaps = 53/269 (19%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L++  + +G P+  + V +DTGSD+ W+ C  C  C     + S   I   +Y P +S +
Sbjct: 26  LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKC----PTKSDLGIKLTLYDPASSVS 81

Query: 163 SSKVPCNSTLC---------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
           +++V C+   C         + +K+ P     C Y V Y  DG+ + G+ V D +     
Sbjct: 82  ATRVSCDDDFCTSTYNGLLPDCKKELP-----CQYNVVY-GDGSSTAGYFVSDAVQFERV 135

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           T   Q+   +  ++FGCG  Q+G            GLG    ++  IL        +F+ 
Sbjct: 136 TGNLQTGLSNGTVTFGCGAQQSG------------GLGTSGEALDGILG-------AFAH 176

Query: 272 CFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------- 321
           C  + +G G  + G+  SP    TP    Q H  YN+ + ++ VGG  +           
Sbjct: 177 CLDNVNGGGIFAIGELVSPKVNTTPMVPNQAH--YNVYMKEIEVGGTVLELPTDVFDSGD 234

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNS 350
               I DSGT+  YL +  Y  +     S
Sbjct: 235 RRGTIIDSGTTLAYLPEVVYDSMMNEIRS 263


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 53/357 (14%)

Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
           SLG  +Y  ++ +G P    ++  DTGSDL W  C           S+ +  D     P 
Sbjct: 128 SLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC-----------SAAETFD-----PT 171

Query: 159 TSSTSSKVPCNSTLCELQKQC---PS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
            S++ + V C++ LC         PS  A S C Y ++Y  DG+ S GFL ++ L + + 
Sbjct: 172 KSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQY-GDGSYSIGFLGKERLTIGST 230

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +     + +   FGCG+   G F   A   GL GLG DK SV S  A +      FS C 
Sbjct: 231 D-----IFNNFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQLFSYCL 280

Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----- 325
             S  TG +SFG   S     TP S   + P+  YN+ +T ++VGG  +    S      
Sbjct: 281 PSSSSTGFLSFGSSQSKSAKFTPLS---SGPSSFYNLDLTGITVGGQKLAIPLSVFSTAG 337

Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I DSGT  T L   AY+ +   F  ++A        S L  + CY  S  +T  + P +
Sbjct: 338 TIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSIL--DTCYDFSKYKT-IKVPKI 394

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
            ++  GG    V+   + V++  K + L   G   + +  I G    R + +  ++S
Sbjct: 395 VISFSGGVDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVS 451


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/258 (29%), Positives = 114/258 (44%), Gaps = 30/258 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   ++GQP   + +  DTGSDL WL CD  C+ C    +          +Y P     
Sbjct: 67  YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHP---------LYQPTNDLV 117

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             K P  ++L     +C      C Y+V Y +DG  S G LV D+     +         
Sbjct: 118 VCKDPICASLHPDNYRCDDP-DQCDYEVEY-ADGGSSIGVLVNDLF--PVNLTSGMRARP 173

Query: 223 RISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           R++ GCG  Q    L G A    +G+ GLG   +S+ + L++QGL+ N    CF   G G
Sbjct: 174 RLTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGG 229

Query: 280 RISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
            + FGD    S     TP S R     Y     ++ + G +   +    +FDSG+S+TY 
Sbjct: 230 YLFFGDDIYDSSKVIWTPMS-RDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYF 288

Query: 337 NDPAYTQISETFNSLAKE 354
           N    TQ  +T  S  K+
Sbjct: 289 N----TQTYQTLLSFIKK 302


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 143/351 (40%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   ++VG P +  ++ALDT SDL WL C  C  C       SG V D     P  S++ 
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 188

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
            ++  N+  C+   +     +    C Y V Y  DG+ + G  +E+ L  A   +     
Sbjct: 189 REMSFNAADCQALGRSGGGDAKRGTCVYTVGY-GDGSTTVGDFIEETLTFAGGVRL---- 243

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGT 278
             RIS GCG    G F  GA   G+ GLG    S P+ + + G    +FS C      G 
Sbjct: 244 -PRISIGCGHDNKGLF--GAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGP 296

Query: 279 GRIS----FGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV----------- 319
           G +S    FG      SP    TP  L    PT Y + +T +SVGG  V           
Sbjct: 297 GSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLD 356

Query: 320 --NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCYVLSPNQ 375
                   I DSGT+ T L  PAYT   + F ++A +  + S       F+ CY +    
Sbjct: 357 PYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRG 416

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              + P V++   G     +     ++  +  G   +        +V+IIG
Sbjct: 417 MK-KVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIG 466


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 81/268 (30%), Positives = 121/268 (45%), Gaps = 24/268 (8%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
            ++ +G P   + + +D+GSDL WL CD  CVSC    +           Y PN    + 
Sbjct: 37  VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPNKGPITC 87

Query: 165 KVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             P C++     +  C ++   C Y+V Y   G+ S G LV D+  L        +   R
Sbjct: 88  NDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA--PR 144

Query: 224 ISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
           ++FGCG  Q  S+    AP   +G+ GLG  K+S+ + L + GLI +    C    G G 
Sbjct: 145 LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGF 202

Query: 281 ISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
           +  GD  S  PG   TP S +     Y +    +   G     +    +FDSG+S+TY N
Sbjct: 203 LFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFN 262

Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLP 364
             AY T +S     L  + +ET+   LP
Sbjct: 263 AQAYKTTLSLVRKYLNGKLKETADESLP 290


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/265 (30%), Positives = 117/265 (44%), Gaps = 35/265 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +VS+G P + ++   DTGSDL W  C  C+ C   L           I++P  S++ 
Sbjct: 92  YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSF 142

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           S VPCN+  C       C   G  C Y   Y  D T S G L  + + +      S SV 
Sbjct: 143 SHVPCNTQTCHAVDDGHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVK 195

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGT 278
           S I  GCG   +G F      +G+ GLG  + S+ S ++    I   FS C     S   
Sbjct: 196 SVI--GCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN 250

Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGT 331
           G+I+FG+      PG   TP   + T   Y IT+  +S+ GN  +  F+     I DSGT
Sbjct: 251 GKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGT 309

Query: 332 SFTYLNDPAYTQISETFNSLAKEKR 356
           + T L    Y  +  +   + K KR
Sbjct: 310 TLTILPKELYDGVVSSLLKVVKAKR 334


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 159/389 (40%), Gaps = 65/389 (16%)

Query: 43  KGILAVDDLPKKGSFA--YYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
           KG  A D   KK SFA    S  A  D   R   GR + ++G   +  T+  G     ++
Sbjct: 68  KGSSATDK--KKPSFAERLRSDRARADHILRKASGRRMMSEGGGASIPTYLGG----FVD 121

Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYS 156
           SL ++    + +G PA+   V +DTGSDL W+   PC+   C    +          ++ 
Sbjct: 122 SLEYV--VTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDP---------LFD 170

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFL 203
           P+ SST + +PC S  C   KQ P  G               C Y + Y  +G ++ G  
Sbjct: 171 PSKSSTFATIPCASDAC---KQLPVDGYDNGCTNNTSGMPPQCGYAIEY-GNGAITEGVY 226

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
             + L L      S +V     FGCG  Q G +      +GL GLG    S+ S  A+  
Sbjct: 227 STETLALG-----SSAVVKSFRFGCGSDQHGPYDKF---DGLLGLGGAPESLVSQTAS-- 276

Query: 264 LIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP-------TYNITITQVSV 314
           +   +FS C    + G G ++ G   S     + F     H         Y +T+T +SV
Sbjct: 277 VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISV 336

Query: 315 GGNAVN-----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           GG A++     F    I DSGT  T +   AY  +   F S   E      +D   + CY
Sbjct: 337 GGKALDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCY 396

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
             + + T    P V LT  GG    ++ P
Sbjct: 397 NFTGHGT-VTVPKVALTFVGGATVDLDVP 424


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 79/282 (28%), Positives = 118/282 (41%), Gaps = 36/282 (12%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
           ++  ++++G P   + + +DTGSDL W+ CD     C  C          +    +Y PN
Sbjct: 61  IYTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCT---------LPKDKLYKPN 111

Query: 159 TSSTSSKVPCNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            +     V C+  +C           ++C      C Y+V Y +D   STG L  D +H+
Sbjct: 112 GNQL---VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEY-ADNAESTGALARDYMHI 167

Query: 211 ATDEKQSKSVDSRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
            +    S S    + FGCG  Q         +  G+ GLG  K S+ S L + G I N  
Sbjct: 168 GS---PSGSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVL 224

Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
             C  ++G G +  GDK  P  G   TP         Y+     +   G     +    I
Sbjct: 225 GHCLSAEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKHYSTGPVDLFFNGKPTPAKGLQII 284

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
           FDSG+S+TY +   YT ++   N+  K K   RET    LP 
Sbjct: 285 FDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPI 326


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 144/351 (41%), Gaps = 55/351 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   + +G PA  +   LDTGSDL W  C  C+ CV        Q   +  + P  SST 
Sbjct: 92  YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPANSSTY 142

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C            C YQ  Y  D   + G L  +     T++  ++    R
Sbjct: 143 RSLGCSAPACNALYYPLCYQKTCVYQYFY-GDSASTAGVLANETFTFGTND--TRVTLPR 199

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           ISFGCG +  GS  +G+   G+ G G    S+ S L +       FS C   F S    R
Sbjct: 200 ISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSR 251

Query: 281 ISFG------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------- 325
           + FG         +     TPF +    PT Y + +T +SVGGN +  + +         
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311

Query: 326 ----IFDSGTSFTYLNDPAYTQISETF----NSLAK--EKRETSTSDLPFEYCYVLSPNQ 375
               I DSGT+ TYL +PAY  + E F    NS     +  ETS  D  F++     P +
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWP---PPPR 368

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +   P + L   G          ++V     GL   CL +  S + +IIG
Sbjct: 369 QSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGL---CLAMATSSDGSIIG 416


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/335 (27%), Positives = 141/335 (42%), Gaps = 42/335 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  NV +G P     +  DTGS L W  C  C +C   +           ++ P  S++ 
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV----------PVFDPTKSASF 181

Query: 164 SKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKS 219
             +PC+S LC+  +Q C S    C Y   Y+ D + STG L  + +   HL  D K    
Sbjct: 182 KGLPCSSKLCQSIRQGCSSP--KCTYLTAYV-DNSSSTGTLATETISFSHLKYDFKN--- 235

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
               I  GC    +G  L     +G+ GL     S+ S  AN  +    FS C  S    
Sbjct: 236 ----ILIGCSDQVSGESL---GESGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGS 286

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSAIFDSGTS 332
           TG ++FG K       +P S       Y+I +T +SVGG     +A  F+ ++  DSG  
Sbjct: 287 TGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAV 346

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
            T L   AY+ +   F  + K        D   + CY  S N +    P +++  +GG  
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDF-LDTCYDFS-NYSTVAIPSISVFFEGGVE 404

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
             ++  +  +  +  G  +YCL   +  D V+I G
Sbjct: 405 MDID--VSGIMWQVPGSKVYCLAFAELDDEVSIFG 437


>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 413

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 38/357 (10%)

Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           H+T + ++G P+  F + +DTGSDL W+ CD  C+ C    +          +Y P+ ++
Sbjct: 52  HFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDM---------LYRPHNNA 102

Query: 162 TSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
            S + P  + L  L K    +    C Y+V Y   G+ S G LV+D++ +       K +
Sbjct: 103 VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGS-SVGVLVKDLVPMRL--TNGKRI 159

Query: 221 DSRISFGCGRVQ-TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
              + FGCG  Q  G      +  G+ GL   K ++ S L++ G + N    C    G G
Sbjct: 160 SPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGG 219

Query: 280 RISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYL 336
            + FG    P  G   TP  LR +   Y+    +V   G AV     +  FDSG+S+TY 
Sbjct: 220 FLFFGGDVVPSSGMSWTPI-LRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYF 278

Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV-VNLTMKGGGPFF 394
           N   Y  I +   N L     + ++ D   E C+        FE  V V    K     F
Sbjct: 279 NSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCW---KGPKPFESVVDVRNFFKPLAMSF 335

Query: 395 VNDPIVIVSSEPKGLYL------YCLGVVKSD-----NVNIIGREYPIANNISLFHN 440
            N   V     P+   +       CLG++        NVNIIG +  + N I ++ N
Sbjct: 336 KNSKNVQFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIG-DISMLNKIVVYDN 391


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 83/278 (29%), Positives = 120/278 (43%), Gaps = 28/278 (10%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
           GF + T + VGQP   + +  DTGSDL WL CD  C  C   L+          +Y P  
Sbjct: 55  GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102

Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             ++  VPC   LC      +  +C +    C Y+V Y +DG  S G LV DV  L  + 
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
                +  R++ GCG  Q          +G+ GLG    S+ S L NQG++ N    CF 
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
           S G G   FGD            + + +P  Y+    ++   G +        +FDSG+S
Sbjct: 217 SKGGGYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276

Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
           +TY N  AY  ++   N  LA +    +  D     C+
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCW 314


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 148/377 (39%), Gaps = 59/377 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF---NIYSPNTSS 161
           ++    VG PA  F++  DTGSDL W+ C       G  +SS          ++ P  S 
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKC------RGRRASSPDASPLASPRVFRPANSK 163

Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           + + +PC+S  C+         C SAG+     C Y  RY  D + + G +  D   +A 
Sbjct: 164 SWAPIPCSSDTCKSYVPFSLANC-SAGTTPPAPCGYDYRY-KDKSSARGVVGTDAATIAL 221

Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
               S  K+    +  GC     G     +  +G+  LG    S  S  A +      FS
Sbjct: 222 SGSGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFS 277

Query: 271 MCF-----GSDGTGRISFGDKGSP-GQGETPFSL-RQTHPTYNITITQVSVGGNAVNFEF 323
            C        + T  ++FG  G+      TP  L  Q  P Y +T+  VSV G A+N   
Sbjct: 278 YCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPA 337

Query: 324 S---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSP 373
                     AI DSGTS T L  PAY  +    +  LA+  R T     PFEYCY  + 
Sbjct: 338 EVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMD---PFEYCYNWTA 394

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGR---- 427
            +     P + +   G           ++ + P    + C+G+ +     V++IG     
Sbjct: 395 TRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPG---VKCIGLQEGVWPGVSVIGNILQQ 451

Query: 428 ----EYPIANNISLFHN 440
               E+ +AN    F  
Sbjct: 452 EHLWEFDLANRWLRFQE 468


>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 410

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 110/273 (40%), Gaps = 30/273 (10%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           + +G P  +F   +DTGSD+ W+ CD  C  C          +     Y P  ++    V
Sbjct: 58  LQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKLQYKPKGNT----V 104

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PC+  +C         QCP+    C Y+V Y   G+ S G LV D            ++ 
Sbjct: 105 PCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGS-SMGALVID--QFPFKLLNGSAMQ 161

Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
            R++FGCG  Q+  S     A  G+ GLG  K  + + L + GL  N    C  S G G 
Sbjct: 162 PRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGY 221

Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
           + FGD   P  G   TP      H  Y     ++   G     +    IFD+G+S+TY N
Sbjct: 222 LFFGDTLIPSLGVAWTPLLPPDNH--YTTGPAELLFNGKPTGLKGLKLIFDTGSSYTYFN 279

Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
              Y  I     N L     + +  D     C+
Sbjct: 280 SKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICW 312


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 152/343 (44%), Gaps = 40/343 (11%)

Query: 99  NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
            S+G  +Y T + +G PA S+ + +DTGS L WL   C  CV   +   G      +Y P
Sbjct: 127 TSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGP-----LYDP 179

Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLA 211
             SST + VPC+++ C ELQ     PSA S    C YQ  Y  D + S G+L  D +   
Sbjct: 180 RASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASY-GDSSFSVGYLSRDTVSFG 238

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           +             +GCG+   G F   A   GL GL  +K S+   LA    +  SFS 
Sbjct: 239 SGSYP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287

Query: 272 CFGSDG-TGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFSA- 325
           C  +   TG +S G   S     TP +      + Y +T++ +SVGG+ +     E+S+ 
Sbjct: 288 CLPTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSL 347

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT  T L    YT +S+   + A    +++ +    + C+    +Q     P V
Sbjct: 348 PTIIDSGTVITRLPTAVYTALSKAVAA-AMVGVQSAPAFSILDTCFQGQASQ--LRVPAV 404

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +   GG    +    V++  +       CL    +D+  IIG
Sbjct: 405 AMAFAGGATLKLATQNVLIDVDDS---TTCLAFAPTDSTTIIG 444


>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 146

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 10/105 (9%)

Query: 36  HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
           HR SD  +  +   V   P++GS  YY AL   D   + + R LA     K   TFS GN
Sbjct: 33  HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
           D      LG+L+Y  V VG PA SF+VALDTGSDLFW+PCDC+ C
Sbjct: 91  D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQC 129


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 147/371 (39%), Gaps = 75/371 (20%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ VG P     + LDTGSDL W+ CD C  C     S          Y P  SST 
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSH---------YYPKDSSTY 221

Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT----- 212
             + C    C+L       + C +    CPY   Y +DG+ +TG    +   +       
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDY-ADGSNTTGDFASETFTVNLTWPNG 280

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
            EK  + VD  + FGCG    G F  GA+  GL GLG    S PS +  Q +  +SFS C
Sbjct: 281 KEKFKQVVD--VMFGCGHWNKG-FFYGAS--GLLGLGRGPISFPSQI--QSIYGHSFSYC 333

Query: 273 F-----GSDGTGRISFGDKGS-------------PGQGETPFSLRQTHPTYNITITQVSV 314
                  +  + ++ FG+                 G+ ETP         Y + I  + V
Sbjct: 334 LTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGE-ETP-----DETFYYLQIKSIMV 387

Query: 315 GGNAVN-----FEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
           GG  ++     + +S+           I DSG++ T+  D AY  I E F    K  ++ 
Sbjct: 388 GGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK-LQQI 446

Query: 359 STSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK 418
           +  D     CY +S      E P   +    GG +           EP    + CL ++K
Sbjct: 447 AADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE--VICLAIMK 504

Query: 419 SDN---VNIIG 426
           + N   + IIG
Sbjct: 505 TPNHSHLTIIG 515


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 143/369 (38%), Gaps = 60/369 (16%)

Query: 77  LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCV 136
           L+    D  PL    G   Y +           S+G P        DTGSDL W  CD  
Sbjct: 81  LSNNDTDTVPLRMDGGGGAYDME---------FSIGTPPQKLTALADTGSDLIWTKCD-- 129

Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK-----QCPSAGSNCPYQVR 191
                           + Y PN SST +++PC+  LC   +     +C + G+ C Y+  
Sbjct: 130 ------AGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYA 183

Query: 192 YL--SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
           Y    D   + GFL  +   L  D          + FGC     G + +GA   GL GLG
Sbjct: 184 YGLGDDPDFTQGFLGSETFTLGGDAVPG------VGFGCTTALEGDYGEGA---GLVGLG 234

Query: 250 MDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGS---PGQGETPFSLRQTHPT 304
                 P  L +Q L   +F  C  +D +    + FG   +    G G     L  +   
Sbjct: 235 RG----PLSLVSQ-LDAGTFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTF 289

Query: 305 YNITITQVSVGGNAV---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
           Y + +  +++G             +FDSGT+ TYL +PAYT+    F S     + TS +
Sbjct: 290 YAVNLRSITIGSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLS-----QTTSLT 344

Query: 362 DLP----FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
            +     FE CY   P+      P + L   GG    +     +V  +     + C  V 
Sbjct: 345 PVEGRYGFEACYE-KPDSARL-IPAMVLHFDGGADMALPVANYVVEVDDG---VVCWVVQ 399

Query: 418 KSDNVNIIG 426
           +S +++IIG
Sbjct: 400 RSPSLSIIG 408


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 150/351 (42%), Gaps = 58/351 (16%)

Query: 105 HYTNVSVGQPA-LSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           ++ ++ +G P    FI+  DTGSDL W+ C+  C SC    N   G+V     +  N SS
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKP-NPHPGRV-----FRANDSS 172

Query: 162 TSSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           +   +PC+S  C+++ Q       CP+  + C +  RYL+       F  E V     D 
Sbjct: 173 SFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDH 232

Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           K+ +  D  I  GC    T SF +    P+G+ GLG  K S+   LA   +  N FS C 
Sbjct: 233 KKIRLFDVLI--GC----TESFNETNGFPDGVMGLGYRKHSLALRLAE--IFGNKFSYCL 284

Query: 274 -----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS- 324
                 S+    +SFGD      P    T   L   +  Y + ++ +SVGG+ ++     
Sbjct: 285 VDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDI 344

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF--EYCYVLSPN 374
                    I DSGTS T L   AY ++ +    +  + ++    +LP    +C+     
Sbjct: 345 WNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCF----E 400

Query: 375 QTNFEYPVVN--LTMKGGGPFF---VNDPIVIVSSEPKGLYLYCLGVVKSD 420
              F+   V   L     G  F   V   I+ V+   K     CLG++K+D
Sbjct: 401 DKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIK-----CLGIIKAD 446


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 112/426 (26%), Positives = 172/426 (40%), Gaps = 61/426 (14%)

Query: 35  HHRYS-DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ---GNDKTPLTFS 90
           HH +S  P        D       A  S+L  R  ++RL     +A+      K  +  S
Sbjct: 74  HHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVS 133

Query: 91  AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
           +G    RL +L ++    +  G+      V +DT S+L W+ C  C SC    +   G +
Sbjct: 134 SGA---RLRTLNYVATVGLGGGEA----TVIVDTASELTWVQCAPCESC----HDQQGPL 182

Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG------------SNCPYQVRYLSDG 196
            D     P++S + + VPC+S  C+ LQ+Q  +              + C Y + Y  DG
Sbjct: 183 FD-----PSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSY-RDG 236

Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           + S G L  D L LA      + +D  + FGCG    G    G   +GL GLG  + S+ 
Sbjct: 237 SYSRGVLAHDRLSLA-----GEVIDGFV-FGCGTSNQGPPFGGT--SGLMGLGRSQLSLV 288

Query: 257 SILANQ--GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ---------THPTY 305
           S   +Q  G+      +   SD +G +  GD  S  +  TP                P Y
Sbjct: 289 SQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFY 348

Query: 306 NITITQVSVGGNAVN---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
            + +T ++VGG  V    F   AI DSGT  T L    Y  +   F S   E  +     
Sbjct: 349 LVNLTGITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFS 408

Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVVKSD 420
           +  + C+ ++      + P + L   GG    V+   V+  VSS+   + L    +   D
Sbjct: 409 I-LDTCFNMT-GLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSED 466

Query: 421 NVNIIG 426
             +IIG
Sbjct: 467 ETSIIG 472


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 149/338 (44%), Gaps = 42/338 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V VG PA S+ + LDTGSD+ W+ C  C  C    +          I++P  SS+ 
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDP---------IFTPAASSSY 209

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + C+S  C   +        C YQV Y  DG+ + G  V + +        S +V+S 
Sbjct: 210 SPLTCDSQQCNSLQMSSCRNGQCRYQVNY-GDGSFTFGDFVTETMSFGG----SGTVNS- 263

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           I+ GCG    G F+  A         +     P  L +Q L   SFS C  +  +   S 
Sbjct: 264 IALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTSQ-LKATSFSYCLVNRDSAASST 315

Query: 284 GDKGSPGQGETPFS--LRQTHPT--YNITITQVSVGGNAVNF-----------EFSAIFD 328
            D  S   G++  +  L+ +     Y + ++ +SVGG  +             +   I D
Sbjct: 316 LDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVD 375

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
            GT+ T L   AY  + ++F S+++  R TS   L F+ CY LS  Q++ + P V+    
Sbjct: 376 CGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVAL-FDTCYDLS-GQSSVKVPTVSFHFD 433

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           GG  + +     ++  +  G Y +      S +++IIG
Sbjct: 434 GGKSWDLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 470


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 135/349 (38%), Gaps = 52/349 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  V VG PA  F +  DTGS+L W+ C   +   GL           ++ P  S + +
Sbjct: 91  YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-----------VFRPEASKSWA 139

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            VPC+S  C+L        C S+ S C Y  RY      + G +  D   +A    +   
Sbjct: 140 PVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQ 199

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
           +   +  GC     G        +G+  LG  K S  S  A +     SFS C       
Sbjct: 200 LQD-VVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASRAAAR--FGGSFSYCLVDHLAP 254

Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
            + TG ++FG    PGQ       +T   L    P Y + +  V V G A++        
Sbjct: 255 RNATGYLAFG----PGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310

Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY-VLSPNQTNFE 379
                I DSGT+ T L  PAY  +      L     +      PFE+CY   +P     E
Sbjct: 311 KSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWTAPRPGAPE 368

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
            P + +   G           ++  +P    + C+G+ + +   V++IG
Sbjct: 369 IPKLAVQFTGCARLEPPAKSYVIDVKPG---VKCIGLQEGEWPGVSVIG 414


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 77/251 (30%), Positives = 112/251 (44%), Gaps = 25/251 (9%)

Query: 55  GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
           G   + SAL   D   R  GR LAA      PL  S       L +   L++T + +G P
Sbjct: 51  GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99

Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
           A  + V +DTGSD+ W+  +CVSC  G    S   I+  +Y P  S +   V C+   C 
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156

Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
                +   C S  S C Y + Y  DG+ + GF V D L     + + Q+   ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214

Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
           CG    G       A +G+ G G   +S+ S LA  G +   F+ C  + +G G  + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274

Query: 286 KGSPGQGETPF 296
              P    TP 
Sbjct: 275 VVQPKVKTTPL 285


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 40/297 (13%)

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
           R+ GRG     + K        N  Y + +  ++     S+G P ++  + +DTGSDL W
Sbjct: 105 RVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYV--VTASLGTPGMAQTLEVDTGSDLSW 162

Query: 131 L---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----LQKQCPSAG 183
           +   PC   SC    +          ++ P  SS+ + VPC  + C         C +A 
Sbjct: 163 VQCKPCAAPSCYRQKDP---------LFDPAQSSSYAAVPCGRSACAGLGIYASACSAA- 212

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
             C Y V Y  DG+ +TG    D L LA +      +     FGCG  Q+G    G   +
Sbjct: 213 -QCGYVVSY-GDGSNTTGVYSSDTLTLAANATVQGFL-----FGCGHAQSGGLFTGI--D 263

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKG--SPGQGETPFSLR 299
           GL G G ++ S+  +    G     FS C    S  TG ++ G     +PG   T     
Sbjct: 264 GLLGFGREQPSL--VQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPS 321

Query: 300 QTHPTYNIT-ITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
              PTY +  +T +SVGG  ++   SA     + D+GT  T L   AY  +   F S
Sbjct: 322 PNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPAAYAALRSAFRS 378


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 125/306 (40%), Gaps = 48/306 (15%)

Query: 87  LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHG 141
           + F  G D +         Y  +++G+PA  + + +DTGS+L W+ C      C +C   
Sbjct: 26  MVFKLGGDVHPTGHF----YVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTC--- 78

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLS 194
                   +   +Y P        VPC   LC+         K C      C YQ+ Y +
Sbjct: 79  ------NKVPHPLYRPK-----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINY-A 126

Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGM 250
           DGT S G L+ D   L T   ++      I+FGCG  Q       A      +G+ GLG 
Sbjct: 127 DGTTSLGVLLLDKFSLPTGSARN------IAFGCGYDQMQGPKKKAPEKVPVDGILGLGR 180

Query: 251 DKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGET---PFSLRQTHPTYN 306
               + S L + G +  N    C  S G G +  G++  P         + + +    Y+
Sbjct: 181 GSVDLVSQLKHSGAVSKNVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYS 240

Query: 307 ITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEK-RETSTSDL 363
                + +G N +  + F AIFDSG+++TYL +  + Q+      SL K   +  S +D 
Sbjct: 241 PGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDT 300

Query: 364 PFEYCY 369
               C+
Sbjct: 301 RLHLCW 306


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 130/305 (42%), Gaps = 47/305 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           + T +S+G PA  F V  DTGSDL W+ C  C +C +  +          I+ P  SS+ 
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + + C  TLC+   +K C     NC Y   Y  DG+ + G L  + + L + + + K   
Sbjct: 91  TTMSCGDTLCDSLPRKSC---SPNCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
             I+FGCG +  GSF D +   GL GLG    S  S L +  L  + FS C         
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200

Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
            T  + FGD+ S           F+    +P     Y + +  +S+ G A+     +   
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                   IFDSGT+ T L D  Y  +     S      E   S    + CY +S ++ +
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFP-EIDGSSAGLDLCYDVSGSKAS 319

Query: 378 FEYPV 382
           ++  +
Sbjct: 320 YKKKI 324


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 171/403 (42%), Gaps = 58/403 (14%)

Query: 59  YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGF--LHYT-NVSVGQ 113
           +Y+ +  RDR+ R+R   R L A     T  T  A     RL  L F  L Y   + +G 
Sbjct: 78  HYTGILRRDRH-RVRSIYRRLTAAETTTTTTTIPA-----RLG-LAFQSLEYVVTIGIGT 130

Query: 114 PALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           P  +F V  DTGSDL W   LPC   SC               ++ P+ SST   VPC++
Sbjct: 131 PPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEP---------LFDPSKSSTYVDVPCSA 181

Query: 171 TLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
             C +   +Q     ++C Y V+Y  D + + G L E+   L+     + +  + + FGC
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKY-GDESETHGSLAEETFTLSPPSPLAPAA-TGVVFGC 239

Query: 229 GRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGSDG--TGRI 281
                  F D G    GL GLG   +   SIL+      NS    FS C    G  TG +
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDS---SILSQTRRSINSGGGVFSYCLPPRGSSTGYL 296

Query: 282 SFGDKGSPGQGE------TPF--SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFD 328
           + G   +  Q +      TP   ++ Q    Y + +  VSV G AV+     F   A+ D
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAVID 356

Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGT  T++   AY  + + F   +   K     S    + CY ++  Q     P V L  
Sbjct: 357 SGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVT-GQDVVTAPRVALEF 415

Query: 388 KGGGPFFVNDP--IVIVSSEP---KGLYLYCLGVVKSDNVNII 425
            GG    V+    ++++ +E    + L L CL  + +++  ++
Sbjct: 416 GGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLV 458


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 129/305 (42%), Gaps = 42/305 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++  V +G P     +  DTGSDL W  C+    SC    ++         I+ P+ S++
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDA---------IFDPSKSTS 195

Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
            S + C STLC         +  C ++   C Y ++Y  D + S G+   + L + ATD 
Sbjct: 196 YSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQY-GDSSFSVGYFSRERLSVTATD- 253

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
                +     FGCG+   G F   A   GL GLG    S   +     +    FS C  
Sbjct: 254 -----IVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVYRKIFSYCLP 303

Query: 274 -GSDGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------A 325
             S  TGR+SFG   +     TPFS + +    Y + IT +SVGG  +    S      A
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT  T L   AYT +   F      K  ++      + CY LS  +  F  P ++ 
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQ-GMSKYPSAGELSILDTCYDLSGYEV-FSIPKIDF 421

Query: 386 TMKGG 390
           +  GG
Sbjct: 422 SFAGG 426


>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
          Length = 320

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 128/298 (42%), Gaps = 45/298 (15%)

Query: 12  VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
           +L++L +   GC    G F      R   P  G         +G   + +AL   D  R+
Sbjct: 14  LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
            RL G    A G    P       DT        L+YT + +G P   + V +DTGSD+ 
Sbjct: 62  GRLLGAVDLALGGVGLP------TDTG-------LYYTRIEIGSPPKGYYVQVDTGSDIL 108

Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG 183
           W+  +C+ C  G  + SG  I+   Y P  S T+  V C    C       +   CPS  
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
           S C +++ Y  DG+ +TGF V D +     +   Q+ + ++ I+FGCG  Q G  L  + 
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221

Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
            A +G+ G G   +S+ S LA    +   F+ C  +  G G  + G+   P    TP 
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPL 279


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 140/357 (39%), Gaps = 59/357 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG P    +V +DTGSDL WL C  C  C   +           +Y P  S T 
Sbjct: 92  YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTP---------LYDPRNSKTH 142

Query: 164 SKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
            ++PC S  C        C +    C Y V Y  DG+ S+G L  D L L  D +     
Sbjct: 143 RRIPCASPQCRGVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDTLVLPDDTRVHN-- 199

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG------ 274
              ++ GCG    G     A   GL G G  + S P+ LA      + FS C G      
Sbjct: 200 ---VTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRMSRA 251

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG-------------N 317
            + +  + FG   +P    T F+  +T+P     Y + +   SVGG             N
Sbjct: 252 RNSSSYLVFGR--TPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALN 309

Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCYVLSPN- 374
                   + DSGT+ +     AY  + + F  ++ A   R        F+ CY +  N 
Sbjct: 310 PATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNG 369

Query: 375 -QTNFEYPVVNLTMKGGGPFFV---NDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
             T    P + L         +   N  I +V  + +    +CLG+  +D+ +N++G
Sbjct: 370 PGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRR--TYFCLGLQAADDGLNVLG 424


>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
          Length = 410

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 74/278 (26%), Positives = 120/278 (43%), Gaps = 28/278 (10%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
           +  +++  PA  + + +DTGS L WL CD  C++C           +   +Y P    + 
Sbjct: 39  FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89

Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             ++  C     +L+K       N C Y ++Y+  G  S G L+ D   L      +   
Sbjct: 90  KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144

Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
            + I+FGCG  Q  +  +   P NG+ GLG  K ++ S L +QG+I  +    C  S G 
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
           G + FGD   P  G T   + + H  Y+     +    N+          IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTY 264

Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
               P +  +S   ++L+KE +   E    D     C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/366 (27%), Positives = 153/366 (41%), Gaps = 48/366 (13%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333

Query: 275 SDGTGRISFGD---KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
           S GTG + FG      +  +  TP  L +  PT Y + +T + VGG  ++   S      
Sbjct: 334 STGTGYLDFGAGSLAAARARLTTPM-LTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
            I DSGT  T L   AY+ +   F +       K+  + S L  + CY  +   +    P
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 449

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
            V+L  +GG    V+   ++ ++    + L          +G+V +  +   G  Y I  
Sbjct: 450 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 509

Query: 434 NISLFH 439
            +  F+
Sbjct: 510 KVVGFY 515


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 87/346 (25%), Positives = 147/346 (42%), Gaps = 47/346 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G  + SV   L       + FS C       
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
             F S  TG  S G K +  + +  +    + R+    + + +T +SV G  +    S  
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220

Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + 
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 154/397 (38%), Gaps = 75/397 (18%)

Query: 63  LAHRDR----YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           LA  DR    +   RGR  AA+      +  S+G  T         ++    VG PA  F
Sbjct: 46  LARMDRERMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 100

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF-------NIYSPNTSSTSSKVPCNST 171
           ++  DTGSDL W+ C   +     +  +   +           + P+ S T + +PC+S 
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSA 160

Query: 172 LCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-IS 225
            C          C +  + C Y  RY  DG+ + G +  D   +A   + ++    R + 
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRY-KDGSAARGTVGVDSATIALSGRAARKAKLRGVV 219

Query: 226 FGCGRVQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
            GC     G SFL   A +G+  LG    S  S  A++      FS C        + T 
Sbjct: 220 LGCTTSYNGQSFL---ASDGVLSLGYSNISFASRAASR--FGGRFSYCLVDHLAPRNATS 274

Query: 280 RISFGDKGS-----PGQG---------------------ETPFSL-RQTHPTYNITITQV 312
            ++FG   +     P +G                     +TP  L  +T P Y +T+  V
Sbjct: 275 YLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGV 334

Query: 313 SVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSD 362
           SV G  +    +         AI DSGTS T L  PAY  +    +  LA   R T    
Sbjct: 335 SVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMD-- 392

Query: 363 LPFEYCY-VLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
            PF+YCY   SP+ ++   P+  L +   G   +  P
Sbjct: 393 -PFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPP 428


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 152/351 (43%), Gaps = 59/351 (16%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           LN+L ++    VS+G PA++  + +DTGSD+ WL C                    +Y P
Sbjct: 126 LNTLEYV--ITVSIGSPAVAXTMFIDTGSDVSWLRCKS-----------------RLYDP 166

Query: 158 NTSSTSSKVPCNSTLC-ELQKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
            TSST +   C++  C +L ++    S+GS C Y V+Y  DG+ +TG    D L LA   
Sbjct: 167 GTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKY-GDGSNTTGTYGSDTLTLA--- 222

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF 273
             S+ + S   FGC  V+ G   D    +GL GLG D  S V    A  G   ++FS C 
Sbjct: 223 GTSEPLISGFQFGCSAVEHGFEEDNT--DGLMGLGGDAQSFVSQTAATYG---SAFSYCL 277

Query: 274 GS--DGTGRISFGDKGSPGQGETP----FSLRQTHPTYNITITQVSVGGNAVN-----FE 322
               + +G ++ G   S              +Q    Y + +  +SVGG  +      F 
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS 337

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPN--QTNFE 379
             +I DSGT  T L   AY  +S  F + +A+ + + +      + C+  + +    NF 
Sbjct: 338 AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFT 397

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL-YCLGVVKSDN---VNIIG 426
            P V L + GG          +V   P G+    CL    +D+     IIG
Sbjct: 398 VPSVALVLDGG---------AVVDLHPNGIVQDGCLAFAATDDDGRTGIIG 439


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 120/470 (25%), Positives = 184/470 (39%), Gaps = 79/470 (16%)

Query: 8   SPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYS-ALAHR 66
           SP+ +L++L S C     G    GF    R S   + + +   +  + S A  S +L HR
Sbjct: 3   SPLLLLVVLCSYCCYIALGGNEHGFAVVQRRSYDSETVCSASKVNLEPSSATVSMSLVHR 62

Query: 67  D--------------------RYFRLRGRGLAAQGNDKTPLTFSAGND------TYRLNS 100
                                R  R R   + +Q +    +  ++  D      T     
Sbjct: 63  YGPCAPSQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDDAAVTIPTRL 122

Query: 101 LGFL----HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
            GF+    +   +  G P++  ++ +DTGSD+ W+   C  C    NS+        ++ 
Sbjct: 123 GGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWV--QCTPC----NSTKCYPQKDPLFD 176

Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           P+ SST + + CN+  C          C S G+ C Y V Y +DG+ S G    + L LA
Sbjct: 177 PSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEY-ADGSHSRGVYSNETLTLA 235

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                         FGCGR Q G        +GL GLG    S+  ++    +   +FS 
Sbjct: 236 PGITVED-----FHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSY 285

Query: 272 CFGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
           C  +  +    F   GSP  G       TP      + T Y +T+T +SVGG  ++   S
Sbjct: 286 CLPALNS-EAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQS 344

Query: 325 A-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
           A     I DSGT  T L + AY  +        K      + D  F+ CY  +   +N  
Sbjct: 345 AFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD--FDTCYNFT-GYSNIT 401

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
            P V  T  GG    ++ P  I+ ++       CL   +S   D + IIG
Sbjct: 402 VPRVAFTFSGGATIDLDVPNGILVND-------CLAFQESGPDDGLGIIG 444


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 135/302 (44%), Gaps = 42/302 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V +G+P+    + LDTGSD+ W+ C  C  C H  +          I+ P +S++ 
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADP---------IFEPASSTSY 194

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + C++  C+         + C Y+V Y  DG+ + G  V + + L      S SVD+ 
Sbjct: 195 SPLSCDTKQCQSLDVSECRNNTCLYEVSY-GDGSYTVGDFVTETITLG-----SASVDN- 247

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A    L GLG  K S PS +       +SFS C     SD    
Sbjct: 248 VAIGCGHNNEGLFIGAAG---LLGLGGGKLSFPSQIN-----ASSFSYCLVDRDSDSAST 299

Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
           + F     P     P    R+    Y + +T +SVGG  ++     FE         I D
Sbjct: 300 LEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIID 359

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT+ T L   AY  + + F    K+   TS   L F+ CY LS  +T+ E P V   + 
Sbjct: 360 SGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVAL-FDTCYDLS-RKTSVEVPTVTFHLA 417

Query: 389 GG 390
           GG
Sbjct: 418 GG 419


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 75/271 (27%), Positives = 115/271 (42%), Gaps = 43/271 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ +GQP  S ++  DTGSDL W+ C  C +C H   ++        ++ P  SST 
Sbjct: 83  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 134

Query: 164 SKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           S   C   +C L  +   A         S CPY+  Y +DG++++G    +   L T   
Sbjct: 135 SPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGY-ADGSLTSGLFARETTSLKTSSG 193

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           +   + S ++FGCG   +G  + G +    NG+ GLG    S  S L  +    N FS C
Sbjct: 194 KEAKLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 250

Query: 273 -----FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
                     T  +  GD G        TP       PT Y + +  V V G  +  + S
Sbjct: 251 LMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPS 310

Query: 325 -----------AIFDSGTSFTYLNDPAYTQI 344
                       + DSGT+  +L DPAY  +
Sbjct: 311 IWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341


>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
          Length = 393

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/280 (31%), Positives = 128/280 (45%), Gaps = 39/280 (13%)

Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           Y NV+  +GQP+  + + +DTGSDL WL CD  CV C    +             P    
Sbjct: 33  YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 79

Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
            ++ VPC   +C+        +C + G  C Y+V Y +DG  S G LV D  +L  T EK
Sbjct: 80  RNNLVPCMDPICQSLHSNGDHRCENPG-QCDYEVEY-ADGGSSFGVLVTDTFNLNFTSEK 137

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP--NGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +   +   ++ GCG  Q   F  G+    +G+ GLG  K+S+ S L++ GL+ N    C 
Sbjct: 138 RHSPL---LALGCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCL 191

Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
              G G + FGD    S     TP S    H  Y+  + +++  G    F+     FDSG
Sbjct: 192 SGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDSG 249

Query: 331 TSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLPFEYCY 369
            S+TYLN  AY   IS     L+ +    +  D     C+
Sbjct: 250 ASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCW 289


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 121/313 (38%), Gaps = 42/313 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLN----SSSGQVIDFNIYSPNT 159
           +Y  + VG P       +DTGSD+ W  C  C  C    N    SS        +Y P  
Sbjct: 88  YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147

Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S T+S   C+  LC     C    ++C Y + Y  D + STG    DV+HL        S
Sbjct: 148 SITASPATCSDPLCSEGGSCRGNNNSCAYDISY-EDTSSSTGIYFRDVVHLG----HKAS 202

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
           +++ +  GC    +G +      +G+ G G  K SVP+ LA Q    N F  C     +G
Sbjct: 203 LNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEKEG 258

Query: 278 TGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSA------ 325
            G +  G     P    TP  +      YN+ +  +SV   A+      FE++A      
Sbjct: 259 GGILVLGKNDEFPEMVYTP--MLANDIVYNVKLVSLSVNSKALPIEASEFEYNATVGNGG 316

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE------YCYVLSPNQTNF 378
            I DSGTS       A     +     A  K  T+    P E      +  +   N    
Sbjct: 317 TIIDSGTSSATFPSKALALFVK-----AVSKFTTAIPTAPLESSGSPCFISISDRNSVEV 371

Query: 379 EYPVVNLTMKGGG 391
           ++P V L   GG 
Sbjct: 372 DFPNVTLKFDGGA 384


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 149/392 (38%), Gaps = 82/392 (20%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-------CVSCVHGLNSSSGQ--------- 148
           ++    VG PA  F++  DTGSDL W+ C          +   G N   G          
Sbjct: 55  YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114

Query: 149 ----VIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMS 199
                    ++ P+ S T + +PC+S  C          CP+ GS C Y+ RY  DG+ +
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRY-KDGSAA 173

Query: 200 TGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKT 253
            G +  D   +A       +KQ ++    +  GC    TG SFL   A +G+  LG    
Sbjct: 174 RGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSNV 230

Query: 254 SVPSILANQGLIPNSFSMCF-----GSDGTGRISF-----------------GDKGSPGQ 291
           S  S  A +      FS C        + T  ++F                 G   +PG 
Sbjct: 231 SFASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGA 288

Query: 292 GETPFSL-RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAY 341
            +TP  L  +  P Y + +  VSV G  +              AI DSGTS T L  PAY
Sbjct: 289 RQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAY 348

Query: 342 TQISETFNSLAKEKRETSTSDL-PFEYCY----VLSPNQTNFEYPVVNLTMKGGGPFFVN 396
             +     +L K+        + PF+YCY     L+        P + +   G       
Sbjct: 349 RAV---VAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPP 405

Query: 397 DPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
               ++ + P    + C+G+ + D   V++IG
Sbjct: 406 PKSYVIDAAPG---VKCIGLQEGDWPGVSVIG 434


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/352 (23%), Positives = 140/352 (39%), Gaps = 46/352 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++T V VG PA  F V +DTGS+L W+ C             G+V +  ++    S +  
Sbjct: 88  YFTEVRVGTPAKKFRVVVDTGSELTWVNC------RYRGRGKGKVKNRRVFRAEESKSFK 141

Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            V C +  C++          CP+  + C Y  RY +DG+ + G   ++ + +     + 
Sbjct: 142 TVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRK 200

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
             +   +  GC    +G    GA  +G+ GL     S  S   +  L     S C     
Sbjct: 201 ARLRGLL-VGCSSSFSGQSFQGA--DGVLGLAFSDFSFTSTATS--LFGAKLSYCLVDHL 255

Query: 278 TGR-----ISFG-------DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
           + +     + FG        K +PG+  TP  L    P Y I I  +S+G + ++     
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGR-TTPLDLTLIPPFYAINIIGISIGDDMLDIPTQV 314

Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                    I DSGTS T L + AY  +         E +      +P EYC+  +    
Sbjct: 315 WDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFN 374

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
             + P +   +KGG  F  +    +V + P    + CLG + +     N++G
Sbjct: 375 ESKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFMSAGTPATNVVG 423


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 129/309 (41%), Gaps = 34/309 (11%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
            SLG L +   V  G PA ++ +  DTGSD+ W+   C+ C       SG     +  I+
Sbjct: 113 TSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 163

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            P  S+T S VPC    C       S+   C Y+V+Y  DG+ + G L  + L L     
Sbjct: 164 DPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQY-GDGSSTAGVLSHETLSLT---- 218

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
            S       +FGCG    G F D    +GL GLG  + S+ S  A       S+ +   +
Sbjct: 219 -SARALPGFAFGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN 274

Query: 276 DGTGRISFGD----KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFS 324
              G ++ G      GS G   T    +Q +P+ Y + +  + VGG  +           
Sbjct: 275 TSHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG 334

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            + DSGT  TYL   AYT + + F     + +     D PF+ CY  +  Q     P+V+
Sbjct: 335 TLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCYDFA-GQNAIFMPLVS 392

Query: 385 LTMKGGGPF 393
                G  F
Sbjct: 393 FKFSDGSSF 401


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 121/305 (39%), Gaps = 47/305 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           ++    VG PA  F++  DTGSDL W+ C      +     +SS+        + P  S 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA----- 211
           T + +PC S  C          CP+ GS C Y  RY  DG+ + G +  +   +A     
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSSSS 213

Query: 212 --TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
             +  K  K+    +  GC    TG   +  A +G+  LG    S  S  A++      F
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFASHAASR--FGGRF 269

Query: 270 SMCF-----GSDGTGRISFGDKGS----------PGQGETPFSL-RQTHPTYNITITQVS 313
           S C        + T  ++FG   +          PG  +TP  L  +  P Y+++I  +S
Sbjct: 270 SYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAIS 329

Query: 314 VGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
           V G  +               I DSGTS T L  PAY  +        K  R    +  P
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGK--KLARFPRVAMDP 387

Query: 365 FEYCY 369
           FEYCY
Sbjct: 388 FEYCY 392


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 136/306 (44%), Gaps = 49/306 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           + T +S+G PA  F V  DTGSDL W+ C  C +C +  +          I+ P  SS+ 
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + + C  TLC+   +K C     +C Y   Y  DG+ + G L  + + L + + + K   
Sbjct: 91  TTMSCGDTLCDSLPRKSC---SPDCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
             I+FGCG +  GSF D +   GL GLG    S  S L +  L  + FS C         
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200

Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
            T  + FGD+ S           F+    +P     Y + +  +S+ G A+     +   
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQT 376
                   IFDSGT+ T L D  Y  +     S ++  K + S++ L  + CY +S ++ 
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGL--DLCYDVSGSKA 318

Query: 377 NFEYPV 382
           +++  +
Sbjct: 319 SYKMKI 324


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 152/357 (42%), Gaps = 56/357 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   ++VG PA+  ++ALDT SDL WL C  C  C       SG V D     P  S++ 
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG----TMSTGFLVEDVLHLATDEKQ 216
            ++  ++  C+   +     +    C Y V+Y  DG    + S G LVE+ L  A   +Q
Sbjct: 185 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVQY-GDGHGSTSTSVGDLVEETLTFAGGVRQ 243

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
           +      +S GCG    G F  GA   G+ GLG  + S+P  +A  G    SFS C    
Sbjct: 244 AY-----LSIGCGHDNKGLF--GAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDF 295

Query: 274 ----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV------ 319
               GS  +  ++FG      SP    TP  L Q  PT Y + +  VSVGG  V      
Sbjct: 296 ISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTER 354

Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
                        I DSGT+ T L  PAY    + F + A    + ST   S L F+ CY
Sbjct: 355 DLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGL-FDTCY 413

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +   +   + P V++   GG    +     ++  + +G   +        +V++IG
Sbjct: 414 TVG-GRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIG 469


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/296 (28%), Positives = 121/296 (40%), Gaps = 33/296 (11%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P +  +   DTGSDL W+ C  C SC               ++ P  SST     C 
Sbjct: 96  IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTP---------LFQPLKSSTFMPTTCR 146

Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           S  C L    QK C  +G  C Y  +Y    + S G L  + L   +             
Sbjct: 147 SQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRIS 282
           FGCG     +        G+ GLG    S+ S + +Q  I + FS C    GS  T ++ 
Sbjct: 206 FGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLK 263

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
           FG++      G   TP  ++   PTY  + +  V+V    V   + + + I DSGT  TY
Sbjct: 264 FGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTY 323

Query: 336 LNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           L +  Y   + +   SLA E  +   S LPF  C+   P + NF +P +     G 
Sbjct: 324 LGESFYYNFAASLQESLAVELVQDVLSPLPF--CF---PYRDNFVFPEIAFQFTGA 374


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 131/303 (43%), Gaps = 39/303 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     +  DTGSDL W  C  CV   +             I++P+ S++ 
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 184

Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
             V C+S  C  L     +AG    SNC Y ++Y  D + S GFL +D   L + +    
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKDKFTLTSSD---- 239

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
            V   + FGCG    G F   A   GL GLG DK S PS  A        FS C  S   
Sbjct: 240 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 293

Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
            TG ++FG  G S     TP S +      Y + I  ++VGG  +      FS   A+ D
Sbjct: 294 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 353

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
           SGT  T L   AY  +  +F   AK  +  +TS +   + C+ LS  +T    P V  + 
Sbjct: 354 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 410

Query: 388 KGG 390
            GG
Sbjct: 411 SGG 413


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/324 (29%), Positives = 137/324 (42%), Gaps = 67/324 (20%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +  +G P   F + +D+GSDL W+ C  C+ C            D  +Y+P+ SST 
Sbjct: 65  YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCY---------AQDTPLYAPSNSSTF 115

Query: 164 SKVPCNSTLCELQKQC---------PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
           + VPC S  C L             P A   C Y+ RY +D ++S G             
Sbjct: 116 NPVPCLSPECLLIPATEGFPCDFHYPGA---CAYEYRY-ADTSLSKGVFA---------- 161

Query: 215 KQSKSVDS----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            +S +VD     +++FGCGR   GSF   AA  G+ GLG    S  S +       N F+
Sbjct: 162 YESATVDDVRIDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFA 216

Query: 271 MCF-----GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
            C       +  +  + FGD+      +   TP      +PT Y + I +V VGG ++  
Sbjct: 217 YCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI 276

Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY- 369
             SA           IFDSGT+ TY   PAY  I   F+   +  R  S   L  + C  
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGL--DLCVD 334

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPF 393
           V   +Q +F  P   + + GG  F
Sbjct: 335 VTGVDQPSF--PSFTIVLGGGAVF 356


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/346 (27%), Positives = 143/346 (41%), Gaps = 48/346 (13%)

Query: 60  YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
           +S+L+H DR      R L+            +     R  + G +   +  +G P + ++
Sbjct: 46  FSSLSHYDRLANAFRRSLS-----------RSAALLNRAATSGAVGLQSSIIGTPPVDYL 94

Query: 120 VALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
              DTGSDL W  C  C+ C   L           I++P  S++ S VPCN+  C     
Sbjct: 95  GIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSFSHVPCNTQTCHAVDD 145

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
             C   G  C Y   Y  D T S G L  + + +      S SV S I  GCG   +G F
Sbjct: 146 GHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVKSVI--GCGHASSGGF 196

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGDKG---SPG 290
                 +G+ GLG  + S+ S ++    I   FS C     S   G+I+FG       PG
Sbjct: 197 ---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPG 253

Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGTSFTYLNDPAYTQISE 346
              TP   + T   Y IT+  +S+ GN  +  F+     I DSGT+ ++L    Y  +  
Sbjct: 254 VVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVS 312

Query: 347 TFNSLAKEKRETSTSDLPFEYCYVLSPN-QTNFEYPVVNLTMKGGG 391
           +   + K KR     +  ++ C+    N  T+   P++     GG 
Sbjct: 313 SLLKVVKAKRVKDPGNF-WDLCFDDGINVATSSGIPIITAQFSGGA 357


>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
 gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
 gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
          Length = 410

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 118/282 (41%), Gaps = 36/282 (12%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +++G PA S+ + +DTGS L WL CD  C +C          ++   +Y P      
Sbjct: 39  FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKPTPKKL- 88

Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             V C  +LC          K+C S    C Y ++Y+   +M  G LV D   L+     
Sbjct: 89  --VTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 143

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
           + +    I+FGCG  Q     +   P + + GL   K ++ S L +QG+I  +    C  
Sbjct: 144 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 200

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
           S G G + FGD   P  G T   + + H  Y+     +    N+        + IFDSG 
Sbjct: 201 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 260

Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
           ++TY     Y    + +  T NS  K   E +  D     C+
Sbjct: 261 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 302


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 144/351 (41%), Gaps = 52/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA   ++ LDTGSD+ WL C  C  C       SGQV D     P  S + 
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYE----QSGQVFD-----PRRSRSY 190

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C + LC       C    S C YQV Y  DG+++ G    + L  A   +      
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLRRSACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
           +R++ GCG    G F+  A   GL        S P+ ++ +     SFS C         
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGRSFSYCLVDRTSSAN 299

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
            +  +  ++FG         + F+    +P     Y + +  +SVGG  V          
Sbjct: 300 TASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRL 359

Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
              +     I DSGTS T L  PAY+ + + F   A   R +      F+ CY LS  + 
Sbjct: 360 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKV 419

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             + P V++   GG    +     ++  + KG   +C     +D  V+IIG
Sbjct: 420 -VKVPTVSMHFAGGAEAALPPENYLIPVDSKG--TFCFAFAGTDGGVSIIG 467


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/284 (28%), Positives = 123/284 (43%), Gaps = 39/284 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           +   + +G P++  +   DTGSDL W+   PCD   C            +  +Y P  SS
Sbjct: 96  YLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCF---------AQNTPLYDPLNSS 146

Query: 162 TSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           T + +PC+S  C      Q  C   G +C Y   Y  D + S G L  D + L   +   
Sbjct: 147 TFTLLPCDSQPCTQLPYSQYVCSDYG-DCIYAYTY-GDNSYSYGGLSSDSIRLMLLQLH- 203

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
              +S+I FGCG     +        G+ GLG    S+ S L ++  I + FS C   F 
Sbjct: 204 --YNSKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFS 259

Query: 275 SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV---NFEFSAIFD 328
           S+   ++ FG+       G   TP  ++   P Y + +  ++VG   V     + + I D
Sbjct: 260 SNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIID 319

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCY 369
           SG++ TYL +  Y +    F SL KE     E      PF++C+
Sbjct: 320 SGSTLTYLEESFYNE----FVSLVKETVAVEEDQYIPYPFDFCF 359


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 88/346 (25%), Positives = 135/346 (39%), Gaps = 45/346 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P    ++ +DTGSD+ WL C  CV C   L+          +Y P  SST 
Sbjct: 99  YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSP---------LYDPRGSSTY 149

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           ++ PC+   C   + C      C Y++ Y  D + ++G L  D L  + D          
Sbjct: 150 AQTPCSPPQCRNPQTCDGTTGGCGYRIVY-GDASSTSGNLATDRLVFSNDTSVGN----- 203

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGT 278
           ++ GCG    G F   A   GL G+     S  + +A+       F+ C G        +
Sbjct: 204 VTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRSGSSS 258

Query: 279 GRISFGDKG--SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV----NFEFS------- 324
             + FG      P    TP       P+ Y + +   SVGG  V    N   S       
Sbjct: 259 SYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGR 318

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYVLSPNQTNFEY 380
              + DSGTS T     AY  + + F++ A +   R+       F+ CY L       + 
Sbjct: 319 GGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLR-GVAVADA 377

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P V L   GG    +     +V  E    + + L     D +++IG
Sbjct: 378 PGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIG 423


>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 547

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/196 (32%), Positives = 94/196 (47%), Gaps = 17/196 (8%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           LG+ +YT +++G P  +    LDTGS L   PC    C     S +G      ++ P  S
Sbjct: 78  LGY-YYTYLTIGTPGQTVSGILDTGSTLPAFPCS--GCTRCGPSKTG------MFKPELS 128

Query: 161 STSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           STSS   C+   C      C      C Y +RYL +G+ ++GFL ED+L +      +  
Sbjct: 129 STSSTFGCSDARCFCGANSCSCNNEQCGYSIRYL-EGSSTSGFLAEDMLAVGDGGPAANF 187

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           V     FGC + ++G  L     +G+FG+G    S+   L  QG+I ++FSMCFG+   G
Sbjct: 188 V-----FGCAQSESG-LLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREG 241

Query: 280 RISFGDKGSPGQGETP 295
            +  G+   P     P
Sbjct: 242 VLLLGNVALPADAPAP 257


>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
          Length = 775

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 119/282 (42%), Gaps = 36/282 (12%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +++G PA S+ + +DTGS L WL CD  C +C          ++   +Y P   +  
Sbjct: 404 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKP---TPK 451

Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             V C  +LC          K+C S    C Y ++Y+   +M  G LV D   L+     
Sbjct: 452 KLVTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 508

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
           + +    I+FGCG  Q     +   P + + GL   K ++ S L +QG+I  +    C  
Sbjct: 509 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 565

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
           S G G + FGD   P  G T   + + H  Y+     +    N+        + IFDSG 
Sbjct: 566 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 625

Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
           ++TY     Y    + +  T NS  K   E +  D     C+
Sbjct: 626 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 667



 Score = 42.0 bits (97), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 47/167 (28%), Positives = 71/167 (42%), Gaps = 27/167 (16%)

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ-TGSFLDGAAP 242
           + C Y+++Y +DG  + G L+ D   L        +    + FGCG  Q  G      +P
Sbjct: 27  TQCDYEIKY-ADGASTIGALIVDQFSLP-----RIATRPNLPFGCGYNQGIGENFQQTSP 80

Query: 243 -NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ 300
            NG+ GL   K S  S L   G+I  +    C  S G G +  GD    G G    +L  
Sbjct: 81  VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGD----GDG----NLVL 132

Query: 301 THPTY------NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAY 341
            H  Y       +   + S+G N ++     +FDSG+++TY     Y
Sbjct: 133 LHANYYSPGSATLYFDRHSLGMNPMD----VVFDSGSTYTYFTAQPY 175


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/337 (27%), Positives = 138/337 (40%), Gaps = 43/337 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           H   + +G P +     +DTGSDL W+ C  C+ C   +           ++ P  SST 
Sbjct: 68  HLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKP---------MFDPLKSSTY 118

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
           + + C+S LC +L     S    C Y   Y  D +++ G L +D     ++  +  S+ S
Sbjct: 119 NNISCDSPLCHKLDTGVCSPEKRCNYTYGY-GDNSLTKGVLAQDTATFTSNTGKPVSL-S 176

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFG 274
           R  FGCG   TG F D     GL GLG   TS+ S +         +Q L+P    +   
Sbjct: 177 RFLFGCGHNNTGGFNDHEM--GLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKIS 234

Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSA 325
           S    R+SFG KGS   G     TP   R+   +Y +T+  +SV       N+   + + 
Sbjct: 235 S----RMSFG-KGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANM 289

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + DSGT    L    Y ++     +    K  T    L  + CY     QTN + P +  
Sbjct: 290 LVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYR---TQTNLKGPTLTF 346

Query: 386 TMKGGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDN 421
              G        PI   +   P+   ++CL +    N
Sbjct: 347 HFVGANVLLT--PIQTFIPPTPQTKGIFCLAIYNRTN 381


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 143/332 (43%), Gaps = 58/332 (17%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS- 164
            N+S+GQP++  +V +DTGSD+ W+ C+ C +C + L           ++ P+ SST S 
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGL---------LFDPSMSSTFSP 153

Query: 165 --KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             K PC    C+            P+ + Y+ + + S  F  + ++   TDE  S+  D 
Sbjct: 154 LCKTPCGFKGCKCDP--------IPFTISYVDNSSASGTFGRDILVFETTDEGTSQISD- 204

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
            +  GCG      F      NG+ GL     + P+ LA Q  I   FS C G+       
Sbjct: 205 -VIIGCG--HNIGFNSDPGYNGILGL----NNGPNSLATQ--IGRKFSYCIGNLADPYYN 255

Query: 279 -GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AI 326
             ++  G+        TPF +   H  Y +T+  +SVG   ++     FE         I
Sbjct: 256 YNQLRLGEGADLEGYSTPFEVY--HGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVI 313

Query: 327 FDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV-- 383
            DSGT+ TYL D A+  + +E  N L    R+    + P++ CY    ++    +PVV  
Sbjct: 314 LDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTF 373

Query: 384 ------NLTMKGGGPFFVNDPIVIVSSEPKGL 409
                 +L +  G  F   D I  ++  P  +
Sbjct: 374 HFVDGADLALDTGSFFSQRDDIFCMTVSPASI 405


>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
          Length = 411

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 29/279 (10%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
           +  +++  PA  + + +DTGS L WL CD  C++C           +   +Y P    + 
Sbjct: 39  FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89

Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             ++  C     +L+K       N C Y ++Y+  G  S G L+ D   L      +   
Sbjct: 90  KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144

Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
            + I+FGCG  Q  +  +   P NG+ GLG  K ++ S L +QG+I  +    C  S G 
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN----FEFSAIFDSGTSFT 334
           G + FGD   P  G T   + + H  Y+     +    N  +         IFDSG ++T
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYT 264

Query: 335 YLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
           Y    P +  +S   ++L+KE +   E    D     C+
Sbjct: 265 YFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 303


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 167/427 (39%), Gaps = 74/427 (17%)

Query: 29  TFGFDFHHRYSDPVKGILAVDDLP---KKGSFAYYSALAHRDRYFRLRGRGLAA----QG 81
           T GF    R+ D  K +  ++ +    K+G          + R  +L    LAA      
Sbjct: 44  TNGFRVMLRHVDSGKNLTKLERVQHGIKRG----------KSRLQKLNAMVLAASSTPDS 93

Query: 82  NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVH 140
            D+      AGN  Y +          +++G P +S+   LDTGSDL W  C  C  C  
Sbjct: 94  EDQLEAPIHAGNGEYLIE---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYK 144

Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTM 198
                        I+ P  SS+ SKV C S+LC      PS+     C Y   Y  D +M
Sbjct: 145 QPTP---------IFDPKKSSSFSKVSCGSSLCS---ALPSSTCSDGCEYVYSY-GDYSM 191

Query: 199 STGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI 258
           + G L  +       + ++K     I FGCG    G   + A+  GL GLG    S+ S 
Sbjct: 192 TQGVLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQ 247

Query: 259 LANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITIT 310
           L  Q      FS C       + S    GS G+ +       TP       P+ Y +++ 
Sbjct: 248 LKEQ-----RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLE 302

Query: 311 QVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
            +SVG   ++ E S            I DSGT+ TY+   AY  + + F S  K   +  
Sbjct: 303 AISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALD-K 361

Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
           TS    + C+ L    T  E P +    KGG      +  +I  S    L + CL +  S
Sbjct: 362 TSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSN---LGVACLAMGAS 418

Query: 420 DNVNIIG 426
             ++I G
Sbjct: 419 SGMSIFG 425


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/297 (30%), Positives = 127/297 (42%), Gaps = 34/297 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   + +G P     +  DTGSDL W  C+ C+   +     S +   FN   P++SST 
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-----SQKEPKFN---PSSSSTY 183

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             V C+S +CE  + C  + SNC Y + Y  D + + GFL ++   L   +     V   
Sbjct: 184 QNVSCSSPMCEDAESC--SASNCVYSIGY-GDKSFTQGFLAKEKFTLTNSD-----VLED 235

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           + FGCG    G F   A   GL    +   +  +   N     N FS C   F S+ TG 
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-----NIFSYCLPSFTSNSTGH 290

Query: 281 ISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
           ++FG  G S     TP S   +   Y I I  +SVG   +      FS   AI DSGT F
Sbjct: 291 LTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 350

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T L    Y ++   F       + TS   L F+ CY  +   T   YP +  +  GG
Sbjct: 351 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFTGLDT-VTYPTIAFSFAGG 405


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/340 (26%), Positives = 144/340 (42%), Gaps = 45/340 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VG PA    + LDTGSD+ W+ C  C  C    +          ++ P+ S++ 
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 213

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C++  C       C ++   C Y+V Y  DG+ + G    + L L      S    
Sbjct: 214 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 269

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F+  A    L G  +   S PS ++       +FS C     S  +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
             + FGD              +T   Y + ++ +SVGG  ++   SA           I 
Sbjct: 320 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIV 379

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L   AY  + + F    +    TS   L F+ CY LS ++T+ E P V+L  
Sbjct: 380 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRF 437

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
            GGG   +     ++  +  G   YCL    ++  V+IIG
Sbjct: 438 AGGGELRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 475


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 151/345 (43%), Gaps = 46/345 (13%)

Query: 99  NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
            SL  L Y   V +G P  S  + +DTGSD+ W+ C   S  H             ++ P
Sbjct: 126 TSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 177

Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++SST S   C+S  C    Q    C S  S C Y V Y  DG+ +TG    D L L ++
Sbjct: 178 SSSSTYSPFSCSSAACAQLGQEGNGCSS--SQCQYTVTY-GDGSSTTGTYSSDTLALGSN 234

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +      +  FGC  V++G F D    +GL GLG    S+ S  A  G    +FS C 
Sbjct: 235 AVR------KFQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTFGAAFSYCL 283

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA 325
              S  +G ++ G  G+ G  +TP       PT Y + I  + VGG  ++     F    
Sbjct: 284 PATSSSSGFLTLG-AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGT 342

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P V L
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGI-LDTCFDFS-GQSSVSIPTVAL 400

Query: 386 TMKGGGPF-FVNDPIVIVSSEPKGLYLYCLG-VVKSDN--VNIIG 426
              GG      +D I++ +S      + CL     SD+  + IIG
Sbjct: 401 VFSGGAVVDIASDGIMLQTSNS----ILCLAFAANSDDSSLGIIG 441


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/344 (27%), Positives = 145/344 (42%), Gaps = 52/344 (15%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           ++S+G PA+++   +DTGSDL W  C    CV   N S+       ++ P++SST + +P
Sbjct: 105 DMSIGTPAVAYAAIIDTGSDLVWTQCK--PCVECFNQST------PVFDPSSSSTYAALP 156

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C+STLC          + C Y   Y  D + + G L  +   LA      K+    ++FG
Sbjct: 157 CSSTLCSDLPSSKCTSAKCGYTYTY-GDSSSTQGVLAAETFTLA------KTKLPDVAFG 209

Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
           CG    G  F  GA   GL GLG    S+ S L   GL  N FS C  S D T +     
Sbjct: 210 CGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--NKFSYCLTSLDDTSKSPLLL 261

Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
                IS     +     TP     + P+ Y + +  ++VG   +    SA         
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
             I DSGTS TYL    Y  + + F +  K       S +  + C+    +  +  E P 
Sbjct: 322 GVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADG-SGIGLDTCFEAPASGVDQVEVPK 380

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           +   + G       +  +++ S    L   CL V+ S  ++IIG
Sbjct: 381 LVFHLDGADLDLPAENYMVLDSGSGAL---CLTVMGSRGLSIIG 421


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 144/341 (42%), Gaps = 48/341 (14%)

Query: 66  RDRYFRLRGRGLAAQGNDKTP----LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
           + R  ++ G G+  +   K P    +    GN           +   V +G P   F + 
Sbjct: 103 QARLSKISGHGIFEEMVTKLPAQSGIAIGTGN-----------YVVTVGLGTPKEDFTLV 151

Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QK 177
            DTGS + W  C    C+        Q  D     P  S++ + V C+S  C L    ++
Sbjct: 152 FDTGSGITWTQCQ--PCLGSCYPQKEQKFD-----PTKSTSYNNVSCSSASCNLLPTSER 204

Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
            C ++ S C YQ+ Y  D + S GF   + L ++     S  V +   FGCG+   G F 
Sbjct: 205 GCSASNSTCLYQIIY-GDQSYSQGFFATETLTIS-----SSDVFTNFLFGCGQSNNGLFG 258

Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETP 295
             A   GL GL     S+PS  A +      FS C  S    TG ++FG K S   G TP
Sbjct: 259 QAA---GLLGLSSSSVSLPSQTAEK--YQKQFSYCLPSTPSSTGYLNFGGKVSQTAGFTP 313

Query: 296 FSLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFN 349
            S       Y I I  +SV G+ +  + S      AI DSGT  T L   AY  + E F+
Sbjct: 314 IS-PAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPPTAYKALKEAFD 372

Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
                  +T+  +L  + CY  S N T   +P V+++ KGG
Sbjct: 373 EKMSNYPKTNGDEL-LDTCYDFS-NYTTVSFPKVSVSFKGG 411


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 73/424 (17%)

Query: 35  HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR---GRGLAAQ---GNDKTPLT 88
           H RY   ++ +LA D+               R   F+LR    R  AA    G+ + PLT
Sbjct: 133 HDRY---LRRLLAADE--------------SRANSFQLRIRNDRAAAASTQSGSAEVPLT 175

Query: 89  FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
             +G     LN +  +     S G PA +  V +DTGSDL W+ C  C +C    +    
Sbjct: 176 --SGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--- 230

Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMS 199
                 ++ P  S+T + V CN++ C    +        C      C Y + Y  DG+ S
Sbjct: 231 ------LFDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAY-GDGSFS 283

Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
            G L  D + L        S+D  + FGCG    G F       GL GLG  + S+ S  
Sbjct: 284 RGVLATDTVALG-----GASLDGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQT 334

Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQT------HPTYNITI 309
           A +      FS C       D +G +S G   S  +  TP +  +        P Y + +
Sbjct: 335 ALR--YGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNV 392

Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLP 364
           T  +VGG A+  +     + + DSGT  T L    Y  +   F    A     T+     
Sbjct: 393 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSI 452

Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNV 422
            + CY L+      + P++ L ++GG    V+    + +V  +   + L    +   D  
Sbjct: 453 LDTCYDLT-GHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQT 511

Query: 423 NIIG 426
            IIG
Sbjct: 512 PIIG 515


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 86/346 (24%), Positives = 147/346 (42%), Gaps = 47/346 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G  + SV   L       + FS C       
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
             F S  TG  S G K +  + +  +    + R+    + + +T +SV G  +    S  
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220

Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + 
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 323


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 149/401 (37%), Gaps = 60/401 (14%)

Query: 65  HRDRYFR------LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
           HR  Y R       RGR  A  G     +  S+G  T         ++    VG PA  F
Sbjct: 60  HRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 114

Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-- 176
           ++  DTGSDL W+ C       G  + S       ++    S + + + C+S  C     
Sbjct: 115 VLVADTGSDLTWVKCRGAGAAAGTGAGSPA----RVFRTAASKSWAPIACSSDTCTSYVP 170

Query: 177 ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR---------- 223
                C S  S C Y  RY  DG+ + G +  D   +A      +               
Sbjct: 171 FSLANCSSPASPCAYDYRY-RDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQG 229

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
           +  GC     G     +  +G+  LG    S  S  A +      FS C        + T
Sbjct: 230 VVLGCAATYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCLVDHLAPRNAT 285

Query: 279 GRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFEFS---------AIFD 328
             ++FG   +    +TP  L R+  P Y +T+  V V G A++             AI D
Sbjct: 286 SYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILD 345

Query: 329 SGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGTS T L  PAY  +    +  LA   R T     PFEYCY  + +    E P + +  
Sbjct: 346 SGTSLTILATPAYRAVVTALSKHLAGLPRVTMD---PFEYCYNWT-DAGALEIPKMEVHF 401

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
            G           ++ + P    + C+GV +     V++IG
Sbjct: 402 AGSARLEPPAKSYVIDAAPG---VKCIGVQEGSWPGVSVIG 439


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 92/365 (25%), Positives = 148/365 (40%), Gaps = 54/365 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++    VG PA  F++  DTGSDL W+ C    S    L+ +         + P  S T 
Sbjct: 97  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156

Query: 164 SKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           + + C S  C          CP+ GS C Y  RY  DG+ + G +  +   +A   ++ +
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGREER 215

Query: 219 SVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
               + +  GC    TG   +  A +G+  LG    S  S  A++      FS C     
Sbjct: 216 KAKLKGLVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAASR--FGGRFSYCLVDHL 271

Query: 274 -GSDGTGRISFGDK---GSPGQG------------ETPFSL-RQTHPTYNITITQVSVGG 316
              + T  ++FG      SP               +TP  L R+  P Y++++  +SV G
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331

Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFE 366
             +    +          I DSGTS T L  PAY  +    +  LA   R T     PFE
Sbjct: 332 EFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMD---PFE 388

Query: 367 YCY-VLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKS--DN 421
           YCY   SP+  + +  V  + +   G   +  P    ++ + P    + C+G+ +     
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPG---VKCIGLQEGPWPG 445

Query: 422 VNIIG 426
           +++IG
Sbjct: 446 ISVIG 450


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/326 (29%), Positives = 129/326 (39%), Gaps = 55/326 (16%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +VS+G P     V LDTGS L W+PC         +SS   +    ++ P  SS+S  V 
Sbjct: 94  SVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVG 153

Query: 168 CNSTLCEL-----QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           C +  C          C S G+N       PY V Y S  T  +G L+ D L L+     
Sbjct: 154 CRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGST--SGLLISDTLRLSPSSSS 211

Query: 217 SKSVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S     R  + GC  V          P+GL G G    SVPS L     +P  FS C   
Sbjct: 212 SAPAPFRNFAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLK----VPK-FSYCLLS 261

Query: 274 -----GSDGTGRISFGDKGSP-GQGETPFSL------RQTHPTYNI----TITQVSVGGN 317
                 S  +G +  GD   P G+ +T            + P Y++     +T +SVGG 
Sbjct: 262 RRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGK 321

Query: 318 AVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSL--AKEKRETSTSD-LPF 365
            VN             AI DSGT+FTYL+   +  ++    S    +  R     D L  
Sbjct: 322 PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL 381

Query: 366 EYCYVLSPNQTN-FEYPVVNLTMKGG 390
             C+ L P      E P + L  KGG
Sbjct: 382 RPCFALPPGPGGAMELPDLELKFKGG 407


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 106/393 (26%), Positives = 165/393 (41%), Gaps = 54/393 (13%)

Query: 28  GTFGFDFHHRY----SDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
           G      HHR+    + P      ++D+ ++      +A   R +Y  + G     +G+D
Sbjct: 55  GVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQL--RAAYITR-KYSGVNGSAGDVEGSD 111

Query: 84  KT-PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
            T P T     DT         +   V +G PA++  + +DTGSD+ W+ C   S  H  
Sbjct: 112 VTVPTTLGTSLDTLE-------YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQ 164

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
             S        ++ P++SST S   C S  C   +Q   + S C Y V+Y  DG+  +G 
Sbjct: 165 ADS--------LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKY-GDGSTGSGT 215

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
              D L L +      S      FGC + ++G+ L       +   G  ++     LA Q
Sbjct: 216 YSSDTLALGS------STVENFQFGCSQSESGNLLQDQTAGLMGLGGGAES-----LATQ 264

Query: 263 --GLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSV 314
             G    +FS C     GS  +G ++ G   S    +TP  LR T  P+ Y + +  + V
Sbjct: 265 TAGTFGKAFSYCLPPTPGS--SGFLTLGASTSGFVVKTPM-LRSTQVPSYYGVLLQAIRV 321

Query: 315 GGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           GG  +N   SA     I DSGT  T L   AY+ +S  F +  K+        + F+ C+
Sbjct: 322 GGRQLNIPASAFSAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGI-FDTCF 380

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPF-FVNDPIVI 401
             S  Q++   P V L   GG      +D I++
Sbjct: 381 DFS-GQSSVSIPTVALVFSGGAVVDLASDGIIL 412


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 91/340 (26%), Positives = 144/340 (42%), Gaps = 45/340 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VG PA    + LDTGSD+ W+ C  C  C    +          ++ P+ S++ 
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 217

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + V C++  C       C ++   C Y+V Y  DG+ + G    + L L      S    
Sbjct: 218 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 273

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F+  A    L G  +   S PS ++       +FS C     S  +
Sbjct: 274 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 323

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
             + FGD              +T   Y + ++ +SVGG  ++   SA           I 
Sbjct: 324 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIV 383

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L   AY  + + F    +    TS   L F+ CY LS ++T+ E P V+L  
Sbjct: 384 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRF 441

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
            GGG   +     ++  +  G   YCL    ++  V+IIG
Sbjct: 442 AGGGELRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 479


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 137/328 (41%), Gaps = 32/328 (9%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +DTGS + ++PC   +C H     S Q   F    P  S T   V
Sbjct: 95  TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCKH---CGSHQDPKFR---PEASETYQPV 146

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            C       Q  C      C Y+ RY ++ + S+G L EDV+       QS+    R  F
Sbjct: 147 KCT-----WQCNCDDDRKQCTYERRY-AEMSTSSGVLGEDVVSFGN---QSELSPQRAIF 197

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG   +  A +G+ GLG    S+   L  + +I ++FS+C+G    G G +  G
Sbjct: 198 GCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
               P       S     P YNI + ++ V G  ++        +   + DSGT++ YL 
Sbjct: 257 GISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCY---VLSPNQTNFEYPVVNLTMKGGGPF 393
           + A+              +  S  D  + + C+    ++ +Q +  +PVV +   G G  
Sbjct: 317 ESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVF-GNGHK 375

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
               P   +    K    YCLGV  + N
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFSNGN 403


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 142/343 (41%), Gaps = 59/343 (17%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G PA + ++ +DTGSDL W+ C  C  C   +++         I+ P  SS+   +PC S
Sbjct: 144 GTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDA---------IFEPKQSSSYKTLPCLS 194

Query: 171 TLC-EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
             C EL        P     C Y++ Y  DG+ S G   ++ L L +D  Q+       +
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINY-GDGSSSQGDFSQETLTLGSDSFQN------FA 247

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRI 281
           FGCG   TG F      +GL GLG +  S PS   ++      F+ C      S  TG  
Sbjct: 248 FGCGHTNTGLF---KGSSGLLGLGQNSLSFPS--QSKSKYGGQFAYCLPDFGSSTSTGSF 302

Query: 282 SFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGN------AVNFEFSAIFDSGTSF 333
           S G    P     TP      +PT Y + +  +SVGG+      AV    S I DSGT  
Sbjct: 303 SVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTVI 362

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYCYVLSPNQTNFEYPVVNLT 386
           T L   AY  +  +F S         T DLP        + CY LS   +    P +   
Sbjct: 363 TRLLPQAYNALKTSFRS--------KTRDLPSAKPFSILDTCYDLS-RHSQVRIPTITFH 413

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
            +      V+D  ++V  +  G  + CL    +   D  NIIG
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQV-CLAFASASQMDGFNIIG 455


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  V VG PA + ++ LDTGSD+ WL   C  C H   + SG+V D     P  S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173

Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            V C + +C       C    ++C YQV Y  DG+++ G    + L  A   +       
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
           R++ GCG    G F+   A +GL GLG  + S PS +A       SFS C          
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 282

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
            S  +  ++FG           F+    +P     Y + +   SVGG             
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342

Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
           N        I DSGTS T L  P Y  + + F + A   R +      F+ CY LS  + 
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 402

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             + P V++ + GG    +     ++  +  G   +C  +  +D  V+IIG
Sbjct: 403 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 450


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 151/366 (41%), Gaps = 48/366 (13%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G PA  + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 171 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 222

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 223 PVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 281

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 282 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 331

Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
           S GTG + FG            TP  L    PT Y I +T + VGG  ++   S      
Sbjct: 332 STGTGYLDFGAGSPAAASARLTTPM-LTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAG 390

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
            I DSGT  T L  PAY+ +   F +       K+  + S L  + CY  +   +    P
Sbjct: 391 TIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 447

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
            V+L  +GG    V+   ++ ++    + L          +G+V +  +   G  Y I  
Sbjct: 448 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 507

Query: 434 NISLFH 439
            +  F+
Sbjct: 508 KVVGFY 513


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/269 (32%), Positives = 119/269 (44%), Gaps = 52/269 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P+   ++ +DTGSDL WL C  C  C     +  GQV D     P  SST 
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136

Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            +VPC+S  C   +   C S   AG  C Y V Y  DG+ STG L  D L  A D     
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           +  + ++ GCGR   G F D AA  GL G+G  K S+ + +A      + F  C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGSVFEYCLG-DRT 244

Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
            R      + FG   +P    T F+   ++P     Y + +   SVGG  V    +A   
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302

Query: 326 ----------IFDSGTSFTYLNDPAYTQI 344
                     + DSGT+ +     AY  +
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAAL 331


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 143/351 (40%), Gaps = 52/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA   ++ LDTGSD+ WL C  C  C       SGQV D     P  S + 
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYD----QSGQVFD-----PRRSRSY 192

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             V C++ LC       C      C YQV Y  DG+++ G    + L  A   +      
Sbjct: 193 GAVGCSAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 246

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
           +RI+ GCG    G F+  A   GL        S P+ ++ +     SFS C         
Sbjct: 247 ARIALGCGHDNEGLFVAAAGLLGLG---RGSLSFPAQISRR--YGRSFSYCLVDRTSSAN 301

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
            +  +  ++FG           F+    +P     Y + +  +SVGG  V          
Sbjct: 302 PASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRL 361

Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
              +     I DSGTS T L  PAY+ + + F + A   R +      F+ CY LS  + 
Sbjct: 362 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKV 421

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             + P V++   GG    +     ++  + KG   +C     +D  V+IIG
Sbjct: 422 -VKVPTVSMHFAGGAEAALPPENYLIPVDSKG--TFCFAFAGTDGGVSIIG 469


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 152/363 (41%), Gaps = 52/363 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VGQPA  F + LDTGSD+ WL C  C  C    +          I+ P +SS+ 
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + +PC S  C+  +      S C YQV Y  DG+ + G  V + L        +  + + 
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVTETLTFG-----NSGMIND 259

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A   GL G  +  TS         +  +SFS C     S  +  
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QMKASSFSYCLVDRDSSSSSD 311

Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
           + F           P        T Y + +T +SVGG  ++     F+         I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT+ T L   AY  + + F S     ++T+   L F+ CY LS +Q+    P V+    
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLS-SQSRVTIPTVSFEFA 429

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNISLF-- 438
           GG    +     ++  +  G + +      S +++IIG          Y +AN++  F  
Sbjct: 430 GGKSLQLPPKNYLIPVDSVGTFCFAFAPTTS-SLSIIGNVQQQGTRVHYDLANSVVGFSP 488

Query: 439 HNC 441
           H C
Sbjct: 489 HKC 491


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 165/425 (38%), Gaps = 67/425 (15%)

Query: 27  FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA---QGND 83
           + T GF    R+ D  K +  ++ +        +     + R  RL    LAA      D
Sbjct: 43  YPTKGFRVMLRHVDSGKNLTKLERV-------QHGIKRGKSRLQRLNAMVLAASTLDSED 95

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
           +      AGN  Y +          +++G P +S+   LDTGSDL W  C  C  C    
Sbjct: 96  QLEAPIHAGNGEYLME---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQP 146

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMST 200
                      I+ P  SS+ SKV C S+LC      PS+     C Y   Y  D +M+ 
Sbjct: 147 TP---------IFDPKKSSSFSKVSCGSSLCS---AVPSSTCSDGCEYVYSY-GDYSMTQ 193

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L  +       + ++K     I FGCG    G   + A+  GL GLG    S+ S L 
Sbjct: 194 GVLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLK 249

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITITQV 312
                   FS C       + S    GS G+ +       TP       P+ Y +++  +
Sbjct: 250 EP-----RFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGI 304

Query: 313 SVGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
           SVG   ++ E S            I DSGT+ TY+   A+  + + F S  K   +  TS
Sbjct: 305 SVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLD-KTS 363

Query: 362 DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
               + C+ L    T  E P +    KGG      +  +I  S    L + CL +  S  
Sbjct: 364 STGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSN---LGVACLAMGASSG 420

Query: 422 VNIIG 426
           ++I G
Sbjct: 421 MSIFG 425


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 83/160 (51%), Gaps = 15/160 (9%)

Query: 195 DGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMD 251
           DG+ + G+LV+DV+HL   T  +Q+ S +  I FGCG  Q+G   +  AA +G+ G G  
Sbjct: 4   DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63

Query: 252 KTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
            +S  S LA+QG +  SF+ C   ++G G  + G+  SP    TP   +  H  Y++ + 
Sbjct: 64  NSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLN 121

Query: 311 QVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
            + VG + +    +A         I DSGT+  YL D  Y
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVY 161


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 140/345 (40%), Gaps = 42/345 (12%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
            R  Y + R  G AA           A      L  S+G L Y   VS+G PA++  + +
Sbjct: 89  RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 148

Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
           DTGSD+ W+   PC    C    +          ++ P  SS+ S VPC +  C      
Sbjct: 149 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 199

Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
              C  +G  C Y V Y  DG+ +TG    D L L         +     FGCG  Q G 
Sbjct: 200 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 251

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
           F   A  +GL GLG    S+ S  ++       FS C     +  G IS G   S  G  
Sbjct: 252 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 306

Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
            TP       PTY I  +  +SVGG  ++ + S     A+ D+GT  T L   AY+ +  
Sbjct: 307 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 366

Query: 347 TFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            F  ++A     ++ +    + CY  +   T    P +++   GG
Sbjct: 367 AFRAAMAPYGYPSAPATGILDTCYDFTRYGT-VTLPTISIAFGGG 410


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 92/346 (26%), Positives = 147/346 (42%), Gaps = 51/346 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G P     + +DTGSD+ W+ C  C SC    ++         ++ P  SS+ 
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64

Query: 164 SKVPCNSTLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            ++ C++  C+L   K C S  + C YQV Y  DG+ + G L  D   +      S+   
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFSV------SRGRT 117

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
           S + FGCG    G F+  A      GLG  K S PS L+++      FS C      G  
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------- 325
            +  + FGD   P      ++    +P     Y   ++ +S+GG  ++   +A       
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                I DSGTS T L   AYT + + F S  ++    +   L F+ CY  S   T+   
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCYDFSA-LTSVTI 287

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P V+   +GG    +     +V  +  G + +       D ++IIG
Sbjct: 288 PTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIG 332


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 130/303 (42%), Gaps = 39/303 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     +  DTGSDL W  C  CV   +             I++P+ S++ 
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 155

Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
             V C+S  C  L     +AG    SNC Y ++Y  D + S GFL ++   L   +    
Sbjct: 156 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 210

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
            V   + FGCG    G F   A   GL GLG DK S PS  A        FS C  S   
Sbjct: 211 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 264

Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
            TG ++FG  G S     TP S +      Y + I  ++VGG  +      FS   A+ D
Sbjct: 265 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 324

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
           SGT  T L   AY  +  +F   AK  +  +TS +   + C+ LS  +T    P V  + 
Sbjct: 325 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 381

Query: 388 KGG 390
            GG
Sbjct: 382 SGG 384


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 143/356 (40%), Gaps = 58/356 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA   ++ LDTGSD+ W+ C  C  C       SG V D     P  SS+ 
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSY 179

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             V C + LC       C      C YQV Y  DG+++ G  V + L  A   +      
Sbjct: 180 GAVGCGAALCRRLDSGGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV----- 233

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
           +R++ GCG    G F+  A   GL        S P+ ++ +     SFS C         
Sbjct: 234 ARVALGCGHDNEGLFVAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGA 288

Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------ 319
               GS  +  +SFG  GS G     F+    +P     Y + +  +SVGG  V      
Sbjct: 289 GAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES 347

Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
                        I DSGTS T L   +Y+ + + F + A      S      F+ CY L
Sbjct: 348 DLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDL 407

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
              +   + P V++   GG    +     ++  + +G   +C     +D  V+IIG
Sbjct: 408 GGRRV-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 460


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  V VG PA + ++ LDTGSD+ WL   C  C H   + SG+V D     P  S + +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 179

Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            V C + +C       C    ++C YQV Y  DG+++ G    + L  A   +       
Sbjct: 180 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 233

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
           R++ GCG    G F+   A +GL GLG  + S PS +A       SFS C          
Sbjct: 234 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 288

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
            S  +  ++FG           F+    +P     Y + +   SVGG             
Sbjct: 289 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 348

Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
           N        I DSGTS T L  P Y  + + F + A   R +      F+ CY LS  + 
Sbjct: 349 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 408

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             + P V++ + GG    +     ++  +  G   +C  +  +D  V+IIG
Sbjct: 409 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 456


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 140/345 (40%), Gaps = 42/345 (12%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
            R  Y + R  G AA           A      L  S+G L Y   VS+G PA++  + +
Sbjct: 100 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 159

Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
           DTGSD+ W+   PC    C    +          ++ P  SS+ S VPC +  C      
Sbjct: 160 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 210

Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
              C  +G  C Y V Y  DG+ +TG    D L L         +     FGCG  Q G 
Sbjct: 211 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 262

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
           F   A  +GL GLG    S+ S  ++       FS C     +  G IS G   S  G  
Sbjct: 263 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 317

Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
            TP       PTY I  +  +SVGG  ++ + S     A+ D+GT  T L   AY+ +  
Sbjct: 318 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 377

Query: 347 TFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            F  ++A     ++ +    + CY  +   T    P +++   GG
Sbjct: 378 AFRAAMAPYGYPSAPATGILDTCYDFTRYGT-VTLPTISIAFGGG 421


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G P   F + +DTGS + ++PC+  SC    N    +      + P+ S T   V CN 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
                   C +    C Y+ +Y ++ + S+G L ED++        S+    R  FGC  
Sbjct: 54  DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
            +TG      A +G+ GLG    S+   L  +G+I +SFS+C+G    G G +  G    
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163

Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
           P       S     P YNI +  + V G  ++        +   I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223

Query: 342 TQISETFNS 350
               +   S
Sbjct: 224 LPFIQAITS 232


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 143/338 (42%), Gaps = 43/338 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VGQP+  F + LDTGSD+ WL C  C  C    +          I+ P  SS+ 
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDP---------IFDPTASSSY 207

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + + C++  C+  +        C YQV Y  DG+ + G  V + +        + SV+ R
Sbjct: 208 NPLTCDAQQCQDLEMSACRNGKCLYQVSY-GDGSFTVGEYVTETVSFG-----AGSVN-R 260

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F+  A   GL G  +  TS         +   SFS C     +G+ S 
Sbjct: 261 VAIGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QIKATSFSYCLVDRDSGKSST 312

Query: 284 GDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNF---EFS--------AIFD 328
            +  SP  G++   P    Q   T Y + +T VSVGG  V      F+         I D
Sbjct: 313 LEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVD 372

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT+ T L   AY  + + F       R      L F+ CY LS  Q+    P V+    
Sbjct: 373 SGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVAL-FDTCYDLSSLQS-VRVPTVSFHFS 430

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           G   + +     ++  +  G Y +      S +++IIG
Sbjct: 431 GDRAWALPAKNYLIPVDGAGTYCFAFAPTTS-SMSIIG 467


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G P   F + +DTGS + ++PC+  SC    N    +      + P+ S T   V CN 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
                   C +    C Y+ +Y ++ + S+G L ED++        S+    R  FGC  
Sbjct: 54  DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
            +TG      A +G+ GLG    S+   L  +G+I +SFS+C+G    G G +  G    
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163

Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
           P       S     P YNI +  + V G  ++        +   I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223

Query: 342 TQISETFNS 350
               +   S
Sbjct: 224 LPFIQAITS 232


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 140/350 (40%), Gaps = 56/350 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +  ++++G P L     LDTGSDL W  CD  C  C               +Y+P  S+T
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142

Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + V C S +C+ LQ    +C    + C Y   Y  DGT + G L  +   L +D     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
                ++FGCG    GS  +    +GL G+G    S+ S L         FS CF     
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248

Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
                     + R+S   K +P         R+    Y +++  ++VG   +  + +   
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                    I DSGT+FT L + A+  ++    S  +     S + L    C+  +  + 
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEESAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA 367

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             E P + L   G       +  V+   E +   + CLG+V +  ++++G
Sbjct: 368 -VEVPRLVLHFDGADMELRRESYVV---EDRSAGVACLGMVSARGMSVLG 413


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 140/350 (40%), Gaps = 56/350 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +  ++++G P L     LDTGSDL W  CD  C  C               +Y+P  S+T
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142

Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + V C S +C+ LQ    +C    + C Y   Y  DGT + G L  +   L +D     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
                ++FGCG    GS  +    +GL G+G    S+ S L         FS CF     
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248

Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
                     + R+S   K +P         R+    Y +++  ++VG   +  + +   
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                    I DSGT+FT L + A+  ++    S  +     S + L    C+  +  + 
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA 367

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             E P + L   G       +  V+   E +   + CLG+V +  ++++G
Sbjct: 368 -VEVPRLVLHFDGADMELRRESYVV---EDRSAGVACLGMVSARGMSVLG 413


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 152/363 (41%), Gaps = 52/363 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V VGQPA  F + LDTGSD+ WL C  C  C    +          I+ P +SS+ 
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + +PC S  C+  +      S C YQV Y  DG+ + G  V + L        +  + + 
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVIETLTFG-----NSGMINN 259

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A   GL G  +  TS         +  +SFS C     S  +  
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGSLSLTS--------QMKASSFSYCLVDRDSSSSSD 311

Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
           + F           P        T Y + +T +SVGG  ++     F+         I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT+ T L   AY  + + F S     ++T+   L F+ CY LS +Q+    P V+    
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLS-SQSRVTIPTVSFEFA 429

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNISLF-- 438
           GG    +     ++  +  G + +      S +++IIG          Y +AN++  F  
Sbjct: 430 GGKSLQLPPKNYLIPVDSVGTFCFAFAPTTS-SLSIIGNVQQQGTRVHYDLANSVVGFSP 488

Query: 439 HNC 441
           H C
Sbjct: 489 HKC 491


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 87/271 (32%), Positives = 110/271 (40%), Gaps = 47/271 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     + LDT  D  W+PC DC  C                +SPNTSST 
Sbjct: 99  YVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSS------------PTFSPNTSSTY 146

Query: 164 SKVPCNSTLCELQK--QCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           + + C+   C   +   CP+ G + C +   Y  D + S   L +D L LA D   S   
Sbjct: 147 ASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFS-AMLSQDSLGLAVDTLPS--- 202

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCFGSDG-- 277
               SFGC    +GS L    P GL GLG       S+L+  G L    FS CF S    
Sbjct: 203 ---YSFGCVNAVSGSTLP---PQGLLGLGRGPM---SLLSQSGSLYSGVFSYCFPSFKSY 253

Query: 278 --TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFE 322
             +G +  G  G P    T   LR  H PT Y + +T VSVG   V           N  
Sbjct: 254 YFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTG 313

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
              I DSGT  T   +P Y  I + F    K
Sbjct: 314 AGTIIDSGTVITRFVEPVYAAIRDEFRKQVK 344


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 86/346 (24%), Positives = 146/346 (42%), Gaps = 47/346 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + I+ +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
             F S  TG  S G K +  + +  +    + R+    + + +T +SV G  +    S  
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220

Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + 
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 93/346 (26%), Positives = 149/346 (43%), Gaps = 51/346 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G P     + +DTGSD+ W+ C  C SC    ++         ++ P  SS+ 
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64

Query: 164 SKVPCNSTLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            ++ C++  C+L   K C S  + C YQV Y  DG+ + G L  D   +      S+   
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFLV------SRGRT 117

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
           S + FGCG    G F+  A      GLG  K S PS L+++      FS C      G  
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN-----FEFSA-- 325
            +  + FGD   P      ++    +P     Y   ++ +S+GG  ++     F+ S+  
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                I DSGTS T L   AYT + + F S  ++    +   L F+ CY  S   T+   
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCYDFSA-LTSVTI 287

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P V+   +GG    +     +V  +  G + +       D ++IIG
Sbjct: 288 PTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIG 332


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 93/309 (30%), Positives = 132/309 (42%), Gaps = 38/309 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
           +SL  L Y  +V +G PA++  V +DTGSD+ W+   PC    C     + +G + D   
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCY----AQTGALFD--- 172

Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
             P  SST   V C +  C +L++Q   C +    C Y V+Y  DG+ + G    D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
           +      K       FGC  V++G F D    +GL GLG    S+ S  A      NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280

Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
            C     GS G   +  G   S          RQ    Y   +  ++VGG  +      F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF 340

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
              ++ DSGT  T L   AY+ +S  F +  K+ R      +  + C+  +  QT    P
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI-LDTCFDFA-GQTQISIP 398

Query: 382 VVNLTMKGG 390
            V L   GG
Sbjct: 399 TVALVFSGG 407


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 151/360 (41%), Gaps = 54/360 (15%)

Query: 95  TYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVI 150
           T+  +S+  L Y   + +G PA+  IV +DTGSDL W+   PC    C    +       
Sbjct: 107 TFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDP------ 160

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPS-AGSNCPYQVRYLSDGTMSTGFL 203
              ++ P++SS+ + VPC+S  C           C S A + C Y + Y +  T +TG  
Sbjct: 161 ---LFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRAT-TTGVY 216

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
             + L L     +   V +   FGCG  Q G +      +GL GLG    S+ S  ++Q 
Sbjct: 217 STETLTL-----KPGVVVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQF 268

Query: 264 LIPNSFSMCFGSDGTGRISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVG 315
             P S+ +   S G G ++ G          + G   TP     + PT Y +T+T +SVG
Sbjct: 269 GGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVG 328

Query: 316 GNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD-LPFEYCY 369
           G  +    SA     + DSGT  T L   AY  +   F S   E R    S+    + CY
Sbjct: 329 GAPLAVPPSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCY 388

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL---GVVKSDNVNIIG 426
             +   TN   P + LT  GG    +  P  +       L   CL   G    D + IIG
Sbjct: 389 DFT-GHTNVTVPTIALTFSGGATIDLATPAGV-------LVDGCLAFAGAGTDDTIGIIG 440


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  V VG PA + ++ LDTGSD+ WL   C  C H   + SG+V D     P  S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173

Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            V C + +C       C    ++C YQV Y  DG+++ G    + L  A   +       
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
           R++ GCG    G F+   A +GL GLG  + S P+ +A       SFS C          
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSSVRP 282

Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
            S  +  ++FG           F+    +P     Y + +   SVGG             
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342

Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
           N        I DSGTS T L  P Y  + + F + A   R +      F+ CY LS  + 
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 402

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             + P V++ + GG    +     ++  +  G   +C  +  +D  V+IIG
Sbjct: 403 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 450


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 101/352 (28%), Positives = 148/352 (42%), Gaps = 54/352 (15%)

Query: 65  HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
            R +Y + R     GR  + +  D T L   +G+     N     ++  V +G P     
Sbjct: 96  ERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSAN-----YFVVVGLGTPKRDLS 150

Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
           +  DTGSDL W  C+ C  SC    ++         I+ P+ SS+   + C S+LC    
Sbjct: 151 LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYINITCTSSLCTQLT 201

Query: 175 ---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCGR 230
              ++ +C S+ + C Y ++Y  D + S GFL ++ L + ATD      VD  + FGCG+
Sbjct: 202 SAGIKSRCSSSTTACIYGIQY-GDKSTSVGFLSQERLTITATD-----IVDDFL-FGCGQ 254

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGS 288
              G F   A   GL GLG    S   +     +    FS C  S  +  G ++FG   +
Sbjct: 255 DNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLTFGASAA 309

Query: 289 PGQG--ETPFSLRQTHPT-YNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYLND 338
                  TP S      T Y + I  +SVGG  +    +  FSA   I DSGT  T L  
Sbjct: 310 TNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAP 369

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            AY  +   F     EK   +  D  F+ CY  S  +     P ++    GG
Sbjct: 370 TAYAALRSAFRQ-GMEKYPVANEDGLFDTCYDFSGYK-EISVPKIDFEFAGG 419


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 143/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           + T+V +G PA + IV +DTGS + W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSSGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 98/332 (29%), Positives = 138/332 (41%), Gaps = 64/332 (19%)

Query: 78  AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
           AA G+ +TPL   +G   Y +           S+G P        DTGSDL W  C  C 
Sbjct: 64  AASGSAQTPLQLDSGGGAYDMT---------FSIGTPPQELSALADTGSDLIWAKCGACT 114

Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRY-- 192
            CV   + S         Y PN SS+ SK+PC+ +LC      QC + G+ C Y+  Y  
Sbjct: 115 RCVPQGSPS---------YYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGL 165

Query: 193 LSDGTMST-GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
            SD    T G+L  +   L +D          I FGC  +  G +  G+         + 
Sbjct: 166 ASDPHHYTQGYLGSETFTLGSDAVPG------IGFGCTTMSEGGYGSGSG-------LVG 212

Query: 252 KTSVPSILANQGLIPNSFSMCFGSDG--TGRISFGDKGSPGQG--ETPFSLRQTHPTYNI 307
               P  L +Q L   +FS C  SD   T  + FG     G G   TP  LR +   Y +
Sbjct: 213 LGRGPLSLVSQ-LNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQSTPL-LRTSTYYYTV 270

Query: 308 TITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP- 364
            +  +S+G        S+  IFDSGT+  +L +PAYT        LAKE   + T++L  
Sbjct: 271 NLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYT--------LAKEAVLSQTTNLTM 322

Query: 365 ------FEYCYVLSPNQTNFEYPVVNLTMKGG 390
                 +E C+      +   +P + L   GG
Sbjct: 323 ASGRDGYEVCF----QTSGAVFPSMVLHFDGG 350


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 130/303 (42%), Gaps = 39/303 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     +  DTGSDL W  C  CV   +             I++P+ S++ 
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 183

Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
             V C+S  C  L     +AG    SNC Y ++Y  D + S GFL ++   L   +    
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 238

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
            V   + FGCG    G F   A   GL GLG DK S PS  A        FS C  S   
Sbjct: 239 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 292

Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
            TG ++FG  G S     TP S +      Y + I  ++VGG  +      FS   A+ D
Sbjct: 293 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 352

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
           SGT  T L   AY  +  +F   AK  +  +TS +   + C+ LS  +T    P V  + 
Sbjct: 353 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 409

Query: 388 KGG 390
            GG
Sbjct: 410 SGG 412


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/269 (27%), Positives = 111/269 (41%), Gaps = 34/269 (12%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +++G P  +F   +DTGSDL W+ CD  C  C    +          +Y P     ++ V
Sbjct: 58  LNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRD---------KLYKPK----NNLV 104

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PC+++LC+         C +    C Y++ Y   G+ S G L+ D   L         + 
Sbjct: 105 PCSNSLCQAVSTGENYHCDAPDDQCDYEIEYADLGS-SIGVLLSDSFPLRL--SNGTLLQ 161

Query: 222 SRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
            +++FGCG  Q    L    P    G+ GLG  K S+ S L   G+  N    CF     
Sbjct: 162 PKMAFGCGYDQ--KHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARG 219

Query: 279 GRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
           G + FGD   P      TP     +   Y+    ++  GG     +    IFDSG+S+TY
Sbjct: 220 GFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTY 279

Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLP 364
            N   Y  I    N + K+       D P
Sbjct: 280 FNAQVYQSI---LNLVRKDLAGKPLKDAP 305


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 140/324 (43%), Gaps = 34/324 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +DTGS + ++PC   +C H      G+  D   + P+ S T   V
Sbjct: 91  TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCEH-----CGRHQDPK-FQPDLSETYQPV 142

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
            C    C     C    + C Y  +Y ++ + S+G L EDV+        S+    R  F
Sbjct: 143 KCTPD-C----NCDGDTNQCMYDRQY-AEMSSSSGVLGEDVVSFG---NLSELAPQRAVF 193

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
           GC   +TG      A +G+ GLG    S+   L ++ +I +SFS+C+G    G G +  G
Sbjct: 194 GCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG 252

Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
               P       S     P YNI + ++ V G  +         +   + DSGT++ YL 
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLP 312

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
           + A+              ++ +  D  + + C+    +  +Q    +PVV++  + G   
Sbjct: 313 ETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKL 372

Query: 394 FVN-DPIVIVSSEPKGLYLYCLGV 416
            ++ +  +   S+ +G   YCLGV
Sbjct: 373 SLSPENYLFRHSKVRG--AYCLGV 394


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 126/296 (42%), Gaps = 34/296 (11%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   + +G P     +  DTGSDL W  C+ C+   +     S +   FN   P++SST 
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-----SQKEPKFN---PSSSSTY 183

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             V C+S +CE  + C  + SNC Y + Y  D + + GFL ++   L   +     V   
Sbjct: 184 QNVSCSSPMCEDAESC--SASNCVYSIVY-GDKSFTQGFLAKEKFTLTNSD-----VLED 235

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           + FGCG    G F   A   GL    +   +  +   N     N FS C   F S+ TG 
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-----NIFSYCLPSFTSNSTGH 290

Query: 281 ISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
           ++FG  G S     TP S   +   Y I I  +SVG   +      FS   AI DSGT F
Sbjct: 291 LTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 350

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           T L    Y ++   F       + TS   L F+ CY  +   T   YP +  +  G
Sbjct: 351 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFTGLDT-VTYPTIAFSFAG 404


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 148/341 (43%), Gaps = 54/341 (15%)

Query: 117 SFIVALDTGSDLFWLPCD-CVSC---VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC---- 168
           ++ + +DTGS   ++PC  C  C    HG             Y  + S    ++ C    
Sbjct: 50  TYDLIVDTGSARTYVPCKGCARCGEHAHGY------------YDYDRSMEFERLDCGEAS 97

Query: 169 NSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           ++TLCE  ++  C S G  C Y V Y ++G+ S G++V D + L        ++ + ++F
Sbjct: 98  DATLCEETMKGTCQSDG-RCSYVVSY-AEGSSSRGYVVRDRVRLG-----EGTLSAMLAF 150

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG----TG 279
           GC   +T +  +  A +GLFG G    +V + LA+ GLI N FS C   FG++G     G
Sbjct: 151 GCEEAETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLG 209

Query: 280 RISFGDKGSPGQGETPFSLRQTHPTY-NITITQVSVGGNAVNF--EFSAIFDSGTSFTYL 336
           R  FG   +P    TP      +P + N+  +   +G + +     ++   DSGT+FT++
Sbjct: 210 RFDFG-ADAPALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFV 268

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEY---CYVLSPNQTNFE---------YPVVN 384
               +       ++ A +      +    +Y   CY +S    N           +P + 
Sbjct: 269 PRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLT 328

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
           +  +GG    +     + + E      +C+G+  + N  I+
Sbjct: 329 IAYEGGVSLTLGPENYLFAHETNSA-AFCVGIFANPNNQIL 368


>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
 gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
          Length = 149

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/133 (41%), Positives = 69/133 (51%), Gaps = 25/133 (18%)

Query: 29  TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
           TF     HR SD       P  G+      P++GS  YY AL   D   + + R LA + 
Sbjct: 26  TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78

Query: 82  N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
                 K   TFS GND      LG+L+Y  V VG P  SF+VALDTGSDLFW+PCDC+ 
Sbjct: 79  QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132

Query: 138 CVHGLNSSSGQVI 150
           C   L+S  G ++
Sbjct: 133 CAP-LSSYRGNLV 144


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 164/373 (43%), Gaps = 57/373 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VG PA  F + +DTGSDL W+ C+  +     NSSS        Y  ++SS+  
Sbjct: 59  YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 113

Query: 165 KVPCNSTLCE-----LQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           ++PC    C+     +   C  ++ S C Y   Y SD + +TG L  + + + + ++  K
Sbjct: 114 EIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 172

Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
              +          ++ GC R   G+   GA+  G+ GLG    S+ +   +  L    F
Sbjct: 173 RAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 229

Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
           S C      GS+ +  +  G         TP        + Y + +T V+V G  V+   
Sbjct: 230 SYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 289

Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
           S+            IFDSGT+ +YL +PAY+++    N+     R     ++P  FE CY
Sbjct: 290 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 346

Query: 370 VLSPNQTNFE--YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVVKSDNV--N 423
               N T  E   P + +  +GG    +  N+ +V+V+   + + L  +      N+  N
Sbjct: 347 ----NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGN 402

Query: 424 IIGREYPIANNIS 436
           ++ +++ I  +++
Sbjct: 403 LLQQDHHIEYDLA 415


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/341 (26%), Positives = 147/341 (43%), Gaps = 42/341 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  N+S+G P +  +   DTGSDL W  C+ C  C    +          ++ P  SST 
Sbjct: 86  YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSP---------LFDPKESSTY 136

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            KV C+S+ C   +   C +  + C Y + Y  D + + G +  D + + +  ++  S+ 
Sbjct: 137 RKVSCSSSQCRALEDASCSTDENTCSYTITY-GDNSYTKGDVAVDTVTMGSSGRRPVSLR 195

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG- 277
           + I  GCG   TG+F    A +G+ GLG   TS+ S L     I   FS C   F S+  
Sbjct: 196 NMI-IGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETG 250

Query: 278 -TGRISFGDKG-SPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNF--------EFSA 325
            T +I+FG  G   G G    S+ +  P   Y + +  +SVG   + F        E + 
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNI 310

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + DSGT+ T L    Y ++     S  K +R     D     CY    + ++F+ P + +
Sbjct: 311 VIDSGTTLTLLPSNFYYELESVVASTIKAER-VQDPDGILSLCY---RDSSSFKVPDITV 366

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             KGG     N    +  SE     + C     ++ + I G
Sbjct: 367 HFKGGDVKLGNLNTFVAVSED----VSCFAFAANEQLTIFG 403


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 147/345 (42%), Gaps = 50/345 (14%)

Query: 105 HYTN-VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +YT+ V +G P   F + +DTGS + ++PC   SC H  N    +      +SP  SS+ 
Sbjct: 34  YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCS--SCTHCGNHQDPR------FSPALSSSY 85

Query: 164 SKVPCNST----LCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C S      C+  ++         YQ +Y    T S+G L +DV+  +     S  
Sbjct: 86  KPLECGSECSTGFCDGSRK---------YQRQYAEKST-SSGVLGKDVIGFS---NSSDL 132

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
              R+ FGC   +TG   D  A +G+ GLG    S+   L  +  + + FS+C+G   +G
Sbjct: 133 GGQRLVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEG 191

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSG 330
            G +  G    P       S     P YN+ +  + VGG+ +         ++  + DSG
Sbjct: 192 GGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSG 251

Query: 331 TSFTYLNDPAYTQISETFNSLAKEK----RETSTSDLPF-EYCYV-LSPNQTNFE--YPV 382
           T++ Y    A+    + F S  KE+    +E    D  F + CY     N +N    +P 
Sbjct: 252 TTYAYFPGAAF----QAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPS 307

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
           V+    G G      P   +    K    YCLGV ++ D   ++G
Sbjct: 308 VDFVF-GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLG 351


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/337 (27%), Positives = 137/337 (40%), Gaps = 34/337 (10%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P        DTGSDL W  C+ CV   +            +I+ P+TS + 
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQRE--------HIFDPSTSLSY 198

Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S V C+S  CE  +         + S C Y +RY  DG+ S GF   + L L      S 
Sbjct: 199 SNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRY-GDGSYSIGFFAREKLSLT-----ST 252

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
            V +   FGCG+   G F   A   GL GL  +  S+ S  A +     S+ +   S  T
Sbjct: 253 DVFNNFQFGCGQNNRGLFGGTA---GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309

Query: 279 GRISF--GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
           G +SF  GD  S     TP  +   +P+ Y + +  +SVG   +    S       I DS
Sbjct: 310 GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDS 369

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GT  + L    Y+ + + F  L  +        +  + CY LS  +T  + P + L   G
Sbjct: 370 GTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSI-LDTCYDLSKYKT-VKVPKIILYFSG 427

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           G    +    +I   +   + L   G    D V IIG
Sbjct: 428 GAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIG 464


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/299 (27%), Positives = 124/299 (41%), Gaps = 35/299 (11%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +S+G P    +V + TGSDL W+PC     C H          D   + P  SST   V
Sbjct: 101 KISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHN--------CDLRFFDPMESSTYKNV 152

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           PC+S  C++        S+C Y        +   G L  D L L +   +S  +     F
Sbjct: 153 PCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFML-PNTGF 211

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISF 283
            CG    G +       G+ GLG    S+ + +++  LI   FS C   + S+ T ++SF
Sbjct: 212 ICGNRIGGDY----PGVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSF 265

Query: 284 GDKGSPGQGETPFSLR--QTHPTYNITIT---------QVSVGGNAVNFEFSAI-FDSGT 331
           GDK     G   FS R   T   Y+ T++          +S GG   ++  + +  DSGT
Sbjct: 266 GDKAVV-SGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGT 324

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            FTY  +  Y+Q+        +++            CY  SP   +F  P + +  +GG
Sbjct: 325 MFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYSP---DFSPPTITMHFEGG 380


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 118/269 (43%), Gaps = 52/269 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P+   ++ +DTGSDL WL C  C  C     +  GQV D     P  SST 
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136

Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            +VPC+S  C   +   C S   AG  C Y V Y  DG+ STG L  D L  A D     
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGELATDKLAFAND----- 190

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           +  + ++ GCGR   G F D AA  GL G+   K S+ + +A      + F  C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGSVFEYCLG-DRT 244

Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
            R      + FG   +P    T F+   ++P     Y + +   SVGG  V    +A   
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302

Query: 326 ----------IFDSGTSFTYLNDPAYTQI 344
                     + DSGT+ +     AY  +
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAAL 331


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 147/349 (42%), Gaps = 55/349 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   +++G P  SF V +DTGSDL W+ C  C  C        G   D     P+ S + 
Sbjct: 39  YLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQ----QPGPKFD-----PSKSRSF 89

Query: 164 SKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            K  C   LC +     K C  A + C YQ  Y  D + + G L  + + L  +   ++S
Sbjct: 90  RKAACTDNLCNVSALPLKAC--AANVCQYQYTY-GDQSNTNGDLAFETISL-NNGAGTQS 145

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD 276
           V +  +FGCG    G+F   A   GL GLG    S+ S L++     N FS C     S 
Sbjct: 146 VPN-FAFGCGTQNLGTFAGAA---GLVGLGQGPLSLNSQLSHT--FANKFSYCLVSLNSL 199

Query: 277 GTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
               ++FG   +    + T   +   HPT Y + +  + VGG  +N   S          
Sbjct: 200 SASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGR 259

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS---DLPFEYCYVLSPNQTN-- 377
              I DSGT+ T L  PAY+ +   + S     R   ++   DL F    V +P+  +  
Sbjct: 260 GGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMV 319

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           F++   +  M+G   F      V+V +    L   CL +  S   +IIG
Sbjct: 320 FKFQGADFQMRGENLF------VLVDTSATTL---CLAMGGSQGFSIIG 359


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 126/309 (40%), Gaps = 41/309 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +   V  G P  +  V  DTGS++ W+ C    VSC               ++ P  SST
Sbjct: 16  YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEP---------LFDPTLSST 66

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
              + C S  C        +GS C Y V Y  DG+ + GFL  +   LA     + +V +
Sbjct: 67  YRNISCTSAACTGLSSRGCSGSTCVYGVTY-GDGSSTVGFLATETFTLA-----AGNVFN 120

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGR 280
              FGCG+   G F   A   GL GLG    S+ S LA    + N FS C    S  TG 
Sbjct: 121 NFIFGCGQNNQGLFTGAA---GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGY 175

Query: 281 ISFGDK-GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTS 332
           ++ G+   +P  G T        PT Y I +  +SVGG  +            I DSGT 
Sbjct: 176 LNIGNPLRTP--GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTV 233

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT------NFEYPVVNLT 386
            T L   AY  +   F +   +    + + +  + CY  S   T         Y  +++T
Sbjct: 234 ITRLPPTAYGALRTAFRAAMTQYTRAAAASI-LDTCYDFSRTTTVTFPTIKLHYTGLDVT 292

Query: 387 MKGGGPFFV 395
           + G G F+V
Sbjct: 293 IPGAGVFYV 301


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/277 (28%), Positives = 125/277 (45%), Gaps = 28/277 (10%)

Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC 179
           +DTGSD+ W+ CD C  C    +S         ++ P  S+T   +PCNST+C +LQ   
Sbjct: 5   IDTGSDITWIQCDPCPQCYKQQDS---------LFQPAGSATYKPLPCNSTMCQQLQSFS 55

Query: 180 PSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
            S   S+C Y V Y  D + + G    + L L +D+    SV    +FGCG    G F +
Sbjct: 56  HSCLNSSCNYMVSY-GDKSTTRGDFALETLTLRSDDTILVSV-PNFAFGCGHANKGLF-N 112

Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE- 293
           GAA  GL GLG      P+           FS C  S      +G + FG+         
Sbjct: 113 GAA--GLMGLGKSSIGFPA--QTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVR 168

Query: 294 -TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSL 351
            TP     + P+ Y +++T ++VG   +    + + DSGT  +     AY ++ + F  +
Sbjct: 169 FTPLVDSSSGPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQI 228

Query: 352 AKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
                +T+ S  PF+ C+ +S    +   P++ L  +
Sbjct: 229 LP-GLQTAVSVAPFDTCFRVS-TVDDINIPLITLHFR 263


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 136/339 (40%), Gaps = 43/339 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V VG PA  F + LDTGSD+ WL C  C  C    +          I+ P  SST 
Sbjct: 20  YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 70

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + V C S  C   +        C YQV Y  DG+ + G    + +        S SV + 
Sbjct: 71  APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 124

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F+  A         +     P  L NQ L   SFS C  +  +   S 
Sbjct: 125 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 176

Query: 284 GDKGSPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
            D  S   G    +      R+    Y + ++ +SVGG  V+   S            I 
Sbjct: 177 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 236

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  + +  + TS   L F+ CY LS  Q +   P V+   
Sbjct: 237 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLS-GQASVRVPTVSFHF 294

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             G  + +     ++  +  G Y +      S +++IIG
Sbjct: 295 ADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 332


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/306 (29%), Positives = 130/306 (42%), Gaps = 38/306 (12%)

Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           HY   +S+G P        DTGSDL W  C  C +C    N          ++ P  S+T
Sbjct: 71  HYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNP---------MFDPQKSTT 121

Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
              + C+S LC +L     S    C Y   Y S   ++ G L ++ + L++ + +S  + 
Sbjct: 122 YRNISCDSKLCHKLDTGVCSPQKRCNYTYAYAS-AAITRGVLAQETITLSSTKGKSVPLK 180

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
             I FGCG   TG F D     G+ GLG    S+ S + +       FS C   F +D  
Sbjct: 181 G-IVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPFHTDVS 236

Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSV-------GGNAVNFEFSA 325
            + ++SFG KGS   G+    TP   +Q    Y +T+  +SV        G++ N E   
Sbjct: 237 VSSKMSFG-KGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGN 295

Query: 326 IF-DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
           +F DSGT  T L    Y Q+     S    K  T   DL  + CY     + N   PV+ 
Sbjct: 296 MFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYR---TKNNLRGPVLT 352

Query: 385 LTMKGG 390
              +G 
Sbjct: 353 AHFEGA 358


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 135/326 (41%), Gaps = 54/326 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ +G P  + ++  DTGSDL W+ C  C +C H    S+        +    S+T 
Sbjct: 86  YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA--------FFARHSTTY 137

Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           S + C S  C+L         N       C YQ  Y +D + +TGF  ++ L L T   +
Sbjct: 138 SAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTY-ADSSTTTGFFSKEALTLNTSTGK 196

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
            K ++  +SFGCG   +G  L GA+     G+ GLG    S  S L  +    + FS C 
Sbjct: 197 VKKLNG-LSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--FGSKFSYCL 253

Query: 274 GS-------------DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV 319
                           G   ++   KG      TP  +    PT Y I I  V V G  +
Sbjct: 254 MDYTLSPPPTSFLTIGGAQNVAVSKKGI--MSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311

Query: 320 NFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEY 367
               S            I DSGT+ T++ +PAYT+I + F    + K  +     P F+ 
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKK--RVKLPSPAEPTPGFDL 369

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPF 393
           C  +S   T    P ++  + GG  F
Sbjct: 370 CMNVS-GVTRPALPRMSFNLAGGSVF 394


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 101/364 (27%), Positives = 159/364 (43%), Gaps = 51/364 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + +G PA  F + +DTGS L WL C  CV   H        V    I++P+TS T 
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSTSKTY 164

Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             +PC+S+ C   K        C +A   C Y+  Y  D + S G+L +DVL L   E  
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLTPSEAP 223

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           S    S   +GCG+   G F      +G+ GL  DK S+   L+ +    N+FS C  S 
Sbjct: 224 S----SGFVYGCGQDNQGLF---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSS 274

Query: 277 G--------TGRISFGDKG--SPGQGETPFSLRQTHPT-YNITITQVSVGG-----NAVN 320
                    +G +S G     S     TP    Q  P+ Y + +T ++V G     +A +
Sbjct: 275 FSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASS 334

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
           +    I DSGT  T L    Y  + ++F  +  +K   +      + C+  S  + +   
Sbjct: 335 YNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMS-TV 393

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG----REYPIANNI 435
           P + +  +GG    +     +V  E KG    CL +  S N ++IIG    + + +A ++
Sbjct: 394 PEIQIIFRGGAGLELKAHNSLVEIE-KG--TTCLAIAASSNPISIIGNYQQQTFKVAYDV 450

Query: 436 SLFH 439
           + F 
Sbjct: 451 ANFK 454


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 146/354 (41%), Gaps = 62/354 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + +G P   + + +DTGS   WL C  C    H        + +  +++P+ S T 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154

Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             VPC+         +TL E    C    + C Y+  Y  D + S G+L +DVL L   +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
             S  V     +GCG+   G F      +G+ GL  ++ S+ S L+  G   N+FS C  
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261

Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
           +      SF    SP +G                 TP      +P+ Y I +  ++V G 
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315

Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                A +++   I DSGT  T L  P YT +   + ++  +K + +      + C+  S
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               +   P + +  KGG    +     +V  E     + CL +  S ++ IIG
Sbjct: 376 LAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG---ITCLAMAGSSSIAIIG 426


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 136/339 (40%), Gaps = 43/339 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V VG PA  F + LDTGSD+ WL C  C  C    +          I+ P  SST 
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 211

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + V C S  C   +        C YQV Y  DG+ + G    + +        S SV + 
Sbjct: 212 APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 265

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F+  A         +     P  L NQ L   SFS C  +  +   S 
Sbjct: 266 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 317

Query: 284 GDKGSPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
            D  S   G    +      R+    Y + ++ +SVGG  V+   S            I 
Sbjct: 318 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 377

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  + +  + TS   L F+ CY LS  Q +   P V+   
Sbjct: 378 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLS-GQASVRVPTVSFHF 435

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             G  + +     ++  +  G Y +      S +++IIG
Sbjct: 436 ADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 473


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 126/313 (40%), Gaps = 52/313 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           +  NVS+G P    +   DTGSDL W  C    DC + V  L            + P TS
Sbjct: 90  YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137

Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           ST   V C+S+ C   E Q  C +  + C Y + Y  D + + G +  D L L + + + 
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
             +   I  GCG    G+F      N      +     P  L  Q    I   FS C   
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249

Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA- 325
                D T +I+FG        G   TP   + +  T Y +T+  +SVG   + +  S  
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309

Query: 326 -------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                  I DSGT+ T L    Y+++ +   +S+  EK++   S L    CY  +    +
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL--CYSAT---GD 364

Query: 378 FEYPVVNLTMKGG 390
            + PV+ +   G 
Sbjct: 365 LKVPVITMHFDGA 377


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 146/354 (41%), Gaps = 62/354 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + +G P   + + +DTGS   WL C  C    H        + +  +++P+ S T 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154

Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             VPC+         +TL E    C    + C Y+  Y  D + S G+L +DVL L   +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
             S  V     +GCG+   G F      +G+ GL  ++ S+ S L+  G   N+FS C  
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261

Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
           +      SF    SP +G                 TP      +P+ Y I +  ++V G 
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315

Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                A +++   I DSGT  T L  P YT +   + ++  +K + +      + C+  S
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               +   P + +  KGG    +     +V  E     + CL +  S ++ IIG
Sbjct: 376 LAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG---ITCLAMAGSSSIAIIG 426


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 126/313 (40%), Gaps = 52/313 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           +  NVS+G P    +   DTGSDL W  C    DC + V  L            + P TS
Sbjct: 90  YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137

Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           ST   V C+S+ C   E Q  C +  + C Y + Y  D + + G +  D L L + + + 
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
             +   I  GCG    G+F      N      +     P  L  Q    I   FS C   
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249

Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA- 325
                D T +I+FG        G   TP   + +  T Y +T+  +SVG   + +  S  
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309

Query: 326 -------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                  I DSGT+ T L    Y+++ +   +S+  EK++   S L    CY  +    +
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL--CYSAT---GD 364

Query: 378 FEYPVVNLTMKGG 390
            + PV+ +   G 
Sbjct: 365 LKVPVITMHFDGA 377


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/354 (23%), Positives = 139/354 (39%), Gaps = 62/354 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VG P   F +  DTGSDL W+ C            +G      ++ P TS + +
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKC------------AGASPPGRVFRPKTSRSWA 163

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            +PC+S  C+L        C S  S C Y  RY      + G +  +   +A    +   
Sbjct: 164 PIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQ 223

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
           +   +  GC     G     A  +G+  LG  K S  +  A +     SFS C       
Sbjct: 224 LKD-VVLGCSSSHDGQSFRSA--DGVLSLGNAKISFATQAAAR--FGGSFSYCLVDHLAP 278

Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
            + TG ++FG    PGQ       +T   L    P Y + +  + V G A++        
Sbjct: 279 RNATGYLAFG----PGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDA 334

Query: 325 ----AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
                I DSG + T L  PAY  +    S+  + + K       S  PFE+CY  +  + 
Sbjct: 335 KSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPK------VSFPPFEHCYNWTARRP 388

Query: 377 NFEYPVVNLTMKGGGPFFVNDPI--VIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
                +  L ++  G   +  P    ++  +P    + C+GV + +   +++IG
Sbjct: 389 GAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPG---VKCIGVQEGEWPGLSVIG 439


>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
 gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
          Length = 376

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/281 (31%), Positives = 128/281 (45%), Gaps = 40/281 (14%)

Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           Y NV+  +GQP+  + + +DTGSDL WL CD  CV C    +             P    
Sbjct: 19  YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 65

Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
            ++ VPC   +C+        +C + G  C Y+V Y +DG  S G LV D  +L  T EK
Sbjct: 66  RNNLVPCMDPICQSLHSNGDHRCENPG-QCDYEVEY-ADGGSSFGVLVRDTFNLNFTSEK 123

Query: 216 QSKSVDSRISFG-CGRVQTGSFLDGAAP--NGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           +   +   ++ G CG  Q   F  G+    +G+ GLG  K+S+ S L++ GL+ N    C
Sbjct: 124 RHSPL---LALGLCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHC 177

Query: 273 FGSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDS 329
               G G + FGD    S     TP S    H  Y+  + +++  G    F+     FDS
Sbjct: 178 LSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDS 235

Query: 330 GTSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLPFEYCY 369
           G S+TYLN  AY   IS     L+ +    +  D     C+
Sbjct: 236 GASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCW 276


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 127/307 (41%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ WL C  C  C    +          I++P  S + 
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDP---------IFNPYKSKSF 160

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC+S LC       C +    C YQV Y  DG+ +TG    + L    ++       
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTRRHTCLYQVSY-GDGSFTTGDFATETLTFRGNKI------ 213

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
           ++++ GCG    G F+  A   GL    +   S   I  N     + FS C      S  
Sbjct: 214 AKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFN-----HKFSYCLVDRSASSK 268

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
              + FGD         TP        T Y + +  +SVGG  V       F+  +    
Sbjct: 269 PSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNG 328

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAYT + + F   A+  +      L F+ CY LS  Q++ + P V
Sbjct: 329 GVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSL-FDTCYDLS-GQSSVKVPTV 386

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 387 VLHFRGA 393


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 115/266 (43%), Gaps = 32/266 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P       +DTGSD+ WL C  C  C +             I+ P+ S+T   +P 
Sbjct: 91  SVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTT---------RIFDPSKSNTYKILPF 141

Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           +ST C+  +    +  N   C Y + Y  DG+ S G L  + L L +    S     R  
Sbjct: 142 SSTTCQSVEDTSCSSDNRKMCEYTI-YYGDGSYSQGDLSVETLTLGSTNGSSVKF-RRTV 199

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCFG--SDGTGRIS 282
            GCGR  T SF +G + +G+ GLG    S+ + L  +   I   FS C    S+ + +++
Sbjct: 200 IGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLN 257

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSG 330
           FGD       G   TP         Y +T+   SVG N + F  S+         I DSG
Sbjct: 258 FGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSG 317

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR 356
           T+ T L +  Y+++      L +  R
Sbjct: 318 TTLTLLPNDIYSKLESAVADLVELDR 343


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 129/307 (42%), Gaps = 39/307 (12%)

Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           HY   VS+G P        DTGSDL W  C  C  C    N          I+ P  S++
Sbjct: 24  HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNP---------IFDPQKSTS 74

Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
              + C+S LC +L     S   +C Y   Y S   ++ G L ++ + L++ + +S  + 
Sbjct: 75  YRNISCDSKLCHKLDTGVCSPQKHCNYTYAYAS-AAITQGVLAQETITLSSTKGESVPLK 133

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
             I FGCG   TG F D     G+ GLG    S  S + +       FS C   F +D  
Sbjct: 134 G-IVFGCGHNNTGGFNDREM--GIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPFHTDVS 189

Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
            + ++S G KGS   G+    TP   +Q    Y +T+  +SVG   ++F  S+       
Sbjct: 190 VSSKMSLG-KGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKG 248

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
               DSGT  T L    Y ++     S    K  T+  DL  + CY     + N   PV+
Sbjct: 249 NVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYR---TKNNLRGPVL 305

Query: 384 NLTMKGG 390
               +GG
Sbjct: 306 TAHFEGG 312


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 146/350 (41%), Gaps = 51/350 (14%)

Query: 65  HRDRYF--RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVA 121
            R  Y   R+ GRG     + K     +     +  N +G L+Y   VS+G P ++  + 
Sbjct: 98  RRAEYILRRVSGRGTPQLWDSKAEAATATVPANWGFN-IGTLNYVVTVSLGTPGVAQTLE 156

Query: 122 LDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---- 174
           +DTGSDL W+   PC   +C    +          ++ P  SS+ + VPC   +C     
Sbjct: 157 VDTGSDLSWVQCTPCAAPACYSQKDP---------LFDPAQSSSYAAVPCGGPVCGGLGI 207

Query: 175 LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
               C +A   C Y V Y  DG+ +TG    D L L+ ++           FGCG  Q+G
Sbjct: 208 YASSCSAA--QCGYVVSY-GDGSKTTGVYSSDTLTLSPNDAVRG-----FFFGCGHAQSG 259

Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQG 292
              +    +GL GLG ++ S+  +    G     FS C  +    TG ++ G  G  G  
Sbjct: 260 FTGN----DGLLGLGREEASL--VEQTAGTYGGVFSYCLPTRPSTTGYLTLG--GPSGAA 311

Query: 293 ETPFSLRQ--THPT----YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLNDPAY 341
              FS  Q  + P     Y + +T +SVGG  ++     F    + D+GT  T L   AY
Sbjct: 312 PPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPPTAY 371

Query: 342 TQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
             +   F S +A     ++ +    + CY  S   T    P V LT  GG
Sbjct: 372 AALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT-VTLPNVALTFSGG 420


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 151/379 (39%), Gaps = 54/379 (14%)

Query: 34  FHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS-- 90
            +HR+   V G  + ++ +    +   +  L         R + L A  N  + +  S  
Sbjct: 30  LNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVY 89

Query: 91  AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
           AG+  Y +N         +S+G PA  F   +DTGSDL W    C  C    N S+    
Sbjct: 90  AGDGEYLMN---------LSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST---- 134

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
              I++P  SS+ S +PC+S LC+       + + C Y   Y  DG+ + G +  + L  
Sbjct: 135 --PIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY-GDGSETQGSMGTETLTF 191

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
                 S S+   I+FGCG    G F  G    GL G+G    S+PS L         FS
Sbjct: 192 G-----SVSIP-NITFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFS 238

Query: 271 MCFGSDGTGRI------SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
            C    G+         S  +  + G   T        PT Y IT+  +SVG   +  + 
Sbjct: 239 YCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDP 298

Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
           SA            I DSGT+ TY  + AY  + + F S         +S   F+ C+  
Sbjct: 299 SAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSS-GFDLCFQT 357

Query: 372 SPNQTNFEYPVVNLTMKGG 390
             + +N + P   +   GG
Sbjct: 358 PSDPSNLQIPTFVMHFDGG 376


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 65/359 (18%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+S+G P L F V +DTGS+L W  C  C  C         +     +  P  SST S++
Sbjct: 94  NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PCN + C+      + +  +A + C Y   Y S  T   G+L  + L +           
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
            +++FGC    T + +D ++  G+ GLG    S+ S LA        FS C  SD    G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248

Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
              I FG      +G         + P+  R TH   N+T      T++ V G+   F  
Sbjct: 249 ASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308

Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCYVLSPNQ 375
           +      I DSGT+ TYL    Y  + + F S +A   + T  S  P+  + CY  S   
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGG 368

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPI----VIVSSEPKG-LYLYCLGVVKSDN---VNIIG 426
                 V  L ++  G    N P+      V ++ +G + + CL V+ + +   ++IIG
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIG 427


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           + T+V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/252 (26%), Positives = 109/252 (43%), Gaps = 29/252 (11%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +DTGS + ++PC  C  C    +           + P+ SST   
Sbjct: 15  TRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPK---------FQPDLSSTYQS 65

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C      C Y+ +Y ++ + S+G L ED++        S     R  
Sbjct: 66  VKCN-----IDCNCDDEKQQCVYERQY-AEMSTSSGVLGEDIISFGN---LSALAPQRAV 116

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
           FGC  ++TG      A +G+ G+G    S+   L ++G+I +SFS+C+G  G G  +   
Sbjct: 117 FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVL 175

Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
            G        FS       P YNI + ++ V G  +         +   I DSGT++ YL
Sbjct: 176 GGISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYL 235

Query: 337 NDPAYTQISETF 348
            + A+    +  
Sbjct: 236 PEAAFVSFKDAI 247


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/345 (25%), Positives = 145/345 (42%), Gaps = 45/345 (13%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           L+  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T 
Sbjct: 81  LYVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTC 130

Query: 164 SKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           +KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K   
Sbjct: 131 AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG 189

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------ 272
                 SFGC     G+   G   +GL G+G    SV   L       + FS C      
Sbjct: 190 -----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKS 240

Query: 273 ---FGSDGTGRISFGDKGSPGQGE-TPFSLRQTH-PTYNITITQVSVGGNAVNFEFS--- 324
              F S  TG  S G   +      T    R+ +   + + +T +SV G  +    S   
Sbjct: 241 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 300

Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               +FDSG+  +Y+ D A + +S+    L  ++   +  +     CY +       + P
Sbjct: 301 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKR--GAAEEESERNCYDMRSVDEG-DMP 357

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 358 AISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 402


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 85/324 (26%), Positives = 136/324 (41%), Gaps = 32/324 (9%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G P   F + +DTGS + ++PC   +C H     S Q   F    P  S T   V C  
Sbjct: 99  IGTPPQRFALIVDTGSTVTYVPCS--TCRH---CGSHQDPKFR---PEDSETYQPVKCT- 149

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
                Q  C +    C Y+ RY ++ + S+G L EDV+       Q++    R  FGC  
Sbjct: 150 ----WQCNCDNDRKQCTYERRY-AEMSTSSGALGEDVVSFGN---QTELSPQRAIFGCEN 201

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
            +TG   +  A +G+ GLG    S+   L  + +I +SFS+C+G  G G  +    G   
Sbjct: 202 DETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISP 260

Query: 291 QGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
             +  F+       P YNI + ++ V G  ++        +   + DSGT++ YL + A+
Sbjct: 261 PADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAF 320

Query: 342 TQISETFNSLAKEKRETSTSDLPF-EYCY---VLSPNQTNFEYPVVNLTMKGGGPFFVND 397
                         +  S  D  + + C+    +  +Q +  +PVV +   G G      
Sbjct: 321 LAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVF-GNGHKLSLS 379

Query: 398 PIVIVSSEPKGLYLYCLGVVKSDN 421
           P   +    K    YCLGV  + N
Sbjct: 380 PENYLFRHSKVRGAYCLGVFSNGN 403


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/345 (25%), Positives = 153/345 (44%), Gaps = 42/345 (12%)

Query: 115 ALSFIVALDTGSDLFWLPCD-CVSC-VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
           A +F + +DTGS   +LPC  C SC  H     +G+  D++      S+  S+V C S  
Sbjct: 44  AQTFELIVDTGSSRTYLPCKGCASCGAH----EAGRYYDYD-----ASADFSRVEC-SAC 93

Query: 173 CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
             +  +C ++G  C Y V YL +G+ S G+LV DV+ L          ++ + FGC   +
Sbjct: 94  AGIGGKCGTSGV-CRYDVHYL-EGSGSEGYLVRDVVSLG-----GSVGNATVVFGCEERE 146

Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------GSDGTGRISFGD 285
            GS    +A +GLFG G    ++ + LA+  +I + FSMC        G    G ++ G+
Sbjct: 147 LGSIKQQSA-DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205

Query: 286 ----KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDP 339
                 +P    TP  +  +   Y +T T  ++G + V        I DSGTS+TY+   
Sbjct: 206 FDFGADAPALVYTP--MVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVPGN 263

Query: 340 AYTQISETFNSLAKEKRETSTS------DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
            + +  +     A+E      +      DL F     L  +  +  +P + +   G    
Sbjct: 264 MHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPALKIEYHGSARL 323

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLF 438
            ++ P   +    K    +C+G+++ D+  I+  +  + N  + F
Sbjct: 324 TLS-PETYLYWHQKNASAFCVGILEHDDNRILLGQITMRNTFTEF 367


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/272 (27%), Positives = 110/272 (40%), Gaps = 41/272 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +VSVG P     + LDTGSDL W    C  C+      +  V+D     P  SST +
Sbjct: 90  YLMHVSVGTPPRPVALTLDTGSDLVW--TQCAPCLDCFEQGAAPVLD-----PAASSTHA 142

Query: 165 KVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            +PC++ LC         G      +C Y   Y  D +++ G L  D      D+     
Sbjct: 143 ALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHY-GDRSLTVGQLATDSFTFGGDDNAGGL 201

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---- 275
              R++FGCG +  G F   A   G+ G G  + S+PS L        SFS CF S    
Sbjct: 202 AARRVTFGCGHINKGIF--QANETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFDT 254

Query: 276 DGTGRISFGDKGSP----------GQGETPFSLRQ-THPT-YNITITQVSVGGNAV---- 319
             +  ++ G   +           G   T   ++  + P+ Y + +  +SVGG  V    
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314

Query: 320 -NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
                S I DSG S T L +  Y  +   F S
Sbjct: 315 SRLRSSTIIDSGASITTLPEDVYEAVKAEFVS 346


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 130/303 (42%), Gaps = 43/303 (14%)

Query: 112 GQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
           G PA+  ++ +DTGSDL W+ C         NSS+       ++ P+ SST + VPC S 
Sbjct: 129 GTPAVPQVLLIDTGSDLSWVQC------QPCNSSTCYPQKDPVFDPSASSTYAPVPCGSE 182

Query: 172 LCE------LQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            C           C    S  S C Y ++Y  +G  + G    + L L+    ++ +V +
Sbjct: 183 ACRDLDPDSYANGCTNSSSGASLCQYGIQY-GNGDTTVGVYSTETLTLS---PEAATVVN 238

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF--GSDGT 278
             SFGCG VQ G F             +     P  L +Q  G    +FS C   G+   
Sbjct: 239 NFSFGCGLVQKGVFDLFDG-------LLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTA 291

Query: 279 GRISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFD 328
           G ++ G   + G        TP  + +T   Y + +T +SVGG  ++ E +      I D
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPLQVVETT-FYLVKLTGISVGGKQLDIEPTVFAGGMIID 350

Query: 329 SGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGT  T L + AY+ +   F S ++         D   + CY  + N TN   P V LT 
Sbjct: 351 SGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGN-TNVTVPTVALTF 409

Query: 388 KGG 390
           +GG
Sbjct: 410 EGG 412


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/376 (24%), Positives = 154/376 (40%), Gaps = 56/376 (14%)

Query: 84  KTPLTFSAGNDTYRLNS----LGF-------LHYTNVSVGQPALSFIVALDTGSDLFWLP 132
           + PL     N T RL++    +G+       L+  +V +G PA + IV +DTGS   W+ 
Sbjct: 50  RIPLFRYISNKTSRLSTQAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVF 109

Query: 133 CDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS-----NCP 187
           C+C  C H          +   +  + S+T +KV C +++C L    P         +CP
Sbjct: 110 CECDGC-H---------TNPRTFLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCP 159

Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
           ++V Y  DG+ S G L +D L  +  +K         +FGC     G+   G   +GL G
Sbjct: 160 FRVSY-QDGSASYGILYQDTLTFSDVQKIPS-----FTFGCNLDSFGANEFGNV-DGLLG 212

Query: 248 LGMDKTSVPSILANQGLIPNSFSMC---------FGSDGTGRISFGDKGSPGQGE--TPF 296
           +G    SV   L       + FS C         F S  TG  S G   +          
Sbjct: 213 MGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMV 269

Query: 297 SLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
           + R+    + + +  +SV G  +    S       +FDSG+  +Y+ D A + +S+    
Sbjct: 270 ARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELSYIPDRALSVLSQRIRE 329

Query: 351 LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY 410
           L    R  +  +     CY +       + P ++L    G  F +    V V    +   
Sbjct: 330 LL--LRRGAAEEESERNCYDMRSVDEG-DMPAISLHFDDGARFDLGSHGVFVERSVQEQD 386

Query: 411 LYCLGVVKSDNVNIIG 426
           ++CL    +++V+IIG
Sbjct: 387 VWCLAFAPTESVSIIG 402


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 144/360 (40%), Gaps = 54/360 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ +G P  S ++  DTGSDL W+ C  C +C H   SS+        + P  SS+ 
Sbjct: 88  YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSA--------FLPRHSSSF 139

Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           S   C    C L    P    N       C +   Y +DG++S+GF  ++   L +    
Sbjct: 140 SPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSY-ADGSLSSGFFSKETTTLKSLSGS 198

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
              +   +SFGCG   +G  + GA  N   G+ GLG    S  S L  +    N FS C 
Sbjct: 199 EIHLKG-LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKFSYCL 255

Query: 273 -----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG---- 316
                      F   G G  S     +     TP  +    PT Y ITI  +++ G    
Sbjct: 256 MDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLP 315

Query: 317 -NAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEY 367
            N   +E         + DSGT+ TYL   AY    E   S+ +  +  + ++L   F+ 
Sbjct: 316 INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAY---EEVLKSVRRRVKLPNAAELTPGFDL 372

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
           C   S        P +   + GGG  F   P        +G+    +  V+S N  ++IG
Sbjct: 373 CVNASGESRRPSLPRLRFRL-GGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIG 431


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 162/373 (43%), Gaps = 57/373 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VG PA  F + +DTGSDL W+ C+  +     NSSS        Y  ++SS+  
Sbjct: 27  YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 81

Query: 165 KVPCNSTLC-----ELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           ++PC    C      +   C   + S C Y   Y SD + +TG L  + + + + ++  K
Sbjct: 82  EIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 140

Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
              +          ++ GC R   G+   GA+  G+ GLG    S+ +   +  L    F
Sbjct: 141 RAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 197

Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
           S C      GS+ +  +  G         TP        + Y + +T V+V G  V+   
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257

Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
           S+            IFDSGT+ +YL +PAY+++    N+     R     ++P  FE CY
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 314

Query: 370 VLSPNQTNFE--YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVVKSDNV--N 423
               N T  E   P + +  +GG    +  N+ +V+V+   + + L  +      N+  N
Sbjct: 315 ----NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGN 370

Query: 424 IIGREYPIANNIS 436
           ++ +++ I  +++
Sbjct: 371 LLQQDHHIEYDLA 383


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 92/329 (27%), Positives = 140/329 (42%), Gaps = 58/329 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
           N+S+GQP +  +V +DTGSD+ W+ C  C +C + L           ++ P+ SST S  
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGL---------LFDPSMSSTFSPL 154

Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
            K PC+   C       S     P+ V Y  + T S  F  + V+   TDE  S+  D  
Sbjct: 155 CKTPCDFKGC-------SRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPD-- 205

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
           + FGCG    G   D    NG+ GL     + P  LA +  I   FS C G         
Sbjct: 206 VLFGCGH-NIGQDTD-PGHNGILGL----NNGPDSLATK--IGQKFSYCIGDLADPYYNY 257

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
            ++  G+        TPF +      Y +T+  +SVG   ++     FE         I 
Sbjct: 258 HQLILGEGADLEGYSTPFEVHNGF--YYVTMEGISVGEKRLDIAPETFEMKKNRTGGVII 315

Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           D+G++ T+L D  +  +S E  N L    R+T+    P+  C+  S ++    +PVV   
Sbjct: 316 DTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFH 375

Query: 387 MKGG-------GPFF--VNDPIVIVSSEP 406
              G       G FF  +ND +  ++  P
Sbjct: 376 FADGADLALDSGSFFNQLNDNVFCMTVGP 404


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 124/303 (40%), Gaps = 43/303 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G P     + +D+GSD+ W+ C  C+ C    +          ++ P TS+T 
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPATSATF 177

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           S VPC S +C   +   C  +G  C Y+V Y  DG+ + G L  + L L     +     
Sbjct: 178 SAVPCGSAVCRTLRTSGCGDSG-GCDYEVSY-GDGSYTKGALALETLTLGGTAVEG---- 231

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRI 281
             ++ GCG    G F+  A   GL GLG    S+   L        +FS C  S G G +
Sbjct: 232 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAGSL 284

Query: 282 SFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIF 327
             G   +  +G    P       P+ Y + ++ + VG   +  +              + 
Sbjct: 285 VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVM 344

Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           D+GT+ T L   AY  + + F  ++    R    S L  + CY LS   T+   P V+  
Sbjct: 345 DTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLL--DTCYDLS-GYTSVRVPTVSFY 401

Query: 387 MKG 389
             G
Sbjct: 402 FDG 404


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 137/335 (40%), Gaps = 59/335 (17%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P +  +   DT SDL W+ C  C +C            D  ++ P+ SST + + C+
Sbjct: 96  IGTPPVERLAIADTASDLIWVQCSPCETCFPQ---------DTPLFEPHKSSTFANLSCD 146

Query: 170 STLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           S  C       CP  G+ C Y   Y  DG+ + G L  + +H  +   Q+ +    I FG
Sbjct: 147 SQPCTSSNIYYCPLVGNLCLYTNTY-GDGSSTKGVLCTESIHFGS---QTVTFPKTI-FG 201

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISFG 284
           CG              G+ GLG    S+ S L +Q  I + FS C   F S  T ++ FG
Sbjct: 202 CGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFG 259

Query: 285 -DKGSPGQG--ETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFS------AIFDSGTSFT 334
            D    G G   TP  +   +P+Y  + +  +++G   +    +       I D GT  T
Sbjct: 260 NDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLT 319

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           YL    Y      F +L +E    S +      PF++C+   PNQ N  +P +     G 
Sbjct: 320 YLEVNFY----HNFVTLLREALGISETKDDIPYPFDFCF---PNQANITFPKIVFQFTGA 372

Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
             F            PK L+       + D++N+I
Sbjct: 373 KVFL----------SPKNLFF------RFDDLNMI 391


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/302 (29%), Positives = 132/302 (43%), Gaps = 38/302 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
           +   +++G P LS  +ALDTGSD+ W  C+ CV SC     +          + P  SS+
Sbjct: 45  YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTK---------FDPRKSSS 95

Query: 163 SSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
              V C+S+ C +      A     S C Y+V+Y  DG+ S GF   + L ++  +    
Sbjct: 96  YKNVSCSSSSCRIITDSGGARGCVSSTCIYKVQY-GDGSYSVGFFATEKLTISPSD---- 150

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGS 275
            V S   FGCG+   G F   A        G+ +  +   L       N F+ C   F S
Sbjct: 151 -VISNFLFGCGQQNAGRFGRIAGLL-----GLGRGKLSLALQTSEKYNNLFTYCLPSFSS 204

Query: 276 DGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AIFD 328
             TG ++ G +       TP S   +  P Y I I  +SVGG+ +  + S      AI D
Sbjct: 205 SSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIID 264

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT  T L    Y+ +S  F  L K+  +T    +  + CY  S N++    P ++   K
Sbjct: 265 SGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSI-LDTCYDFSGNES-ISVPRISFFFK 322

Query: 389 GG 390
           GG
Sbjct: 323 GG 324


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 142/341 (41%), Gaps = 43/341 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + +G P     + LDTGSD+ W+ C+ C  C    +          I++P++S + 
Sbjct: 8   YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 58

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S V C+S +C         G  C Y+V Y  DG+ + G    + L   T   Q+      
Sbjct: 59  STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 111

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A   GL    +   S P+ L  Q     +FS C     S+ +G 
Sbjct: 112 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 166

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
           + FG +  P G   TP       PT Y +++  +SVGG  ++      F           
Sbjct: 167 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 226

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT+ T L   AY  + + F +  +         + F+ CY LS  Q+    P V  
Sbjct: 227 IIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI-FDTCYDLSALQS-VSIPAVGF 284

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               G  F +     ++  +  G + +      S N++I+G
Sbjct: 285 HFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADS-NLSIMG 324


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 155/346 (44%), Gaps = 45/346 (13%)

Query: 99  NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
            S+G  +Y T + +G P+ S+ + +DTGS L WL   C  CV   +   G + D     P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179

Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
             SST + V C+++ C ELQ     PSA S    C YQ  Y  D + S G+L  D +   
Sbjct: 180 RASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGYLSTDTVSFG 238

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           +    S        +GCG+   G F   A   GL GL  +K S+   LA    +  SFS 
Sbjct: 239 STSYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287

Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
           C  +   TG +S G   + G     TP +      + Y IT++ +SVGG+ +     E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346

Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
           +   I DSGT  T L    +T +S+    ++A  +R  + S L  + C+    +Q     
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL--DTCFEGQASQ--LRV 402

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P V +   GG    +    V++  +       CL    +D+  IIG
Sbjct: 403 PTVVMAFAGGASMKLTTRNVLIDVDDS---TTCLAFAPTDSTAIIG 445


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 149/348 (42%), Gaps = 47/348 (13%)

Query: 99  NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
           N   FL   N+S+G P +  ++ +DTGSDL W   LPC C            Q I F  +
Sbjct: 84  NPAAFL--ANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYP----------QTIPF--F 129

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
            P+ SST     C S    + +      + NC Y +RY  D + + G L ++ L   T +
Sbjct: 130 HPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRY-RDFSNTRGILAKEKLTFQTSD 188

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           +   S    I FGCG+  +G        +G+ GLG    S+  +  N G   + FS CFG
Sbjct: 189 EGLIS-KPNIVFGCGQDNSGF----TQYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 238

Query: 275 S--DGTGRISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
           S  D T   +F   G+  + E   TP  + Q    Y + +  +S+G   ++ E       
Sbjct: 239 SLIDPTYPHNFLILGNGARIEGDPTPLQIFQDR--YYLDLQAISLGEKLLDIEPGIFQRY 296

Query: 323 ---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
                 + D+G S T L   AY  +SE  + L  E  R     +    +CY  +     +
Sbjct: 297 RSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLY 356

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +PVV     GG    ++   + VSSE    +   + +   D++++IG
Sbjct: 357 GFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIG 404


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           + T+V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGIHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 102/400 (25%), Positives = 154/400 (38%), Gaps = 92/400 (23%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSS------------- 146
           ++    VG PA  F++  DTGSDL W+ C     D  +  +G  + +             
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166

Query: 147 -GQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMST 200
                   ++ P+ S T + +PC+S  C          CP+ GS C Y  RY  DG+ + 
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRY-KDGSAAR 225

Query: 201 GFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKTS 254
           G +  D   +A       +KQ ++    +  GC    TG SFL   A +G+  LG    S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFL---ASDGVLSLGYSNIS 282

Query: 255 VPSILANQGLIPNSFSMCF-----GSDGTGRISFGDKGSPGQGETPFS------------ 297
             S  A +      FS C        + T  ++FG   +P    +P S            
Sbjct: 283 FASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGP--NPAVSSSPPSKTACAGGGSPAA 338

Query: 298 -------LRQT--------HPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSF 333
                   RQT         P Y +T+  +SV G  +              AI DSGTS 
Sbjct: 339 APPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGGGAILDSGTSL 398

Query: 334 TYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV--NLTMKGG 390
           T L  PAY  +    N  LA   R T     PF+YCY  +   T  +  V    L +   
Sbjct: 399 TVLVSPAYRAVVAALNKKLAGLPRVTMD---PFDYCYNWTSPSTGEDLTVAMPELAVHFA 455

Query: 391 GPFFVNDPI--VIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
           G   +  P    ++ + P    + C+G+ + +   V++IG
Sbjct: 456 GSARLQPPAKSYVIDAAPG---VKCIGLQEGEWPGVSVIG 492


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/341 (25%), Positives = 142/341 (41%), Gaps = 43/341 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + +G P     + LDTGSD+ W+ C+ C  C    +          I++P++S + 
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 204

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S V C+S +C         G  C Y+V Y  DG+ + G    + L   T   Q+      
Sbjct: 205 STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 257

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A   GL    +   S P+ L  Q     +FS C     S+ +G 
Sbjct: 258 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 312

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
           + FG +  P G   TP       PT Y +++  +SVGG  ++      F           
Sbjct: 313 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 372

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT+ T L   AY  + + F +  +         + F+ CY LS  Q+    P V  
Sbjct: 373 IIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI-FDTCYDLSALQS-VSIPAVGF 430

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               G  F +     ++  +  G + +      S N++I+G
Sbjct: 431 HFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADS-NLSIMG 470


>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
 gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
          Length = 107

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 42/70 (60%), Positives = 47/70 (67%), Gaps = 3/70 (4%)

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDK 286
           CG   TGSFLDG A NGL GLG +K SV  +L   GL+  +SFSMCF  D  GRI+FGD 
Sbjct: 20  CG--PTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRINFGDA 77

Query: 287 GSPGQGETPF 296
           G  GQGE PF
Sbjct: 78  GIRGQGEMPF 87


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 133/309 (43%), Gaps = 38/309 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
           +SL  L Y  +V +G PA++  V +DTGSD+ W+   PC    C    ++ +G + D   
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPC----HAQTGALFD--- 172

Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
             P  SST   V C +  C +L++Q   C +    C Y V+Y  DG+ + G    D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
           +      K       FGC  +++G F D    +GL GLG    S+ S  A      NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280

Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
            C     GS G   +  G   S          +Q    Y   +  ++VGG  +      F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF 340

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
              ++ DSGT  T L   AY+ +S  F +  K+ R      +  + C+  +  QT    P
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI-LDTCFDFA-GQTQISIP 398

Query: 382 VVNLTMKGG 390
            V L   GG
Sbjct: 399 TVALVFSGG 407


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 124/312 (39%), Gaps = 41/312 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G PA+   + LDTGS L W+   C  C    NSS        ++ PNTSS+ S
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWV--QCKPC----NSSQCYPQRLPLFDPNTSSSYS 182

Query: 165 KVPCNSTLCELQKQ------CPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            VPC+S  C           C S G   C Y++ Y S G    G    D L L       
Sbjct: 183 PVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGS-GATPAGEYSTDALTLG-----P 236

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS---FSMCFG 274
            ++  R  FGCG  Q     D A  +G+ GLG     +P  LA Q         FS C  
Sbjct: 237 GAIVKRFHFGCGHHQQRGKFDMA--DGVLGLG----RLPQSLAWQASARRGGGVFSHCLP 290

Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS 324
             G     F   G+P        TP       P  Y +  T +SV G  ++     F   
Sbjct: 291 PTGVS-TGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREG 349

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT  + L + AYT +   F S A  +   +      + C+  +    N   P V+
Sbjct: 350 VITDSGTVLSALQETAYTALRTAFRS-AMAEYPLAPPVGHLDTCFNFT-GYDNVTVPTVS 407

Query: 385 LTMKGGGPFFVN 396
           LT +GG    ++
Sbjct: 408 LTFRGGATVHLD 419


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 136/329 (41%), Gaps = 45/329 (13%)

Query: 45  ILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLG 102
           ++ +  L    S  Y  AL H D    L    L  +   ++ L   +G D  + RL+S+ 
Sbjct: 15  LVLLTSLAVSASSGYRLALTHVDSKIGLTKTELMRRAAHRSRLRALSGYDANSPRLHSVQ 74

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
             +   +++G P + F+   DTGSDL W  C  C  C            D  +Y P+ SS
Sbjct: 75  VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASS 125

Query: 162 TSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           T S VPC+S  C      + C +  S C Y   Y SDG  S G L  + L L +      
Sbjct: 126 TFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSY-SDGAYSAGILGTETLTLGSSVPGQA 184

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FG 274
              S ++FGCG    G  L+     G  GLG       S+LA  G+    FS C    F 
Sbjct: 185 VSVSDVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFN 236

Query: 275 SDGTGRISFGDKG--SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEF 323
           S        G     +PG G    TP      +P+ Y +++  +++G   +      F+ 
Sbjct: 237 STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDL 296

Query: 324 SA------IFDSGTSFTYLNDPAYTQISE 346
            A      + DSGT+F+ L +  +  + +
Sbjct: 297 HANSTGGMVVDSGTTFSILPESGFRVVVD 325


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 63/290 (21%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ +GQP  S ++  DTGSDL W+ C  C +C H   ++        ++ P  SST 
Sbjct: 84  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 135

Query: 164 SKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           S   C   +C L  +   A         S C Y+  Y +DG++++G    +   L T   
Sbjct: 136 SPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGY-ADGSLTSGLFARETTSLKTSSG 194

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           +   + S ++FGCG   +G  + G +    NG+ GLG    S  S L  +    N FS C
Sbjct: 195 KEARLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 251

Query: 273 F-----------------GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSV 314
                             G DG  ++ F          TP       PT Y + +  V V
Sbjct: 252 LMDYTLSPPPTSYLIIGNGGDGISKLFF----------TPLLTNPLSPTFYYVKLKSVFV 301

Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
            G  +  + S            + DSGT+  +L +PAY  +        K
Sbjct: 302 NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK 351


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 81/287 (28%), Positives = 127/287 (44%), Gaps = 41/287 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ W+ C+ C  C   ++          I++P+ S++ 
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDP---------IFNPSLSASF 247

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + CNS +C         G  C Y+V Y  DG+ + G    ++L   T   ++      
Sbjct: 248 STLGCNSAVCSYLDAYNCHGGGCLYKVSY-GDGSYTIGSFATEMLTFGTTSVRN------ 300

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGR 280
           ++ GCG    G F+  A    L GLG    S PS L  Q     +FS C     S+ +G 
Sbjct: 301 VAIGCGHDNAGLFVGAAG---LLGLGAGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGT 355

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
           + FG +  P G   TP     + PT Y + +  +SVGG  ++      F           
Sbjct: 356 LEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGF 415

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
           I DSGT+ T L  P Y  + + F +  ++  +     + F+ CY LS
Sbjct: 416 IVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI-FDTCYDLS 461


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 93/346 (26%), Positives = 142/346 (41%), Gaps = 29/346 (8%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
           ++ S  F +   V++G P  S +   DTGSDL W     V C  G N +S        + 
Sbjct: 93  KVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVW-----VKCKKGNNDTSSAAAPTTQFD 147

Query: 157 PNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           P+ SST  +V C +  CE L +     GSNC Y   Y  DG+ +TG L  +         
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAY-GDGSNTTGVLSTETFTFDDGGS 206

Query: 216 QSKSVDSR---ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                  R   + FGC     GSF      +GL GLG    S+ + L     +   FS C
Sbjct: 207 GRSPRQVRVGGVKFGCSTATAGSF----PADGLVGLGGGAVSLVTQLGGATSLGRRFSYC 262

Query: 273 F---GSDGTGRISFG---DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
                 + +  ++FG   D   PG   TP         Y + +  V VG   V    S+ 
Sbjct: 263 LVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSR 322

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPV 382
            I DSGT+ T+L DP+   +    + L++        + D   + CY ++  +      +
Sbjct: 323 IIVDSGTTLTFL-DPSL--LGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 379

Query: 383 VNLTMK-GGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +LT++ GGG      P    V+ +   L L  +   +   V+I+G
Sbjct: 380 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILG 425


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 123/299 (41%), Gaps = 54/299 (18%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           S+G P       +DTGSDL WL C+ C  C   +           I+ P+ SS+   +PC
Sbjct: 93  SIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITP---------IFDPSLSSSYQNIPC 143

Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
            S  C   +      ++C   VR         G+L  + L L +    S S   +   GC
Sbjct: 144 LSDTCHSMRT-----TSC--DVR---------GYLSVETLTLDSTTGYSVSF-PKTMIGC 186

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGD 285
           G   TG+F      +G+ GLG    S+PS L     I   FS C G    + T +++FGD
Sbjct: 187 GYRNTGTF--HGPSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGD 242

Query: 286 KG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIFDSGTSFT 334
                  G   TP   +     Y +T+   SVG   + F        E + + DSGT+FT
Sbjct: 243 AAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFT 302

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPNQTNFEYPVVNLTMKGG 390
           +L    Y +    F S   E       + P   F+ CY ++ +   FE P++    KG 
Sbjct: 303 FLPYDVYYR----FESAVAEYINLEHVEDPNGTFKLCYNVAYH--GFEAPLITAHFKGA 355


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 142/341 (41%), Gaps = 45/341 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA S  +  DTGSD+ WL C  C  C    +          I++P+ SS+ 
Sbjct: 81  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 131

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             + C S++C +L+ +  S  + C YQV Y  DG+ + G    + L       +S     
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 185

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
            ++ GCGR   G F   A    L GLG    S PS         + FS C     S    
Sbjct: 186 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 239

Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
            + FG    P +      L  R+    Y + + ++ V G+ VN    A           I
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 299

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT+ + L  PAYT + + F SL         S   F+ CY LS  +T    P V L 
Sbjct: 300 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGIS--LFDTCYDLSSMKTA-TLPAVVLD 356

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV-NIIG 426
             GG    +    ++V+ + +G   YCL     +   +IIG
Sbjct: 357 FDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIG 395


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 89/371 (23%), Positives = 140/371 (37%), Gaps = 64/371 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++    VG PA  F++  DTGSDL W+ C   +     NSS         + P  S T +
Sbjct: 94  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAA----NSSESGSGSGRAFRPEDSRTWA 149

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            + C S  C          CP+ GS C Y  RY  DG+ + G +  +   +A   +  + 
Sbjct: 150 PISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGRGREE 208

Query: 220 VDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
             +++     GC    TG   +    +G+  LG    S  S  A++      FS C    
Sbjct: 209 RKAKLKGLVLGCTSSYTGPSFE--VSDGVLSLGYSDVSFASHAASR--FAGRFSYCLVDH 264

Query: 274 --GSDGTGRISFGDK-----------------------GSPGQGETPFSL-RQTHPTYNI 307
               + T  ++FG                           P   +TP  L R+  P Y++
Sbjct: 265 LSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDV 324

Query: 308 TITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRE 357
            +  VSV G  +    +          I DSGTS T L  PAY  +    +  LA   R 
Sbjct: 325 AVKAVSVAGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRV 384

Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           T     PFEYCY  +    +   P + +   G           ++ + P    + C+G+ 
Sbjct: 385 TMD---PFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPG---VKCIGLQ 438

Query: 418 KS--DNVNIIG 426
           +     +++IG
Sbjct: 439 EGPWPGISVIG 449


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           + T+V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 139/318 (43%), Gaps = 39/318 (12%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +SVG P    I   DTGSD+ W  C+ C +C            D  +++P+ S+T  KV 
Sbjct: 89  LSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139

Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           C+S +C    +    S   +C Y + Y  D + S G    D L + +   +  +   R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
            GCG    GSF   A  +G+ GLG+   S+   + +   +   FS C    G+D  G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253

Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
           ++FG   +    G   TP  +     + Y++ +  VSVG N   +         + + I 
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L    Y   ++  ++    +R    +    EYC+  + +  +++ P + +  
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCFETTTD--DYKVPFIAMHF 370

Query: 388 KGGGPFFVNDPIVIVSSE 405
           +G       + ++I  S+
Sbjct: 371 EGANLRLQRENVLIRVSD 388


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 137/342 (40%), Gaps = 55/342 (16%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
           + P+T  A     RL +L ++    +  G+      V +DT S+L W+ C+     H   
Sbjct: 99  QVPVTSGA-----RLRTLNYVATVGIGGGEAT----VIVDTASELTWVQCEPCDACHDQQ 149

Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--------QCPSAGSNCPYQVRYLSD 195
                     ++ P++S + + VPCNS+ C+  +         C    + C Y + Y  D
Sbjct: 150 EP--------LFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSY-RD 200

Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
           G+ S G L  D L LA ++ Q         FGCG    G F      +GL GLG  + S+
Sbjct: 201 GSYSRGVLAHDRLSLAGEDIQG------FVFGCGTSNQGPF---GGTSGLMGLGRSQLSL 251

Query: 256 PSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPTYN 306
            S   +Q      FS C     S  +G +  GD  S  +  TP       S     P Y 
Sbjct: 252 ISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYL 309

Query: 307 ITITQVSVGGNAVNFE-FS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
             +T ++VGG  V    FS      AI DSGT  T L    Y  +   F S   E  + +
Sbjct: 310 ANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAA 369

Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI 401
              +  + C+ L+      + P + L   GG    V+   V+
Sbjct: 370 PFSI-LDTCFDLT-GLREVQVPSLKLVFDGGAEVEVDSKGVL 409


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 142/359 (39%), Gaps = 41/359 (11%)

Query: 50  DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
           D PK   +      + R R    R     +   D + +  S  +    +   G  +  N+
Sbjct: 39  DSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMNL 98

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           S+G P    +   DTGS+L W  C  C  C   ++          ++ P  SST   V C
Sbjct: 99  SLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDP---------LFDPKASSTYKDVSC 149

Query: 169 NSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           +S+ C   E Q  C +    C Y V Y +DG+ + G    D L L + + +   + + I 
Sbjct: 150 SSSQCTALENQASCSTEDKTCSYLVSY-ADGSYTMGKFAVDTLTLGSTDNRPVQLKNII- 207

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCF--GSDGTGRIS 282
            GCG+    +F      N   G+        S++   G  I   FS C    +D T +I+
Sbjct: 208 IGCGQNNAVTFR-----NKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKIN 262

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSAIFDSGTSFT 334
           FG       PG   TP  ++     Y +T+  +SVG   +     N + + + DSGT+ T
Sbjct: 263 FGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLT 322

Query: 335 YLNDPAYTQISETFNSLA---KEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
            L    Y +I     SL    K K E   S L    CY  +    +   PV+ +  +G 
Sbjct: 323 LLPVKYYIEIENAVASLINADKSKDERIGSSL----CYNAT---ADLNIPVITMHFEGA 374


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 82/297 (27%), Positives = 123/297 (41%), Gaps = 54/297 (18%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           SVG P        DTGSD+ WL C+   C    N ++ +      + P+ SST   +PC+
Sbjct: 92  SVGTPPFKLYGIADTGSDIVWLQCE--PCKECYNQTTPK------FKPSKSSTYKNIPCS 143

Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
           S LC+  +Q                      G L  D L L +      S    +  GCG
Sbjct: 144 SDLCKSGQQ----------------------GNLSVDTLTLESSTGHPISFPKTV-IGCG 180

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRISFG 284
              T SF +GA+ +G+ GLG    S+ + L +   I   FS C       S+ T +++FG
Sbjct: 181 TDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVESNTTSKLNFG 236

Query: 285 DKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--------IFDSGTSF 333
           D       G   TP   +     Y +T+   SVG   + FE S+        I DSGT+ 
Sbjct: 237 DTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTL 296

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T +    Y  +      L K KR    + L F  CY ++ +   +++P++    KG 
Sbjct: 297 TVIPTDVYNNLESAVLELVKLKRVNDPTRL-FNLCYSVTSD--GYDFPIITTHFKGA 350


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 142/341 (41%), Gaps = 45/341 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA S  +  DTGSD+ WL C  C  C    +          I++P+ SS+ 
Sbjct: 14  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 64

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             + C S++C +L+ +  S  + C YQV Y  DG+ + G    + L       +S     
Sbjct: 65  KPLACASSICGKLKIKGCSRKNKCMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 118

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
            ++ GCGR   G F   A    L GLG    S PS         + FS C     S    
Sbjct: 119 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 172

Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
            + FG    P +      L  R+    Y + + ++ V G+ VN    A           I
Sbjct: 173 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 232

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT+ + L  PAYT + + F SL         S   F+ CY LS  +T    P V L 
Sbjct: 233 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL--FDTCYDLSSMKTA-TLPAVVLD 289

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV-NIIG 426
             GG    +    ++V+ + +G   YCL     +   +IIG
Sbjct: 290 FDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIG 328


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 152/362 (41%), Gaps = 60/362 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   ++VG PA+  ++ALDT SDL WL C  C  C       SG V D     P  S++ 
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 191

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG------TMSTGFLVEDVLHLATDE 214
            ++  ++  C+   +     +    C Y V Y  DG      + S G LVE+ L  A   
Sbjct: 192 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVLY-GDGDGHGSTSTSVGDLVEETLTFAGGV 250

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
           +Q+      +S GCG    G F  GA   G+ GL   + S+P  +A  G    SFS C  
Sbjct: 251 RQAY-----LSIGCGHDNKGLF--GAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLV 302

Query: 274 ------GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---- 319
                 GS  +  ++FG      SP    TP  L Q  PT Y + +  VSVGG  V    
Sbjct: 303 DFISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVT 361

Query: 320 ---------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEY 367
                          I DSGT+ T L  PAYT   + F + A    + ST   S L F+ 
Sbjct: 362 ERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGL-FDT 420

Query: 368 CYVLSPN---QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
           CY +      +   + P V++   GG    +     +++ + +G   +        +V++
Sbjct: 421 CYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSV 480

Query: 425 IG 426
           IG
Sbjct: 481 IG 482


>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
 gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/276 (30%), Positives = 123/276 (44%), Gaps = 44/276 (15%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYS 156
           GF + T +++GQP+  + + +DTGSDL WL CD     C    H              Y 
Sbjct: 18  GFYNVT-LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH------------PYYK 64

Query: 157 PNTSSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDE 214
           P+ +  + K P C S      ++C + G  C Y+V Y +DG  S G LV+D  +L  T E
Sbjct: 65  PSNNLVACKDPICQSLHTGGDQRCENPG-QCDYEVEY-ADGGSSLGVLVKDAFNLNFTSE 122

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
           K+   + +    G  ++  G++      +G+ GLG  K S+ S L+  GL+ N    C  
Sbjct: 123 KRQSPLLALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCL- 178

Query: 275 SDGTGRISFGDK------GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIF 327
              +GR             S     TP S    H  Y+    +++  G    F+     F
Sbjct: 179 ---SGRGGGFLFFGDDLYDSSRVAWTPMSPNAKH--YSPGFAELTFDGKTTGFKNLIVAF 233

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
           DSG S+TYLN    +Q+ +   SL   KRE ST  L
Sbjct: 234 DSGASYTYLN----SQVYQGLISLI--KRELSTKPL 263


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 92/352 (26%), Positives = 139/352 (39%), Gaps = 53/352 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P    ++ LDTGSD+ WL C  C  C       SGQ+ D     P  S + 
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCY----DQSGQMFD-----PRASHSY 197

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             V C + LC       C      C YQV Y  DG+++ G    + L  A+  +      
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFASGARV----- 251

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
            R++ GCG    G F+  A   GL        S PS ++ +     SFS C         
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRR--FGRSFSYCLVDRTSSSA 306

Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV--------- 319
             +  +  ++FG           F+    +P     Y + +  +SVGG  V         
Sbjct: 307 SATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLR 366

Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
                     I DSGTS T L  PAY  + + F + A   R +      F+ CY LS  +
Sbjct: 367 LDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLK 426

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
              + P V++   GG    +     ++  + +G   +C     +D  V+IIG
Sbjct: 427 V-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 475


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 108/401 (26%), Positives = 161/401 (40%), Gaps = 62/401 (15%)

Query: 62  ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
           +LA R R  R R   +  +        T L+ +AG  T    +  +S+  L Y   + +G
Sbjct: 39  SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 98

Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
            PA+   V +DTGSDL W+   PC    C    +          ++ P++SS+ + VPC+
Sbjct: 99  TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 149

Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S  C                  A + C Y + Y +  T +TG    + L L     +   
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 203

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           V +   FGCG  Q G +      +GL GLG    S+ S  ++Q   P S+ +   S G G
Sbjct: 204 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 260

Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
            ++ G          + G   TP     + PT Y +T+T +SVGG  +    SA     +
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 320

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNL 385
            DSGT  T L   AY  +   F S   E R    S+    + CY  +    N   P ++L
Sbjct: 321 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFT-GHANVTVPTISL 379

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           T  GG    +  P  +       L   CL    +   N IG
Sbjct: 380 TFSGGATIDLAAPAGV-------LVDGCLAFAGAGTDNAIG 413


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 148/375 (39%), Gaps = 57/375 (15%)

Query: 82  NDKTPLTFSAGNDTYRLNS---------LGFLHY-TNVSVGQPALSFIVALDTGSDLFWL 131
           ND+    +S  N TY   S         +G  +Y      G PA + ++ +DTGSD+ W+
Sbjct: 105 NDRLNTIWSKNNGTYSTMSNLPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWI 164

Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
            C  C  C   ++          I+ P  SS+   + C S+ C EL          C Y+
Sbjct: 165 QCKPCSDCYSQVDP---------IFEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCVYE 215

Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
           + Y  DG+ S G   ++ L L +D   S       +FGCG   TG F   A   GL GLG
Sbjct: 216 INY-GDGSRSQGDFSQETLTLGSDSFPS------FAFGCGHTNTGLFKGSA---GLLGLG 265

Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT 304
               S PS    +      FS C      S  TG  S G    P      P      +P+
Sbjct: 266 RTALSFPS--QTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPS 323

Query: 305 -YNITITQVSVGGN------AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
            Y + +  +SVGG       AV      I DSGT  T L   AY  +  +F S    K  
Sbjct: 324 FYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRS----KTR 379

Query: 358 TSTSDLPF---EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
              S  PF   + CY LS + +    P +    +      V+   ++ + +  G  + CL
Sbjct: 380 NLPSAKPFSILDTCYDLS-SYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQV-CL 437

Query: 415 GVV---KSDNVNIIG 426
                 +S + NIIG
Sbjct: 438 AFASASQSISTNIIG 452


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +T +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 86/355 (24%), Positives = 147/355 (41%), Gaps = 43/355 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + +G P   + + LDTGS L WL C  C    H             +Y P+ S T 
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADP--------LYDPSVSKTY 176

Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            K+ C S  C   K        C +  + C Y   Y  D + S G+L +D+L L + +  
Sbjct: 177 KKLSCASVECSRLKAATLNDPLCETDSNACLYTASY-GDTSFSIGYLSQDLLTLTSSQTL 235

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
                 + ++GCG+   G F   A   G+ GL  DK S+ + L+ +    ++FS C  + 
Sbjct: 236 -----PQFTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTK--YGHAFSYCLPTA 285

Query: 277 GTGRISFGDKG----SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN-----AVNFEFSA 325
            +G    G       SP   + TP      +P+ Y + +T ++V G      A  +    
Sbjct: 286 NSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT 345

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + DSGT  T L    Y  + + F  +   K   + +    + C+  S    +   P + +
Sbjct: 346 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSIS-AVPEIKM 404

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
             +GG    +  P +++ ++     L   G   ++ + IIG    + Y IA ++S
Sbjct: 405 IFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 459


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 158/374 (42%), Gaps = 52/374 (13%)

Query: 34  FHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ--GNDKTPLTFSA 91
            HHRY DP   +      P K        L  R R  +LR   +  +  G      + +A
Sbjct: 59  LHHRY-DPCSPV------PSK----KVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAA 107

Query: 92  GNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
              T    SL  L Y   V +G PA++  +++DTGSD+ W+ C  C  C   ++S     
Sbjct: 108 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS----- 162

Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVE 205
               ++ P++SST S   C+S  C    Q         S C Y V Y   G  S+     
Sbjct: 163 ----LFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNY---GDSSSTTGTY 215

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
               L        S  +   FGC + ++G F D    +GL GLG    S+ S  A  G  
Sbjct: 216 SSDTLTL----GSSAMTDFQFGCSQSESGGFNDQT--DGLMGLGGGAQSLASQTA--GTF 267

Query: 266 PNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PTYNITITQ-VSVGGNAVN- 320
             +FS C    S  +G ++ G  GS G  +TP  LR T  PTY + + + + VG   +N 
Sbjct: 268 GTAFSYCLPPTSGSSGFLTLG-TGSSGFVKTPM-LRSTQIPTYYVVLLESIKVGSQQLNL 325

Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
               F   ++ DSGT  T L   AY+ +S  F +  ++    + S +  + C+  S  Q+
Sbjct: 326 PTSVFSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGI-LDTCFDFS-GQS 383

Query: 377 NFEYPVVNLTMKGG 390
           +   P V L   GG
Sbjct: 384 SISIPTVTLVFSGG 397


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 102/424 (24%), Positives = 163/424 (38%), Gaps = 64/424 (15%)

Query: 29  TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKT 85
           + GF    ++ D VK +   + L +            ++R  RL    LAA      D+ 
Sbjct: 48  SHGFRVRLKHVDHVKNLTRFERLRR-------GVARGKNRLHRLNAMVLAAANATVGDQV 100

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
                AGN  + +          +++G P  SF   +DTGSDL W    C  C    + S
Sbjct: 101 KAPVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQS 149

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           +       I+ P  SS+  K+ C+S LC        +   C Y   Y  D + + G L  
Sbjct: 150 T------PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAF 202

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGL 264
           +        +   S+   + FGCG    G  F  GA   GL GLG    S+ S L  Q  
Sbjct: 203 ETFTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQKF 258

Query: 265 I----------PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVS 313
                      P+S  +   ++ T + S  +  +     TP     + P+ Y +++  +S
Sbjct: 259 AYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKT-----TPLIKNPSQPSFYYLSLQGIS 313

Query: 314 VGGNAVN-----FEF------SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           VGG  ++     FE         I DSGT+ TY+ + A+T +   F +      + S + 
Sbjct: 314 VGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTG 373

Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
              + C+ L       E P +    KG       +  +I  S+     L CL +  S  +
Sbjct: 374 -GLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG---LLCLAIGSSRGM 429

Query: 423 NIIG 426
           +I G
Sbjct: 430 SIFG 433


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 153/379 (40%), Gaps = 62/379 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VG PA  F++  DTGSDL W+ C   S      ++S       ++ P  S + S
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQ---RVFRPAGSKSWS 160

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEKQS 217
            +PC+S  C+         C S    C Y  RY  D + + G +  D   + L+ ++   
Sbjct: 161 PLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRY-KDNSSARGVVGLDSATVSLSGNDGTR 219

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
           K+    +  GC     G     +  +G+  LG    S  S  A++      FS C     
Sbjct: 220 KAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFASRAASR--FGGRFSYCLVDHL 275

Query: 274 -GSDGTGRISFGDKGSPGQG-----ETPFSL---RQTHPTYNITITQVSVGGNAVN---- 320
              + T  ++FG+  S          TP  L    +T P Y +++  V+V G  +     
Sbjct: 276 APRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPD 335

Query: 321 -FEFS----AIFDSGTSFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCYVL 371
            ++F     AI DSGTS T L  PAY      IS+ F  + +   +      PFEYCY  
Sbjct: 336 VWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMD------PFEYCYNW 389

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGR-- 427
           +    + E P + L   G           ++ + P    + C+GVV+     V++IG   
Sbjct: 390 T--GVSAEIPRMELRFAGAATLAPPGKSYVIDTAPG---VKCIGVVEGAWPGVSVIGNIL 444

Query: 428 ------EYPIANNISLFHN 440
                 E+ +AN    F  
Sbjct: 445 QQEHLWEFDLANRWLRFKQ 463


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSKGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 154/346 (44%), Gaps = 45/346 (13%)

Query: 99  NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
            S+G  +Y T + +G P+ S+ + +DTGS L WL   C  CV   +   G + D     P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179

Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
             SST + V C+++ C ELQ     PSA S    C YQ  Y  D + S G L  D +   
Sbjct: 180 RASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGSLSTDTVSFG 238

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           +    S        +GCG+   G F   A   GL GL  +K S+   LA    +  SFS 
Sbjct: 239 STRYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287

Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
           C  +   TG +S G   + G     TP +      + Y IT++ +SVGG+ +     E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346

Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
           +   I DSGT  T L    +T +S+    ++A  +R  + S L  + C+    +Q     
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL--DTCFEGQASQ--LRV 402

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P V +   GG    +    V++  +       CL    +D+  IIG
Sbjct: 403 PTVAMAFAGGASMKLTTRNVLIDVDDS---TTCLAFAPTDSTAIIG 445


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 135/316 (42%), Gaps = 43/316 (13%)

Query: 92  GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVI 150
           G+D+ R N     ++  +S+G P +  +V +DTGS L W+ C +C    +   + +GQ  
Sbjct: 16  GDDSMRKNK----YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-- 69

Query: 151 DFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
              I++P  SST SKV C++  C        ++  C      C Y +RY S G  S G+L
Sbjct: 70  ---IFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYL 125

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
            +D L LA++    +S+D+ I FGCG       L      G+ G G    S  + +  Q 
Sbjct: 126 GKDRLTLASN----RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQT 176

Query: 264 LIPNSFSMCFGSD--GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
               +FS CF  D    G ++ G          T        P Y   I Q+ +  N + 
Sbjct: 177 DY-TAFSYCFPRDHENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIR 233

Query: 321 FEFS--------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
            E           I DSGT+ TY+  P +  + +      + K  T   D     C++ +
Sbjct: 234 LEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISN 292

Query: 373 PNQTNF-EYPVVNLTM 387
               N+ ++P V + +
Sbjct: 293 SGSANWNDFPTVEMKL 308


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/317 (29%), Positives = 134/317 (42%), Gaps = 66/317 (20%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +S+G P   F   +DTGSDL W+ C  C  C    +          ++ P  SS+ S  
Sbjct: 11  QISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDP---------LFIPLASSSYSNA 61

Query: 167 PCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
            C  +LC+ L +   S  + C Y   Y  DG+ + G    + + L      + S  +RI 
Sbjct: 62  SCTDSLCDALPRPTCSMRNTCTYSYSY-GDGSNTRGDFAFETVTL------NGSTLARIG 114

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRI 281
           FGCG  Q G+F   A  +GL GLG    S+PS L +     + FS C     T      I
Sbjct: 115 FGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPI 169

Query: 282 SFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFD 328
           +FG+     +   TP    + +P+ Y + +  +SVG   V    SA           I D
Sbjct: 170 TFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILD 229

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEY----CYVLS---------PN 374
           SGT+ TY    A+  I      LA+ +R+ S  +  P  Y    CY +S         P+
Sbjct: 230 SGTTITYWRLAAFIPI------LAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPS 283

Query: 375 QT------NFEYPVVNL 385
            T      +FE PV NL
Sbjct: 284 MTVHLTNVDFEIPVSNL 300


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRRGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 90/328 (27%), Positives = 140/328 (42%), Gaps = 48/328 (14%)

Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQ 176
           + LDTGSD+ W+ C  C  C    +          ++ P+ S++ + V C+S  C     
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASYAAVSCDSQRCRDLDT 51

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
             C +A   C Y+V Y  DG+ + G    + L L             ++ GCG    G F
Sbjct: 52  AACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVGN-----VAIGCGHDNEGLF 105

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGE 293
           +  A    L G  +   S PS ++      ++FS C     S     + FGD  +     
Sbjct: 106 VGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTV 157

Query: 294 TPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA------------IFDSGTSFTYLNDP 339
           T   +R  +T   Y + ++ +SVGG  ++   SA            I DSGT+ T L   
Sbjct: 158 TAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSA 217

Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI 399
           AY  + + F   A     TS   L F+ CY LS ++T+ E P V+L  +GGG   +    
Sbjct: 218 AYAALRDAFVQGAPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRFEGGGALRLPAKN 275

Query: 400 VIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
            ++  +  G   YCL    ++  V+IIG
Sbjct: 276 YLIPVDGAG--TYCLAFAPTNAAVSIIG 301


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +T +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRGGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 145/350 (41%), Gaps = 57/350 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +T +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
              +FDSG+  +Y+ D A + +S+    L      A+E+ E +        CY +     
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             + P ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 273 G-DMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 49/299 (16%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G PA + ++ALDT +D  W+PC  C+ C               ++S + SS+   +PC 
Sbjct: 32  IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 80

Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
           S  C        +GS C + + Y S    +   LV+D L LATD   S       +FGC 
Sbjct: 81  SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 132

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
           R  TGS +          LG+ +  +  +  +Q L  ++FS C  S    + +G +  G 
Sbjct: 133 RKATGSSVPPQG-----LLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 187

Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
              P + +    LR    +  Y + +  + VG   V+   SA           + DSGT+
Sbjct: 188 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 247

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCY---VLSPNQTNFEYPVVNLTM 387
           FT L  PAYT + + F    +  R  + S L  F+ CY   ++SP  T F +  +N+T+
Sbjct: 248 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTVPIISPTIT-FMFAGMNVTL 303


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 108/401 (26%), Positives = 161/401 (40%), Gaps = 62/401 (15%)

Query: 62  ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
           +LA R R  R R   +  +        T L+ +AG  T    +  +S+  L Y   + +G
Sbjct: 119 SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 178

Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
            PA+   V +DTGSDL W+   PC    C    +          ++ P++SS+ + VPC+
Sbjct: 179 TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 229

Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S  C                  A + C Y + Y +  T +TG    + L L     +   
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 283

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           V +   FGCG  Q G +      +GL GLG    S+ S  ++Q   P S+ +   S G G
Sbjct: 284 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 340

Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
            ++ G          + G   TP     + PT Y +T+T +SVGG  +    SA     +
Sbjct: 341 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 400

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNL 385
            DSGT  T L   AY  +   F S   E R    S+    + CY  +    N   P ++L
Sbjct: 401 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFT-GHANVTVPTISL 459

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           T  GG    +  P  +       L   CL    +   N IG
Sbjct: 460 TFSGGATIDLAAPAGV-------LVDGCLAFAGAGTDNAIG 493


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/310 (28%), Positives = 132/310 (42%), Gaps = 54/310 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           N S+G+P +  +  +DTGS L W+ C      H  +S S Q +   I+ P+ SST S + 
Sbjct: 96  NFSIGEPPIPQLAVMDTGSSLTWVMC------HPCSSCSQQSVP--IFDPSKSSTYSNLS 147

Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           C+        +C      CPY V Y+  G+ S G    + L L T ++    V S I FG
Sbjct: 148 CSEC-----NKCDVVNGECPYSVEYVGSGS-SQGIYAREQLTLETIDESIIKVPSLI-FG 200

Query: 228 CGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPN---SFSMCFGSDGT-- 278
           CGR    S      P    NG+FGLG  + S         L+P+    FS C G+     
Sbjct: 201 CGR--KFSISSNGYPYQGINGVFGLGSGRFS---------LLPSFGKKFSYCIGNLRNTN 249

Query: 279 ---GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------ 324
               R+  GDK +  QG++  +L   +  Y + +  +S+GG  ++     FE S      
Sbjct: 250 YKFNRLVLGDKANM-QGDST-TLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNS 307

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCYVLSPNQTNFEYP 381
             I DSG   T+L    +  +S    +L +     +  D   P+  CY    +Q    +P
Sbjct: 308 GVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFP 367

Query: 382 VVNLTMKGGG 391
           +V      G 
Sbjct: 368 LVTFHFAEGA 377


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 136/323 (42%), Gaps = 35/323 (10%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
            R  Y   R  G A Q  D      +A         +G L+Y    S+G P ++  + +D
Sbjct: 99  RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
           TGSDL W+ C   S      S    + D     P  SS+ + VPC   +C    +     
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
            + + C Y V Y  DG+ +TG    D L L+     + S      FGCG  Q+G F +G 
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
             +GL GLG ++ S+  +    G     FS C  +  +  G ++ G  G P      FS 
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-LGGPSGAAPGFST 321

Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
            Q  P+      Y + +T +SVGG  ++   SA     + D+GT  T L   AY  +   
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSA 381

Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
           F S +A     T+ S+   + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 132/305 (43%), Gaps = 40/305 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS-CVHGLNSSSGQVIDFNIYSPNTSST 162
           +   V +G P        DTGSDL W  C+ C   C H             I++P+ S++
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEP---------IFNPSKSTS 188

Query: 163 SSKVPCNSTLCELQK----QCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            + + C+S  C+  K      PS + S C Y ++Y  D + S GF  +D L L + +   
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQY-GDQSYSVGFFAQDKLALTSTD--- 244

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
             V +   FGCG+   G F+  A   GL GLG +  S+ S  A +      FS C    S
Sbjct: 245 --VFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQK--YGKLFSYCLPSTS 297

Query: 276 DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------I 326
             TG ++FG  G   +    TP  +    P+ Y + +  +SVGG  ++   S       I
Sbjct: 298 SSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTI 357

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT  + L   AY+ +  +F     +  + + + +  + CY  S   T  + P +NL 
Sbjct: 358 IDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI-LDTCYDFSQYDT-VDVPKINLY 415

Query: 387 MKGGG 391
              G 
Sbjct: 416 FSDGA 420


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 140/357 (39%), Gaps = 48/357 (13%)

Query: 68  RYFRLRGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDT 124
           R    R   LAA+ + +    +++G  T      +  G  +    S+G+P L     +DT
Sbjct: 47  RTAESRNLSLAAERSRRRLSVYTSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDT 106

Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
           GSDL W+ C   S  +G N          +Y P  S +S K+PC+S LC+       +  
Sbjct: 107 GSDLMWVKC---SPCNGCNPPPSP-----LYDPARSRSSGKLPCSSQLCQALGRGRIISD 158

Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
           QC      C Y   Y   G  ST    + VL   T       V + +SFG      GS  
Sbjct: 159 QCSDDPPLCGYHYAYGHSGDHST----QGVLGTETFTFGDGYVANNVSFGRSDTIDGSQF 214

Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLI------PNSFS-MCFGSDGTGRISFGDKGSPG 290
            G A  GL GLG    S+ S L            PN +S + FGS      S GD  S  
Sbjct: 215 GGTA--GLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTP 272

Query: 291 QGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSA--IFDSGTSFTYLNDP 339
               P   R TH  Y + +  +SVGG+         A+N + S    FDSG   T L D 
Sbjct: 273 LVTNPKPDRDTH--YYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDA 330

Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
           AY  + +   S  +     +  D     C+V +  Q   + P + L    G    +N
Sbjct: 331 AYQVVRQAITSEIQRLGYDAGDDT----CFVAANQQAVAQMPPLVLHFDDGADMSLN 383


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 103/424 (24%), Positives = 160/424 (37%), Gaps = 64/424 (15%)

Query: 29  TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKT 85
           + GF    ++ D VK +   + L +            ++R  RL    LAA      D+ 
Sbjct: 303 SHGFRVRLKHVDHVKNLTRFERLRR-------GVARGKNRLHRLNAMVLAAANATVGDQV 355

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
                AGN  + +          +++G P  SF   +DTGSDL W    C  C    + S
Sbjct: 356 KAPVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQS 404

Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           +       I+ P  SS+  K+ C+S LC        +   C Y   Y  D + + G L  
Sbjct: 405 T------PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAF 457

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGL 264
           +        +   S+   + FGCG    G  F  GA   GL GLG    S+ S L  Q  
Sbjct: 458 ETFTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQ-- 511

Query: 265 IPNSFSMCFGSDGTGRISFGDKGSPG----------QGETPFSLRQTHPT-YNITITQVS 313
               F+ C  +    + S    GS               TP     + P+ Y +++  +S
Sbjct: 512 ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGIS 568

Query: 314 VGGNAVN-----FEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           VGG  ++     FE         I DSGT+ TY+ + A+T +   F +      + S + 
Sbjct: 569 VGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTG 628

Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
              + C+ L       E P +    KG       +  +I  S+     L CL +  S  +
Sbjct: 629 -GLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG---LLCLAIGSSRGM 684

Query: 423 NIIG 426
           +I G
Sbjct: 685 SIFG 688


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/351 (26%), Positives = 148/351 (42%), Gaps = 58/351 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  N+S+G P +  I  +DTGSDL W  C  C  C         QV+ F  + P  SST 
Sbjct: 92  YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVPF--FDPKNSSTY 142

Query: 164 SKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
               C ++ C      + C + G  C +   Y +DG+ + G L  + L +A+   +  S 
Sbjct: 143 RDSSCGTSFCLALGNDRSCRN-GKKCTFMYSY-ADGSFTGGNLAVETLTVASTAGKPVSF 200

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
               +FGC     G F + ++  G+ GLG+ + S+ S L  +  I   FS C       S
Sbjct: 201 PG-FAFGCVHRSGGIFDEHSS--GIVGLGVAELSMISQL--KSTINGRFSYCLLPVFTDS 255

Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAVNF---------- 321
             + RI+FG  G     G   TP  ++     Y  IT+   SVG   +++          
Sbjct: 256 SMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVE 315

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
           E + I DSGT++TYL    Y ++ E+     K KR    + +    CY  + +Q   + P
Sbjct: 316 EGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGIS-SLCYNTTVDQ--IDAP 372

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLY------LYCLGVVKSDNVNIIG 426
           ++    K             V  +P   +      L C  V+ + ++ I+G
Sbjct: 373 IITAHFKDAN----------VELQPWNTFLRMQEDLVCFTVLPTSDIGILG 413


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/356 (24%), Positives = 142/356 (39%), Gaps = 45/356 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIY-SPNTSS 161
           F +   + VG P +  +   DTGSDL W+ C       G ++ +      ++Y  P+ SS
Sbjct: 108 FEYLMAIEVGTPPVRVLAIADTGSDLVWVKC------KGKDNDNNSTAPPSVYFVPSASS 161

Query: 162 TSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           T  +V C++  C        C   GS C Y   Y  DG+ ++G L  +    +T    SK
Sbjct: 162 TYGRVGCDTKACRALSSAASCSPDGS-CEYLYSY-GDGSRASGQLSTETFTFSTIADSSK 219

Query: 219 SVD----------------SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           +                  +++ FGC    TG+F      +GL GLG    S+ S L   
Sbjct: 220 TNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVSLASQLGAT 275

Query: 263 GLIPNSFSMCFG----SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVG 315
             +   FS C      ++ +  ++FG +     PG   TP    +    Y I +  ++V 
Sbjct: 276 TSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVA 335

Query: 316 GN---AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
           G        +   I DSGT+ TYL+    T + +      K  R  S   +  + CY +S
Sbjct: 336 GTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKI-LDLCYDIS 394

Query: 373 --PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               +     P V L + GGG   +      V  +   L L  +   +  +V+I+G
Sbjct: 395 GVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILG 450


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/359 (26%), Positives = 145/359 (40%), Gaps = 59/359 (16%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
            N+S+G P ++ ++ +DT SDL WL C  C++C               I+ P+ S T   
Sbjct: 87  VNISIGSPPVTQLLHMDTASDLLWLQCRPCINCY---------AQSLPIFDPSRSYTHRN 137

Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
             C ++    Q   PS   N     C Y +RY+ DGT S G L +++L   T  DE  S 
Sbjct: 138 ESCRTS----QYSMPSLRFNAKTRSCEYSMRYM-DGTGSKGILAKEMLMFNTIYDESSSA 192

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           ++   + FGCG    G  L G    G+ GLG  + S+      +      FS CFGS   
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGTK------FSYCFGSLDD 242

Query: 279 -----GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------- 322
                  +  GD G+   G+T   L   +  Y +TI  +SV G  +  +           
Sbjct: 243 PSYPHNVLVLGDDGANILGDTT-PLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTG 301

Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCY--VLSPNQT 376
               I D+G S T L + AY  +        + +    + +  D+    CY   L  +  
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLV 361

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
              +P+V      G    ++   V +   P    ++CL V    N+N IG     + NI
Sbjct: 362 ESGFPIVTFHFSDGAELSLDVKSVFMKLSPN---VFCLAVTPG-NMNSIGATAQQSYNI 416


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 129/340 (37%), Gaps = 51/340 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA + +VA+D  +D  W+PC  C  C     S          +SP  SST 
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 151

Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             VPC S  C       CP+  GS+C + + Y +    +   L +D L L  +      V
Sbjct: 152 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 203

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
               +FGC RV +G   +   P GL G G    S   +   +    + FS C      S+
Sbjct: 204 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 258

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
            +G +  G  G P + +T   L   H P+ Y + +  + VG   V    SA         
Sbjct: 259 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 318

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I D+GT FT L  P Y  + + F    +           F+ CY           P V
Sbjct: 319 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTV 371

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
                G     + +  V++ S   G+    +    SD VN
Sbjct: 372 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 411


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 138/329 (41%), Gaps = 59/329 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
           N+S+GQP +  +V +DTGSD+ W+ C  C +C + L           ++ P+ SST S  
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGL---------LFDPSKSSTFSPL 154

Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
            K PC+   C             P+ V Y  + T S  F  + V+   TDE  S+  D  
Sbjct: 155 CKTPCDFEGCRCDP--------IPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISD-- 204

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
           + FGCG    G   D    NG+ GL     S+ + L  +      FS C G+        
Sbjct: 205 VLFGCGH-NIGHDTD-PGHNGILGLNNGPDSLVTKLGQK------FSYCIGNLADPYYNY 256

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
            ++  G+        TPF +      Y +T+  +SVG   ++     FE         I 
Sbjct: 257 HQLILGEGADLEGYSTPFEVYNGF--YYVTMEGISVGEKRLDIAPETFEMKENRAGGVII 314

Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           D+G++ T+L D  +  +S E  N L    R+ +    P+  C+  S ++    +PVV   
Sbjct: 315 DTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFH 374

Query: 387 MKGG-------GPFF--VNDPIVIVSSEP 406
              G       G FF  +ND +  ++  P
Sbjct: 375 FSDGADLALDSGSFFNQLNDNVFCMTVGP 403


>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
          Length = 507

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 154/344 (44%), Gaps = 55/344 (15%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + VG    +F+V +DTGS L  +P + C +CV              +Y P  SSTS+K
Sbjct: 124 TQIIVGNT--TFLVQVDTGSLLMAIPLEGCNTCVESR----------PVYHP--SSTSTK 169

Query: 166 VPCNSTLCELQKQCP------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           V C+S  C+     P      S+G +C +Q+RY  DG+  +G++ EDV++LA        
Sbjct: 170 VACSSDQCKGSGSTPPSCSRTSSGESCDFQIRY-GDGSHVSGYIYEDVVNLA-------G 221

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VP----SILANQGLIPNSFSMCFG 274
           +  + +FG    +TG F +    +G+ G G   +S VP    S++++ GL  N F M   
Sbjct: 222 LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQFGMLLN 279

Query: 275 SDGTGRISFGDKGSP-----------GQGETPF-SLRQTHPTYNITITQVSVGGNAVNFE 322
            +G G +S G+  +             Q  TPF S++ T     I I   ++ G+ +  E
Sbjct: 280 YEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKST----GIRINDYTIPGSKLGQE 335

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSG++   L   AY Q+   F +     +    +   F+     S +    ++P 
Sbjct: 336 --VIVDSGSTALSLASGAYDQLRNYFQTHYCSIQGVCENPNIFQGSICYSSDDVLSKFPT 393

Query: 383 VNLTMKGGGPFFVNDPIVIVSSE-PKGLYLYCLGVVKSDNVNII 425
           +  T  GG    +     +V +    G Y YC  + ++D+   I
Sbjct: 394 LYFTFDGGVQVAIPPKNYLVKAPLTNGKYGYCFMIERADSTMTI 437


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 62/397 (15%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           R+R  RL+   L A  + +       GN  + +          +++G P  ++   LDTG
Sbjct: 67  RNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMK---------LAIGTPPETYSAILDTG 117

Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
           SDL W  C  C  C H             I+ P  SS+ SK+ C+S LCE   Q  S  +
Sbjct: 118 SDLIWTQCKPCTQCFHQSTP---------IFDPKKSSSFSKLSCSSQLCEALPQS-SCNN 167

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPN 243
            C Y   Y  D + + G L  + L         K+    ++FGCG    GS F  GA   
Sbjct: 168 GCEYLYSY-GDYSSTQGILASETLTFG------KASVPNVAFGCGADNEGSGFSQGA--- 217

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT-------GRISFGDKGSPGQGETP 295
           GL GLG    S+ S L         FS C  + D T       G ++  +  S     TP
Sbjct: 218 GLVGLGRGPLSLVSQLKEP-----KFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTP 272

Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
                 HP+ Y +++  +SVG   +  + S            I DSGT+ TYL + A+  
Sbjct: 273 LIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNL 332

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
           +++ F +      ++S S    + C+ L    TN E P +     G       +  +I  
Sbjct: 333 VAKEFTAKINLPVDSSGST-GLDVCFTLPSGSTNIEVPKLVFHFDGADLELPAENYMIGD 391

Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
           S    + + CL +  S  ++I G       N+ + H+
Sbjct: 392 SS---MGVACLAMGSSSGMSIFGNVQ--QQNMLVLHD 423


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/343 (27%), Positives = 143/343 (41%), Gaps = 38/343 (11%)

Query: 99  NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
            S+G  +Y T + +G PA  +I+ +DTGS L WL   C  C    +  SG V D     P
Sbjct: 110 TSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----P 162

Query: 158 NTSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
            TSS+ + V C+S  C+      L     S  + C YQ  Y  D + S G+L +D +   
Sbjct: 163 KTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASY-GDSSFSVGYLSKDTVSFG 221

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
            +            +GCG+   G F   A   GL GL  +K S+   LA    +  SFS 
Sbjct: 222 ANSVP------NFYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSY 270

Query: 272 CFGS-DGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
           C  S   +G +S G     G   TP  S       Y I+++ ++V G  +    S     
Sbjct: 271 CLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSL 330

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT  T L    YT +S+   +  K   + + +    + C+    ++     P V
Sbjct: 331 PTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLR-AVPAV 389

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++   GG    ++   ++V  +       CL    + +  IIG
Sbjct: 390 SMAFSGGATLKLSAGNLLVDVDGA---TTCLAFAPARSAAIIG 429


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 91/352 (25%), Positives = 139/352 (39%), Gaps = 53/352 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P+   ++ LDTGSD+ WL C  C  C       SG V D     P  SS+ 
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCY----DQSGPVFD-----PRRSSSY 190

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             V C + LC       C      C YQV Y  DG+++ G    + L  A   +      
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLRRRACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
           +R++ GCG    G F+  A   GL        S P+ ++ +     SFS C         
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGKSFSYCLVDRTSSSS 299

Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV--------- 319
                   +  ++FG   +     TP        T Y + +  +SVGG  V         
Sbjct: 300 SGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLR 359

Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
                     I DSGTS T L  P+Y+ + + F + A   R +      F+ CY L   +
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRK 419

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
              + P V++   GG    +     ++  + +G   +C     +D  V+IIG
Sbjct: 420 V-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 468


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 149/366 (40%), Gaps = 48/366 (13%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
            +LG  +Y   V +G P   + V  DTGSD  W+ C  CV   +             ++ 
Sbjct: 173 RALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C +  C        +G +C Y V+Y  DG+ S GF   D L L++ +  
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F + A   GL GLG  KTS+P    ++      F+ C    
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333

Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
           S GTG + FG            TP  L    PT Y + +T + VGG  ++   S      
Sbjct: 334 STGTGYLDFGAGSLAAASARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
            I DSGT  T L   AY+ +   F +       K+  + S L  + CY  +   +    P
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 449

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
            V+L  +GG    V+   ++ ++    + L          +G+V +  +   G  Y I  
Sbjct: 450 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 509

Query: 434 NISLFH 439
            +  F+
Sbjct: 510 KVVGFY 515


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 136/332 (40%), Gaps = 41/332 (12%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P +  +  +DTGS L WL C  C +C            +  ++ P  SST     C+
Sbjct: 95  IGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQ---------ETPLFEPLKSSTYKYATCD 145

Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
           S  C L    Q+ C   G  C Y + Y  D + S G L  + L   +T   Q+ S  + I
Sbjct: 146 SQPCTLLQPSQRDCGKLG-QCIYGIMY-GDKSFSVGILGTETLSFGSTGGAQTVSFPNTI 203

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
            FGCG     +        G+ GLG    S+ S L  Q  I + FS C   + S  T ++
Sbjct: 204 -FGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKL 260

Query: 282 SFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV---NFEFSAIFDSGTSFT 334
            FG +    + G   TP  ++ + PTY  + +  V++G   V     + + + DSGT  T
Sbjct: 261 KFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLT 320

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
           YL +  Y     +       K      DL  P + C+   PN+ N   P +     G   
Sbjct: 321 YLENTFYNNFVASLQETLGVKL---LQDLPSPLKTCF---PNRANLAIPDIAFQFTGASV 374

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
                 ++I  ++     + CL VV S  + I
Sbjct: 375 ALRPKNVLIPLTDSN---ILCLAVVPSSGIGI 403


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 49/299 (16%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G PA + ++ALDT +D  W+PC  C+ C               ++S + SS+   +PC 
Sbjct: 109 IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 157

Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
           S  C        +GS C + + Y S    +   LV+D L LATD   S       +FGC 
Sbjct: 158 SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 209

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
           R  TGS +          LG+ +  +  +  +Q L  ++FS C  S    + +G +  G 
Sbjct: 210 RKATGSSVPPQG-----LLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 264

Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
              P + +    LR    +  Y + +  + VG   V+   SA           + DSGT+
Sbjct: 265 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 324

Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCY---VLSPNQTNFEYPVVNLTM 387
           FT L  PAYT + + F    +  R  + S L  F+ CY   ++SP  T F +  +N+T+
Sbjct: 325 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTVPIISPTIT-FMFAGMNVTL 380


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/295 (30%), Positives = 131/295 (44%), Gaps = 46/295 (15%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G P  SF   LDTGS++ W+PC+ C  C      SS Q      + P+ SST + + C S
Sbjct: 131 GTPPQSFYTVLDTGSNIAWIPCNPCSGC------SSKQ----QPFEPSKSSTYNYLTCAS 180

Query: 171 TLCELQKQCPSAGS--NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
             C+L + C  + +  NC    RY   G  S    V+++L   T    S+ V++ + FGC
Sbjct: 181 QQCQLLRVCTKSDNSVNCSLTQRY---GDQSE---VDEILSSETLSVGSQQVENFV-FGC 233

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFG 284
                G  L    P+ L G G +  S  S  A   L  ++FS C    F S  TG +  G
Sbjct: 234 SNAARG--LIQRTPS-LVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSSAFTGSLLLG 288

Query: 285 DKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIFDSG 330
            +    QG   TP      +P+ Y + +  +SVG   V+                I DSG
Sbjct: 289 KEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSG 348

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           T  T L +PAY  + ++F S        S +DL F+ CY  +    + E+P++ L
Sbjct: 349 TVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL-FDTCY--NRPSGDVEFPLITL 400


>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 407

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 148/355 (41%), Gaps = 52/355 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +  N+++G P  ++ + +DTGSDL W+ CD  C  C    +           Y P+
Sbjct: 45  LGY-YSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQ---------YKPH 94

Query: 159 TSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
                + V C   LC   +  P+         C Y+V Y   G+ S G LV D++ L   
Sbjct: 95  ----GNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGS-SLGVLVRDIIPLKL- 148

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSF 269
                   S ++FGCG  QT     G  P     G+ GLG  + S+ S L ++GLI N  
Sbjct: 149 -TNGTLTHSMLAFGCGYDQTHV---GHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVV 204

Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFE-FS 324
             C    G G + FGD+  P  G     + Q+  +    Y      +   G A + +   
Sbjct: 205 GHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLE 264

Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYV-------LSPNQT 376
             FDSG+S+TY N  A+  + +   N +  +    +T D     C+        L    +
Sbjct: 265 LTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHDVTS 324

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-----NVNIIG 426
           NF+  V++ T      F V     ++ ++   +   CLG++        N NIIG
Sbjct: 325 NFKPLVLSFTKSKNSLFQVPPEAYLIVTKHGNV---CLGILDGTEIGLGNTNIIG 376


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 129/340 (37%), Gaps = 51/340 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA + +VA+D  +D  W+PC  C  C     S          +SP  SST 
Sbjct: 83  YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 132

Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             VPC S  C       CP+  GS+C + + Y +    +   L +D L L  +      V
Sbjct: 133 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 184

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
               +FGC RV +G   +   P GL G G    S   +   +    + FS C      S+
Sbjct: 185 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 239

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
            +G +  G  G P + +T   L   H P+ Y + +  + VG   V    SA         
Sbjct: 240 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 299

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I D+GT FT L  P Y  + + F    +           F+ CY           P V
Sbjct: 300 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTV 352

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
                G     + +  V++ S   G+    +    SD VN
Sbjct: 353 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 392


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/336 (24%), Positives = 138/336 (41%), Gaps = 52/336 (15%)

Query: 79  AQGNDKTPLTFSAGND-TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
           ++ N  TP + SA     Y +   G  ++  +S+G P +  +V  DTGSDL W+ C  C 
Sbjct: 67  SRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQ 126

Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG--SNCPYQV 190
            C    +          I++P  SST  +V C +  C       + C + G    C Y  
Sbjct: 127 ECYKQKSP---------IFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSY 177

Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
            Y  D + + G+L  +   + +     +     ++FGCG    G+F +  +     G+  
Sbjct: 178 SY-GDHSFTMGYLATERFIIGSTNNSIQ----ELAFGCGNSNGGNFDEVGS-----GIVG 227

Query: 251 DKTSVPSILANQGL-IPNSFSMCF------GSDGTGRISFGDK----GSPGQGETPFSLR 299
                 S+++  G  I N FS C        +   G+I FGD     GS     TP   +
Sbjct: 228 LGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSK 287

Query: 300 QTHPTYNITITQVSVGGNAVNFEFS----------AIFDSGTSFTYLNDPAYTQISETFN 349
           +    Y +T+  +SVG   + +E S           I DSGT+ T+L+   Y ++ E   
Sbjct: 288 EPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKL-ELVL 346

Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
             A E    S  +  F  C+    ++   E P++ +
Sbjct: 347 EKAVEGERVSDPNGIFSICF---RDKIGIELPIITV 379


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 151/359 (42%), Gaps = 65/359 (18%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+S+G P L F V +DTGS+L W  C  C  C         +     +  P  SST S++
Sbjct: 94  NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PCN + C+      + +  +A + C Y   Y S  T   G+L  + L +           
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
            +++FGC    T + +D ++  G+ GLG    S+ S LA        FS C  SD    G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248

Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
              I FG      +          + P+  R TH   N+T      T++ V G+   F  
Sbjct: 249 ASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308

Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCYVLSPNQ 375
           +      I DSGT+ TYL    Y  + + F S +A   + T  S  P+  + CY  S   
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGG 368

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPI----VIVSSEPKG-LYLYCLGVVKSDN---VNIIG 426
                 V  L ++  G    N P+      V ++ +G + + CL V+ + +   ++IIG
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIG 427


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + I+ +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +T +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 98/346 (28%), Positives = 153/346 (44%), Gaps = 53/346 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ VG P  +  +  DTGSD+ WL C  C SC        GQ     +++P+ SST 
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             + C S+LC+  L + C    + C YQV Y  DG+ + G    + L   ++   S    
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F   A    L GLG    S PS +    L  + FS C     S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
             + FG++      +  F+   T+P     Y + +  + VGG +VN    +         
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGN 295

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
              I DSGT+ T L   AY  + + F + +  + + TS   L F+ CY LS  +++   P
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLS-GRSSIMLP 353

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
            V+    GG    +    ++V  +  G   YCL     S+N +IIG
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIG 397


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 138/318 (43%), Gaps = 39/318 (12%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +SVG P    I   DTGSD+ W  C  C +C            D  +++P+ S+T  KV 
Sbjct: 89  LSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139

Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           C+S +C    +    S   +C Y + Y  D + S G    D L + +   +  +   R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
            GCG    GSF   A  +G+ GLG+   S+   + +   +   FS C    G+D  G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253

Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
           ++FG   +    G   TP  +     + Y++ +  VSVG N   +         + + I 
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L    Y   ++  ++    +R    +    EYC+  + +  +++ P + +  
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCFETTTD--DYKVPFIAMHF 370

Query: 388 KGGGPFFVNDPIVIVSSE 405
           +G       + ++I  S+
Sbjct: 371 EGANLRLQRENVLIRVSD 388


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
            N SVG+P +  +V +DTGSDL W+ C  C  C               I+ P+ SST   
Sbjct: 61  VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111

Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           +  +S +C    Q      N C Y   Y +DG+ S+G L  + +   T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
            FGCG    G F DG   +G+ GL     S+ S L ++      FS C G          
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221

Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
           ++  GD        TPF     +  Y +T+  +SVG   ++            +   + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGT+ T+L    +  +S     L +   ++     +P   CY    N+    +P +    
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
             G    ++   + V    K   ++CL V++S+  NI
Sbjct: 340 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 373


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/356 (24%), Positives = 143/356 (40%), Gaps = 50/356 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + VG PA    + LDTGSD+ W+ C+ C  C    +          +++P +SST 
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDP---------VFNPTSSSTY 212

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C L +      + C YQV Y  DG+ + G L  D +      K +      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKIND----- 266

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F   A         +        + NQ +   SFS C     +G+ S 
Sbjct: 267 VALGCGHDNEGLFTGAAG-------LLGLGGGALSITNQ-MKATSFSYCLVDRDSGKSSS 318

Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
            D  S     G    P    Q   T Y + ++  SVGG  V      F+  A      I 
Sbjct: 319 LDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVIL 378

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  L    ++ ++S   F+ CY  S + ++ + P V    
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFS-SLSSVKVPTVAFHF 437

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNI 435
            GG    +     ++  +  G + +      S +++IIG          Y +AN I
Sbjct: 438 TGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSS-SLSIIGNVQQQGTRITYDLANKI 492


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 103/410 (25%), Positives = 168/410 (40%), Gaps = 61/410 (14%)

Query: 59  YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
           +Y+ +  RD + R+R   R L   G+    +  S G   + L      +   + +G PA 
Sbjct: 84  HYTGILRRD-HNRVRSIHRRLTGAGDTAATIPASLGLAFHSLE-----YVVTIGIGTPAR 137

Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
           +F V  DTGSDL W+ C  C    +             ++ P+ SST   VPC +  C++
Sbjct: 138 NFTVLFDTGSDLTWVQCKPCTDSCYQQQEP--------LFDPSKSSTYVDVPCGTPQCKI 189

Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
              +     G+ C Y V+Y  D +++ G L ++   L+     +  V     FGC   + 
Sbjct: 190 GGGQDLTCGGTTCEYSVKY-GDQSVTRGNLAQEAFTLSPSAPPAAGV----VFGCSH-EY 243

Query: 234 GSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKG 287
            S + GA       GL GLG   +S+ S    +G   + FS C    G+  G ++ G   
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGNSGDVFSYCLPPRGSSAGYLTIG-AA 301

Query: 288 SPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLN 337
           +P Q    F+       Q    Y + +  +SV G A+  + SA     + DSGT  T++ 
Sbjct: 302 APPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVITHMP 361

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP------FEYCYVLSPNQTNFEYPVVNLTMKGGG 391
             AY  + + F      +     + LP       + CY ++ +      P V L   GG 
Sbjct: 362 AAAYYVLRDEF-----RRHMGGYTMLPEGHVESLDTCYDVTGHDV-VTAPPVALEFGGGA 415

Query: 392 PFFVNDPIVI----VSSEPKGLYLYCLGVVKSD--NVNIIGREYPIANNI 435
              V+   ++    V +  + L L CL  V ++     IIG     A N+
Sbjct: 416 RIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNV 465


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
            N SVG+P +  +V +DTGSDL W+ C  C  C               I+ P+ SST   
Sbjct: 61  VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111

Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           +  +S +C    Q      N C Y   Y +DG+ S+G L  + +   T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
            FGCG    G F DG   +G+ GL     S+ S L ++      FS C G          
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221

Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
           ++  GD        TPF     +  Y +T+  +SVG   ++            +   + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGT+ T+L    +  +S     L +   ++     +P   CY    N+    +P +    
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
             G    ++   + V    K   ++CL V++S+  NI
Sbjct: 340 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 373


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 148/349 (42%), Gaps = 51/349 (14%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +++G P L +    DTGSDL W  C  C S C               +Y+P++S+T + +
Sbjct: 96  LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 146

Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           PCNS+L             P  G  C Y V Y S  T  + F   +     +       V
Sbjct: 147 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 204

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
              I+FGC    +G   + ++ +GL GLG  + S    L +Q  +P  FS C      ++
Sbjct: 205 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 256

Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
            T  +  G      G+ G   TPF +   T P    Y + +T +S+G  A++     F  
Sbjct: 257 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 316

Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
           +A      I DSGT+ T L + AY Q+     SL        ++D   + C++L P+ T+
Sbjct: 317 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFML-PSSTS 375

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               + ++T+   G   V      + S+  GL+   +       VNI+G
Sbjct: 376 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILG 424


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 138/331 (41%), Gaps = 47/331 (14%)

Query: 59  YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLGFLHYTNVSVGQPAL 116
           Y  AL H D         L  +   ++ L   +G D  + RL+S+   +   +++G P +
Sbjct: 18  YRLALTHVDSKIGFTKTELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPV 77

Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-- 173
            F+   DTGSDL W  C  C  C            D  +Y P+ SST S VPC+S  C  
Sbjct: 78  PFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASSTFSPVPCSSATCLP 128

Query: 174 -ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD-EKQSKSVDSRISFGCGRV 231
               + C +  S C Y   Y SDG  S G L  + L + +    Q+ SV S ++FGCG  
Sbjct: 129 TWRSRNCSNPSSPCRYIYSY-SDGAYSVGILGTETLTIGSSVPGQTVSVGS-VAFGCGTD 186

Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDKG 287
             G  L+     G  GLG       S+LA  G+    FS C    F S        G   
Sbjct: 187 NGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSTMDSPFFLGTLA 238

Query: 288 --SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSA------IFDSG 330
             +PG G    TP      +P+ Y + +  +S+G   +      F+  A      + DSG
Sbjct: 239 ELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSG 298

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
           T+FT L    + ++ +    L  +    ++S
Sbjct: 299 TTFTILAKSGFREVVDRVAQLLGQPPVNASS 329


>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 873

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 115/264 (43%), Gaps = 33/264 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           HY  + +G P     V LDTGS L   PCD CV C        G   D     P   +T 
Sbjct: 46  HYAELYIGIPPQRASVILDTGSGLTAFPCDKCVDC--------GTHTD-----PKFDATK 92

Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQSKSVD 221
           S    N   C+ ++ C +   N C    RY S+G+M    +++D++ +   D  +++ + 
Sbjct: 93  S-TSINFVQCKYEEGCDTCRDNLCVIHQRY-SEGSMWEAVVMQDLIWVGNVDSDRAEMIM 150

Query: 222 S----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSD 276
                R  FGC   +TG F+     NG+ GLG+ + ++ + +     +  + F++CFG  
Sbjct: 151 RRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQK 209

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYN--ITITQVSVGGNAVNFEFS-------AIF 327
           G   +  G   S    +  ++    H T N  I +  V +GG ++  +         AI 
Sbjct: 210 GGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAIV 269

Query: 328 DSGTSFTYLNDPAYTQISETFNSL 351
           DSGT+ TY    A T   E F  +
Sbjct: 270 DSGTTDTYFPSAAATPFQEAFKRI 293


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
            N SVG+P +  +V +DTGSDL W+ C  C  C               I+ P+ SST   
Sbjct: 93  VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 143

Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           +  +S +C    Q      N C Y   Y +DG+ S+G L  + +   T ++ + +V S +
Sbjct: 144 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 201

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
            FGCG    G F DG   +G+ GL     S+ S L ++      FS C G          
Sbjct: 202 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 253

Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
           ++  GD        TPF     +  Y +T+  +SVG   ++            +   + D
Sbjct: 254 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SGT+ T+L    +  +S     L +   ++     +P   CY    N+    +P +    
Sbjct: 312 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 371

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
             G    ++   + V    K   ++CL V++S+  NI
Sbjct: 372 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 405


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 146/355 (41%), Gaps = 52/355 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
           L  L+Y   +VG  A    V +DT S+L W+ C  C SC    +          ++ P++
Sbjct: 115 LRTLNYV-ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDP---------LFDPSS 164

Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSN-----------CPYQVRYLSDGTMSTGFLVEDVL 208
           S + + VPCNS+ C+  +   +AG++           C Y + Y  DG+ S G L  D L
Sbjct: 165 SPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSY-RDGSYSRGVLARDKL 223

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
            LA  + +         FGCG    G+   G   +GL GLG    S+ S   +Q      
Sbjct: 224 RLAGQDIEG------FVFGCGTSNQGAPFGGT--SGLMGLGRSHVSLVSQTMDQ--FGGV 273

Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPF--------SLRQTHPTYNITITQVSVGGN 317
           FS C     S  +G +  GD  S  +  TP         S     P Y + +T ++VGG 
Sbjct: 274 FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQ 333

Query: 318 AVNFE-FSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
            V    FSA   I DSGT  T L    Y  +   F S   E  +     +  + C+ L+ 
Sbjct: 334 EVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSI-LDTCFNLT- 391

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVVKSDNVNIIG 426
                + P +    +G     V+   V+  VSS+   + L    +    + +IIG
Sbjct: 392 GLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIG 446


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/341 (26%), Positives = 144/341 (42%), Gaps = 44/341 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + VG P    ++ LDTGSD+ W+ C+ C  C    +          IY+P  SS+ 
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDP---------IYNPALSSSY 195

Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             V C + LC +L     S   +C YQV Y  DG+ + G    + L L     Q+     
Sbjct: 196 KLVGCQANLCQQLDVSGCSRNGSCLYQVSY-GDGSYTQGNFATETLTLGGAPLQN----- 249

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCF---GSDGT 278
            ++ GCG    G F+  A   GL        S PS L ++ G I   FS C     S+ +
Sbjct: 250 -VAIGCGHDNEGLFVGAAGLLGLG---GGSLSFPSQLTDENGKI---FSYCLVDRDSESS 302

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS-----------A 325
             + FG    P        L+ +     Y ++++ +SVGG  ++   S            
Sbjct: 303 STLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGV 362

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT+ T L   AY  + + F +  K    T    L F+ CY LS  ++  + P V  
Sbjct: 363 IVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL-FDTCYDLSSKES-VDVPTVVF 420

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              GGG   +     +V  +  G + +      S +++I+G
Sbjct: 421 HFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSS-SLSIVG 460


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 159/410 (38%), Gaps = 72/410 (17%)

Query: 40  DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
           + VKG +  D L ++     +  +++ D     R +G        TP        + R +
Sbjct: 56  EAVKGFVKRDKLRRQRMNQRWGVVSNYDS----RRKGFEMT---TTPAEVEMPMHSGRDD 108

Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           +LG  ++  V VG P   F + +DTGS+  WL C          S S + +         
Sbjct: 109 ALG-EYFAEVKVGSPGQRFWLVVDTGSEFTWLNC----------SKSFEAV--------- 148

Query: 160 SSTSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQ 216
             T +   C   L EL     CP     C Y + Y +DG+ + GF   D + +  T+ KQ
Sbjct: 149 --TCASRKCKVDLSELFSLSVCPKPSDPCLYDISY-ADGSSAKGFFGTDSITVGLTNGKQ 205

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
            K   + ++ GC    T S L+G   N    G+ GLG  K S     AN+      FS C
Sbjct: 206 GKL--NNLTIGC----TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANK--YGAKFSYC 257

Query: 273 FGSDGTGRISFGDKGSPGQGETPF--SLRQTH-----PTYNITITQVSVGGNAV------ 319
                + R    +    G         +R+T      P Y + +  +S+GG  +      
Sbjct: 258 LVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQV 317

Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQ 375
              N E   + DSGT+ T L  PAY  + E    SL K KR T       E+C+    + 
Sbjct: 318 WDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCF----DA 373

Query: 376 TNFE---YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
             F+    P +     GG  F       I+   P    + C+G+V  D +
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP---LVKCIGIVPIDGI 420


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 148/350 (42%), Gaps = 51/350 (14%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +++G P L +    DTGSDL W  C  C S C               +Y+P++S+T + +
Sbjct: 36  LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 86

Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           PCNS+L             P  G  C Y V Y S  T  + F   +     +       V
Sbjct: 87  PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 144

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
              I+FGC    +G   + ++ +GL GLG  + S+ S L     +P  FS C      ++
Sbjct: 145 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLSLVSQLG----VPK-FSYCLTPYQDTN 196

Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
            T  +  G      G+ G   TPF +   T P    Y + +T +S+G  A++     F  
Sbjct: 197 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 256

Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
           +A      I DSGT+ T L + AY Q+     SL        ++D   + C++L P+ T+
Sbjct: 257 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFML-PSSTS 315

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
               + ++T+   G   V      + S+  GL+   +       VNI+G 
Sbjct: 316 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGN 365


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G P+ + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + R+    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              +FDSG+  +Y+ D A + +S+    L    R  +  +     CY +       + P 
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++L    G  F +    V V    +   ++CL    +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 123/305 (40%), Gaps = 44/305 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG P     V +D+GSD+ W+ C  C  C H  +          ++ P  S++ 
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDP---------VFDPADSASF 192

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             VPC+S++CE  +        C Y+V Y  DG+ + G L  + L         ++V   
Sbjct: 193 MGVPCSSSVCERIENAGCHAGGCRYEVMY-GDGSYTKGTLALETLTFG------RTVVRN 245

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---GSDGT 278
           ++ GCG    G F+  A   GL G  M        L  Q  G    +FS C    G+D  
Sbjct: 246 VAIGCGHRNRGMFVGAAGLLGLGGGSMS-------LVGQLGGQTGGAFSYCLVSRGTDSA 298

Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------A 325
           G + FG    P G    P       P+ Y I ++ V VGG  V      F+ +       
Sbjct: 299 GSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + D+GT+ T +   AY    + F          S   + F+ CY L+    +   P V+ 
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSI-FDTCYNLN-GFVSVRVPTVSF 416

Query: 386 TMKGG 390
              GG
Sbjct: 417 YFAGG 421


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 82/308 (26%), Positives = 127/308 (41%), Gaps = 40/308 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   +SVG P  S +   DTGSD+ W  C   S  +  N+         ++ P+ S+T  
Sbjct: 83  YLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAP--------MFDPSKSTTYK 134

Query: 165 KVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            V C+S +C       S    S C Y + Y  D + S G L  D + + +   +  +   
Sbjct: 135 NVACSSPVCSYSGDGSSCSDDSECLYSIAY-GDDSHSQGNLAVDTVTMQSTSGRPVAF-P 192

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG--- 279
           R   GCG    G+F   A  +G+ GLG    S+ + L         FS C    GTG   
Sbjct: 193 RTVIGCGHDNAGTF--NANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGSTN 248

Query: 280 ---RISFGDKGS---PGQGETP-FSLRQTHPTYNITITQVSVGGNAVNF---------EF 323
              +++FG   +    G   TP +S  Q    Y++ +  VSVG    NF         E 
Sbjct: 249 DSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGES 308

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC-YVLSPNQTNFEYPV 382
           + I DSGT+ TYL     + +  +F S   +      +  P E+  Y  +    ++E P 
Sbjct: 309 NIIIDSGTTLTYLP----SALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPP 364

Query: 383 VNLTMKGG 390
           V +  +G 
Sbjct: 365 VTMHFEGA 372


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/341 (26%), Positives = 140/341 (41%), Gaps = 43/341 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ W+ C+ C  C    +          I++P+ S++ 
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S V C+S +C            C Y+  Y  DG+ STG    + L   T         + 
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A      GLG    S P+ +  Q    ++FS C     SD +G 
Sbjct: 261 VAIGCGHKNVGLFIGAAGLL---GLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
           + FG K  P G   TP       PT Y +++T +SVGG  ++      F           
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGF 375

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGT  T L   AY  + + F +   +   T    + F+ CY LS  Q     P V  
Sbjct: 376 IIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSI-FDTCYDLSGLQF-VSVPTVGF 433

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               G    +     ++  +  G + +      S +V+I+G
Sbjct: 434 HFSNGASLILPAKNYLIPMDTVGTFCFAFAPAAS-SVSIMG 473


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/348 (27%), Positives = 143/348 (41%), Gaps = 47/348 (13%)

Query: 99  NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
           N   FL   N+S+G P +  ++ +DTGSDL W   LPC C            Q I F  +
Sbjct: 74  NPAAFL--ANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYP----------QTIPF--F 119

Query: 156 SPNTSSTSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
            P+ SST     C S    + Q        NC Y +RY  D + + G L E+ L   T +
Sbjct: 120 HPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRY-RDFSNTRGILAEEKLTFETSD 178

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
               S    I FGCG+  +G        +G+ GLG    S+  +  N G   + FS CFG
Sbjct: 179 DGLIS-KQNIVFGCGQDNSGF----TKYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 228

Query: 275 SDGT----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
           S         I     G+  +G+ TP  + Q    Y + +  +S G   ++ E       
Sbjct: 229 SLTNPTYPHNILILGNGAKIEGDPTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTFQRY 286

Query: 323 ---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
                 + D+G S T L   AY  +SE  + L  E  R     D     CY  +     +
Sbjct: 287 RSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLY 346

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +PVV     GG    ++   + VSSE    +   + +   D++++IG
Sbjct: 347 GFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIG 394


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 122/307 (39%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ WL C  C  C    +          ++ P  S T 
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADP---------VFDPTKSRTY 179

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC + LC       C +    C YQV Y  DG+ + G    + L         ++  
Sbjct: 180 AGIPCGAPLCRRLDSPGCNNKNKVCQYQVSY-GDGSFTFGDFSTETLTF------RRTRV 232

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
           +R++ GCG    G F+  A      GLG  + S P     +      FS C      S  
Sbjct: 233 TRVALGCGHDNEGLFIGAAGLL---GLGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAK 287

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
              + FGD         TP        T Y + +  +SVGG+ V       F   A    
Sbjct: 288 PSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNG 347

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAY  + + F   A   +  +   L F+ C+ LS   T  + P V
Sbjct: 348 GVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSL-FDTCFDLS-GLTEVKVPTV 405

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 406 VLHFRGA 412


>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Brachypodium distachyon]
          Length = 436

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/271 (27%), Positives = 117/271 (43%), Gaps = 33/271 (12%)

Query: 96  YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
           + L ++  L+   V +G P+  + +A  TGSD+ W+PC  C  C     +        ++
Sbjct: 67  FVLEAMPGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDC----PTPDDIGFSLDL 122

Query: 155 YSPNTSSTSSKVP-----CNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVED 206
           Y P  SSTSS++      C   L      C    S+G  C Y   Y      +TG+ V D
Sbjct: 123 YDPKNSSTSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSD 182

Query: 207 VLH--LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
            +H  +    +   S  + + FGC + ++G        +G+ G G D  S+ S L +QG 
Sbjct: 183 DIHFDIFMGNESFASSSASVIFGCSKSRSGHL----QADGVIGFGKDAPSLISQLNSQG- 237

Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
           + ++FS C     DG G +   + G PG   T  SL  + P YN+ +  ++V    V  +
Sbjct: 238 VSHAFSRCLDDSDDGGGVLILDEVGEPGLEFT--SLVASRPCYNLNMKSIAVNNQNVPID 295

Query: 323 FS---------AIFDSGTSFTYLNDPAYTQI 344
            S            DSGTS  Y  D  Y  +
Sbjct: 296 SSLFTTSSTQGTFLDSGTSLAYFPDGVYDPV 326


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 128/302 (42%), Gaps = 41/302 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G+P+ +F + +DTGSD+ WL C  C  C   ++          I+ P +SS+ 
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDP---------IFDPASSSSF 210

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S++ C +  C           +C YQV Y  DG+ + G    + +        S SVD +
Sbjct: 211 SRLGCQTPQCRNLDVFACRNDSCLYQVSY-GDGSYTVGDFATETVSFG----NSGSVD-K 264

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A         +     P  L +Q +  +SFS C     S  +  
Sbjct: 265 VAIGCGHDNEGLFVGAAG-------LIGLGGGPLSLTSQ-IKASSFSYCLVNRDSVDSST 316

Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
           + F           P F   +    Y + IT +SVGG  +      FE         I D
Sbjct: 317 LEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVD 376

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
            GT+ T L   AY  + +TF  L K+   TS   L F+ CY LS ++T+   P V     
Sbjct: 377 CGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFAL-FDTCYNLS-SRTSVRVPTVAFLFD 434

Query: 389 GG 390
           GG
Sbjct: 435 GG 436


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +VS+G PAL++   +DTGSDL W  C  CV C               ++ P++SST + V
Sbjct: 77  DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 127

Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           PC+S  C      +C SA S C Y   Y  D + + G L  +   LA      KS    +
Sbjct: 128 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 179

Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
            FGCG    G  F  GA   GL GLG    S+ S L   GL  + FS C  S D T    
Sbjct: 180 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 231

Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
                   IS     +     TP     + P+ Y +++  ++VG   ++   SA      
Sbjct: 232 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 291

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
                I DSGTS TYL    Y  + + F +          S +  + C+       +  E
Sbjct: 292 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 350

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            P +     GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 351 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 395


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +VS+G PAL++   +DTGSDL W  C  CV C               ++ P++SST + V
Sbjct: 98  DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 148

Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           PC+S  C      +C SA S C Y   Y  D + + G L  +   LA      KS    +
Sbjct: 149 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 200

Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
            FGCG    G  F  GA   GL GLG    S+ S L   GL  + FS C  S D T    
Sbjct: 201 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 252

Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
                   IS     +     TP     + P+ Y +++  ++VG   ++   SA      
Sbjct: 253 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 312

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
                I DSGTS TYL    Y  + + F +          S +  + C+       +  E
Sbjct: 313 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 371

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            P +     GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 416


>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 681

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 136/340 (40%), Gaps = 45/340 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           HYT V  G P     V  DTGS L   PC  C  C H  +           +    SST 
Sbjct: 67  HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCGHHTDQP---------FQAANSSTL 117

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSK 218
             + C        K+C      C     Y+ +G+     +VED+++L       D++   
Sbjct: 118 VHITCAQKSLFQCKECHVQSDTCGISQSYM-EGSSWKASVVEDIVYLGGESSFDDKEMRN 176

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDG 277
              +   FGC   + G F+   A +G+ GL   +  + + L  +  I  N FS+CF  +G
Sbjct: 177 RYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCFTENG 235

Query: 278 TGRISFGD-KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
            G +S G    +  +GE  +    + R     YN+ +  + +GG ++N +  A      I
Sbjct: 236 -GTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRGHYI 294

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT+ +YL     T+  + F  +A    +   S   F        N+     P + L 
Sbjct: 295 VDSGTTDSYLPRALKTEFLQMFKEIAGRDYQVGNSCKGF-------TNKDLASLPTIQLV 347

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYL-----YCLGVVKSDN 421
           M+  G     +  VI+   P+   L     YC G+  S+N
Sbjct: 348 MEAYGD---ENAEVILDVPPEQYLLESNGAYCGGIYLSEN 384


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 146/348 (41%), Gaps = 40/348 (11%)

Query: 60  YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSL---GFLHY-TNVSVGQPA 115
           +S+L+H DR      R L+      T L  +A N    L +    G   Y  +VS+G P 
Sbjct: 46  FSSLSHYDRLTNAFRRSLS---RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPP 102

Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
           + +I   DTGSDL W    C+ C+     S        I+ P  S++ S VPCNS  C+ 
Sbjct: 103 VDYIGMADTGSDLMW--AQCLPCLKCYKQSR------PIFDPLKSTSFSHVPCNSQNCKA 154

Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
                C + G  C Y   Y  D T + G L  + + +      S SV S I  GCG    
Sbjct: 155 IDDSHCGAQGV-CDYSYTY-GDQTYTKGDLGFEKITIG-----SSSVKSVI--GCGHESG 205

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGDKG--- 287
           G F   +    + GLG  + S+ S ++    I   FS C     S   G+I+FG      
Sbjct: 206 GGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVS 262

Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGN---AVNFEFSAIFDSGTSFTYLNDPAYTQI 344
            PG   TP   +     Y +T+  +S+G     A   + + I DSGT+ ++L    Y  +
Sbjct: 263 GPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYDGV 322

Query: 345 SETFNSLAKEKRETSTSDLPFEYCYVLSPN-QTNFEYPVVNLTMKGGG 391
             +   + K KR     +  ++ C+    N  T+   P++     GG 
Sbjct: 323 VSSLLKVVKAKRVKDPGNF-WDLCFDDGINVATSSGIPIITAQFSGGA 369


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +VS+G PAL++   +DTGSDL W  C  CV C               ++ P++SST + V
Sbjct: 108 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 158

Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           PC+S  C      +C SA S C Y   Y  D + + G L  +   LA      KS    +
Sbjct: 159 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 210

Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
            FGCG    G  F  GA   GL GLG    S+ S L   GL  + FS C  S D T    
Sbjct: 211 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 262

Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
                   IS     +     TP     + P+ Y +++  ++VG   ++   SA      
Sbjct: 263 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 322

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
                I DSGTS TYL    Y  + + F +          S +  + C+       +  E
Sbjct: 323 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 381

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            P +     GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 426


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 130/315 (41%), Gaps = 54/315 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA +  + LDTGSD+ WL C  C +C +  ++         I+ P  S T 
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA---------IFDPKKSKTF 185

Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           + VPC S LC       +C +  S  C YQV Y  DG+ + G    + L           
Sbjct: 186 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 239

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
           VD  +  GCG    G F+  A      GLG    S PS   N+      FS C       
Sbjct: 240 VD-HVPLGCGHDNEGLFVGAAGLL---GLGRGGLSFPSQTKNR--YNGKFSYCLVDRTSS 293

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
              S     I FG+   P    + F+   T+P     Y + +  +SVGG+ V       F
Sbjct: 294 GSSSKPPSTIVFGNAAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 351

Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           +  A      I DSGTS T L  PAY  + + F  L   K + + S   F+ C+ LS   
Sbjct: 352 KLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLS-GM 409

Query: 376 TNFEYPVVNLTMKGG 390
           T  + P V     GG
Sbjct: 410 TTVKVPTVVFHFGGG 424


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 130/311 (41%), Gaps = 53/311 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ WL C  C +C    +          +++P  S + 
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 179

Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           +KV C + LC   ++  S G N    C YQV Y  DG+ +TG  V + L     + +   
Sbjct: 180 AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 232

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
              +++ GCG    G F+  A   GL   G+   S      NQ      FS C      S
Sbjct: 233 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 284

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
                + FG+          F+   T+P     Y + +  +SVGG  V      +F+   
Sbjct: 285 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 342

Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
                 I D GTS T LN PAY  + + F + A   +      L F+ CY LS  +T  +
Sbjct: 343 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLS-GKTTVK 400

Query: 380 YPVVNLTMKGG 390
            P V L  +G 
Sbjct: 401 VPTVVLHFRGA 411


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 89/324 (27%), Positives = 137/324 (42%), Gaps = 56/324 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ +G P    ++  DTGSDL W+ C  C +C      S+        +SPN     
Sbjct: 89  YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNH---- 144

Query: 164 SKVPCNSTLCEL-----QKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
               C  + C+L       +C  A   S C Y+  Y  DG+ ++GF  ++   L T   +
Sbjct: 145 ----CYDSACQLVPLPKHHRCNHARLHSPCRYEYSY-GDGSKTSGFFSKETTTLNTSSGR 199

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
              +   I+FGC    +G  + GA+ N   G+ GLG    S+ S L ++    N FS C 
Sbjct: 200 EAKLKG-IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSYCL 256

Query: 274 -----GSDGTGRISFG---DKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
                    T  +  G   +  +PG+     TP  +    PT Y I I  VSV G  +  
Sbjct: 257 MDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPI 316

Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYC 368
             S            I DSGT+ T+L +PAY QI      + +  R  S ++    F+ C
Sbjct: 317 NPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI---LTVIKRRVRLPSPAEPTPGFDLC 373

Query: 369 YVLSPNQTNFEYPVV-NLTMKGGG 391
                N +  E+P +  L+ K GG
Sbjct: 374 V----NVSEIEHPRLPKLSFKLGG 393


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           LN+L +L    V +G PA S  + +DTGSD+ W+ C   S  H             ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172

Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++SST S   C S  C    Q    C S+ S C Y V Y  DG+ +TG    D L L + 
Sbjct: 173 SSSSTYSPFSCGSAACAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +S        FGC  V++G F D    +GL GLG    S+ S  A  G +  +FS C 
Sbjct: 231 AVKS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279

Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
                 +G ++ G  G  G     +TP       PT Y + +  + VGG  ++     F 
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              + DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P 
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 397

Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
           V L   GG    ++   +I+S
Sbjct: 398 VALVFSGGAVVSLDASGIILS 418


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 139/339 (41%), Gaps = 42/339 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + VG PA    + LDTGSD+ W+ C+ C  C    +          +++P +SST 
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C L +      + C YQV Y  DG+ + G L  D +      K +      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F   A   GL G  +        + NQ +   SFS C     +G+ S 
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318

Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
            D  S     G    P    +   T Y + ++  SVGG  V      F+  A      I 
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  L    ++ S+S   F+ CY  S   T  + P V    
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAFHF 437

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            GG    +     ++  +  G + +      S +++IIG
Sbjct: 438 TGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-SLSIIG 475


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 135/315 (42%), Gaps = 42/315 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G P   F +  DTGSDL W  C+ CV   +    +        I++P+ S++ 
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEA--------IFNPSQSTSY 204

Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
           + + C STLC+            A S C Y ++Y  D + S GF  ++ L L ATD    
Sbjct: 205 ANISCGSTLCDSLASATGNIFNCASSTCVYGIQY-GDSSFSIGFFGKEKLSLTATD---- 259

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
             V +   FGCG+   G F   A      GLG DK S+ S  A +     S+ +   S  
Sbjct: 260 --VFNDFYFGCGQNNKGLFGGAAGLL---GLGRDKLSLVSQTAQRYNKIFSYCLPSSSSS 314

Query: 278 TGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSG 330
           TG ++FG   S     TP  ++      Y + +T +SVGG  +    S       I DSG
Sbjct: 315 TGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSG 374

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T  T L   AY+ +S TF  L  +        +  + C+  S N      P + L   GG
Sbjct: 375 TVITRLPPAAYSALSSTFRKLMSQYPAAPALSI-LDTCFDFS-NHDTISVPKIGLFFSGG 432

Query: 391 --------GPFFVND 397
                   G F+VND
Sbjct: 433 VVVDIDKTGIFYVND 447


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 142/356 (39%), Gaps = 71/356 (19%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
            N+S+G P ++ ++ +DT SDL W+ C  C++C               I+ P+ S T   
Sbjct: 87  VNISIGSPPITQLLHMDTASDLLWIQCLPCINC---------YAQSLPIFDPSRSYTHRN 137

Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
             C ++    Q   PS   N     C Y +RY+ D T S G L  ++L   T  DE  S 
Sbjct: 138 ETCRTS----QYSMPSLKFNANTRSCEYSMRYVDD-TGSKGILAREMLLFNTIYDESSSA 192

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           ++   + FGCG    G  L G    G+ GLG  + S+      +      FS CFGS   
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGKK------FSYCFGSLDD 242

Query: 279 -----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
                  +  GD G+   G+ TP  +      Y +TI  +SV G  +  +          
Sbjct: 243 PSYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQT 300

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLSPNQTN 377
                I D+G S T L + AY  +      + + +    + S  D+    CY       N
Sbjct: 301 GLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECY-----NGN 355

Query: 378 FE-------YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           FE       +P+V      G    ++   + +   P    ++CL V    N+N IG
Sbjct: 356 FERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPN---VFCLAVTPG-NLNSIG 407


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 111/248 (44%), Gaps = 42/248 (16%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   + +G P    +  +DTGSDL W  C  C +C               I+ P+ SST
Sbjct: 60  IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAP---------IFDPSKSST 110

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  C+             G++CPY++ Y +D + STG L  + + + +   +   V +
Sbjct: 111 FKEKRCH-------------GNSCPYEIIY-ADESYSTGILATETVTIQSTSGE-PFVMA 155

Query: 223 RISFGCGRVQTGSFLDG--AAPNGLFGLGMDKTSVPSILANQGL-IPNSFSMCFGSDGTG 279
             S GCG   +     G  A+ +G+ GL M  +   S+++   L IP   S CF S GT 
Sbjct: 156 ETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPS---SLISQMDLPIPGLISYCFSSQGTS 212

Query: 280 RISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVG-------GNAVNFEFSAIF-D 328
           +I+FG        G       +++  P Y + +  VSVG       G   + +   IF D
Sbjct: 213 KINFGTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272

Query: 329 SGTSFTYL 336
           SGT++TYL
Sbjct: 273 SGTTYTYL 280


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 130/310 (41%), Gaps = 53/310 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ WL C  C +C    +          +++P  S + 
Sbjct: 42  YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 92

Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           +KV C + LC   ++  S G N    C YQV Y  DG+ +TG  V + L     + +   
Sbjct: 93  AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 145

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
              +++ GCG    G F+  A   GL   G+   S      NQ      FS C      S
Sbjct: 146 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 197

Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
                + FG+          F+   T+P     Y + +  +SVGG  V      +F+   
Sbjct: 198 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 255

Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
                 I D GTS T LN PAY  + + F + A   +      L F+ CY LS  +T  +
Sbjct: 256 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLS-GKTTVK 313

Query: 380 YPVVNLTMKG 389
            P V L  +G
Sbjct: 314 VPTVVLHFRG 323


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 83/303 (27%), Positives = 129/303 (42%), Gaps = 41/303 (13%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           S+G P       +DTGSD  W  C  C  C   LN +S       I++P+ SST   + C
Sbjct: 95  SIGTPPFQLYGVVDTGSDGIWFQCKPCKPC---LNQTSP------IFNPSKSSTYKNIRC 145

Query: 169 NSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           +S +C+   + +C S     C Y++ YL D + S G + +D L L +++    S   +I 
Sbjct: 146 SSPICKRGEKTRCSSNRKRKCEYEITYL-DRSGSQGDISKDTLTLNSNDGSPISF-PKIV 203

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTGR 280
            GCG     S       +G+ G G    S+ S L +   I   FS C  S     + + +
Sbjct: 204 IGCG--HKNSLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISSK 259

Query: 281 ISFGDKGS-PGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF---------EFSAIFD 328
           + FGD     G G     L Q+     Y   +   SVG + +           E +A+ D
Sbjct: 260 LYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVID 319

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           SG++ T L +  Y+Q+     S+ K KR +  T  L   Y   L      +E P++    
Sbjct: 320 SGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLK----KYEVPIITAHF 375

Query: 388 KGG 390
           +G 
Sbjct: 376 RGA 378


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 56/314 (17%)

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYR--------LNSLGFLHYT-NVSVGQPALSFIVA 121
           + R   L+A  N      FS  ND  R        +   G L Y  ++++G P       
Sbjct: 59  KARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSAL 118

Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQ 178
           LDTGSDL W  C  C SC+   +          +++P  S++   + C   LC   L   
Sbjct: 119 LDTGSDLIWTQCAPCASCLAQPDP---------LFAPGESASYEPMRCAGQLCSDILHHG 169

Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
           C      C Y+  Y  DGTM+ G    +     T     + +   + FGCG +  GS  +
Sbjct: 170 C-EMPDTCTYRYNY-GDGTMTMGVYATERFTF-TSSGGDRLMTVPLGFGCGSMNVGSLNN 226

Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-----------FGDKG 287
           G   +G+ G G +  S+ S L+ +      FS C  S G+GR S           +GD  
Sbjct: 227 G---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSYGSGRKSTLLFGSLSGGVYGDAT 278

Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
            P Q  TP      +PT Y + +  ++VG   +    SA           I DSGT+ T 
Sbjct: 279 GPVQ-TTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 337

Query: 336 LNDPAYTQISETFN 349
           L      ++   F 
Sbjct: 338 LPGAVLAEVVRAFR 351


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 78/281 (27%), Positives = 115/281 (40%), Gaps = 50/281 (17%)

Query: 98  LNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           + + G L Y  +++VG P       LDTGSDL W  CD C +C+   +          ++
Sbjct: 90  VRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LF 140

Query: 156 SPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           SP  SS+   + C   LC   L   C      C Y+  Y  DGT + G+   +    A+ 
Sbjct: 141 SPRMSSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASS 198

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
             +++SV   + FGCG +  GS  +    +G+ G G D  S+ S L+ +      FS C 
Sbjct: 199 SGETQSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCL 248

Query: 273 --FGSDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN 320
             + S     + FG         D   P Q  TP      +PT Y +  T V+VG   + 
Sbjct: 249 TPYASSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLR 307

Query: 321 FEFSA-----------IFDSGTSFTYLNDPAYTQISETFNS 350
              SA           I DSGT+ T        ++   F S
Sbjct: 308 IPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRS 348


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 135/339 (39%), Gaps = 44/339 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P       +DTGSD+ WL C+ C  C +             +++P+ SS+   +PC
Sbjct: 92  SVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTP---------MFNPSKSSSYKNIPC 142

Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
            S LC+  +       N C Y   Y  D + S G L  D L L +    + S    I  G
Sbjct: 143 PSKLCQSMEDTSCNDKNYCEYST-YYGDNSHSGGDLSVDTLTLESTNGLTVSF-PNIVIG 200

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---------GSDGT 278
           CG     S+ +GA+ +G+ G G    S  + L +       FS C           S+ T
Sbjct: 201 CGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFSYCLTPLFSVTNIQSNAT 256

Query: 279 GRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIF 327
            +++FGD  +    G   TP   +     Y +T+   SVG   V          E + I 
Sbjct: 257 SKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIII 316

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L    Y+ +      L K +R    +      CY  S     +++P++ +  
Sbjct: 317 DSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQT-LNLCY--SVKAEGYDFPIITMHF 373

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           KG        PI    S   G  ++CL    S +  I G
Sbjct: 374 KGADVDL--HPISTFVSVADG--VFCLAFESSQDHAIFG 408


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 95/370 (25%), Positives = 150/370 (40%), Gaps = 68/370 (18%)

Query: 97  RLNSLGFLHYTNV--SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
           RL +L ++   ++  S G PA +  V +DTGSDL W+ C  C +C    +          
Sbjct: 138 RLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--------- 188

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ--------CPSAGS---NCPYQVRYLSDGTMSTGF 202
           ++ P  S+T + V CN++ C    +        C S G+    C Y + Y  DG+ S G 
Sbjct: 189 LFDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAY-GDGSFSRGV 247

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           L  D + L        S+   + FGCG    G F       GL GLG  + S+ S  A++
Sbjct: 248 LATDTVALG-----GASLGGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTASR 298

Query: 263 GLIPNSFSMCF----GSDGTGRISFG---DKGSPGQGETPFSLRQT------HPTYNITI 309
                 FS C       D +G +S G   D  S  +  TP +  +        P Y + +
Sbjct: 299 --YGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV 356

Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP- 364
           T  +VGG A+  +     + + DSGT  T L    Y  +   F       R+   +  P 
Sbjct: 357 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEF------MRQFGAAGYPA 410

Query: 365 ------FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGV 416
                  + CY L+      + P++ L ++GG    V+    + +V  +   + L    +
Sbjct: 411 APGFSILDTCYDLT-GHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASL 469

Query: 417 VKSDNVNIIG 426
              D   IIG
Sbjct: 470 SYEDETPIIG 479


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 123/307 (40%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ WL C  C  C     S + Q+ D     P+ S + 
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCY----SQTDQIFD-----PSKSKSF 180

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC S LC       C    + C YQV Y  DG+ + G    + L         ++  
Sbjct: 181 AGIPCYSPLCRRLDSPGCSLKNNLCQYQVSY-GDGSFTFGDFSTETLTF------RRAAV 233

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
            R++ GCG    G F+  A    L GLG    S P+    +    N FS C      S  
Sbjct: 234 PRVAIGCGHDNEGLFVGAAG---LLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAK 288

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
              I FGD         TP        T Y + +  +SVGG  V       F   +    
Sbjct: 289 PSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNG 348

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAY  + + F   A   +      L F+ CY LS   +  + P V
Sbjct: 349 GVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSL-FDTCYDLS-GLSEVKVPTV 406

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 407 VLHFRGA 413


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 151/377 (40%), Gaps = 58/377 (15%)

Query: 71  RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
           RLRG   A +   K+  T  +GN           +  +V +G P     +  DTGSDL W
Sbjct: 109 RLRGSK-ATKIPAKSGATIGSGN-----------YIVSVGLGTPKKYLSLIFDTGSDLTW 156

Query: 131 LPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSA 182
             C  C    +             ++ P+ S+T S + C+S  C         Q  C SA
Sbjct: 157 TQCQPCARYCYNQKDP--------VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGC-SA 207

Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
              C Y ++Y  D + S G+  ++ L L      S  V     FGCG+   G F   A  
Sbjct: 208 ARACIYGIQY-GDQSFSVGYFAKETLTLT-----STDVIENFLFGCGQNNRGLFGSAA-- 259

Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE-TPFSLR 299
            GL GLG DK S+    A +      FS C    S  TG ++FG  G  G  + TP  + 
Sbjct: 260 -GLIGLGQDKISIVKQTAQK--YGQVFSYCLPKTSSSTGYLTFGGGGGGGALKYTP--IT 314

Query: 300 QTHPT---YNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
           + H     Y + I  + VGG  +    S      AI DSGT  T L   AY+ +   F  
Sbjct: 315 KAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYSALKSAFEK 374

Query: 351 -LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL 409
            +AK  +    S L  + CY LS   T  + P V    KGG    ++   ++  +    +
Sbjct: 375 GMAKYPKAPELSIL--DTCYDLSKYST-IQIPKVGFVFKGGEELDLDGIGIMYGASTSQV 431

Query: 410 YLYCLGVVKSDNVNIIG 426
            L   G      V IIG
Sbjct: 432 CLAFAGNQDPSTVAIIG 448


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 144/347 (41%), Gaps = 60/347 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +++G P+LSF   LDTGSDL W  C  C  C               IY P+ SST SKV
Sbjct: 118 KMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTP---------IYDPSQSSTYSKV 168

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           PC+S++C+       +G+NC Y   Y  D + + G L  +   L      S+S+   I+F
Sbjct: 169 PCSSSMCQALPMYSCSGANCEYLYSY-GDQSSTQGILSYESFTLT-----SQSL-PHIAF 221

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTGRI 281
           GCG  Q       +   GL G G    S+ S L     + N FS C  S       T  +
Sbjct: 222 GCG--QENEGGGFSQGGGLVGFGRGPLSLISQLGQS--LGNKFSYCLVSITDSPSKTSPL 277

Query: 282 SFGDKGSPGQ---GETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AI 326
             G   S        TP    ++ PT Y +++  +SVGG  ++     F+         I
Sbjct: 278 FIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVI 337

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT+ TYL    Y  + +   S +    +   S++  + C+      +   +P +   
Sbjct: 338 IDSGTTVTYLEQSGYDVVKKAVIS-SINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFH 396

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLY-------CLGVVKSDNVNIIG 426
            +G              + PK  Y+Y       CL ++ S+ ++I G
Sbjct: 397 FEGAD-----------FNLPKENYIYTDSSGIACLAMLPSNGMSIFG 432


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 139/339 (41%), Gaps = 42/339 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + VG PA    + LDTGSD+ W+ C+ C  C    +          +++P +SST 
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C L +      + C YQV Y  DG+ + G L  D +      K +      
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F   A   GL G  +        + NQ +   SFS C     +G+ S 
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318

Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
            D  S     G    P    +   T Y + ++  SVGG  V      F+  A      I 
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  L    ++ S+S   F+ CY  S   T  + P V    
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAFHF 437

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            GG    +     ++  +  G + +      S +++IIG
Sbjct: 438 TGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-SLSIIG 475


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 129/303 (42%), Gaps = 39/303 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  +S+G P +  +V +DTGS L W+ C +C    +   + +GQ     I++P  SST 
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTY 60

Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           SKV C++  C        ++  C      C Y +RY S G  S G+L +D L LA++   
Sbjct: 61  SKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN--- 116

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
            +S+D+ I FGCG       L      G+ G G    S  + +  Q     +FS CF  D
Sbjct: 117 -RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRD 169

Query: 277 --GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------A 325
               G ++ G          T        P Y   I Q+ +  N +  E           
Sbjct: 170 HENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMT 227

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF-EYPVVN 384
           I DSGT+ TY+  P +  + +      + K  T   D     C++ +    N+ ++P V 
Sbjct: 228 IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISNSGSANWNDFPTVE 286

Query: 385 LTM 387
           + +
Sbjct: 287 MKL 289


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 143/348 (41%), Gaps = 42/348 (12%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           L++L F+    V  G PA ++ +++DTGSD+ W+   C+ C          V D     P
Sbjct: 156 LDTLEFV--VTVGFGSPAQNYTLSIDTGSDVSWI--QCLPCSGHCYKQHDPVFD-----P 206

Query: 158 NTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             S+T S VPC    C     +C ++G+ C Y+V Y  DG+ + G L  + L L++    
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGT-CLYKVTY-GDGSSTAGVLSHETLSLSSTRDL 264

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
                   +FGCG+   G F       GL    +   S+PS  A       +FS C  S 
Sbjct: 265 PG-----FAFGCGQTNLGEFGGVDGLVGLGRGAL---SLPSQAA--ATFGATFSYCLPSY 314

Query: 277 GT--GRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGG------NAVNF 321
            T  G ++ G        +      T    ++ +P+ Y + +  + +GG        V  
Sbjct: 315 DTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT 374

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               +FDSGT  TYL   AY  + + F     + +     D PF+ CY  + +   F  P
Sbjct: 375 RDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYD-PFDTCYDFTGHNAIF-MP 432

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV---NIIG 426
            V      G  F ++   +++  +       CL  V   +    NIIG
Sbjct: 433 AVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIG 480


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 123/286 (43%), Gaps = 51/286 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC---VHGLNSSSGQVIDFNIYSPNTSS 161
           ++ ++ +G P  + ++  DTGSDL W+ C        +H   S+         +    S+
Sbjct: 83  YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST---------FLARHST 133

Query: 162 TSSKVPCNSTLCELQKQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           T S   C S+LC+L  Q P+          S C Y+  Y SDG+ ++GF  ++   L T 
Sbjct: 134 TFSPTHCFSSLCQLVPQ-PNPNPCNHTRLHSTCRYEYVY-SDGSKTSGFFSKETTTLNTS 191

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFS 270
             +   + S I+FGCG   +G  L G++ N   G+ GLG    S  S L  +     SFS
Sbjct: 192 SGREMKLKS-IAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFS 248

Query: 271 MC-----FGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNA 318
            C          T  +  GD  S  +        TP  +    PT Y I+I  V V G  
Sbjct: 249 YCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVK 308

Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
           ++ + S            + DSGT+ T+L +PAY +I   F    K
Sbjct: 309 LHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 122/306 (39%), Gaps = 53/306 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     VG P  + ++ALD   D  W+PC  CV C               +++   S+T 
Sbjct: 35  YIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC------------SSTVFNTVKSTTF 82

Query: 164 SKVPCNSTLCELQKQCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             + C +  C   KQ P+    GS C +   Y S   +S   L  D + L+ D       
Sbjct: 83  KTLGCGAPQC---KQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALSMDPV----- 132

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT-- 278
               +FGC +  TGS      P GL G G    S  S    Q L  ++FS C  S  T  
Sbjct: 133 -PYYAFGCIQKATGS---SVPPQGLLGFGRGPLSFLS--QTQNLYKSTFSYCLPSFRTLN 186

Query: 279 --GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
             G +  G  G P + +T   L+    +  Y + +  + VG   V+   SA         
Sbjct: 187 FSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGA 246

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV--LSPNQTNFEYP 381
             IFDSGT FT L  PAY  +   F    +    T +S   F+ CY   + P    F + 
Sbjct: 247 GTIFDSGTVFTRLVAPAYIAVRNEFRK--RVGNATVSSLGGFDTCYSVPIVPPTITFMFS 304

Query: 382 VVNLTM 387
            +N+TM
Sbjct: 305 GMNVTM 310


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 122/302 (40%), Gaps = 39/302 (12%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P    +  +DTGSD+ WL C+ C  C               I+ P+ S T   +PC
Sbjct: 96  SVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTP---------IFDPSKSKTYKTLPC 146

Query: 169 NSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           +S  CE L+    S+ + C Y + Y  DG+ S G L  + L L + +  S      +  G
Sbjct: 147 SSNTCESLRNTACSSDNVCEYSIDY-GDGSHSDGDLSVETLTLGSTDGSSVHFPKTV-IG 204

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGRIS 282
           CG    G+F +  +      +G+    V  I      I   FS C       S+ + +++
Sbjct: 205 CGHNNGGTFQEEGSGI----VGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLN 260

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
           FGD       G   TP         Y +T+   SVG N + F           + + I D
Sbjct: 261 FGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIID 320

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           SGT+ T L    Y  +    + + K +R    S L    CY  + ++   + PV+    K
Sbjct: 321 SGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKL-LSLCYKTTSDE--LDLPVITAHFK 377

Query: 389 GG 390
           G 
Sbjct: 378 GA 379


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 127/324 (39%), Gaps = 53/324 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           + VG P   F +  D  +D  WL C  C+ C    +S         I+ P+ SS+ + + 
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDS---------IFDPSQSSSYTLLS 241

Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           C +  C L     C   G  C Y + Y  DGT + G L+ + +      + S  VD R+S
Sbjct: 242 CETKHCNLLPNSSCSDDGY-CRYNITY-KDGTNTEGVLINETVSF----ESSGWVD-RVS 294

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
            GC     G F+     +G FGLG    S PS +    +   S+ +    DG    +   
Sbjct: 295 LGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINASSM---SYCLVESKDGYSSSTLEF 348

Query: 286 KGSPGQGETPFSLRQ---THPTYNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
              P  G     L Q       Y + +  + VGG  ++   S            I  S +
Sbjct: 349 NSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSS 408

Query: 332 SFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
             T L +  Y  + + F  +AK +  E   + L F+ CY LS N T  E P++   +  G
Sbjct: 409 LITMLENDTYNVVRDAF--VAKTQHLERLKAFLQFDTCYNLSSNNT-VELPILEFEVNDG 465

Query: 391 GPFFVNDPIVIVSSEPKGLYLYCL 414
             + +          PK  YLY +
Sbjct: 466 KSWLL----------PKESYLYAV 479


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 131/306 (42%), Gaps = 49/306 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA + ++A+DT +D  W+PC  CV C                ++P  S+T 
Sbjct: 98  YIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTT-----------TPFAPAKSTTF 146

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
            KV C ++ C+  +     GS C +   Y   GT S    LV+D + LATD   +     
Sbjct: 147 KKVGCGASQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA----- 198

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
             +FGC +  TGS +      GL    +   +       Q L  ++FS C  S  T    
Sbjct: 199 -YAFGCIQKVTGSSVPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFS 252

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
           G +  G    P + +    L+    +  Y + +  + VG   V+    A           
Sbjct: 253 GSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGT 312

Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYP 381
           +FDSGT FT L +PAY  +   F   +A  K+ T TS   F+ CY   +++P  T F + 
Sbjct: 313 VFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYTAPIVAPTIT-FMFS 371

Query: 382 VVNLTM 387
            +N+T+
Sbjct: 372 GMNVTL 377


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           LN+L +L    V +G PA S  + +DTGSD+ W+ C   S  H             ++ P
Sbjct: 193 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 242

Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++SST S   C S  C    Q    C S+ S C Y V Y  DG+ +TG    D L L + 
Sbjct: 243 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 300

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +S        FGC  V++G F D    +GL GLG    S+ S  A  G +  +FS C 
Sbjct: 301 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 349

Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
                 +G ++ G  G  G     +TP       PT Y + +  + VGG  ++     F 
Sbjct: 350 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 409

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              + DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P 
Sbjct: 410 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 467

Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
           V L   GG    ++   +I+S
Sbjct: 468 VALVFSGGAVVSLDASGIILS 488


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 130/305 (42%), Gaps = 48/305 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V +G+PA    + LDTGSD+ WL C  C  C H             I+ P++SS+ 
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 198

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C   +      + C Y+V Y  DG+ + G    + L + +   Q+      
Sbjct: 199 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 251

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A      GLG    ++PS L        SFS C     SD    
Sbjct: 252 VAVGCGHSNEGLFVGAAGLL---GLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 303

Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
           + FG   SP     P  LR  Q    Y + +T +SVGG  +     +FE         I 
Sbjct: 304 VDFGTSLSPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 362

Query: 328 DSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           DSGT+ T L    Y  + ++F   +L  EK   +     F+ CY LS  +T  E P V  
Sbjct: 363 DSGTAVTRLQTEIYNSLRDSFVKGTLDLEK---AAGVAMFDTCYNLSA-KTTVEVPTVAF 418

Query: 386 TMKGG 390
              GG
Sbjct: 419 HFPGG 423


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 113/277 (40%), Gaps = 50/277 (18%)

Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
           G L Y  +++VG P       LDTGSDL W  CD C +C+   +          ++SP  
Sbjct: 94  GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LFSPRM 144

Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           SS+   + C   LC   L   C      C Y+  Y  DGT + G+   +    A+   ++
Sbjct: 145 SSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASSSGET 202

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
           +SV   + FGCG +  GS  +    +G+ G G D  S+ S L+ +      FS C   + 
Sbjct: 203 QSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCLTPYA 252

Query: 275 SDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
           S     + FG         D   P Q  TP      +PT Y +  T V+VG   +    S
Sbjct: 253 SSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLRIPAS 311

Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNS 350
           A           I DSGT+ T        ++   F S
Sbjct: 312 AFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRS 348


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 139/355 (39%), Gaps = 62/355 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           ++SVG PAL +   +DTGSDL W    C  CV   N ++       ++ P  SST + +P
Sbjct: 119 DLSVGTPALPYAAIVDTGSDLVW--TQCKPCVECFNQTT------PVFDPAASSTYAALP 170

Query: 168 CNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S LC               SA S C Y   Y  D + + G L  +   LA  +     
Sbjct: 171 CSSALCADLPTSTCASSSSSSSASSPCGYTYTY-GDASSTQGVLATETFTLARQKVPG-- 227

Query: 220 VDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--D 276
               ++FGCG    G  F  GA   GL GLG    S+ S L       + FS C  S  D
Sbjct: 228 ----VAFGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQLGI-----DRFSYCLTSLDD 275

Query: 277 GTGRISF----------GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
             GR                 +P Q  TP     + P+ Y +++T ++VG   +    SA
Sbjct: 276 AAGRSPLLLGSAAGISASAATAPAQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSA 334

Query: 326 -----------IFDSGTSFTYLNDPAYTQISETF---NSLAKEKRETSTSDLPFEYCYVL 371
                      I DSGTS TYL   AY  + + F    SL          DL F+     
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGA 394

Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
                  + P + L   GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 395 VDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASG--ALCLTVMASRGLSIIG 447


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 153/346 (44%), Gaps = 53/346 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ VG P  +  +  DTGSD+ WL C  C SC        GQ     +++P+ SST 
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             + C S+LC+  L + C    + C YQV Y  DG+ + G    + L   ++   S    
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
             ++ GCG    G F   A    L GLG    S PS +    L  + FS C     S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
             + FG++      +  F+   T+P     Y + +  + VGG +V+    +         
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGN 295

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
              I DSGT+ T L   AY  + + F + +  + + TS   L F+ CY LS  +++   P
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLS-GRSSIMLP 353

Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
            V+    GG    +    ++V  +  G   YCL     S+N +IIG
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIG 397


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 90/333 (27%), Positives = 143/333 (42%), Gaps = 46/333 (13%)

Query: 111 VGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P +     +DTGSDL W+ C  C+ C + +N          ++ P  SST + + C+
Sbjct: 70  IGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINP---------MFDPLKSSTYTNISCD 120

Query: 170 STLC--ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           S LC      +C S    C Y   Y +D +++ G L ++ + L ++  +  S+   I FG
Sbjct: 121 SPLCYKPYIGEC-SPEKRCDYTYGY-ADSSLTKGVLAQETVTLTSNTGKPISLQG-ILFG 177

Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFGSDGTG 279
           CG   TG+F D     GL GLG   TS+ S +         +Q L+P    +   S    
Sbjct: 178 CGHNNTGNFNDHEM--GLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISS---- 231

Query: 280 RISFGDKGSPGQGE----TPFSLR-QTHPTYNITITQVSVGG-----NAVNFEFSAIFDS 329
           ++SFG KGS   GE    TP   R Q   +Y +T+  +SV       N+   + + + DS
Sbjct: 232 QMSFG-KGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDS 290

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GT    L    Y ++     +    +  T    L  + CY     QTN + P +    +G
Sbjct: 291 GTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYR---TQTNLKGPTLTYHFEG 347

Query: 390 GGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDN 421
                   PI   +   P+   ++CL +    N
Sbjct: 348 ANLLLT--PIQTFIPPTPETKGVFCLAITNCAN 378


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 94/349 (26%), Positives = 138/349 (39%), Gaps = 48/349 (13%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC--DCVSCVHGLNSSSGQVIDFNI 154
           R++  G  +    S+G P        DTGSDL W  C   C +      S S        
Sbjct: 83  RMDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPS-------- 134

Query: 155 YSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRY---LSDGTMSTGFLVED 206
           Y PN SST +K+PC+  LC L +      C +AG+ C Y+  Y     D   + GFL  +
Sbjct: 135 YLPNASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARE 194

Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
              L  D   S      + FGC     G +  G+         +     P  L +Q L  
Sbjct: 195 TFTLGADAVPS------VRFGCTTASEGGYGSGSG-------LVGLGRGPLSLVSQ-LNA 240

Query: 267 NSFSMCFGSDGTGR--ISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNA---VN 320
           ++F  C  SD +    + FG   S  G       L  +   Y + +  +S+G      V 
Sbjct: 241 STFMYCLTSDASKASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVG 300

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN--QTNF 378
                +FDSGT+ TYL +PAY++    F S     +   T    FE C+    N   +N 
Sbjct: 301 EPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG--FEACFQKPANGRLSNA 358

Query: 379 EYPVVNLTMKGGGPFF-VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             P + L   G      V + +V V        + C  V +S +++IIG
Sbjct: 359 AVPTMVLHFDGADMALPVANYVVEVEDG-----VVCWIVQRSPSLSIIG 402


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 127/309 (41%), Gaps = 48/309 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ W+ C  C+ C    +          ++ P  S + 
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDP---------VFDPTKSRSF 195

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC S LC       C +    C YQV Y  DG+ + G    + L             
Sbjct: 196 ANIPCGSPLCRRLDYPGCSTKKQICLYQVSY-GDGSFTVGEFSTETLTFRGTRV------ 248

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG----SDG 277
            R+  GCG    G F+  A      GLG  + S PS +  +    + FS C G    S  
Sbjct: 249 GRVVLGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQIGRR--FNSKFSYCLGDRSASSR 303

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN------FEFSA-- 325
              I FGD  S     T F+   ++P     Y + +  +SVGG  V+      F+  +  
Sbjct: 304 PSSIVFGD--SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG 361

Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               I DSGTS T L   AY  + + F   A   +      L F+ C+ LS  +T  + P
Sbjct: 362 NGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL-FDTCFDLS-GKTEVKVP 419

Query: 382 VVNLTMKGG 390
            V L  +G 
Sbjct: 420 TVVLHFRGA 428


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 117/259 (45%), Gaps = 27/259 (10%)

Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
           +G +C Y V+Y  DG+ + GF   D L L++ +           FGCG    G F + A 
Sbjct: 17  SGGHCLYGVQY-GDGSYTIGFFAMDTLTLSSHDAIKG-----FRFGCGERNEGLFGEAA- 69

Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE----TP 295
             GL GLG  KTS+P    ++      F+ CF   S GTG + FG   SP        TP
Sbjct: 70  --GLLGLGRGKTSLPVQTYDK--YGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTP 125

Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF 348
             L  T PT Y + +T + VGG  +      F+A   I DSGT  T L   AY+ +   F
Sbjct: 126 M-LIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAF 184

Query: 349 -NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
             S+A    + + +    + CY L+   +    P V+L  +GG    V+   +I ++   
Sbjct: 185 AASMAARGYKRAPALSLLDTCYDLT-GASEVAIPTVSLLFQGGVSLDVDASGIIYAASVS 243

Query: 408 GLYLYCLGVVKSDNVNIIG 426
              L   G   +D+V I+G
Sbjct: 244 QACLGFAGNEAADDVAIVG 262


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 110/282 (39%), Gaps = 46/282 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F + LDTGSDL WL C  C  C H   +          Y P TS++ 
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEA---------FYDPKTSASF 212

Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
             + CN   C L        QC S   +CPY   Y      +  F VE   ++L T E +
Sbjct: 213 KNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGR 272

Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S       + FGCG    G F   +   GL    +  +S       Q L  +SFS C   
Sbjct: 273 SSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 327

Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNFEF 323
               ++ + ++ FG DK         F+             Y I I  + VGG A++   
Sbjct: 328 RNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPE 387

Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
                        I DSGT+ +Y  +PAY  I   F    KE
Sbjct: 388 ETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 429


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 157/414 (37%), Gaps = 61/414 (14%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
           G F  D  HR            D PK   +      A R DR+FR       A  +  TP
Sbjct: 33  GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80

Query: 87  LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
               S+ N  Y +          +S+G P        DTGSDL W  C  C+SC    N 
Sbjct: 81  EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131

Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
                    ++ P+ S++  +V C S  C L     C      C +   Y  DG+++ G 
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           +  + L L ++  Q  S+   I FGCG   +G+F +     GLFG G    S+ S + + 
Sbjct: 182 IATETLTLNSNSGQPTSI-LNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238

Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSV 314
                 FS C   F +D   T +I FG +      +   TP   +     Y +T+  +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
           G     F  S+          D+GT  T L    Y ++ +     A         DL  +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
            CY    + T  + P+  LT    G      P+    S  +G+Y + +  +  D
Sbjct: 358 LCYR---SATLIDGPI--LTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGD 406


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 49/304 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    +VG PA +F++ALDT +D  W+PC+ CV C               +++  TS+T 
Sbjct: 90  YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C+        GS C +   Y     +S   L  D + L+TD      +   
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----G 279
            +FGC +  TGS      P GL GLG    S  S    Q L  ++FS C  S  T    G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244

Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
            +  G  G P + +T   L+    +  Y + +  + VG   V+   SA           I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
           FDSGT FT L  P YT + + F         +S     F+ CY   +++P  T F +  +
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCYTGPIVAPTMT-FMFSGM 361

Query: 384 NLTM 387
           N+T+
Sbjct: 362 NVTL 365


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           LN+L +L    V +G PA S  + +DTGSD+ W+ C   S  H             ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172

Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++SST S   C S  C    Q    C S+ S C Y V Y  DG+ +TG    D L L + 
Sbjct: 173 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +S        FGC  V++G F D    +GL GLG    S+ S  A  G +  +FS C 
Sbjct: 231 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279

Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
                 +G ++ G  G  G     +TP       PT Y + +  + VGG  ++     F 
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              + DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P 
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 397

Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
           V L   GG    ++   +I+S
Sbjct: 398 VALVFSGGAVVSLDASGIILS 418


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 156/414 (37%), Gaps = 61/414 (14%)

Query: 28  GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
           G F  D  HR            D PK   +      A R DR+FR       A  +  TP
Sbjct: 33  GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80

Query: 87  LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
               S+ N  Y +          +S+G P        DTGSDL W  C  C+SC    N 
Sbjct: 81  EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131

Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
                    ++ P+ S++  +V C S  C L     C      C +   Y  DG+++ G 
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181

Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
           +  + L L ++  Q  S+   I FGCG   +G+F +     GLFG G    S+ S + + 
Sbjct: 182 IATETLTLNSNSGQPXSI-XNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238

Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQG---ETPFSLRQTHPTYNITITQVSV 314
                 FS C   F +D   T +I FG +          TP   +     Y +T+  +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
           G     F  S+          D+GT  T L    Y ++ +     A         DL  +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
            CY    + T  + P+  LT    G      P+    S  +G+Y + +  +  D
Sbjct: 358 LCYR---SATLIDGPI--LTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGD 406


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 143/350 (40%), Gaps = 57/350 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                +FGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +T +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
              +FDSG+  +Y+ D A + + +    L      A+E+ E +        CY +     
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLRQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             + P ++L    G  F +    V V    +   ++CL    + +V+IIG
Sbjct: 273 G-DMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTKSVSIIG 321


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 127/311 (40%), Gaps = 43/311 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
           +   VS G PA+  +V +DTGSDL WL C           SSGQ       ++ P+ SST
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK--------PCSSGQCSPQKDPLFDPSHSST 163

Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            S VPC S  C+          C S G  C + + Y+ DGT + G   +D L LA     
Sbjct: 164 YSAVPCASGECKKLAADAYGSGC-SNGQPCGFAISYV-DGTSTVGVYGKDKLTLAPG--- 218

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
             ++     FGCG  ++                +    +   L  Q      FS C  + 
Sbjct: 219 --AIVKDFYFGCGHSKSSLPGLFDG-------LLGLGRLSESLGAQYGGGGGFSYCLPAV 269

Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
            +  G ++FG   +P G   TP       PT++ +T+  ++VGG  ++   SA     I 
Sbjct: 270 NSKPGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIV 329

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT  T L    Y  +   F    K  R     DL  + CY L+    N   P + LT 
Sbjct: 330 DSGTVVTVLQSTVYRALRAAFREAMKAYRLVH-GDL--DTCYDLT-GYKNVVVPKIALTF 385

Query: 388 KGGGPFFVNDP 398
            GG    ++ P
Sbjct: 386 SGGATINLDVP 396


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 108/271 (39%), Gaps = 45/271 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   + +G P   +   LDTGSDL W  C  C+ CV        Q   +  + P  S+T 
Sbjct: 90  YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C S  C            C YQ  Y  D   + G L  +     T+E +       
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           ISFGCG +  GS  +G   +G+ G G    S+ S L +       FS C   F S    R
Sbjct: 198 ISFGCGNLNAGSLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249

Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
           + FG        +  S     TPF +    PT Y + +T +SVGG            N  
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
           +     I DSGT+ TYL +PAY  +   F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 49/304 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +    +VG PA +F++ALDT +D  W+PC+ CV C               +++  TS+T 
Sbjct: 90  YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C+        GS C +   Y     +S   L  D + L+TD      +   
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----G 279
            +FGC +  TGS      P GL GLG    S  S    Q L  ++FS C  S  T    G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244

Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
            +  G  G P + +T   L+    +  Y + +  + VG   V+   SA           I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
           FDSGT FT L  P YT + + F         +S     F+ CY   +++P  T F +  +
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCYTGPIVAPTMT-FMFSGM 361

Query: 384 NLTM 387
           N+T+
Sbjct: 362 NVTL 365


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 126/307 (41%), Gaps = 41/307 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  N+S+G P    +   DTGSDL W  C  C  C   ++          ++ P  SST 
Sbjct: 94  YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDP---------LFDPKASSTY 144

Query: 164 SKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             V C+S+ C   E Q  C +  + C Y   Y  D + + G +  D L L + + +   +
Sbjct: 145 KDVSCSSSQCTALENQASCSTEDNTCSYSTSY-GDRSYTKGNIAVDTLTLGSTDTRPVQL 203

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
            + I  GCG    G+F       G   +G+   +V  I      I   FS C       +
Sbjct: 204 KNII-IGCGHNNAGTF----NKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSEN 258

Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFS 324
           D T +I+FG        G   TP   +     Y +T+  +SVG   V +        E +
Sbjct: 259 DRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGN 318

Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
            I DSGT+ T L    Y+++ +   +S+  EK++   + L    CY  +    + + P +
Sbjct: 319 IIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSL--CYSAT---GDLKVPAI 373

Query: 384 NLTMKGG 390
            +   G 
Sbjct: 374 TMHFDGA 380


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/349 (26%), Positives = 147/349 (42%), Gaps = 51/349 (14%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           +++G P L +    DTGSDL W  C  C S C               +Y+P++S+T + +
Sbjct: 94  LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 144

Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           PCNS+L             P  G  C Y V Y S G  S     E     +T   QS+  
Sbjct: 145 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGS-GWTSVFQGSETFTFGSTPAGQSRV- 202

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
              I+FGC    +G   + ++ +GL GLG  + S    L +Q  +P  FS C      ++
Sbjct: 203 -PGIAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 254

Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVNFEFSA--- 325
            T  +  G      G+ G   TPF +   T P    Y + +T +S+G  A++    A   
Sbjct: 255 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLL 314

Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                   I DSGT+ T L + AY Q+     SL        ++    + C++L P+ T+
Sbjct: 315 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAATGLDLCFML-PSSTS 373

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
               + ++T+   G   V      + S+  GL+   +       VNI+G
Sbjct: 374 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILG 422


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 34/283 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++S+G P    +   DTGSDL W  C  C  C   ++          ++ P +S T 
Sbjct: 95  YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDP---------LFDPKSSKTY 145

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
               C++  C L  Q   +G+ C YQ  Y  D + + G +  D + L +      S    
Sbjct: 146 RDFSCDARQCSLLDQSTCSGNICQYQYSY-GDRSYTMGNVASDTITLDSTTGSPVSFPKT 204

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
           +  GCG    G+F D  +  G+ GLG    S+ S + +   +   FS C       +  +
Sbjct: 205 V-IGCGHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNS 259

Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF--------EFSAI 326
            +++FG       PG   TP    +T  + Y +T+  +SVG   + F        E + I
Sbjct: 260 SKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNII 319

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
            DSGT+ T + D  ++ +S    +  + +R    S      CY
Sbjct: 320 IDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGF-LSVCY 361


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 115/429 (26%), Positives = 170/429 (39%), Gaps = 81/429 (18%)

Query: 40  DPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLR------GRGLAAQGNDKTPLTFSAG 92
           + V G+L+ D        A  S+L  R DRY RL            A    + P+T  A 
Sbjct: 95  EEVDGLLSTD-------AARVSSLQRRIDRYRRLMITSSAEVAVAVAASKAQVPVTSGA- 146

Query: 93  NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
               +L +L ++    +  G+      V +DT S+L W+ C  C SC    +        
Sbjct: 147 ----KLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQDP------- 191

Query: 152 FNIYSPNTSSTSSKVPCNSTLCEL---------------QKQCPSAGSNCPYQVRYLSDG 196
             ++ P++S + + VPCNS+ C+                Q Q  SA + C Y + Y  DG
Sbjct: 192 --LFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAA-CSYTLSY-RDG 247

Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
           + S G L  D L LA      + +D  + FGCG    G    G +  GL GLG  + S+ 
Sbjct: 248 SYSRGVLAHDRLSLA-----GEVIDGFV-FGCGTSNQGPPFGGTS--GLMGLGRSQLSLV 299

Query: 257 SILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPTYNI 307
           S   +Q      FS C     SD +G +  GD  S  +  TP       S     P Y +
Sbjct: 300 SQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFV 357

Query: 308 TITQVSVGGNAVN--------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
            +T ++VGG  V             AI DSGT  T L    Y  +   F S   E  +  
Sbjct: 358 NLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAP 417

Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVV 417
              +  + C+ ++      + P + L   GG    V+   V+  VSS+   + L    + 
Sbjct: 418 GFSI-LDTCFNMT-GLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLK 475

Query: 418 KSDNVNIIG 426
                NIIG
Sbjct: 476 SEYETNIIG 484


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 39/299 (13%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +S+G P +  +V +DTGS L W+ C +C    +   + +GQ     I++P  SST SKV 
Sbjct: 3   ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTYSKVG 57

Query: 168 CNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           C++  C        ++  C      C Y +RY S G  S G+L +D L LA++    +S+
Sbjct: 58  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN----RSI 112

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GT 278
           D+ I FGCG       L      G+ G G    S  + +  Q     +FS CF  D    
Sbjct: 113 DNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRDHENE 166

Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------AIFDS 329
           G ++ G          T        P Y   I Q+ +  N +  E           I DS
Sbjct: 167 GSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMTIVDS 224

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF-EYPVVNLTM 387
           GT+ TY+  P +  + +      + K  T   D     C++ +    N+ ++P V + +
Sbjct: 225 GTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISNSGSANWNDFPTVEMKL 282


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 142/354 (40%), Gaps = 56/354 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P     + LDTGSDL W  C  CVSC         Q + +  +  + SST+
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFD-------QPLPY--FDTSRSSTN 85

Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           + +PC ST C+L        +       C Y   Y  D +++ G L  D           
Sbjct: 86  ALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSY-GDNSVTIGLLAADKFTFVAGTSLP 144

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
                 ++FGCG   TG F   +   G+ G G    S+PS L        +FS CF +  
Sbjct: 145 G-----VTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQLKV-----GNFSHCFTTI- 191

Query: 278 TGRISF-------GDKGSPGQGE---TP---FSLRQTHPT-YNITITQVSVGGNAVNFEF 323
           TG I          D  S GQG    TP   ++  + +PT Y +++  ++VG   +    
Sbjct: 192 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 251

Query: 324 SA----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
           SA          I DSGTS T L    Y  + + F   A+ K      +    Y    +P
Sbjct: 252 SAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEF--AAQIKLPVVPGNATGHYTCFSAP 309

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
           +Q   + P + L  +G       +  V    +  G  + CL + K D   IIG 
Sbjct: 310 SQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGN 363


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 125/287 (43%), Gaps = 39/287 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  N+ +G P +  I  +DTGSDL W  C  C  C         QV+   ++ P  SST 
Sbjct: 92  YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
               C ++ C    + +  S    C ++  Y +DG+ + G L  + L +  D    K V 
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASETLTV--DSTAGKPVS 199

Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
               +FGCG    G F    + +G+ GLG  + S+ S L  +  I   FS C       S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255

Query: 276 DGTGRISFGDKGSP---GQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------E 322
             + RI+FG  G     G   TP   +     Y +T+  +SVG   + +          E
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEE 315

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
            + I DSGT++T+L    Y+++ ++  +  K KR    + + F  CY
Sbjct: 316 GNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 361


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 120/273 (43%), Gaps = 46/273 (16%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           RL +L ++    V +G   ++ IV  DTGSDL W+ C  C  C +  +          ++
Sbjct: 61  RLQTLNYI--VTVEIGGRNMTVIV--DTGSDLTWVQCQPCRLCYNQQDP---------LF 107

Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
           +P+ S +   + CNS+ C+ LQ        C S    C Y V Y  DG+ + G L  + L
Sbjct: 108 NPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNY-GDGSYTRGDLGMEQL 166

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
           +L T         S   FGCGR   G F      +GL GLG  K+ +  +     +    
Sbjct: 167 NLGTTHV------SNFIFGCGRNNKGLF---GGASGLMGLG--KSDLSLVSQTSAIFEGV 215

Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQT-----HPT-YNITITQVSVGGNAV 319
           FS C     +D +G +  G   S  +  TP S  +       PT Y + +T +S+GG A+
Sbjct: 216 FSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVAL 275

Query: 320 ---NFEFSAIF-DSGTSFTYLNDPAYTQISETF 348
              N+  S I  DSGT  T L  P Y  +   F
Sbjct: 276 QAPNYRQSGILIDSGTVITRLPPPVYRDLKAEF 308


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 121/307 (39%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ WL C  C  C    +         +++ P  S T 
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTD---------HVFDPTKSRTY 168

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC + LC       C +    C YQV Y  DG+ + G    + L    +        
Sbjct: 169 AGIPCGAPLCRRLDSPGCSNKNKVCQYQVSY-GDGSFTFGDFSTETLTFRRNRV------ 221

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
           +R++ GCG    G F       GL GLG  + S P     +    + FS C      S  
Sbjct: 222 TRVALGCGHDNEGLF---TGAAGLLGLGRGRLSFPVQTGRR--FNHKFSYCLVDRSASAK 276

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
              + FGD         TP        T Y + +  +SVGG  V       F   A    
Sbjct: 277 PSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNG 336

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAY  + + F   A   +      L F+ C+ LS   T  + P V
Sbjct: 337 GVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL-FDTCFDLS-GLTEVKVPTV 394

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 395 VLHFRGA 401


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 123/307 (40%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ W+ C  C  C    +          +++P  S + 
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDP---------VFNPTKSRSF 197

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + +PC S LC       C +    C YQV Y  DG+ + G    + L             
Sbjct: 198 ANIPCGSPLCRRLDSPGCSTKKHICLYQVSY-GDGSFTYGEFSTETLTFRGTRV------ 250

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
            R++ GCG    G F+  A    L GLG  + S PS +  +      FS C      S  
Sbjct: 251 GRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSK 305

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
              + FGD         TP        T Y + +  VSVGG  V       F+  +    
Sbjct: 306 PSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNG 365

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAY  + + F   A   +      L F+ C+ LS  +T  + P V
Sbjct: 366 GVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL-FDTCFDLS-GKTEVKVPTV 423

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 424 VLHFRGA 430


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 131/330 (39%), Gaps = 42/330 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P    ++ LDTGSD+ WL C  C  C     + SG+V D        +   
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCY----AQSGRVFDPRRSRSYAAVRC 197

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
              PC          C      C YQV Y  DG+++ G L  + L  A   +       R
Sbjct: 198 GAPPCRGLDAGGGGGCDRRRGTCLYQVAY-GDGSVTAGDLATETLWFARGARVP-----R 251

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRIS 282
           ++ GCG    G F+  A   GL      + S+P+  A +      FS CF GSD   R  
Sbjct: 252 VAVGCGHDNEGLFVAAAGLLGLG---RGRLSLPTQTARR--YGRRFSYCFQGSDLDHRTI 306

Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----AIFDSGTSFTYLN 337
                          +R  H        +  VG  ++  + S      I DSGTS T L 
Sbjct: 307 ---------------IRTVHQHVGGARVR-GVGERSLRLDPSTGRGGVILDSGTSVTRLA 350

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVND 397
            P Y  + E F + A   R        F+ CY L   +   + P V++ + GG    +  
Sbjct: 351 RPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRV-VKVPTVSVHLAGGAEVALPP 409

Query: 398 PIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
              ++  + +G   +CL +  +D  V+I+G
Sbjct: 410 ENYLIPVDTRG--TFCLALAGTDGGVSIVG 437


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/310 (29%), Positives = 130/310 (41%), Gaps = 53/310 (17%)

Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
           Q  LS I+  DTGS+   + C          S S  V D     P  S +  +VPC S L
Sbjct: 110 QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 153

Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           C  +Q+Q        C ++ + C Y + Y  D   STG   +DV+ L +     ++V  R
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSGQAVQFR 212

Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
            ++FGC     G FL      G+ G      S+PS L ++ L  + FS CF S       
Sbjct: 213 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 270

Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
           TG I  GD G      G TP       P     Y + +T +SV G  +    SA      
Sbjct: 271 TGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 330

Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
                 + DSGT+FT + D AYT     F +  +   R+   +   F+ CY +S   +  
Sbjct: 331 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLP 390

Query: 379 EYPVVNLTMK 388
             P V L+++
Sbjct: 391 GVPEVRLSLQ 400


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 113/259 (43%), Gaps = 36/259 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  V  G PA  + + +DTGS L WL C  CV   H        V    ++ P+ S T 
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 169

Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             + C S+ C            C ++ + C Y   Y  D + S G+L +D+L LA  +  
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 228

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
              V     +GCG+   G F   A   G+ GLG +K S+   ++++     +FS C  + 
Sbjct: 229 PGFV-----YGCGQDSDGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 278

Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
           G G  +S G     G     TP +    +P+ Y + +T ++VGG A+      +    I 
Sbjct: 279 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 338

Query: 328 DSGTSFTYLNDPAYTQISE 346
           DSGT  T L    YT   +
Sbjct: 339 DSGTVITRLPMSVYTPFQQ 357


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 122/310 (39%), Gaps = 56/310 (18%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVH------GLNSSSGQVIDFNIYSP 157
           ++    VG PA  F++  DTGSDL W+ C    S  H         + S  V    ++ P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169

Query: 158 NTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHL 210
             S T S +PC+S  C+         C S+ + C Y  RY +D + + G +  D   + L
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRY-NDNSAARGVVGTDSATVAL 228

Query: 211 ATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
           +         D +     +  GC     G   +  A +G+  LG    S  S  A++   
Sbjct: 229 SGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFE--ASDGVLSLGYSNISFASRAASR--F 284

Query: 266 PNSFSMCF-----GSDGTGRISFG------DKGSPGQG-ETPFSL-RQTHPTYNITITQV 312
              FS C        + T  ++FG         +P  G  TP  L  +  P Y + +  V
Sbjct: 285 GGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSV 344

Query: 313 SVGGNAVNFEFSA---------IFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETS 359
           SV G A++              I DSGTS T L  PAY  +    SE    L +   +  
Sbjct: 345 SVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMD-- 402

Query: 360 TSDLPFEYCY 369
               PF+YCY
Sbjct: 403 ----PFDYCY 408


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 134/334 (40%), Gaps = 53/334 (15%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G PA + ++A+DT +D  W+PC  CV C                ++P  S+T  KV C +
Sbjct: 113 GTPAQTLLLAMDTSNDAAWVPCTACVGCSTT-----------TPFAPPKSTTFKKVGCGA 161

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDSRISFGCG 229
           + C+  +     GS C +   Y   GT S    LV+D + LATD   +       +FGC 
Sbjct: 162 SQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA------YTFGCI 212

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRISFGD 285
           +  TGS L      GL    +   +       Q L  ++FS C  S  T    G      
Sbjct: 213 QKATGSSLPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFSGHXDLXP 267

Query: 286 KGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
              P     P F   +    Y + +  + VG   V+    A           +FDSGT F
Sbjct: 268 VAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVF 327

Query: 334 TYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
           T L +PAYT +   F   ++  K+ T TS   F+ CY +         P +     G   
Sbjct: 328 TRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTVP-----IVAPTITFMFSGMNV 382

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
               D I+I S+      + CL +  + DNVN +
Sbjct: 383 TLPPDNILIHST---AGSVTCLAMAPAPDNVNSV 413


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 143/350 (40%), Gaps = 57/350 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  +V +G PA + IV +DTGS   W+ C+C  C H          +   +  + S+T +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50

Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           KV C +++C L    P         +CP++V Y  DG+ S G L +D L  +  +K    
Sbjct: 51  KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
                SFGC     G+   G   +GL G+G    SV   L       + FS C       
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160

Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
             F S  TG  S G   +          + ++    + + +  +SV G  +    S    
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSR 220

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
              +FDSG+  +Y+ D A + +S+    L      A+E+ E +        CY +     
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272

Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             + P ++L       F +    V V    +   ++CL    +++V+IIG
Sbjct: 273 G-DMPAISLHFDDAARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 126/299 (42%), Gaps = 37/299 (12%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P       +DT +D  W  C+ C  C    N++S       ++ P+ SST   +PC+
Sbjct: 95  IGTPPFQLYGVMDTANDNIWFQCNPCKPC---FNTTSP------MFDPSKSSTYKTIPCS 145

Query: 170 STLCE--LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           S  C+      C S     C Y   Y  +   S G L  D L L ++     S  + I  
Sbjct: 146 SPKCKNVENTHCSSDDKKVCEYSFTYGGEA-YSQGDLSIDTLTLNSNNDTPISFKN-IVI 203

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDG-TGRI 281
           GCG    G  L+G   +G  GLG    S  S L +   I   FS C    F ++G +G++
Sbjct: 204 GCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKL 259

Query: 282 SFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGT 331
            FGDK    G G     +      Y+ T+  +SVG + + FE S          I DSGT
Sbjct: 260 HFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           + T L +  Y+++     S+ K +R  S +   F+ CY       N + P++     G 
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQ-QFKLCY--KATLKNLDVPIITAHFNGA 375


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 131/329 (39%), Gaps = 43/329 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +  ++++G P L     LDTGSDL W  CD  C  C               +Y+P  S+T
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142

Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + V C S +C+ LQ    +C    + C Y   Y  DGT + G L  +   L +D     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
                ++FGCG    GS  +    +GL G+G      P  L +Q  +      C      
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRG----PLSLVSQLGVTRPRRSCRARAAA 249

Query: 279 GRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLN 337
                    SP +G T   +L    P     +T +  GG         I DSGT+FT L 
Sbjct: 250 RGGGAPTTTSPLEGITVGDTLLPIDPAV-FRLTPMGDGG--------VIIDSGTTFTALE 300

Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVND 397
           + A+  ++    S  +     S + L    C+  +  +   E P + L   G       +
Sbjct: 301 ERAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA-VEVPRLVLHFDGADMELRRE 358

Query: 398 PIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             V+   E +   + CLG+V +  ++++G
Sbjct: 359 SYVV---EDRSAGVACLGMVSARGMSVLG 384


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 85/351 (24%), Positives = 142/351 (40%), Gaps = 41/351 (11%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           R  Y + R     A   D   +T       + ++SL ++    +  G P++  ++ +DTG
Sbjct: 89  RTNYIKSRASTGMASTPDDAAVTVPTRLGGF-VDSLEYM--VTLGFGTPSVPQVLLMDTG 145

Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-----ELQKQCP 180
           SD+ W+   C  C    NS+        ++ P+ SST + + C +  C       +  C 
Sbjct: 146 SDVSWV--QCAPC----NSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCT 199

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
           S G+ C Y+V Y  DG+ + G    + +  A              FGCG  Q G      
Sbjct: 200 SGGTQCGYRVEY-GDGSSTRGVYSNETITFAPGITVKD-----FHFGCGHDQRGP---SD 250

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPF-- 296
             +GL GLG    S+  ++    +   +FS C  +     G ++ G + S     + F  
Sbjct: 251 KFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVF 308

Query: 297 ----SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
                L     +Y + +T +SVGG  ++   SA     + DSGT  T L + AY  ++  
Sbjct: 309 TPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAA 368

Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
                      ++ D  F+ CY  +   +N   P V LT  GG    ++ P
Sbjct: 369 LRKAFAAYPMVASED--FDTCYNFT-GYSNVTVPRVALTFSGGATIDLDVP 416


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 58/345 (16%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
            +R  RL    LA     +TP+  ++GN  Y ++         +S G P       +DTG
Sbjct: 62  HERRARLAKHVLAGDQLFETPV--ASGNGEYLID---------ISYGNPPQKSTAIVDTG 110

Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG 183
           SDL W+ C  C SC   L++          + P+ S++   + C S  C+ L  Q  S  
Sbjct: 111 SDLNWVQCLPCKSCYETLSAK---------FDPSKSASYKTLGCGSNFCQDLPFQ--SCA 159

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
           ++C Y   Y  DG+ ++G L  D + + T +  +      ++FGCG    G+F       
Sbjct: 160 ASCQYDYMY-GDGSSTSGALSTDDVTIGTGKIPN------VAFGCGNSNLGTFAGAGG-- 210

Query: 244 GLFGLGMDKTSVPSILANQ--GLIPNSFSMC---FGSDGTGRISFGDKG-SPGQGETPFS 297
                 +     P  L +Q  G     FS C    GS  T  +  GD   + G   TP  
Sbjct: 211 -----LVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAYTPML 265

Query: 298 LRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IFDSGTSFTYLNDPAYTQIS 345
               +PT Y   +  +SV G AVN     F+ +A      I DSGT+ TYL+  A+  + 
Sbjct: 266 TNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMV 325

Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
               + A    E   S    EYC+  +    N  YP V     G 
Sbjct: 326 AALKA-ALPYPEADGSFYGLEYCFS-TAGVANPTYPTVVFHFNGA 368


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 110/447 (24%), Positives = 163/447 (36%), Gaps = 70/447 (15%)

Query: 1   MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDP-VKGILAVDDLPKKGSFAY 59
           M+SS        +L+ L  CA    G  +        +SDP +     V D  ++     
Sbjct: 1   MSSSTSQMASLAVLVFLVVCATLASGAASVRVGLTRIHSDPDITAPEFVRDALRRD---- 56

Query: 60  YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
                HR +   L GR LA   +D T ++     D       G  +   +S+G P LS+ 
Sbjct: 57  ----MHRQQSRSLFGRELAE--SDGTTVSARTRKDLPN----GGEYLMTLSIGTPPLSYP 106

Query: 120 VALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-- 174
              DTGSDL W    PC    C               +Y+P +S+T   +PCNS+L    
Sbjct: 107 AIADTGSDLIWTQCAPCSGDQCF---------AQPAPLYNPASSTTFGVLPCNSSLSMCA 157

Query: 175 --LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
             L  + P  G  C Y   Y +  T   G    +     +       V   I+FGC    
Sbjct: 158 GVLAGKAPPPGCACMYNQTYGTGWT--AGVQGSETFTFGSAAADQARVPG-IAFGCSNAS 214

Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS 288
           +  + +G+A  GL GLG    S+ S L         FS C      ++ T  +  G   +
Sbjct: 215 SSDW-NGSA--GLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQDTNSTSTLLLGPSAA 266

Query: 289 ---PGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSG 330
               G   TPF            Y + +T +S+G  A++    A           I DSG
Sbjct: 267 LNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSG 326

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL-SPNQTNFEYPVVNLTMKG 389
           T+ T L + AY Q+     SL        +     + CY L +P       P + L   G
Sbjct: 327 TTITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFDG 386

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGV 416
                  D  +I      G  ++CL +
Sbjct: 387 ADMVLPADSYMI-----SGSGVWCLAM 408


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/312 (25%), Positives = 131/312 (41%), Gaps = 32/312 (10%)

Query: 94  DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDF 152
           +T  +++LG  +  + SVG P+L     LDTGSD+ WL C  C  C              
Sbjct: 79  ETTVISALG-EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTP-------- 129

Query: 153 NIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
            I+  + S T   +PC S  C+ +Q    S+  +C Y + Y+ DG+ S G L  + L L 
Sbjct: 130 -IFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYV-DGSQSLGDLSVETLTLG 187

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
           +           +  GCGR       +  +  G+ GLG    S+ + L+        FS 
Sbjct: 188 STNGSPVQFPGTV-IGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKFSY 242

Query: 272 CFG---SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---- 321
           C     S  + +++FG+       G   TP   +     Y +T+   SVG N + F    
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302

Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
              + + I DSGT+ T L +  Y+++          +R    + +    CY ++P++ + 
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQV-LGLCYKVTPDKLDA 361

Query: 379 EYPVVNLTMKGG 390
             PV+     G 
Sbjct: 362 SVPVITAHFSGA 373


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 154/381 (40%), Gaps = 50/381 (13%)

Query: 63  LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHY-TNVSVGQPALSFIVA 121
           L H  R  +  G G +   +   PLT  A        S+   +Y T + +G PA S+++ 
Sbjct: 96  LLHGHRKKKAGGVGGSQASSSSVPLTPGA--------SVAVGNYVTRLGLGTPATSYVMV 147

Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC- 179
           +DTGS L WL   C  C    +  +G V D     P  S T + V C+S+ C ELQ    
Sbjct: 148 VDTGSSLTWL--QCSPCSVSCHRQAGPVFD-----PRASGTYAAVQCSSSECGELQAATL 200

Query: 180 -PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
            PSA S    C YQ  Y  D + S G+L +D +   +             +GCG+   G 
Sbjct: 201 NPSACSVSNVCIYQASY-GDSSYSVGYLSKDTVSFGSGSFPG------FYYGCGQDNEGL 253

Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQ-G 292
           F   A   GL GL  +K S+   LA    +  +FS C    S   G +S G   +PGQ  
Sbjct: 254 FGRSA---GLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGSY-NPGQYS 307

Query: 293 ETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLNDPAYTQIS 345
            TP +      + Y +T++ +SV G  +            I DSGT  T L    YT +S
Sbjct: 308 YTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVYTALS 367

Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
               +        + +    + C+  S        P V++   GG    ++   V++  +
Sbjct: 368 RAVAAAMASAAPRAPTYSILDTCFRGS--AAGLRVPRVDMAFAGGATLALSPGNVLIDVD 425

Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
                  CL    +    IIG
Sbjct: 426 DS---TTCLAFAPTGGTAIIG 443


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           LN+L +L    V +G PA S  + +DTGSD+ W+ C   S  H             ++ P
Sbjct: 47  LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 96

Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           ++SST S   C S  C    Q    C S+ S C Y V Y  DG+ +TG    D L L + 
Sbjct: 97  SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 154

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
             +S        FGC  V++G F D    +GL GLG    S+ S  A  G +  +FS C 
Sbjct: 155 AVRS------FQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 203

Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
                 +G ++ G  G  G     +TP       PT Y + +  + VGG  ++     F 
Sbjct: 204 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 263

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              + DSGT  T L   AY+ +S  F +  K+      S +  + C+  S  Q++   P 
Sbjct: 264 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 321

Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
           V L   GG    ++   +I+S
Sbjct: 322 VALVFSGGAVVSLDASGIILS 342


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 133/313 (42%), Gaps = 61/313 (19%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
           HR R    RGR L       + L+  +G            ++  + +G P  S+ + LDT
Sbjct: 20  HRHR----RGRSLLQTAQVSSGLSLGSGE-----------YFARMGIGSPQRSYYLELDT 64

Query: 125 GSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
           GSD+ W+ C  C SC   ++          IY P+ SS+  +V C S LC+        G
Sbjct: 65  GSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSYRRVYCGSALCQALDYSACQG 115

Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
             C Y+V Y  D + S+G L  +  +L  +   S +    I+FGCG   +G F   A   
Sbjct: 116 MGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRNIAFGCGHSNSGLFRGEAGLL 171

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-FGDKGSP---GQGETPFSLR 299
           G+ G  +   S   I A+ G    +FS C       R S    + SP   G+   PF+ R
Sbjct: 172 GMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQLQSRSSPLIFGRTAIPFAAR 222

Query: 300 QT----HPT----YNITITQVSVGGNAV-----------NFEFSAIFDSGTSFTYLNDPA 340
            T    +P     Y   +T +SVGG A+           N    AI DSGTS T +   A
Sbjct: 223 FTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAA 282

Query: 341 YTQISETFNSLAK 353
           Y  + + + + ++
Sbjct: 283 YAVLRDAYRAASR 295


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/310 (29%), Positives = 126/310 (40%), Gaps = 53/310 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++ +V VG P   F + LDTGSDL W+   CV C      +         Y P  SS+  
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWI--QCVPCYECFEQNGPH------YDPGQSSSYR 232

Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV---LHLATDEK 215
            + C+ + C L       + C +    CPY   Y      +  F +E     L +++ + 
Sbjct: 233 NIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKP 292

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           + + V++ + FGCG    G F   A    L GLG    S  S L  Q L  +SFS C   
Sbjct: 293 ELRRVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 346

Query: 274 -GSDG--TGRISFGDK----GSPGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
             SD   + ++ FG+       P    T     + +P    Y + I  + VGG  VN   
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406

Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
                        I DSGT+ +Y  +PAY  I E F  +AK K      D P  E CY  
Sbjct: 407 EKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF--MAKVKGYPVVKDFPVLEPCY-- 462

Query: 372 SPNQTNFEYP 381
             N T  E P
Sbjct: 463 --NVTGVEQP 470


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 121/299 (40%), Gaps = 44/299 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  ++SVG P     + LDTGSDL W    C  C++  +  +  V+D     P  SST +
Sbjct: 94  YLVHLSVGTPPRPVALTLDTGSDLVW--TQCAPCLNCFDQGAIPVLD-----PAASSTHA 146

Query: 165 KVPCNSTLCELQ--KQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQ 216
            V C++ +C       C   GS     +C Y   Y  D +++ G L  D       D   
Sbjct: 147 AVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHY-GDKSITVGKLASDRFTFGPGDNAD 205

Query: 217 SKSV-DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
              V + R++FGCG    G F   A   G+ G G  + S+PS L        SFS CF S
Sbjct: 206 GGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFSYCFTS 258

Query: 276 DGTGRISFGDKG-SPGQ-------GETPFSLRQTHPT-YNITITQVSVGGNAVNF----- 321
                 S    G +P +         TP     + P+ Y +++  ++VG   +       
Sbjct: 259 MFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQ 318

Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETFNS---LAKEKRETSTSDLPFEYCYVLSPN 374
              E SAI DSG S T L +  Y  +   F +   L     E S  DL F      +P 
Sbjct: 319 RLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPK 377


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 142/337 (42%), Gaps = 41/337 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           + T + +G PA  +I+ +DTGS L WL   C  C    +  SG V D     P TSS+ +
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----PKTSSSYA 189

Query: 165 KVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            V C++  C       L     S+   C YQ  Y  D + S G+L +D +   ++   + 
Sbjct: 190 AVSCSTPQCNDLSTATLNPAACSSSDVCIYQASY-GDSSFSVGYLSKDTVSFGSNSVPN- 247

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
                  +GCG+   G F   A   GL GL  +K S+   LA    +  SFS C  S  +
Sbjct: 248 -----FYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSYCLPSSSS 297

Query: 279 GRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAVNF---EFSA---IFDSG 330
                    +PGQ   TP  S       Y I ++ ++V G  +     E+S+   I DSG
Sbjct: 298 SGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSG 357

Query: 331 TSFTYLNDPAYTQISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           T  T L    Y  +S+      K  KR  + S L  + C+V     ++   P V++   G
Sbjct: 358 TVITRLPTTVYDALSKAVAGAMKGTKRADAYSIL--DTCFV--GQASSLRVPAVSMAFSG 413

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           G    ++   ++V  +       CL    + +  IIG
Sbjct: 414 GAALKLSAQNLLVDVDSSTT---CLAFAPARSAAIIG 447


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 137/346 (39%), Gaps = 49/346 (14%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +S+G PA+ +   +DTGSDL W  C  C  C               I+ P  SS+ SKV
Sbjct: 111 ELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 161

Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            C+S LC    +  C     +C Y   Y  D + + G L  +         + ++  S I
Sbjct: 162 GCSSGLCNALPRSNCNEDKDSCEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 215

Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
            FGCG    G   DG +  +GL GLG    S+ S L                 S S+  G
Sbjct: 216 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 272

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
           S  +G ++       G+     SL +    P+ Y + +  ++VG   ++ E S       
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                I DSGT+ TYL + A+  + E F S      + S S    + C+ L     N   
Sbjct: 333 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPNAAKNIAV 391

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           P +    KG       +  ++  S    L   CL +  S+ ++I G
Sbjct: 392 PKLIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFG 434


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 121/272 (44%), Gaps = 34/272 (12%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           +SL  L Y  +V +G PA++  V +DTGSD+ W+ C+        ++ +G + D     P
Sbjct: 128 SSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 182

Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
             SST +   C++  C       +     A S C Y V+Y  DG+ +TG    DVL L+ 
Sbjct: 183 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 241

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
            +     V     FGC   + G+ +D    +GL GLG D  S+ S  A +     SFS C
Sbjct: 242 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSLVSQTAAR--YGKSFSYC 293

Query: 273 FGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN--- 320
             +    +G ++ G   S G         TP    +  PTY    +  ++VGG  +    
Sbjct: 294 LPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSP 353

Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
             F   ++ DSGT  T L   AY  +S  F +
Sbjct: 354 SVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 385


>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
          Length = 802

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 75/362 (20%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSC-VHGLNSSSGQVIDFNIYSPNTSSTS 163
           Y  V +G P   F V +DTGS   ++ C  C SC  HG N+          Y    SS+ 
Sbjct: 139 YATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHGSNAP---------YDAAKSSSY 189

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
            +VPC S    +   C ++G  C Y  ++  D  +  G +V DV+ +            R
Sbjct: 190 ERVPCGSGC--IFGACRASGL-CEYDEKFSEDSQVG-GHVVSDVIDVG-----GSLGTPR 240

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGS-DGT 278
           I FGC  ++T + L     NG+  LG  +  +   L  +   P S    F +C GS +G 
Sbjct: 241 IHFGCNSLET-NMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFGLCLGSFEGG 299

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT------------YNITITQVSVGG---------- 316
           G +S G    P Q    F  R+TH +            YN+ + ++ V            
Sbjct: 300 GVLSLGK--LPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFVRNTELKKPSGAE 357

Query: 317 --NAVNFEFSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKE------KRETSTSDLPFEY 367
              A    +  + DSGT++TYL++  +   ISE  + +  +      +      + P + 
Sbjct: 358 LMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISEIEDKVVNDHGANFFRVRGGDPNYPNDV 417

Query: 368 CY-------VLSPNQTNFEYPVVNLTMKGGG------PFFVNDPIVIVSSEPKGLYLYCL 414
           C+        LS +  N+ +P  NLT  G         F   + + +  +EP     +C+
Sbjct: 418 CWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEFLPENYLFVHPNEPNA---FCV 474

Query: 415 GV 416
           GV
Sbjct: 475 GV 476


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 150/364 (41%), Gaps = 72/364 (19%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           GFL   N+S+G P ++ +V +DTGS L W+ C  C++C     S          + P  S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151

Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
            +   + C        N   C    Q         Y++RYL  G  S G L ++ L   T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203

Query: 213 -DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-LANQGLIPNSFS 270
            DE + K   S I+FGCG +   +  D A  NG+FGLG    + P I +A Q  + N FS
Sbjct: 204 LDEGKIKK--SNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITMATQ--LGNKFS 254

Query: 271 MCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
            C G           +  G +GS  +G+ TP  +   H  Y +T+  +SVG   +  + +
Sbjct: 255 YCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSISVGSKTLKIDPN 311

Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE-YCYVLS 372
           A           + DSG ++T L +  +  + +    L K   E   +   FE  C+   
Sbjct: 312 AFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGV 371

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDP------------IVIVSSEPKGLYLYCLGVVKSD 420
            ++    +P V     GG    +               + I+ S  + L L  +G++   
Sbjct: 372 VSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQ 431

Query: 421 NVNI 424
           N N+
Sbjct: 432 NYNV 435


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 140/367 (38%), Gaps = 70/367 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++    VG PA  F++  DTGSDL W+ C       G    +G      ++    S + +
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCS------GAGDGTGDA-PRRVFRAAASRSWA 164

Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            + C+S  C          C S  S C Y  RY +DG+ + G +  D   +A    +S+ 
Sbjct: 165 PIACSSDTCTSYVPFSLANCSSPASPCAYDYRY-NDGSAARGVVGTDSATIALSGSESRD 223

Query: 220 VDSR------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
              R      +  GC     G     +  +G+  LG    S  S  A +      FS C 
Sbjct: 224 GGGRRAKLQGVVLGCTASYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCL 279

Query: 274 -----GSDGTGRISFGDKGSPG-----------QGETPFSL-RQTHPTYNITITQVSVGG 316
                  + T  ++FG  G  G              TP  L R+  P Y + +  V V G
Sbjct: 280 VDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAG 339

Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDL 363
            A++             AI DSGTS T L  PAY  +    SE    L +   +      
Sbjct: 340 EALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMD------ 393

Query: 364 PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI--VIVSSEPKGLYLYCLGVVKS-- 419
           PFEYCY    N T     +  L ++  G   +  P    +V + P    + C+GV +   
Sbjct: 394 PFEYCY----NWTAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPG---VKCIGVQEGAW 446

Query: 420 DNVNIIG 426
             V++IG
Sbjct: 447 PGVSVIG 453


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 143/345 (41%), Gaps = 48/345 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + +G PA    + LDTGSD+ WL C  C  C    +          ++ P  SS+ 
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDP---------LFDPALSSSY 246

Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           + VPC+S  C             +  S+C Y+V Y  DG+ + G    + L L  D    
Sbjct: 247 ATVPCDSPHCRALDASACHNNAANGNSSCVYEVAY-GDGSYTVGDFATETLTLGGD---G 302

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---G 274
            +    ++ GCG    G F+  A    L G  +   S PS ++        FS C     
Sbjct: 303 SAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQIS-----ATEFSYCLVDRD 354

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN----------AVNFEFS 324
           S     + FG   S           +++  Y + +  +SVGG           A++ + S
Sbjct: 355 SPSASTLQFGASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGS 414

Query: 325 A--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT+ T L   AY+ + + F    +     S   L F+ CY L+  +++ + P 
Sbjct: 415 GGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL-FDTCYDLA-GRSSVQVPA 472

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
           V+L  +GGG   +     ++  +  G   YCL    +   V+I+G
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAG--TYCLAFAATGGAVSIVG 515


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 96/354 (27%), Positives = 150/354 (42%), Gaps = 52/354 (14%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
           LG+ +  ++++G P   + + +DTGSDL W+ CD  C  C    N          +Y P+
Sbjct: 61  LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRN---------RLYKPH 110

Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
                  V C   LC   +  P+   AG N  C Y+V Y   G+ S G L+ D + L  T
Sbjct: 111 ----GDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNS 268
           +   ++ +   ++FGCG  QT     G  P     G+ GLG  +TS+ S L + GLI N 
Sbjct: 166 NGSLARPM---LAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNV 219

Query: 269 FSMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSA 325
              C    G G + FGD+  P  G   TP     +   Y      +       + +    
Sbjct: 220 VGHCLSGRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLEL 279

Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYV-------LSPNQTN 377
           IFDSG+S+TY N  A+  +     N L  +    +T D     C+        L    +N
Sbjct: 280 IFDSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSN 339

Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-----NVNIIG 426
           F+  +++ T     P  +     ++ ++   +   CLG++        N NIIG
Sbjct: 340 FKPLLLSFTKSKNSPLQLPPEAYLIVTKHGNV---CLGILDGTEIGLGNTNIIG 390


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 51/360 (14%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +S+G PA+ +   +DTGSDL W  C  C  C               I+ P  SS+ SKV
Sbjct: 110 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 160

Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            C+S LC    +  C      C Y   Y  D + + G L  +         + ++  S I
Sbjct: 161 GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 214

Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
            FGCG    G   DG +  +GL GLG    S+ S L                 S S+  G
Sbjct: 215 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 271

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
           S  +G ++       G+     SL +    P+ Y + +  ++VG   ++ E S       
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                I DSGT+ TYL + A+  + E F S      + S S    + C+ L     N   
Sbjct: 332 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPDAAKNIAV 390

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
           P +    KG       +  ++  S    L   CL +  S+ ++I G       N ++ H+
Sbjct: 391 PKMIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFGNVQ--QQNFNVLHD 445


>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 81/195 (41%), Gaps = 30/195 (15%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           + +G P  +F   +DTGSDL W+ CD  C  C          +     Y P  ++    V
Sbjct: 58  LQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCT---------LPPIRQYKPKGNT----V 104

Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           PC   +C       + QCP+    C Y+V Y   G+ S G LV D   L        ++ 
Sbjct: 105 PCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGS-SMGALVIDQFPLKL--LNGSAMQ 161

Query: 222 SRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
            R++FGCG  Q    L  A P     G+ GLG  K  V   L   GL  N    C  S G
Sbjct: 162 PRLAFGCGYDQ---ILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSKG 218

Query: 278 TGRISFGDKGSPGQG 292
            G + FGD   P  G
Sbjct: 219 GGYLFFGDTLIPTLG 233


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 140/377 (37%), Gaps = 79/377 (20%)

Query: 63  LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVAL 122
           L  R R      +G ++ G+   P T +    +Y     G   +T  S+G P     V L
Sbjct: 67  LKRRGRASHHSQKGSSSGGHKSIPATAALYPHSY-----GGYAFT-ASLGTPPQPLPVLL 120

Query: 123 DTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----- 173
           DTGS L W+PC    DC +C      SS       ++ P  SS+S  V C +  C     
Sbjct: 121 DTGSQLTWVPCTSNYDCRNC------SSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHS 174

Query: 174 -ELQKQCP---SAGSNC--------PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            E   +C    S G+NC        PY V Y S  T   G L+ D L      +      
Sbjct: 175 AEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGST--AGLLIADTL------RAPGRAV 226

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA----NQGLIPNSF-------- 269
           S    GC  V          P+GL G G    SVP+ L     +  L+   F        
Sbjct: 227 SGFVLGCSLVSVHQ-----PPSGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSG 281

Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
           S+  G D  G        S    + P+++      Y + ++ V+VGG AV          
Sbjct: 282 SLVLGGDNDGMQYVPLVKSAAGDKQPYAV-----YYYLALSGVTVGGKAVRLPARAFAAN 336

Query: 323 ----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPN 374
                 AI DSGT+FTYL DP   Q        A   R   + D    L    C+ L   
Sbjct: 337 AAGSGGAIVDSGTTFTYL-DPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQG 395

Query: 375 QTNFEYPVVNLTMKGGG 391
             +   P ++L  KGG 
Sbjct: 396 AKSMALPELSLHFKGGA 412


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/353 (24%), Positives = 137/353 (38%), Gaps = 56/353 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   VSVG P     + +D+GSD+ W+ C  C+ C          V    ++ P TS+T 
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECY---------VQADPLFDPATSATF 221

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           S V C S +C +             C Y+V Y +DG+ + G L  + L L     +    
Sbjct: 222 SGVSCGSAICRILPTSACGDGELGGCEYEVSY-ADGSYTKGALALETLTLGGTAVEG--- 277

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
              +  GCG    G F+  A   GL GLG    S+   L  +  +  +FS C  S G   
Sbjct: 278 ---VVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQLGGE--VGGAFSYCLASRGGYG 329

Query: 278 -------TGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS--- 324
                   G +  G   +  +G    P       P+ Y + ++ + VG   +  +     
Sbjct: 330 SGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF-NSLAKE-KRETSTSDLPFEYCYVLSPN 374
                    + D+GT+ T L   AY  + + F  +LA    R    S    + CY LS  
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLS-G 448

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
             +   P V+    G     +    V++  +   + +YCL     S  ++I+G
Sbjct: 449 YASVRVPTVSFCFDGDARLILAARNVLLEVD---MGIYCLAFAPSSSGLSIMG 498


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 117/285 (41%), Gaps = 46/285 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +  +G P   F + +D+GSDL W+ C  C  C            D  +Y P+ SST 
Sbjct: 64  YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCY---------AQDSPLYVPSNSSTF 114

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV-LHLATDEKQSKSVD- 221
           S VPC S+ C L      A    P   RY   G  +  +L  D          +S +VD 
Sbjct: 115 SPVPCLSSDCLLIP----ATEGFPCDFRY--PGACAYEYLYADTSSSKGVFAYESATVDG 168

Query: 222 ---SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----- 273
               +++FGCG    GSF   AA  G+ GLG    S  S +       N F+ C      
Sbjct: 169 VRIDKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLD 223

Query: 274 GSDGTGRISFGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
            +  +  + FGD+          TP       PT Y + I +V+VGG ++    SA    
Sbjct: 224 PTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEID 283

Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
                  IFDSGT+ TY    AY+ I   F+S     R  S   L
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGL 328


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 102/354 (28%), Positives = 150/354 (42%), Gaps = 57/354 (16%)

Query: 65  HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
            R +Y + R     GR    +  D T L   +G+     N     +   V +G P     
Sbjct: 6   ERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSAN-----YVVVVGLGTPKRDLS 60

Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
           +  DTGSDL W  C+ C  SC    ++         I+ P+ SS+ + + C S+LC    
Sbjct: 61  LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYTNITCTSSLCTQLT 111

Query: 175 ---LQKQCPSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCG 229
              ++ +C S+  ++C Y  +Y  D + S GFL ++ L + ATD      VD  + FGCG
Sbjct: 112 SDGIKSECSSSTDASCIYDAKY-GDNSTSVGFLSQERLTITATD-----IVDDFL-FGCG 164

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF--GSDGTGRISFGDK 286
           +   G F +G+A  GL GLG    S V    +N   I   FS C    S   G ++FG  
Sbjct: 165 QDNEGLF-NGSA--GLMGLGRHPISIVQQTSSNYNKI---FSYCLPATSSSLGHLTFGAS 218

Query: 287 GSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYL 336
            +       TP S +   +  Y + I  +SVGG  +    +  FSA   I DSGT  T L
Sbjct: 219 AATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRL 278

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
               Y  +   F     EK   +      + CY LS  +     P ++    GG
Sbjct: 279 APTVYAALRSAFRR-XMEKYPVANEAGLLDTCYDLSGYK-EISVPRIDFEFSGG 330


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/282 (28%), Positives = 111/282 (39%), Gaps = 46/282 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F + LDTGSDL WL C  C  C H     +G       Y P TS++ 
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFH----QNGM-----FYDPKTSASF 210

Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
             + CN   C L        QC S   +CPY   Y      +  F VE   ++L T E  
Sbjct: 211 KNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGG 270

Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S       + FGCG    G F   +   GL    +  +S       Q L  +SFS C   
Sbjct: 271 SSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 325

Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNF-- 321
               ++ + ++ FG DK         F+             Y I I  + VGG A++   
Sbjct: 326 RNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPE 385

Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
                    +   I DSGT+ +Y  +PAY  I   F    KE
Sbjct: 386 ETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 427


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 95/386 (24%), Positives = 150/386 (38%), Gaps = 66/386 (17%)

Query: 46  LAVDDLPKKGSFAYYSALAHRDRY---------FRLRGRGLAAQGNDKTPLTF---SAGN 93
           L V +  ++G   +   + HRD+           RL GR L         L     S G 
Sbjct: 59  LEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGR-LKRDAKRVASLIRRLSSGGG 117

Query: 94  DTYRLNSLGF-----------LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHG 141
            +YR++  G             ++  + VG P  S  + +D+GSD+ W+ C  C  C H 
Sbjct: 118 GSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 177

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
            +          ++ P  S++ + V C+S++C+  +        C Y+V Y  DG+ + G
Sbjct: 178 SDP---------VFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSY-GDGSYTKG 227

Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
            L  + L         +++   ++ GCG    G F+  A   GL G  M   S    L  
Sbjct: 228 TLALETLTFG------RTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSM---SFVGQLGG 278

Query: 262 QGLIPNSFSMCF---GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGG 316
           Q     +FS C    G+D +G + FG +  P G    P       P+ Y I +  + VGG
Sbjct: 279 Q--TGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGG 336

Query: 317 NAVNF-----------EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLP 364
             V             +   + D+GT+ T L   AY    + F    A   R T  +   
Sbjct: 337 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA--I 394

Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGG 390
           F+ CY L     +   P V+    GG
Sbjct: 395 FDTCYDLL-GFVSVRVPTVSFYFSGG 419


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 126/303 (41%), Gaps = 44/303 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T V +G PA    + LDTGSD+ WL C  C  C H             I+ P++SS+ 
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 201

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C++  C   +      + C Y+V Y  DG+ + G    + L + +   Q+      
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 254

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A    L GLG    ++PS L        SFS C     SD    
Sbjct: 255 VAVGCGHSNEGLFVGAAG---LLGLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 306

Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
           + FG    P     P  LR  Q    Y + +T +SVGG  +     +FE         I 
Sbjct: 307 VEFGTSLPPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 365

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L    Y  + ++F        E +     F+ CY LS  +T  E P V    
Sbjct: 366 DSGTAVTRLQTGIYNSLRDSFLK-GTSDLEKAAGVAMFDTCYNLSA-KTTIEVPTVAFHF 423

Query: 388 KGG 390
            GG
Sbjct: 424 PGG 426


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 122/298 (40%), Gaps = 37/298 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           +   V  G P  +  V  DTGSD+ WL C    V C               ++ P+ SST
Sbjct: 16  YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEP---------LFDPSLSST 66

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
              V C    C        + S C Y V Y  DG+ + GFL  D   L   +K    +  
Sbjct: 67  YRNVSCTEPACVGLSTRGCSSSTCLYGVFY-GDGSSTIGFLAMDTFMLTPAQKFKNFI-- 123

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKT-SVPSILANQGLIPNSFSMCF--GSDGTG 279
              FGCG+  TG F   A   GL GLG   T S+ S +A    + N FS C    S  TG
Sbjct: 124 ---FGCGQNNTGLFQGTA---GLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATG 175

Query: 280 RISFGD-KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGT 331
            ++ G+ + +PG        R   PT Y I +  +SVGG  ++           I DSGT
Sbjct: 176 YLNIGNPQNTPGYTAMLTDTRV--PTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGT 233

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
             T L   AY+ +     + A  +   + +    + CY  S   T+  YPV+ L   G
Sbjct: 234 VITRLPPTAYSALKTAVRA-AMTQYTLAPAVTILDTCYDFS-RTTSVVYPVIVLHFAG 289


>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 431

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 83/289 (28%), Positives = 125/289 (43%), Gaps = 49/289 (16%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           +GF + T +++GQPA  + + +DTGSDL WL CD   C H   +            P   
Sbjct: 68  VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETPH----------PLHR 115

Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLATD 213
            ++  VPC   LC  LQ   P+   NC       Y++ Y +D   + G L+ DV  L + 
Sbjct: 116 PSNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTYGVLLNDVYLLNSS 171

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
                 V  R++ GCG  Q  S       +GL GLG  K S+ S L +QGL+ N    C 
Sbjct: 172 NGVQLKV--RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCL 229

Query: 274 GSDGTG-----------RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF- 321
            S G G           R+++          TP S   +   Y+    ++  GG      
Sbjct: 230 SSQGGGYIFFGNAYDSARVTW----------TPISSVDSK-HYSAGPAELVFGGRKTGVG 278

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
             +A+FD+G+S+TY N  AY  +    N  L+ +  + +  D     C+
Sbjct: 279 SLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCW 327


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/310 (29%), Positives = 130/310 (41%), Gaps = 53/310 (17%)

Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
           Q  LS I+  DTGS+   + C          S S  V D     P  S +  +VPC S L
Sbjct: 9   QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 52

Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           C  +Q+Q        C ++ + C Y + Y  D   STG   +DV+ L +    S++V  R
Sbjct: 53  CLAVQQQTSNGSSQPCVNSSAACTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSSQAVQFR 111

Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
            ++FGC     G FL      G+ G      S+PS L ++ L  + FS CF S       
Sbjct: 112 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 169

Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
           TG I  GD G        TP       P     Y + +T +SV G  +    SA      
Sbjct: 170 TGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 229

Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
                 + DSGT+FT + D AYT     F +  +   R+   +   F+ CY +S   +  
Sbjct: 230 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLP 289

Query: 379 EYPVVNLTMK 388
             P V L+++
Sbjct: 290 GVPEVRLSLQ 299


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 156/384 (40%), Gaps = 59/384 (15%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A RD    L    LAA+G  +     ++G    +  +    +     +G P    ++A+D
Sbjct: 73  ASRDASRLLYLDSLAARGKARAYAPIASGRQLLQTPT----YVVRARLGTPPQQLLLAVD 128

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
           T +D  W+PC  C  C     +SS    D     P  S++   VPC S LC       CP
Sbjct: 129 TSNDAAWIPCAGCAGC----PTSSAPPFD-----PAASTSYRSVPCGSPLCAQAPNAACP 179

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
             G  C + + Y +D ++    L +D L +A D  ++       +FGC +  TG+    A
Sbjct: 180 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGDAVKT------YTFGCLQKATGT---AA 228

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
            P GL GLG    S   +   + +   +FS C  S    + +G +  G  G P + +T  
Sbjct: 229 PPQGLLGLGRGPLSF--LSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTP 286

Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
            L   H +  Y + +T + VG   V     A           + DSGT FT L  PAY  
Sbjct: 287 LLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVA 346

Query: 344 ISETFNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
           + +      + +     S L  F+ C+    N T   +P V L   G       + +VI 
Sbjct: 347 VRDEV----RRRVGAPVSSLGGFDTCF----NTTAVAWPPVTLLFDGMQVTLPEENVVIH 398

Query: 403 SSEPKGLYLYCLGVVKS-DNVNII 425
           S+      + CL +  + D VN +
Sbjct: 399 STYGT---ISCLAMAAAPDGVNTV 419


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 100/348 (28%), Positives = 144/348 (41%), Gaps = 44/348 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +V +G P   F + +DTGSDL WL C  C+ C       SG + D     P  S + 
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QSGPIFD-----PAASISY 199

Query: 164 SKVPCNSTLCEL--------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
             V C    C L         ++C    S+ CPY   Y  D + +TG L  +   +   +
Sbjct: 200 RNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTQ 258

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCF 273
             ++ VD  ++FGCG    G F   A    L GLG    S  S L  +G+   ++FS C 
Sbjct: 259 SGTRRVDG-VAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RGVYGGHAFSYCL 312

Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
              GS    +I FG   +    P    T F+      T Y + +  + VGG AVN     
Sbjct: 313 VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDT 372

Query: 323 FSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
            SA   I DSGT+ +Y  +PAY  I + F                   CY +S      E
Sbjct: 373 LSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVS-GAEKVE 431

Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY-LYCLGVVKSDNVNIIG 426
            P ++L    G  +        +  EP+G+  L  LG  +S  ++IIG
Sbjct: 432 VPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRS-GMSIIG 478


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 78/271 (28%), Positives = 107/271 (39%), Gaps = 45/271 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   + +G P   +   LDTGSDL W  C  C+ CV        Q   +  + P  S+T 
Sbjct: 90  YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C S  C            C YQ  Y  D   + G L  +     T+E +       
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           ISFGCG +  G   +G   +G+ G G    S+ S L +       FS C   F S    R
Sbjct: 198 ISFGCGNLNAGLLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249

Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
           + FG        +  S     TPF +    PT Y + +T +SVGG            N  
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
           +     I DSGT+ TYL +PAY  +   F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 51/360 (14%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
            +S+G PA+ +   +DTGSDL W  C  C  C               I+ P  SS+ SKV
Sbjct: 2   ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 52

Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            C+S LC    +  C      C Y   Y  D + + G L  +         + ++  S I
Sbjct: 53  GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 106

Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
            FGCG    G   DG +  +GL GLG    S+ S L                 S S+  G
Sbjct: 107 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 163

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
           S  +G ++       G+     SL +    P+ Y + +  ++VG   ++ E S       
Sbjct: 164 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 223

Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                I DSGT+ TYL + A+  + E F S      + S S    + C+ L     N   
Sbjct: 224 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPDAAKNIAV 282

Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
           P +    KG       +  ++  S    L   CL +  S+ ++I G       N ++ H+
Sbjct: 283 PKMIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFGNVQ--QQNFNVLHD 337


>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
          Length = 656

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 142/333 (42%), Gaps = 38/333 (11%)

Query: 74  GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC 133
            R L      +  L  S  N+   LN     HY  + VG P     + +DTGS +   PC
Sbjct: 64  ARTLQIAKTYRRSLFTSDQNEVVPLNLGMGTHYAWIYVGTPPQRVSIIIDTGSGMTAFPC 123

Query: 134 D-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY 192
             C  C +  +      I FN    N SS+   + CN         C +    C    R 
Sbjct: 124 SGCDQCGNHTD------IPFNT---NLSSSIQPISCNHRTYFSCAYCTNPTEPC----RT 170

Query: 193 LSDGTMSTGFLVEDVLHL-----ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
             +G+  +  ++ED+++L     A D     S  +R  FGC   +TG F+   A +G+ G
Sbjct: 171 YMEGSSWSAKVMEDIVYLGDVASAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMG 229

Query: 248 LGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKG-SPGQGETPFSLRQT---H 302
           +  +   + + L  +  IP N+F++CF   G G  + G    S   GE  ++        
Sbjct: 230 IHNNGNDIVTKLFREKKIPSNTFTLCFSPRG-GYFALGAMDTSRHAGEVTYARINDAYGE 288

Query: 303 PTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
             Y + +T + VGG++++ +  A      I DSGT+ + ++  A   + + + +L   K 
Sbjct: 289 NYYAVFMTDIRVGGHSIDIDMKATNSYRYIVDSGTTNSIISGRAGQALMDLYRNLTHLKN 348

Query: 357 ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
             + +D     C +LSP+Q   + P +   M+G
Sbjct: 349 PLNDND-----CILLSPSQIE-QLPTLQFVMEG 375


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 98/344 (28%), Positives = 138/344 (40%), Gaps = 56/344 (16%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G PAL++   +DTGSDL W  C  CV C               ++ P++SST + VPC+
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCS 223

Query: 170 STLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
           S  C      +C SA S C Y   Y  D + + G L  +   LA      KS    + FG
Sbjct: 224 SASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGVVFG 275

Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
           CG    G  F  GA   GL GLG    S+ S L   GL  + FS C  S D T       
Sbjct: 276 CGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSPLLL 327

Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
                IS     +     TP     + P+ Y +++  ++VG   ++   SA         
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 387

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
             I DSGTS TYL    Y  + + F +          S +  + C+       +  E P 
Sbjct: 388 GVIVDSGTSITYLEVQGYRALKKAFAAQMALP-AADGSGVGLDLCFRAPAKGVDQVEVPR 446

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           +     GG    +     +V     G    CL V+ S  ++IIG
Sbjct: 447 LVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 488


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 119/306 (38%), Gaps = 51/306 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     + LDT +D  W+PC  C  C                + PN S+T 
Sbjct: 45  YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC------------SSTTFLPNASTTL 92

Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             + C+   C   +   CP+ GS+ C +   Y  D +++   LV+D + LA D      V
Sbjct: 93  GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
               +FGC    +G  +    P GL GLG    S+  I     +    FS C  S     
Sbjct: 146 IPGFTFGCINAVSGGSIP---PQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200

Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
            +G +  G  G P    T   LR  H P+ Y + +T VSVG   V           N   
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT  T    P Y  I + F    K+     +S   F+ C+  +      E P V
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAV 314

Query: 384 NLTMKG 389
            L  +G
Sbjct: 315 TLHFEG 320


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 81/174 (46%), Gaps = 20/174 (11%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
           T + +G P   F + +D+GS + ++PC DC  C        G+  D   + P  SST   
Sbjct: 95  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145

Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           V CN     +   C      C Y+  Y ++ + S G L ED++       +S+    R  
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFG---NESQLTPQRAV 196

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
           FGC  V+TG      A +G+ GLG    S+   L ++GLI NSF +C+G    G
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 119/306 (38%), Gaps = 51/306 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     + LDT +D  W+PC  C  C                + PN S+T 
Sbjct: 98  YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS------------TTFLPNASTTL 145

Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             + C+   C   +   CP+ GS+ C +   Y  D ++ T  LV+D + LA D      V
Sbjct: 146 GSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------V 198

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
               +FGC    +G  +    P GL GLG    S+  I     +    FS C  S     
Sbjct: 199 IPGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 253

Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
            +G +  G  G P    T   LR  H P+ Y + +T VSVG   V           N   
Sbjct: 254 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 313

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT  T    P Y  I + F    K+     +S   F+ C+  +      E P +
Sbjct: 314 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAI 367

Query: 384 NLTMKG 389
            L  +G
Sbjct: 368 TLHFEG 373


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 138/328 (42%), Gaps = 49/328 (14%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           + +   + +G P       LDTGS+  W    C+ CVH  N ++       I+ P+ SST
Sbjct: 57  YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 108

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             ++           +C +   +CPY++ Y    + + G LV + + + +   Q   +  
Sbjct: 109 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 156

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
            I  GCGR  +G F  G A  G+  +G+D+     I    G  P   S CF   GT +I+
Sbjct: 157 TI-IGCGRNNSG-FKPGFA--GV--VGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 210

Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
           FG        G   T   ++   P  Y + +  VSVG   +          + + + DSG
Sbjct: 211 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 270

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLTMKG 389
           ++ TY          E++ +L ++  E   + + F    +L       + +PV+ +   G
Sbjct: 271 STLTYF--------PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDIFPVITMHFSG 322

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           G    ++   + V+S   G  ++CL ++
Sbjct: 323 GADLVLDKYNMYVASNTGG--VFCLAII 348


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 121/307 (39%), Gaps = 53/307 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P     + LDT +D  W+PC  C  C                + PN S+T 
Sbjct: 45  YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC------------SSTTFLPNASTTL 92

Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             + C+   C   +   CP+ GS+ C +   Y  D +++   LV+D + LA D      V
Sbjct: 93  GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
               +FGC    +G  +    P GL GLG    S+  I     +    FS C  S     
Sbjct: 146 IPGFTFGCINAVSGGSIP---PQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200

Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
            +G +  G  G P    T   LR  H P+ Y + +T VSVG   V           N   
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
             I DSGT  T    P Y  I + F    K+     +S   F+ C+     +TN  E P 
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFA----ETNEAEAPA 313

Query: 383 VNLTMKG 389
           V L  +G
Sbjct: 314 VTLHFEG 320


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 139/338 (41%), Gaps = 43/338 (12%)

Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
           LDTGS L WL C  C    H             +Y P+ S T  K+ C S  C   K   
Sbjct: 3   LDTGSSLSWLQCQPCAVYCHAQADP--------LYDPSVSKTYKKLSCASVECSRLKAAT 54

Query: 179 -----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
                C +  + C Y   Y  D + S G+L +D+L L + +   +      ++GCG+   
Sbjct: 55  LNDPLCETDSNACLYTASY-GDTSFSIGYLSQDLLTLTSSQTLPQ-----FTYGCGQDNQ 108

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG----SP 289
           G F   A   G+ GL  DK S+ + L+ +    ++FS C  +  +G    G       SP
Sbjct: 109 GLFGRAA---GIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGFLSIGSISP 163

Query: 290 GQGE-TPFSLRQTHPT-YNITITQVSVGGN-----AVNFEFSAIFDSGTSFTYLNDPAYT 342
              + TP      +P+ Y + +T ++V G      A  +    + DSGT  T L    Y 
Sbjct: 164 TSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYA 223

Query: 343 QISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
            + + F  +   K   + +    + C+  S    +   P + +  +GG    +  P +++
Sbjct: 224 ALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSIS-AVPEIKMIFQGGADLTLRAPSILI 282

Query: 403 SSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
            ++     L   G   ++ + IIG    + Y IA ++S
Sbjct: 283 EADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 320


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 124/300 (41%), Gaps = 36/300 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VGQP  S+    DTGSD+ WL C      +G     G + D     P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            + C+S  C L  +     ++C Y+V Y  DG+ + G L  +        + S S+   +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
             GCG    G F+  A         +        L++Q L   SFS C     S+ +  +
Sbjct: 293 PIGCGHDNEGLFVGAAG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344

Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
            F          +P       PT+  + +  +SVGG  +     +FE         I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GT+ T +    Y  + + F  L K     +    PF+ CY LS +Q+N E P +   + G
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTK-NLPPAPGVSPFDTCYDLS-SQSNVEVPTIAFILPG 462


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 78/300 (26%), Positives = 122/300 (40%), Gaps = 36/300 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++  + VGQP  S+    DTGSD+ WL C      +G     G + D     P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238

Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
            + C+S  C L  +     ++C Y+V Y  DG+ + G L  +        + S S+   +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
             GCG    G F+            +        L++Q L   SFS C     S+ +  +
Sbjct: 293 PIGCGHDNEGLFVGADG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344

Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
            F          +P       PT+  + +  +SVGG  +     +FE         I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GT+ T +    Y  + + F  L K          PF+ CY LS +Q+N E P +   + G
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVS-PFDTCYDLS-SQSNVEVPTIAFILPG 462


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 144/352 (40%), Gaps = 66/352 (18%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+SVG P L+F V  DTGSDL W  C  C  C                + P +SST SK+
Sbjct: 89  NISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139

Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
           PC S+ C+      + C + G  C Y  +Y S  T   G+L  + L +      S     
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
            ++FGC   + G    G + +G+ GLG    S         LIP      FS C  S   
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236

Query: 277 -GTGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
            G   I FG   +   G    TPF +    HP+ Y + +T ++VG   +      F F+ 
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296

Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
                  I DSGT+ TYL    Y  + + F S       T       + C+  +      
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS-QTANVTTVNGTRGLDLCFKSTGGGGGI 355

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKG-LYLYCLGVV--KSDN-VNIIG 426
             P + L   GG  + V      V ++ +G + + CL ++  K D  +++IG
Sbjct: 356 AVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIG 407


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 121/274 (44%), Gaps = 46/274 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + +G P  S+ + LDTGSD+ W+ C  C SC   ++          IY P+ SS+ 
Sbjct: 12  YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSY 62

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
            +V C S LC+        G  C Y+V Y  D + S+G L  +  +L  +   S +    
Sbjct: 63  RRVYCGSALCQALDYSACQGMGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRN 118

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS- 282
           I+FGCG   +G F   A   G+ G  +   S   I A+ G    +FS C       R S 
Sbjct: 119 IAFGCGHSNSGLFRGEAGLLGMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQ 169

Query: 283 FGDKGSP---GQGETPFSLRQT----HPTYNI----TITQVSVGGNAV-----------N 320
              + SP   G+   PF+ R T    +P  N      +T +SVGG  +           N
Sbjct: 170 LQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGN 229

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
               AI DSGTS T +  PAY  + + + + ++ 
Sbjct: 230 GTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRN 263


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 89/323 (27%), Positives = 130/323 (40%), Gaps = 48/323 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++  V +G P     +  DTGSDL W  C+    SC    +          I+ P+ S++
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDV---------IFDPSKSTS 196

Query: 163 SSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
            S + C S LC            C ++   C Y ++Y  D + S G+   + L + ATD 
Sbjct: 197 YSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQY-GDSSFSVGYFSRERLTVTATD- 254

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
                V     FGCG+   G F   A   GL GLG    S     A +     S+ +   
Sbjct: 255 -----VVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAKYRKIFSYCLPST 306

Query: 275 SDGTGRISFGDKGSPGQGE-TPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
           S  TG +SFG   +    + TPFS + +    Y + IT ++VGG  +    S      AI
Sbjct: 307 SSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAI 366

Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
            DSGT  T L   AY  +   F      K  ++      + CY LS  +  F  P +  +
Sbjct: 367 IDSGTVITRLPPTAYGALRSAFRQ-GMSKYPSAGELSILDTCYDLSGYKV-FSIPTIEFS 424

Query: 387 MKGGGPFFVNDPIVIVSSEPKGL 409
             GG         V V   P+G+
Sbjct: 425 FAGG---------VTVKLPPQGI 438


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 115/274 (41%), Gaps = 42/274 (15%)

Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
           G L Y  ++++G P       LDTGSDL W  C  C SC+   +          +++P  
Sbjct: 99  GDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDP---------LFAPAA 149

Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           SS+   + C+  LC   L   C      C Y+  Y  DGT + G    +    A+   + 
Sbjct: 150 SSSYVPMRCSGQLCNDILHHSCQRP-DTCTYRYNY-GDGTTTLGVYATERFTFASSSGEK 207

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG----LIP----NSF 269
            SV   + FGCG +  GS  +G   +G+ G G D  S+ S L+ +     L P       
Sbjct: 208 LSVP--LGFGCGTMNVGSLNNG---SGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKS 262

Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-- 325
           ++ FGS   G +  GD  + GQ +T   L  RQ    Y +  T V+VG   +    SA  
Sbjct: 263 TLMFGSLSDG-VFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFA 321

Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNS 350
                    I DSGT+ T       T++   F +
Sbjct: 322 LRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRA 355


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 138/328 (42%), Gaps = 49/328 (14%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           + +   + +G P       LDTGS+  W    C+ CVH  N ++       I+ P+ SST
Sbjct: 63  YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 114

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             ++           +C +   +CPY++ Y    + + G LV + + + +   Q   +  
Sbjct: 115 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 162

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
            I  GCGR  +G F  G A  G+  +G+D+     I    G  P   S CF   GT +I+
Sbjct: 163 TI-IGCGRNNSG-FKPGFA--GV--VGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 216

Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
           FG        G   T   ++   P  Y + +  VSVG   +          + + + DSG
Sbjct: 217 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 276

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLTMKG 389
           ++ TY          E++ +L ++  E   + + F    +L       + +PV+ +   G
Sbjct: 277 STLTYF--------PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDIFPVITMHFSG 328

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           G    ++   + V+S   G  ++CL ++
Sbjct: 329 GADLVLDKYNMYVASNTGG--VFCLAII 354


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 123/305 (40%), Gaps = 49/305 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +   V +G P     + LDT +D  W+PC   S   G +S++        + PN S+T  
Sbjct: 98  YVVRVKLGTPGQQMFMVLDTSNDAAWVPC---SGCTGFSSTT--------FLPNASTTLG 146

Query: 165 KVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
            + C+   C   +   CP+ GS+ C +   Y  D ++ T  LV+D + LA D      V 
Sbjct: 147 SLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------VI 199

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---- 277
              +FGC    +G  +    P GL GLG    S+  I     +    FS C  S      
Sbjct: 200 PGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYYF 254

Query: 278 TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEFS 324
           +G +  G  G P    T   LR  H P+ Y + +T VSVG   V           N    
Sbjct: 255 SGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG 314

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT  T    P Y  I + F    K+     +S   F+ C+  +      E P + 
Sbjct: 315 TIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAIT 368

Query: 385 LTMKG 389
           L  +G
Sbjct: 369 LHFEG 373


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 145/364 (39%), Gaps = 58/364 (15%)

Query: 52  PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT---PLTFSAGNDTYRLNSLGFLHYTN 108
           P   +  +  A  HRD + R   R LAA  +D T   P++ +     + +          
Sbjct: 39  PSVTASQFVRAALHRDMH-RHNARKLAASSSDGTVSAPVSPTTVPGEFLMT--------- 88

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID--FNIYSPNTSSTSSKV 166
           +++G P L F+   DTGSDL W    C  C       S Q       +Y+P++S+T S +
Sbjct: 89  LAIGTPPLPFLAIADTGSDLIW--TQCAPC-------SRQCFQQPTPLYNPSSSTTFSAL 139

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           PCNS+L      C      C Y + Y S  T    F   +     +     +     I+F
Sbjct: 140 PCNSSLGLCAPAC-----ACMYNMTYGSGWTYV--FQGTETFTFGSSTPADQVRVPGIAF 192

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRIS 282
           GC    +G   + ++ +GL GLG    S+ S L         FS C      ++ T  + 
Sbjct: 193 GCSNASSG--FNASSASGLVGLGRGSLSLVSQLGAP-----KFSYCLTPYQDTNSTSTLL 245

Query: 283 FGDKGSPGQ----GETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
            G   S         TPF    +   Y + +T +S+G  A+      F   A      I 
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L + AY Q+     SL        ++    + C+ L P+ T+    + ++T+
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFEL-PSSTSAPPSMPSMTL 364

Query: 388 KGGG 391
              G
Sbjct: 365 HFDG 368


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 85/340 (25%), Positives = 130/340 (38%), Gaps = 50/340 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +     +G PA + +VA+D  +D  W+PC   +      S          + P  SST  
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPS----------FDPTRSSTYR 156

Query: 165 KVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
            V C +  C  Q   PS     GS+C + + Y +  +     L +D L L  D     + 
Sbjct: 157 PVRCGAPQCS-QAPAPSCPGGLGSSCAFNLSYAA--STFQALLGQDALALHDDVDAVAA- 212

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
               +FGC  V TG  +    P GL G G    S PS    + +  + FS C      S+
Sbjct: 213 ---YTFGCLHVVTGGSVP---PQGLVGFGRGPLSFPS--QTKDVYGSVFSYCLPSYKSSN 264

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
            +G +  G  G P + +T   L   H P+ Y + +  + VGG  V    SA         
Sbjct: 265 FSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGR 324

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I D+GT FT L+ P Y  + + F S  +           F+ CY           P V
Sbjct: 325 GTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG--FDTCY-----NVTISVPTV 377

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
             +  G     + +  V++ S   G+    +     D V+
Sbjct: 378 TFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVD 417


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 76/266 (28%), Positives = 122/266 (45%), Gaps = 39/266 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L   N S+GQP +  +  +DTGS L W+ C  C SC       S Q+I   ++ P+ SST
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSC-------SQQIIG-PMFDPSISST 152

Query: 163 SSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
              + C + +C      +C S+ S C Y   Y+ +G  S G +  + L   + ++   +V
Sbjct: 153 YDSLSCKNIICRYAPSGECDSS-SQCVYNQTYV-EGLPSVGVIATEQLIFGSSDEGRNAV 210

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
           ++ + FGC   + G++ D     G+FGLG   TSV     NQ  + + FS C G+     
Sbjct: 211 NN-VLFGCSH-RNGNYKDRRF-TGVFGLGSGITSV----VNQ--MGSKFSYCIGNIADPD 261

Query: 281 ISFGD----KGSPGQG-ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------- 325
            S+      +G   +G  TP  +   H  Y + +  +SVG   +  + SA          
Sbjct: 262 YSYNQLVLSEGVNMEGYSTPLDVVDGH--YQVILEGISVGETRLVIDPSAFKRTEKQRRV 319

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSL 351
           I DSGT+ T+L +  Y  +     +L
Sbjct: 320 IIDSGTAPTWLAENEYRALEREVRNL 345


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 83/304 (27%), Positives = 122/304 (40%), Gaps = 40/304 (13%)

Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           LG   Y  +++ G P    ++  DTGSDL WL C   +                 +  + 
Sbjct: 49  LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 107

Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           S+T S VPC++  C L            P+A   C Y   Y +DG+ +TGFL  D   ++
Sbjct: 108 SATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 166

Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
                  +V   ++FGCG R Q GSF   +   G+ GLG  + S P+   +  L   +FS
Sbjct: 167 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 220

Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
            C      GR     SF   G P +      TP       PT Y + +  + VG   +  
Sbjct: 221 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 280

Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
             S            + DSG++ TYL   AY  +   F +     R  S++      E C
Sbjct: 281 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 340

Query: 369 YVLS 372
           Y +S
Sbjct: 341 YNVS 344


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 135/315 (42%), Gaps = 53/315 (16%)

Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
           SLG   Y   V++G PA++ ++++DTGSD+ W+   PC   SC    +          ++
Sbjct: 123 SLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 173

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            P  S+T S   C S  C    Q    G     S C Y V+Y  DG+ + G    D L L
Sbjct: 174 DPAMSATYSAFSCGSAQC---AQLGDEGNGCLKSQCQYIVKY-GDGSNTAGTYGSDTLSL 229

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            +    S +V S   FGC     G F+     +GL GLG D  S+ S  A       +FS
Sbjct: 230 TS----SDAVKS-FQFGCSHRAAG-FV--GELDGLMGLGGDTESLVSQTA--ATYGKAFS 279

Query: 271 MCF---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--- 320
            C     S G G ++ G  G   S     TP  +R + PT Y + +  ++V G  +N   
Sbjct: 280 YCLPPPSSSGGGFLTLGAAGGASSSRYSHTPM-VRFSVPTFYGVFLQGITVAGTMLNVPA 338

Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPNQ 375
             F  +++ DSGT  T L   AY  +   F    K++ +   S  P    + C+  S   
Sbjct: 339 SVFSGASVVDSGTVITQLPPTAYQALRTAF----KKEMKAYPSAAPVGSLDTCFDFSGFN 394

Query: 376 TNFEYPVVNLTMKGG 390
           T    P V LT   G
Sbjct: 395 T-ITVPTVTLTFSRG 408


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/378 (23%), Positives = 143/378 (37%), Gaps = 54/378 (14%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A+R R+ +   R      N   P+   +G            +   V  G P  S    +D
Sbjct: 85  ANRLRFLKRTSRSSKQDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
           TGSD+ W+PC      H             I+ P  SS+     C+S  C E+   C   
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183

Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
            S C ++V Y  DGT   G L  D + L +    +       SFGC      S  +  +P
Sbjct: 184 NSKCQFEVSY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAE----SLSEDTSP 232

Query: 243 NGLFGLGMDKTSVPSILA-NQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
           +         +      A    L   +FS C    S  +G +  G + +       F+  
Sbjct: 233 SPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292

Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
              P+    Y +T+  +SVG   ++   +        I DSGT+ T+L   AYT + + F
Sbjct: 293 IKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAF 352

Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
                  + T   D+  + CY LS   ++ + P + L +       +    ++++ E   
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLS--SSSVDVPTITLHLDRNVDLVLPKENILITQESG- 407

Query: 409 LYLYCLGVVKSDNVNIIG 426
             L CL    +D+ +IIG
Sbjct: 408 --LACLAFSSTDSRSIIG 423


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 145/368 (39%), Gaps = 72/368 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           +  N+S+G P    +   DTGSDL WL   PCD      G            I+ P+ S+
Sbjct: 80  YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKG-----------PIFDPSNST 128

Query: 162 TSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           T  K+PC +  C    E  + C +  + C Y   Y  D + +TG+L  D + +     Q 
Sbjct: 129 TFHKLPCTTAPCNALDESARSC-TDPTTCGYTYSY-GDHSYTTGYLASDTVTVGNASVQI 186

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
           ++V    +FGCG    G+F +  +  G+ GLG    S  S L +   I   FS C     
Sbjct: 187 RNV----AFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLE 238

Query: 274 --------GSDGTGRISFGDK----GSPGQG----ETPFSLRQTHPTYNITITQVSVGGN 317
                    S  T RI FGD      S   G     TP   ++    Y +TI  ++VG  
Sbjct: 239 NEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRK 298

Query: 318 AVNF-------------------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
            + +                   E + I DSGT+ T+L +  Y  +        K +R  
Sbjct: 299 KLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVN 358

Query: 359 STSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK 418
              +  F  C+     +   E P++ +  +GG    +      V +E     L C  ++ 
Sbjct: 359 DVKNSMFSLCF--KSGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEG---LVCFTMLP 413

Query: 419 SDNVNIIG 426
           +++V I G
Sbjct: 414 TNDVGIYG 421


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/342 (25%), Positives = 141/342 (41%), Gaps = 40/342 (11%)

Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
           S+G  +Y T + +G PA  +++ +DTGS L WL   C  C+   +  SG V     ++P 
Sbjct: 116 SVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPV-----FNPK 168

Query: 159 TSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           +SST + V C++  C       L     S+ + C YQ  Y  D + S G+L +D +   +
Sbjct: 169 SSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGS 227

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                        +GCG+   G F   A   GL GL  +K S+   LA    +  SF+ C
Sbjct: 228 TSLP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYC 276

Query: 273 FGSDGTGRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFS 324
             S  +         +PGQ   TP  S       Y I ++ ++V GN +           
Sbjct: 277 LPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP 336

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT  T L    Y+ +S+   +  K     S   +  + C+      +    P V 
Sbjct: 337 TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI-LDTCF--KGQASRVSAPAVT 393

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           ++  GG    ++   ++V  +       CL    + +  IIG
Sbjct: 394 MSFAGGAALKLSAQNLLVDVDDS---TTCLAFAPARSAAIIG 432


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 113/263 (42%), Gaps = 33/263 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +YT++ +G P    I+ +DTGS+L WL C  C  C   +++         IY    S++ 
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDT---------IYDAARSASY 150

Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
             V C NS LC    Q   A    GS C +   Y  DG+ S G L  D L + T      
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
                 +FGC +        GA+  G+ GL   K ++P  L  +      FS CF     
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265

Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPT-----YNITITQVSVGGNAVNF---EFSA 325
             + TG + FG+   P +     S+  T+       Y++ +  VS+  + + F       
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV 325

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG+SF+    P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 125/305 (40%), Gaps = 46/305 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G PA   ++A+DT SD+ W+PC  CV C                +SP  S++ 
Sbjct: 99  YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSF 147

Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             V C++  C+ Q   P+ G+  C + + Y S    +   L +D + LA D  ++     
Sbjct: 148 KNVSCSAPQCK-QVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA----- 199

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
             +FGC     G    G  P     LG+ +  +  +   Q +  ++FS C  S      +
Sbjct: 200 -FTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFS 255

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
           G +  G    P + +    LR    +  Y + +  + VG   V+   +A           
Sbjct: 256 GSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGT 315

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPV 382
           IFDSGT +T L  P Y  +   F    K      TS   F+ CY   V  P  T F +  
Sbjct: 316 IFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQVKVPTIT-FMFKG 374

Query: 383 VNLTM 387
           VN+TM
Sbjct: 375 VNMTM 379


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 77/274 (28%), Positives = 114/274 (41%), Gaps = 22/274 (8%)

Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           H+T +V++G P   F + +DTGSDL W+ CD  C  C          +    +Y P+ + 
Sbjct: 54  HFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCT---------LPHDRLYKPHNNV 104

Query: 162 TSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
                P C++     +  C +    C Y+V Y   G+ S G LV+D + L         +
Sbjct: 105 VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGS-SIGVLVKDPVPLRL--TNGTIL 161

Query: 221 DSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
              + FGCG  Q   GS L      G+ GLG  K ++ + L+    + N    CF   G 
Sbjct: 162 APNLGFGCGYDQHNGGSQLPPLT-AGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGG 220

Query: 279 GRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
           G + FG    P  G +    LR     Y+    +V  GGN V        FDSG+S+TY 
Sbjct: 221 GFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYF 280

Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
           N   Y  +     N L  +    +  D     C+
Sbjct: 281 NSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICW 314


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 82/301 (27%), Positives = 120/301 (39%), Gaps = 40/301 (13%)

Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           LG   Y  +++ G P    ++  DTGSDL WL C   +                 +  + 
Sbjct: 48  LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 106

Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           S+T S VPC++  C L            P+A   C Y   Y +DG+ +TGFL  D   ++
Sbjct: 107 SATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 165

Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
                  +V   ++FGCG R Q GSF   +   G+ GLG  + S P+   +  L   +FS
Sbjct: 166 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 219

Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
            C      GR     SF   G P +      TP       PT Y + +  + VG   +  
Sbjct: 220 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 279

Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
             S            + DSG++ TYL   AY  +   F +     R  S++      E C
Sbjct: 280 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 339

Query: 369 Y 369
           Y
Sbjct: 340 Y 340


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 61/393 (15%)

Query: 75  RGLAAQGNDKTPLTFSAGNDTYRLNSLGFL-------HYTNVSVGQPALSFIVALDTGSD 127
           R +AA+   ++    S    + R++   +        +  ++++G P     + LDTGSD
Sbjct: 74  RRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSD 133

Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN- 185
           L W  C  CVSC                ++P+ S T S +PC+  +C       S G   
Sbjct: 134 LTWTQCAPCVSCFRQ---------SLPRFNPSRSMTFSVLPCDLRICR-DLTWSSCGEQS 183

Query: 186 -----CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQSKSVDSRISFGCGRVQTGSFLDG 239
                C Y   Y +D +++TG L  D    A+ D     +    ++FGCG    G F+  
Sbjct: 184 WGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSN 242

Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD------GTGRISFGDKGSP 289
               G+ G      S+P+ L       ++FS CF    GS+      G     + D    
Sbjct: 243 E--TGIAGFSRGALSMPAQLK-----VDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGG 295

Query: 290 GQGETP-FSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
           G G     +L + H +    Y I++  V+VG   +    S            I DSGT  
Sbjct: 296 GHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGM 355

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           T L +  Y  + + F +  K     STS L  + C+ + P     + P + L  +G    
Sbjct: 356 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFSVPPGAKP-DVPALVLHFEGATLD 413

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              +  +    E  G+ L CL +   +++++IG
Sbjct: 414 LPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 446


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 142/339 (41%), Gaps = 45/339 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ V +G+P     + LDTGSD+ W+ C  C  C    +          I+ P +S++ 
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADP---------IFEPASSASF 199

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S + CN+  C            C Y+V Y  DG+ + G  V + + L      S  VD+ 
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSY-GDGSYTVGDFVTETITLG-----SAPVDN- 252

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A    L GLG    S PS +        SFS C     S+    
Sbjct: 253 VAIGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQIN-----ATSFSYCLVDRDSESAST 304

Query: 281 ISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IF 327
           + F     P     P  LR  H    Y + +T +SVGG  V+   SA           I 
Sbjct: 305 LEFNSTLPPNAVSAPL-LRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIV 363

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           DSGT+ T L    Y  + + F    ++   T+   L F+ CY LS ++ N E P V+   
Sbjct: 364 DSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL-FDTCYDLS-SKGNVEVPTVSFHF 421

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             G    +     +V  + +G + +      S +++IIG
Sbjct: 422 PDGKELPLPAKNYLVPLDSEGTFCFAFAPTAS-SLSIIG 459


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 75/288 (26%), Positives = 114/288 (39%), Gaps = 49/288 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +D+GSD+ W+ C  C  C    +          ++ P  SS+ 
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180

Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S V C S +C                C Y V Y  DG+ + G L  + L L     Q   
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
               ++ GCG   +G F+  A   GL GLG    S+   L   G     FS C    G+ 
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------A 325
           G G +  G   +  +G      R+    Y + +T + VGG  +  + S            
Sbjct: 289 GAGSLVLGRTEAVPRG------RRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 342

Query: 326 IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
           + D+GT+ T L   AY  +   F+ ++    R  + S L  + CY LS
Sbjct: 343 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 388


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 116/304 (38%), Gaps = 49/304 (16%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL--------------NSSSGQ 148
           F +   V+VG P + F+   DTGSDL WL C+     +G+                    
Sbjct: 80  FEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEA 139

Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
           V+ FN   P  SS+ S+V C+   C        C      C ++  Y  DG  +TG L  
Sbjct: 140 VVYFN---PFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSY-RDGASATGLLAA 195

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
           D      +     +  + I FGC     G        +G+ GLG    S+ S L  +   
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQLGRK--- 249

Query: 266 PNSFSMCFGS----DGTGRISFGDKG---SPGQGETPFSLRQTHPT--YNITITQVSVGG 316
              FS C  +    D +  ++FG +     PG   TP     ++    Y I+I  + V G
Sbjct: 250 ---FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAG 306

Query: 317 NAVNFEFS---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-----FEYC 368
             V    S    I D+GT  T+L+  A   ++    SLA+          P      E C
Sbjct: 307 QPVPGTTSVSKVIVDTGTVLTFLDRAAL--LAPLTESLARVMDGAGLPRAPPPDETLELC 364

Query: 369 YVLS 372
           Y +S
Sbjct: 365 YDVS 368


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 88/378 (23%), Positives = 142/378 (37%), Gaps = 54/378 (14%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A+R R+ +   R      N   P+   +G            +   V  G P  S    +D
Sbjct: 85  ANRLRFLKRTSRSSKEDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
           TGSD+ W+PC      H             I+ P  SS+     C+S  C E+   C   
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183

Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR-VQTGSFLDGAA 241
            S C ++V Y  DGT   G L  D + L +    +       SFGC   +   ++     
Sbjct: 184 NSKCQFEVLY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAESLSEDTYSSPGL 236

Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
                G     T  P+      L   +FS C    S  +G +  G + +       F+  
Sbjct: 237 MGLGGGSLSLLTQAPT----AELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292

Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
              P+    Y +T+  +SVG   ++   +        I DSGT+ TYL   AY  + + F
Sbjct: 293 IKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAF 352

Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
                  + T   D+  + CY LS   ++ + P + L +       +    ++++ E   
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLS--SSSVDVPTITLHLDRNVDLVLPKENILITQESG- 407

Query: 409 LYLYCLGVVKSDNVNIIG 426
             L CL    +D+ +IIG
Sbjct: 408 --LSCLAFSSTDSRSIIG 423


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 89/356 (25%), Positives = 143/356 (40%), Gaps = 54/356 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P     + LDTGSDL W  C  CVSC                ++P+ S T 
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQ---------SLPRFNPSRSMTF 161

Query: 164 SKVPCNSTLCELQKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQ 216
           S +PC+  +C       S G        C Y   Y +D +++TG L  D    A+ D   
Sbjct: 162 SVLPCDLRICR-DLTWSSCGEQSWGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAI 219

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
             +    ++FGCG    G F+      G+ G      S+P+ L       ++FS CF   
Sbjct: 220 GGASVPDLTFGCGLFNNGIFVSNE--TGIAGFSRGALSMPAQLK-----VDNFSYCFTAI 272

Query: 274 -GSD------GTGRISFGDKGSPGQGETP-FSLRQTHPT----YNITITQVSVGGNAVNF 321
            GS+      G     + D    G G     +L + H +    Y I++  V+VG   +  
Sbjct: 273 TGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPI 332

Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
             S            I DSGT  T L +  Y  + + F +  K     STS L  + C+ 
Sbjct: 333 PESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFS 391

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           + P     + P + L  +G       +  +    E  G+ L CL +   +++++IG
Sbjct: 392 VPPGAKP-DVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 446


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 73/149 (48%), Gaps = 24/149 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P+   ++ +DTGSDL WL C  C  C     +  GQV D     P  SST 
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136

Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            +VPC+S  C   +   C S   AG  C Y V Y  DG+ STG L  D L  A D     
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
           +  + ++ GCGR   G F D AA  GL G
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLG 216


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 126/315 (40%), Gaps = 54/315 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA +  + LDTGSD+ WL C  C  C +  +          +++P  S T 
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDP---------VFNPAKSKTF 186

Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           + VPC S LC       +C S  S  C YQV Y  DG+ + G    + L           
Sbjct: 187 ATVPCGSRLCRRLDDSSECVSRRSKACLYQVSY-GDGSFTVGDFSTETLTF-----HGAR 240

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
           VD  ++ GCG    G F+  A   GL        S PS   N+      FS C       
Sbjct: 241 VD-HVALGCGHDNEGLFVGAAGLLGLG---RGGLSFPSQTKNR--YNGKFSYCLVDRTSS 294

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
              S     I FG+   P      F+   T+P     Y + +  +SVGG+ V       F
Sbjct: 295 GSSSKPPSTIVFGNGAVPKTAV--FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 352

Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           +  A      I DSGTS T L   AY  + + F   A   +   +  L F+ C+ LS   
Sbjct: 353 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSL-FDTCFDLS-GM 410

Query: 376 TNFEYPVVNLTMKGG 390
           T  + P V     GG
Sbjct: 411 TTVKVPTVVFHFTGG 425


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 61/393 (15%)

Query: 75  RGLAAQGNDKTPLTFSAGNDTYRLNSLGFL-------HYTNVSVGQPALSFIVALDTGSD 127
           R +AA+   ++    S    + R++   +        +  ++++G P     + LDTGSD
Sbjct: 48  RRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSD 107

Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN- 185
           L W  C  CVSC                ++P+ S T S +PC+  +C       S G   
Sbjct: 108 LTWTQCAPCVSCFRQ---------SLPRFNPSRSMTFSVLPCDLRICR-DLTWSSCGEQS 157

Query: 186 -----CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQSKSVDSRISFGCGRVQTGSFLDG 239
                C Y   Y +D +++TG L  D    A+ D     +    ++FGCG    G F+  
Sbjct: 158 WGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSN 216

Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD------GTGRISFGDKGSP 289
               G+ G      S+P+ L       ++FS CF    GS+      G     + D    
Sbjct: 217 E--TGIAGFSRGALSMPAQLKV-----DNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGG 269

Query: 290 GQGETP-FSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
           G G     +L + H +    Y I++  V+VG   +    S            I DSGT  
Sbjct: 270 GHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGM 329

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           T L +  Y  + + F +  K     STS L  + C+ + P     + P + L  +G    
Sbjct: 330 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFSVPPGAKP-DVPALVLHFEGATLD 387

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              +  +    E  G+ L CL +   +++++IG
Sbjct: 388 LPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 420


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 126/324 (38%), Gaps = 59/324 (18%)

Query: 105 HYTNVSVGQPALSFIVA-LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++ +G P    +V  LDTGSDL W  C C  C               ++  + S T 
Sbjct: 94  YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVCFDQ---------PVPVFRASVSHTF 144

Query: 164 SKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
           S+VPC+  LC      P +G      +C Y   Y+ D +++TG + ED     A D   +
Sbjct: 145 SRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYM-DHSITTGKMAEDTFTFKAPDRADT 203

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPN--GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
            +    I FGCG +  G F     PN  G+ G G    S+PS L  +      FS CF +
Sbjct: 204 AAAVPNIRFGCGMMNYGLF----TPNQSGIAGFGTGPLSLPSQLKVR-----RFSYCFTA 254

Query: 276 DGTGRIS---FGDKGSPGQGE---------TPFSLRQ------THPTYNITITQVSVGGN 317
               R+S    G  G P   E         TPF+         + P Y +++  V+VG  
Sbjct: 255 MEESRVSPVILG--GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGET 312

Query: 318 AVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
            + F  S              DSGT+ T+     +  + E F +          +D    
Sbjct: 313 RLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDPDNL 372

Query: 367 YCYVLSPNQTNFEYPVVNLTMKGG 390
            C+ +   +     P + L ++G 
Sbjct: 373 LCFSVPAKKKAPAVPKLILHLEGA 396


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 85/296 (28%), Positives = 122/296 (41%), Gaps = 36/296 (12%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           + VGQP       LDTGSD+ WL   C+ C  G N    Q+    I+ P  SS+ + V C
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWL--QCLPCA-GKNGCYEQITP--IFDPELSSSYNPVSC 55

Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
           +S  C+L  +     ++C Y+V Y  DG+ + G L  + L        S S+   IS GC
Sbjct: 56  DSEQCQLLDEAGCNVNSCIYKVEY-GDGSFTIGELATETLTFV----HSNSI-PNISIGC 109

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGD 285
           G    G F+      GL G  +  +S         L  +SFS C     S     + F  
Sbjct: 110 GHDNEGLFVGADGLIGLGGGAISISS--------QLKASSFSYCLVDIDSPSFSTLDFNT 161

Query: 286 KGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDSGTSF 333
                   +P       P++  + +  +SVGG  +      FE         I DSGT+ 
Sbjct: 162 DPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTI 221

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           T L    Y  + E F  L       +    PF+ CY LS +Q+N E P +   + G
Sbjct: 222 TQLPSDVYEVLREAFLGLTT-NLPPAPEISPFDTCYDLS-SQSNVEVPTIAFILPG 275


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 145/360 (40%), Gaps = 65/360 (18%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
           H   V +G P     + +DTGSDL W  C        L+SS+          +Y P  SS
Sbjct: 91  HSLTVGIGTPPQPRKLIVDTGSDLIWTQCK-------LSSSTAVAARHGSPPVYDPGESS 143

Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           T + +PC+  LC+      K C S  + C Y+  Y S    + G L  +           
Sbjct: 144 TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 196

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
           ++V  R+ FGCG +  GS +      G+ GL  +  S+ + L  Q      FS C   F 
Sbjct: 197 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 248

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
              T  + FG      + +T   ++ T    +P     Y + +  +S+G   +    ++ 
Sbjct: 249 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASL 308

Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
                     I DSG++  YL + A+  + E    + +      T +  +E C+VL P +
Sbjct: 309 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVL-PRR 366

Query: 376 T------NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
           T        + P + L   GG    +  P      EP+   L CL V K+ +   V+IIG
Sbjct: 367 TAAAAMEAVQVPPLVLHFDGGAAMVL--PRDNYFQEPRA-GLMCLAVGKTTDGSGVSIIG 423


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 122/300 (40%), Gaps = 40/300 (13%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           S+G P       +DT SD+ W+ C  C +C +  +          ++ P+ S T   +PC
Sbjct: 93  SLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSP---------MFDPSYSKTYKNLPC 143

Query: 169 NSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
           +ST C+   Q  S  S+    C + V Y  DG+ S G L+ + + L +          R 
Sbjct: 144 SSTTCK-SVQGTSCSSDERKICEHTVNY-KDGSHSQGDLIVETVTLGSYNDPFVHF-PRT 200

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRIS 282
             GC R    SF       G+ GLG    S+   L++   I   FS C    SD + ++ 
Sbjct: 201 VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLK 254

Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSG 330
           FGD       G   T    +     Y +T+   SVG N + F           + I DSG
Sbjct: 255 FGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSG 314

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
           T+FT L D  Y+++      + K +R        F  CY  + ++   + PV+     G 
Sbjct: 315 TTFTVLPDDVYSKLESAVADVVKLERAEDPLK-QFSLCYKSTYDKV--DVPVITAHFSGA 371


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 156/380 (41%), Gaps = 55/380 (14%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A RD    L    LA +G  +     ++G    +  +    +    S+G P    ++A+D
Sbjct: 75  ASRDASRLLYLDSLAVRGRARAYAPIASGRQLLQTPT----YVVRASLGTPPQQLLLAVD 130

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
           T +D  W+PC  C  C     +SS    D     P +S++   VPC S LC       CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PASSASYRTVPCGSPLCAQAPNAACP 181

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
             G  C + + Y +D ++    L +D L +A +  ++       +FGC +  TG+    A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
            P GL GLG    S   +   + +   +FS C  S    + +G +  G  G P + +T  
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288

Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISET 347
            L   H +  Y + +T + VG   V             + DSGT FT L  PAY  + + 
Sbjct: 289 LLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDE 348

Query: 348 FNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
                + +     S L  F+ C+    N T   +P V L   G       + +VI S+  
Sbjct: 349 V----RRRVGAPVSSLGGFDTCF----NTTAVAWPPVTLLFDGMQVTLPEENVVIHSTYG 400

Query: 407 KGLYLYCLGVVKS-DNVNII 425
               + CL +  + D VN +
Sbjct: 401 T---ISCLAMAAAPDGVNTV 417


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 33/263 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +YT++ +G P    I+ +DTGS+L WL C  C  C   +++         IY    S + 
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDT---------IYDAARSVSY 150

Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
             V C NS LC    Q   A    GS C +   Y  DG+ S G L  D L + T      
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
                 +FGC +        GA+  G+ GL   K ++P  L  +      FS CF     
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265

Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE--------FSA 325
             + TG + FG+   P +     S+  T+         V++ G ++N             
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV 325

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG+SF+    P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/299 (26%), Positives = 123/299 (41%), Gaps = 46/299 (15%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G PA   ++A+DT SD+ W+PC  CV C                +SP  S++   V C+
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 153

Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
           +  C+ Q   P+ G+  C + + Y S    +   L +D + LA D  ++       +FGC
Sbjct: 154 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 204

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
                G    G  P     LG+ +  +  +   Q +  ++FS C  S      +G +  G
Sbjct: 205 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 261

Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
               P + +    LR    +  Y + +  + VG   V+   +A           IFDSGT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVVNLTM 387
            +T L  P Y  +   F    K      TS   F+ CY   V  P  T F +  VN+TM
Sbjct: 322 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKVPTIT-FMFKGVNMTM 379


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 147/378 (38%), Gaps = 57/378 (15%)

Query: 67  DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
           +R  +L    LA      TP+  ++GN  Y ++         +S G P     V +DTGS
Sbjct: 53  ERRAQLSKHILAEGRLFSTPV--ASGNGEYLID---------ISFGSPPQKASVIVDTGS 101

Query: 127 DLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGS 184
           DL W  C  C +C    N+++  + D     P  SST   V C S  C  L  Q  S  +
Sbjct: 102 DLIWTQCLPCETC----NAAASVIFD-----PVKSSTYDTVSCASNFCSSLPFQ--SCTT 150

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
           +C Y   Y  DG+ ++G L                    ++FGCG    GSF   A   G
Sbjct: 151 SCKYDYMY-GDGSSTSGALS------TETVTVGTGTIPNVAFGCGHTNLGSFAGAA---G 200

Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISFGDKGSP-GQGETPFSLRQ 300
           + GLG    S+  I     +    FS C    GS  T  +  GD  +  G   T      
Sbjct: 201 IVGLGQGPLSL--ISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVAYTALLTNT 258

Query: 301 THPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF 348
            +PT Y   +T +SV G AV +               I DSGT+ TYL   A+  +    
Sbjct: 259 ANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNALVAAL 318

Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
            +      E   S    +YC+  +    N  YP +    KG   + +    V V+ +  G
Sbjct: 319 KAEVPFP-EADGSLYGLDYCFS-TAGVANPTYPTMTFHFKGAD-YELPPENVFVALDTGG 375

Query: 409 LYLYCLGVVKSDNVNIIG 426
               CL +  S   +I+G
Sbjct: 376 --SICLAMAASTGFSIMG 391


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/272 (30%), Positives = 114/272 (41%), Gaps = 61/272 (22%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+SVG P L+F V  DTGSDL W  C  C  C                + P +SST SK+
Sbjct: 89  NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139

Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
           PC S+ C+      + C + G  C Y  +Y S  T   G+L  + L +      S     
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
            ++FGC   + G    G + +G+ GLG    S         LIP      FS C  S   
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236

Query: 277 -GTGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
            G   I FG   +   G    TPF +    HP+ Y + +T ++VG   +      F F+ 
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296

Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNS 350
                  I DSGT+ TYL    Y  + + F S
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 100/380 (26%), Positives = 157/380 (41%), Gaps = 55/380 (14%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A RD    L    LA +G  +     ++G     L +L ++     S+G P    ++A+D
Sbjct: 75  ASRDASRLLYLDSLAVRGRARAYAPIASGRQL--LQTLTYV--VRASLGTPPQQLLLAVD 130

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
           T +D  W+PC  C  C     +SS    D     P  S++   VPC S LC       CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PAASASYRTVPCGSPLCAQAPNAACP 181

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
             G  C + + Y +D ++    L +D L +A +  ++       +FGC +  TG+    A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
            P GL GLG    S   +   + +   +FS C  S    + +G +  G  G P + +T  
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288

Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISET 347
            L   H +  Y + +T V VG   V             + DSGT FT L  PAY  + + 
Sbjct: 289 LLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDE 348

Query: 348 FNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
                + +     S L  F+ C+    N T   +P + L   G       + +VI S+  
Sbjct: 349 V----RRRVGAPVSSLGGFDTCF----NTTAVAWPPMTLLFDGMQVTLPEENVVIHSTYG 400

Query: 407 KGLYLYCLGVVKS-DNVNII 425
               + CL +  + D VN +
Sbjct: 401 T---ISCLAMAAAPDGVNTV 417


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/277 (27%), Positives = 111/277 (40%), Gaps = 47/277 (16%)

Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
           G L Y  +++VG P       LDTGSDL W  C  C SC+   +          I+SP  
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDP---------IFSPGA 150

Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEK 215
           SS+   + C   LC   L   C      C Y+  Y  DGT + G    +      ++   
Sbjct: 151 SSSYEPMRCAGELCNDILHHSCQRP-DTCTYRYSY-GDGTTTRGVYATERFTFSSSSSGG 208

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
           ++  + + + FGCG +  GS  +G   +G+ G G    S+ S LA +      FS C   
Sbjct: 209 ETTKLSAPLGFGCGTMNKGSLNNG---SGIVGFGRAPLSLVSQLAIR-----RFSYCLTP 260

Query: 276 DGTGRIS---FG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
             +GR S   FG       D  +     T     + +PT Y +  T V+VG   +    S
Sbjct: 261 YASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPIS 320

Query: 325 -----------AIFDSGTSFTYLNDPAYTQISETFNS 350
                      AI DSGT+ T    P   ++   F S
Sbjct: 321 AFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRS 357


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/341 (26%), Positives = 134/341 (39%), Gaps = 58/341 (17%)

Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
           + LDTGSD+ W+ C  C  C       SG V D     P  SS+   V C + LC     
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSYGAVGCGAALCRRLDS 51

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
             C      C YQV Y  DG+++ G  V + L  A   +      +R++ GCG    G F
Sbjct: 52  GGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV-----ARVALGCGHDNEGLF 105

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------------GSDGTGRISFG 284
           +  A   GL        S P+ ++ +     SFS C             GS  +  +SFG
Sbjct: 106 VAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFG 160

Query: 285 DKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV-------------NFEFSAIF 327
             GS G     F+    +P     Y + +  +SVGG  V                   I 
Sbjct: 161 -AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIV 219

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLT 386
           DSGTS T L   +Y+ + + F + A      S      F+ CY L   +   + P V++ 
Sbjct: 220 DSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV-VKVPTVSMH 278

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
             GG    +     ++  + +G   +C     +D  V+IIG
Sbjct: 279 FAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 317


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/299 (26%), Positives = 123/299 (41%), Gaps = 46/299 (15%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G PA   ++A+DT SD+ W+PC  CV C                +SP  S++   V C+
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 169

Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
           +  C+ Q   P+ G+  C + + Y S    +   L +D + LA D  ++       +FGC
Sbjct: 170 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 220

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
                G    G  P     LG+ +  +  +   Q +  ++FS C  S      +G +  G
Sbjct: 221 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 277

Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
               P + +    LR    +  Y + +  + VG   V+   +A           IFDSGT
Sbjct: 278 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 337

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVVNLTM 387
            +T L  P Y  +   F    K      TS   F+ CY   V  P  T F +  VN+TM
Sbjct: 338 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKVPTIT-FMFKGVNMTM 395


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/321 (27%), Positives = 130/321 (40%), Gaps = 61/321 (19%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID---FNIYSPNTSSTSSKV 166
           S+G P     + LDTGS L W PC   +  +   + +   +D     IY+ N SST   +
Sbjct: 79  SLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSL 138

Query: 167 PCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
           PC S  C         C S    CPY       G+ +TG LV DVL L+   K ++  D 
Sbjct: 139 PCRSPKCNWVFGSDLNC-STTKRCPYYGLEYGLGS-TTGQLVSDVLGLS---KLNRIPD- 192

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---DGT- 278
              FGC      S +    P G+ G G    S+P+ L   GL    FS C  S   D T 
Sbjct: 193 -FLFGC------SLVSNRQPEGIAGFGRGLASIPAQL---GL--TKFSYCLVSHRFDDTP 240

Query: 279 ---------GRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSVGGNAVNF---- 321
                    GR    D  + G    PF+    L      Y I+++++ VGG  V      
Sbjct: 241 QSGDLVLHRGR-RHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRY 299

Query: 322 -------EFSAIFDSGTSFTYLN----DPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
                  +   I DSG++FT++     DP   ++ +      + K    +S L    CY 
Sbjct: 300 LVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGL--GPCYN 357

Query: 371 LSPNQTNFEYPVVNLTMKGGG 391
           ++  Q+  + P +  + KGG 
Sbjct: 358 IT-GQSEVDVPKLTFSFKGGA 377


>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
 gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
          Length = 408

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 116/284 (40%), Gaps = 36/284 (12%)

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTSS 164
           Y  +++G+PA  + + +DTGS   WL C         ++  G     N +  P    T  
Sbjct: 40  YVTMNIGEPAEPYFLDIDTGSSFTWLEC---------HAKDGPCKTCNKVPHPLYRLTRK 90

Query: 165 K-VPCNSTLCEL-------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           K VPC   LC+         K+C     N C Y+V+Y  DG  S G L+ D   L T   
Sbjct: 91  KLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKY-QDGLSSLGVLLLDKFSLPTGGA 149

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLI-PNSFS 270
           ++      I+FGCG  Q       A      +G+ GLG     + S L + G +  N   
Sbjct: 150 RN------IAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIG 203

Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE-FSA 325
            C  S G G +  G++  P    T   +  T P     Y+     + +  N +  +   A
Sbjct: 204 HCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKA 263

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           IFDSG+++TYL +  + Q+     +   +      SD     C+
Sbjct: 264 IFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPLCW 307


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/345 (28%), Positives = 149/345 (43%), Gaps = 43/345 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +Y  + VG PA  F + +DTGS L WL C  CV   H        V    I++P+ S T 
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSVSKTY 158

Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             + C+S+ C   K        C +A   C Y+  Y  D + S G+L +DVL L      
Sbjct: 159 KALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLT----P 213

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ------GLIPNSFS 270
           S +  S   +GCG+   G F   A   G+ GL  DK S+   L+N+        +P+SFS
Sbjct: 214 SAAPSSGFVYGCGQDNQGLFGRSA---GIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFS 270

Query: 271 MCFGSDGTGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGG-----NAVNFE 322
               S  +G +S G           TP       P+ Y + +T ++V G     +A ++ 
Sbjct: 271 AQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYN 330

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
              I DSGT  T L    Y  + ++F  +  +K   +      + C+  S  + +   P 
Sbjct: 331 VPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMS-TVPE 389

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
           + +  +GG    +     +V  E KG    CL +  S N ++IIG
Sbjct: 390 IRIIFRGGAGLELKVHNSLVEIE-KG--TTCLAIAASSNPISIIG 431


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/335 (25%), Positives = 126/335 (37%), Gaps = 38/335 (11%)

Query: 117 SFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
           ++ +ALD G  L W+   C+ C H L   S       ++ P  S T S +P ++T+    
Sbjct: 110 NYQLALDMGGGLSWM--QCLPCRHCLLQMS------PVFDPTKSPTFSNIPAHNTVWCRP 161

Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
              P A   C + + Y  D T ++G+L  D             + S I FGC   QT  F
Sbjct: 162 PYQPLANGACGFDIAY-RDNTHASGYLARDTFSFPAGNDDFVPL-SAIVFGCAH-QTEHF 218

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIP---NSFSMCFGSDGTGRISFGDKGSPGQGE 293
            +  A  G+ GLGM     P     + ++P     FS C    G    S+   GS     
Sbjct: 219 KNQRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSH 278

Query: 294 TPFSL-RQTHPT---------YNITITQVSVGGNAVNFEFSAIF------------DSGT 331
            P ++ RQ+ P          Y + +  VSVG N ++    A+F            D GT
Sbjct: 279 PPPNVHRQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGT 338

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
             T     AY  I         ++R      +    C V  P   +   P + L  + G 
Sbjct: 339 RMTAFIHSAYVHIDHAVRQ-HLQRRGAHIVVVRGNTC-VQQPAPHHDVLPSMTLHFENGA 396

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              V    V +     G +  C G V S ++ +IG
Sbjct: 397 WLRVMPEHVFMPFVVGGHHYQCFGFVSSTDLTVIG 431


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/316 (28%), Positives = 133/316 (42%), Gaps = 54/316 (17%)

Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
           SLG   Y   VS+G PA++ ++++DTGSD+ W+   PC   SC    +          ++
Sbjct: 124 SLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 174

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
            P  S+T S   C+S  C    Q    G     S+C Y V+Y+ D + +TG    D L L
Sbjct: 175 DPAKSATYSAFSCSSAQC---AQLGGEGNGCLNSHCQYIVKYV-DHSNTTGTYGSDTLGL 230

Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
            T +           FGC     G F+     +GL GLG D  S+ S  A       +FS
Sbjct: 231 TTSDAVKN-----FQFGCSHRANG-FV--GQLDGLMGLGGDTESLVSQTA--ATYGKAFS 280

Query: 271 MCFGSDGTGRISFGDKGSPGQG-------ETPFSLRQTHPT-YNITITQVSVGGNAVN-- 320
            C     +    F   G+   G        TP  +R   PT Y + +  ++V G  +N  
Sbjct: 281 YCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPL-VRFNVPTFYGVFLQAITVAGTKLNVP 339

Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPN 374
              F  +++ DSGT  T L   AY  +   F    K++ +   S  P    + C+  S  
Sbjct: 340 ASVFSGASVVDSGTVITQLPPTAYQALRTAF----KKEMKAYPSAAPVGILDTCFDFSGI 395

Query: 375 QTNFEYPVVNLTMKGG 390
           +T    PVV LT   G
Sbjct: 396 KT-VRVPVVTLTFSRG 410


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 129/313 (41%), Gaps = 39/313 (12%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           R+ S    +   +++G P +     +DTGSDL W  C  C  C    +          ++
Sbjct: 74  RVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSP---------MF 124

Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            P  S T S +PC S  C       S    C Y   Y +D +++ G L  + +  ++ + 
Sbjct: 125 EPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSY-ADSSVTKGVLAREAITFSSTDG 183

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNS--FSMC 272
               V   I FGCG   +G+F +           +     P  L +Q G +  S  FS C
Sbjct: 184 DPVVV-GDIIFGCGHSNSGTFNENDM------GIIGMGGGPLSLVSQIGTLYGSKRFSQC 236

Query: 273 ---FGSDG--TGRISFGDKGS-PGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
              F +D   +G I+FG++    G+G   TP +  +   +Y +T+  +SVG   V F  S
Sbjct: 237 LVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS 296

Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                   + DSGT  TY+    Y ++ E     +         DL  + CY    ++TN
Sbjct: 297 ETLSKGNIMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYR---SETN 353

Query: 378 FEYPVVNLTMKGG 390
            E P++    +G 
Sbjct: 354 LEGPILTAHFEGA 366


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/333 (24%), Positives = 131/333 (39%), Gaps = 60/333 (18%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++   + VG P       +DTGS++ W    C+ CVH    ++       I+ P+ SST 
Sbjct: 379 VYLMKLQVGTPPFEIEAVIDTGSEITW--TQCLPCVHCYKQNAP------IFDPSKSSTF 430

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
            +  C+               +CPY+V Y  D T + G L  D + + +   +   +   
Sbjct: 431 KEKRCHD-------------HSCPYEVDYF-DKTYTKGTLATDTVTIHSTSGEPFVMAET 476

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           I  GCGR    S+   +   G  GL     S+  I    G  P   S CF  +GT +I+F
Sbjct: 477 I-IGCGR--NNSWFRPSF-EGFVGLNWGPLSL--ITQMGGEYPGLMSYCFAGNGTSKINF 530

Query: 284 GDKGSPGQG----ETPFSLRQTHPTYNITITQVSVGGNAVN--------FEFSAIFDSGT 331
           G     G G     T F        Y + +  VSVG   +          E + + DSGT
Sbjct: 531 GTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGT 590

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE-------YCYVLSPNQTNFEYPVVN 384
           + TY          E++ +L ++  E     +P          CY    + T   +PV+ 
Sbjct: 591 TLTYF--------PESYCNLVRQAVEHVVPAVPAADPTGNDLLCYY---SNTTEIFPVIT 639

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           +   GG    ++   + + S   G  L+CL ++
Sbjct: 640 MHFSGGADLVLDKYNMFMESYSGG--LFCLAII 670



 Score = 45.1 bits (105), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 71/325 (21%), Positives = 123/325 (37%), Gaps = 62/325 (19%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           + +   + +G P       LDTGS+L W    C+ C+H  +  +       I+ P+ SST
Sbjct: 63  YEYLMKLQIGTPPFEVEAVLDTGSELIW--TQCLPCLHCYDQKAP------IFDPSKSST 114

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  CN           +   +CPY++ Y  D + + G L  + + + +       +  
Sbjct: 115 FKETRCN-----------TPDHSCPYKLVY-DDKSYTQGTLATETVTIHSTSGVPFVMPE 162

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
            I  GC R  +GS   G  P+    +G+ + S+  I    G  P          G G +S
Sbjct: 163 TI-IGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQMGGAYP----------GDGVVS 208

Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG---NAVNFEFSA-----IFDSGTSFT 334
                      T F+       Y + +  VSVG      V   F A     + DSGT  T
Sbjct: 209 ----------TTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLT 258

Query: 335 YLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
           Y        + +    +    R  + S +D+    CY    + T   +PV+ +   GG  
Sbjct: 259 YFPVSYCNLVRKAVERVVTADRVVDPSRNDM---LCYY---SNTIEIFPVITVHFSGGAD 312

Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVV 417
             ++   + +     G  ++CL ++
Sbjct: 313 LVLDKYNMYMELNRGG--VFCLAII 335


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 127/310 (40%), Gaps = 50/310 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ WL C  C  C    +          I+ P  S T 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           + +PC+S  C   ++  SAG N     C YQV Y  DG+ + G    + L    +  +  
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
                ++ GCG    G F+  A    L GLG  K S P    ++      FS C      
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297

Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
           S     + FG+         TP  S  +    Y + +  +SVGG  V    +++F     
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357

Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                  DSGTS T L  PAY  + + F   AK  +      L F+ C+ LS N    + 
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSL-FDTCFDLS-NMNEVKV 415

Query: 381 PVVNLTMKGG 390
           P V L  +G 
Sbjct: 416 PTVVLHFRGA 425


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 96/375 (25%), Positives = 151/375 (40%), Gaps = 81/375 (21%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           GFL   N+S+G P ++ +V +DTGS L W+ C  C++C     S          + P  S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151

Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
            +   + C        N   C    Q         Y++RYL  G  S G L ++ L   T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203

Query: 213 -DEKQ-----------SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-L 259
            DE +           SK   S I+FGCG +   +  D A  NG+FGLG    + P I +
Sbjct: 204 LDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITM 258

Query: 260 ANQGLIPNSFSMCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVS 313
           A Q  + N FS C G           +  G +GS  +G+ TP  +   H  Y +T+  +S
Sbjct: 259 ATQ--LGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSIS 313

Query: 314 VGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           VG   +  + +A           + DSG ++T L +  +  + +    L K   E   + 
Sbjct: 314 VGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQ 373

Query: 363 LPFE-YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP------------IVIVSSEPKGL 409
             FE  C+    ++    +P V     GG    +               + I+ S  + L
Sbjct: 374 RKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELL 433

Query: 410 YLYCLGVVKSDNVNI 424
            L  +G++   N N+
Sbjct: 434 NLSVIGILAQQNYNV 448


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 127/310 (40%), Gaps = 50/310 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ WL C  C  C    +          I+ P  S T 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           + +PC+S  C   ++  SAG N     C YQV Y  DG+ + G    + L    +  +  
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
                ++ GCG    G F+  A    L GLG  K S P    ++      FS C      
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297

Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
           S     + FG+         TP  S  +    Y + +  +SVGG  V    +++F     
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQI 357

Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                  DSGTS T L  PAY  + + F   AK  +      L F+ C+ LS N    + 
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSL-FDTCFDLS-NMNEVKV 415

Query: 381 PVVNLTMKGG 390
           P V L  +G 
Sbjct: 416 PTVVLHFRGA 425


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 119/303 (39%), Gaps = 40/303 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG P  +  V +D+GSD+ W+ C+ C  C H  +          +++P  SS+ 
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDP---------VFNPADSSSY 184

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + V C ST+C            C Y+V Y  DG+ + G L  + L         +++   
Sbjct: 185 AGVSCASTVCSHVDNAGCHEGRCRYEVSY-GDGSYTKGTLALETLTFG------RTLIRN 237

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---TGR 280
           ++ GCG    G F+  A   GL GLG    S    L  Q     +FS C  S G   +G 
Sbjct: 238 VAIGCGHHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGL 292

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY--------NITITQVSVGGNAVNF----EFSAIF 327
           + FG +  P G    P        ++         +   +V +  +        +   + 
Sbjct: 293 LQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVM 352

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D+GT+ T L   AY    + F +        S   + F+ CY L     +   P V+   
Sbjct: 353 DTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSI-FDTCYDLF-GFVSVRVPTVSFYF 410

Query: 388 KGG 390
            GG
Sbjct: 411 SGG 413


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 117/295 (39%), Gaps = 47/295 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P L +   +DTGSDL W  C  C+ C       + Q   +  +    S+T 
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------AAQPTPY--FDVKRSATY 139

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             +PC S+ C            C YQ  Y  D   + G L  +          +K   + 
Sbjct: 140 RALPCRSSRCAALSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGA-ASSTKVRAAN 197

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           ISFGCG +  G   +    +G+ G G    S+ S L      P+ FS C   + S    R
Sbjct: 198 ISFGCGSLNAGELANS---SGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSPTPSR 249

Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
           + FG            GSP Q  TPF +    P  Y +++  +S+G           A+N
Sbjct: 250 LYFGVFANLNSTNTSSGSPVQ-STPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAIN 308

Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
            + +   I DSGTS T+L   AY  +     S         T D+  + C+   P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDT-DIGLDTCFQWPP 362


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 120/311 (38%), Gaps = 55/311 (17%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           F +   + V  P +  +   DTGS L WL C   +                 ++P  SS+
Sbjct: 74  FEYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAA----------------HTP-ASSS 116

Query: 163 SSKVPCNSTLCEL---QKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            +++PC++  C+       C + GS    C Y+  + +DG+ + G +  D    +T    
Sbjct: 117 YARLPCDAFACKALGDAASCRATGSGNNICVYRYAF-ADGSCTAGPVTVDAFTFST---- 171

Query: 217 SKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
                 R+ FGC  R +  S  D    +GL GL     S+ S L+ +    + FS C   
Sbjct: 172 ------RLDFGCATRTEGLSVPD----DGLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221

Query: 274 ---GSDGTGRISFGDKG----SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
                  +  ++FG       SPG   TP    +    Y I +  + V G  V  + +  
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT 281

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL---SPNQTNFEY 380
             I DSGT  TYL       +     +  K  R  S   L +  CY +   +P       
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETL-YAVCYDVRRRAPEDVGKSI 340

Query: 381 PVVNLTMKGGG 391
           P V L + GGG
Sbjct: 341 PDVTLVLGGGG 351


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 132/322 (40%), Gaps = 70/322 (21%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
           +   +++G P  +  V LDTGSDL W+PC     DC+ C    N+    +   +++SP  
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNN---DLKSPSVFSPLH 139

Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
           SSTS +  C S+ C E+         C  AG +            CP       +G + +
Sbjct: 140 SSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLIS 199

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L  D+L   T +        R SFGC    T ++ +   P G+ G G    S+PS L 
Sbjct: 200 GILTRDILKARTRDV------PRFSFGC---VTSTYRE---PIGIAGFGRGLLSLPSQL- 246

Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
             G +   FS CF         + +  +  G        +     TP      +P +Y I
Sbjct: 247 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYI 304

Query: 308 TITQVSVGGNAVNFEFS-------------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
            +  +++G N    +                + DSGT++T+L +P Y+Q+  T  S    
Sbjct: 305 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITY 364

Query: 355 KRETST-SDLPFEYCY-VLSPN 374
            R T T S   F+ CY V  PN
Sbjct: 365 PRATETESRTGFDLCYKVPCPN 386


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 113/291 (38%), Gaps = 46/291 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +D+GSD+ W+ C  C  C    +          ++ P  SS+ 
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180

Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S V C S +C                C Y V Y  DG+ + G L  + L L     Q   
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
               ++ GCG   +G F+  A   GL GLG    S+   L   G     FS C    G+ 
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288

Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFEFS--------- 324
           G G +  G   +   G     L    Q    Y + +T + VGG  +  + S         
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA 348

Query: 325 --AIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
              + D+GT+ T L   AY  +   F+ ++    R  + S L  + CY LS
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 397


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/356 (23%), Positives = 137/356 (38%), Gaps = 61/356 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V +G P     + +D+GSD+ W+ C  C+ C    +          ++ P +S+T 
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPASSATF 175

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           S V C S +C   +   C  +G  C Y+V Y  DG+ + G L  + L L     +     
Sbjct: 176 SAVSCGSAICRTLRTSGCGDSG-GCEYEVSY-GDGSYTKGTLALETLTLGGTAVEG---- 229

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------ 275
             ++ GCG    G F+  A   GL GLG    S+   L        +FS C  S      
Sbjct: 230 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGSGS 282

Query: 276 ---DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFE------- 322
              D  G +  G   +  +G    P       P+ Y + ++ + VG   +  +       
Sbjct: 283 GAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLT 342

Query: 323 ----FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
                  + D+GT+ T L   AY  + + F  ++    R    S L  + CY LS   T+
Sbjct: 343 EDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLL--DTCYDLS-GYTS 399

Query: 378 FEYPVVNLTMKGGGPFF---------VNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
              P V+    G              V+  I  ++  P    L  LG ++ + + I
Sbjct: 400 VRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQI 455


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 147/360 (40%), Gaps = 57/360 (15%)

Query: 66  RDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
           R +Y   R  +G+     D +  T   G+    ++SL ++    V +G P++S ++ +DT
Sbjct: 90  RSKYIMSRVSKGMMGDDADVSIPTHLGGS----VDSLEYV--VTVGLGTPSVSQVLLIDT 143

Query: 125 GSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--- 178
           GSDL W+   PC+  +C    +          ++ P+ SST + +PCN+  C        
Sbjct: 144 GSDLSWVQCQPCNSTTCYPQKDP---------LFDPSKSSTYAPIPCNTDACRDLTDDGY 194

Query: 179 ---CPS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
              C S    + C + + Y  DG+ + G    + L LA              FGCG  Q 
Sbjct: 195 GGGCASGDGAAQCGFAITY-GDGSQTRGVYSNETLALAPGVAVKD-----FRFGCGHDQD 248

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----------DGTGRISF 283
           G+       +GL GLG    S+  ++    +   +FS C  +           G G  S 
Sbjct: 249 GA---NDKYDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSG 303

Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLND 338
           G   + G   TP  +R+    Y + +T ++VGG  ++   SA     I DSGT  T L  
Sbjct: 304 GVVNTSGFVFTPM-IREEETFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQH 362

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
            AY  +   F             +L  + CY  S   +N   P V LT  GG    ++ P
Sbjct: 363 TAYNALQAAFRKAMAAYPLVRNGEL--DTCYDFS-GYSNVTLPKVALTFSGGATIDLDVP 419


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 84/330 (25%), Positives = 134/330 (40%), Gaps = 39/330 (11%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           +G PA  +++ +DTGS L WL C    C+   +  SG V     ++P +SST + V C++
Sbjct: 3   LGTPATQYVMVVDTGSSLTWLQCS--PCLVSCHRQSGPV-----FNPKSSSTYASVGCSA 55

Query: 171 TLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
             C       L     S+ + C YQ  Y  D + S G+L +D +   +            
Sbjct: 56  QQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGSTSLP------NF 108

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
            +GCG+   G F   A   GL GL  +K S+   LA    +  SF+ C  S  +      
Sbjct: 109 YYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSL 163

Query: 285 DKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFSAIFDSGTSFTYL 336
              +PGQ   TP  S       Y I ++ ++V GN +            I DSGT  T L
Sbjct: 164 GSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRL 223

Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
               Y+ +S+   +  K     S   +  + C+      +    P V ++  GG    ++
Sbjct: 224 PTSVYSALSKAVAAAMKGTSRASAYSI-LDTCF--KGQASRVSAPAVTMSFAGGAALKLS 280

Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              ++V  +       CL    + +  IIG
Sbjct: 281 AQNLLVDVDDS---TTCLAFAPARSAAIIG 307


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 95/383 (24%), Positives = 142/383 (37%), Gaps = 49/383 (12%)

Query: 44  GILAVDDLPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSL 101
           G+  +   P+  +  +      RD  R+ R     LA        LT  A       N  
Sbjct: 26  GLTRIHADPEVTASEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRN-- 83

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNT 159
           G  +   +S+G P LS+    DTGSDL W    C  C   +  +  Q    +  +Y+P++
Sbjct: 84  GGEYIMTLSIGTPPLSYRAIADTGSDLIW--TQCAPCGDTVTDTDNQCFKQSGCLYNPSS 141

Query: 160 SSTSSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           S+T   +PCNS L     +    P  G  C Y   Y +  T   G    +     +    
Sbjct: 142 STTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTGWT--AGVQSVETFTFGSSSTP 199

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
                  I+FGC    +  + +G+A  GL GLG    S+ S L        +FS C    
Sbjct: 200 PAVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLTPF 251

Query: 274 -GSDGTGRISFGD------KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFE 322
             ++ T  +  G       KG+     TPF    S       Y + +T +SVG  A+   
Sbjct: 252 QDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIP 311

Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS---TSDLPFEYC 368
             A           I DSGT+ T L D AY Q+     SL   +   +         + C
Sbjct: 312 PDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLC 371

Query: 369 YVLSPNQTNFEYPVVNLTMKGGG 391
           + L  +      P + L  +GG 
Sbjct: 372 FALKASTPPPAMPSMTLHFEGGA 394


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 17/167 (10%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           T + +G P   F + +DTGS++ ++PC    C  G     G+  D       T S+S+  
Sbjct: 52  TKLYIGTPPQEFTLVVDTGSNMTFVPC----C--GSEEYCGKHEDPAF---QTESSSTYQ 102

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
           P N   C     C    S C Y++ Y  DG+ S G L ED++       +S+    R+ F
Sbjct: 103 PVN---CHPSCDCDYLRSQCSYKMHY-GDGSYSRGVLAEDIISFG---NESEFAPQRLVF 155

Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           GC     GS     A +G+ GLG  ++++   L ++G+I +SFS+C+
Sbjct: 156 GCELDAIGSLYSLRA-DGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/275 (28%), Positives = 120/275 (43%), Gaps = 36/275 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  N+ +G P +  I  +DTGSDL W  C  C  C         QV+   ++ P  SST 
Sbjct: 92  YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
               C ++ C    + +  S    C ++  Y +DG+ + G L  +   L  D    K V 
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASET--LTVDSTAGKPVS 199

Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
               +FGCG    G F    + +G+ GLG  + S+ S L  +  I   FS C       S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255

Query: 276 DGTGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
             + RI+FG  G   G G     LR  +  Y+   T+V  G        + I DSGT++T
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLRLPYKGYS-KKTEVEEG--------NIIVDSGTTYT 306

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           +L    Y+++ ++  +  K KR    + + F  CY
Sbjct: 307 FLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 340


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 120/295 (40%), Gaps = 47/295 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P L +   +DTGSDL W  C  C+ C       + Q   +  +    S+T 
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             +PC S+ C            C YQ  Y  D   + G L  +          +K   + 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQ-YYYGDTASTAGVLANETFTFGA-ANSTKVRATN 197

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           I+FGCG +  G   D A  +G+ G G    S+ S L      P+ FS C   + S    R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249

Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
           + FG            GSP Q  TPF +    P  Y +++  +S+G           A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308

Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
            + +   I DSGTS T+L   AY  +     S A      + +D+  + C+   P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLPAMNDTDIGLDTCFQWPP 362


>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
          Length = 178

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 65/127 (51%), Gaps = 9/127 (7%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           L+YT++ +G PA+ + V LDTGS  FW+    C  C H     S  +     Y P +S +
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
           S +V C+ T+C  +  C +    CPY   Y +DG ++ G L  D+LH        Q++  
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171

Query: 221 DSRISFG 227
            + ++FG
Sbjct: 172 STSVTFG 178


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 122/307 (39%), Gaps = 44/307 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ W+ C  C  C     + S  V D     P  S + 
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCY----AQSDPVFD-----PRKSRSF 176

Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + + C S LC       C +    C YQV Y  DG+ + G    + L         ++  
Sbjct: 177 ASIACRSPLCHRLDSPGCNTQKQTCMYQVSY-GDGSFTFGDFSTETLTF------RRTRV 229

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
           +R++ GCG    G F+  A      GLG  + S PS    +    + FS C      S  
Sbjct: 230 ARVALGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQTGRR--FNHKFSYCLVDRSASSK 284

Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFS----- 324
              + FGD         TP        T Y + +  +SVGG  V       F+       
Sbjct: 285 PSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNG 344

Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGTS T L  PAY    + F + A   +      L F+ C+ LS  +T  + P V
Sbjct: 345 GVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSL-FDTCFDLS-GKTEVKVPTV 402

Query: 384 NLTMKGG 390
            L  +G 
Sbjct: 403 VLHFRGA 409


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/304 (30%), Positives = 131/304 (43%), Gaps = 53/304 (17%)

Query: 60  YSALAHR--DRYFRLRGR-GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
           ++  AHR  +R   L  R G A+ G+ ++PL   +G   Y +           S+G P  
Sbjct: 42  FTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMT---------FSMGTPPQ 92

Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE- 174
           +     DTGSDL W  C  C  C    ++S         Y P  SS+ SK+PC+S LC  
Sbjct: 93  TLSALADTGSDLIWAKCGACKRCAPRGSAS---------YYPTKSSSFSKLPCSSALCRT 143

Query: 175 LQKQ-------CPSAGSNCPYQVRY-LSDGTM--STGFLVEDVLHLATDEKQSKSVDSRI 224
           L+ Q         + G+ C Y+  Y LS      + G++  +   L +D  Q       I
Sbjct: 144 LESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQG------I 197

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRIS 282
            FGC    T S     + +GL GLG  K S    L  Q L   +FS C  SD   +  + 
Sbjct: 198 GFGC---TTMSEGGYGSGSGLVGLGRGKLS----LVRQ-LKVGAFSYCLTSDPSTSSPLL 249

Query: 283 FGDKG--SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSFTYLND 338
           FG      PG   TP    +T   Y + +  +S+G            IFDSGT+ T+L +
Sbjct: 250 FGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAE 309

Query: 339 PAYT 342
           PAYT
Sbjct: 310 PAYT 313


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 122/315 (38%), Gaps = 57/315 (18%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFI 119
           A + R F LR R + A    + P            + L F H      +++VG P  +  
Sbjct: 30  AAKPRAFPLRARQVPAGALPRPP------------SKLRFHHNVSLTVSLAVGTPPQNVT 77

Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
           + LDTGS+L WL   C +   G  ++         + P  S+T + VPC ST C  +   
Sbjct: 78  MVLDTGSELSWL--LCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDLP 135

Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
               C  A   C   + Y +DG+ S G L  DV  +       ++   R +FGC      
Sbjct: 136 APPSCDGASRQCHVSLSY-ADGSASDGALATDVFAVG------EAPPLRSAFGCMSTAYD 188

Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-SDGTGRISFGDKGSP--GQ 291
           S  DG A  GL G+     S  +  + +      FS C    D  G +  G    P    
Sbjct: 189 SSPDGVATAGLLGMNRGTLSFVTQASTR-----RFSYCISDRDDAGVLLLGHSDLPFLPL 243

Query: 292 GETPFSLRQTHP-------TYNITITQVSVGGNAVNFEFSAI-----------FDSGTSF 333
             TP   + T P        Y++ +  + VGG A+    S +            DSGT F
Sbjct: 244 NYTPL-YQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQF 302

Query: 334 TYLNDPAYTQISETF 348
           T+L   AY+ +   F
Sbjct: 303 TFLLGDAYSALKAEF 317


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 144/357 (40%), Gaps = 64/357 (17%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+S+G P ++F V  DTGS L W  C  C  C                + P +SST SK+
Sbjct: 93  NLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPP---------FQPASSSTFSKL 143

Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGT-MSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           PC S+LC+     P    N    V Y   G   + G+L  + LH+             ++
Sbjct: 144 PCASSLCQFLTS-PYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPG------VA 196

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---GTGRIS 282
           FGC   + G    G + +G+ GLG       S+++  G+    FS C  SD   G   I 
Sbjct: 197 FGC-STENGV---GNSSSGIVGLGRSPL---SLVSQVGV--GRFSYCLRSDADAGDSPIL 247

Query: 283 FGDKGSPGQG---ETPFSLRQTHPT---YNITITQVSVGG-----NAVNFEFS------- 324
           FG       G    TP       P+   Y + +T ++VG       +  F F+       
Sbjct: 248 FGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGL 307

Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTNF 378
               I DSGT+ TYL    Y  +   F S       T+T   +   F+ C+  +      
Sbjct: 308 VGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGS 367

Query: 379 EYPVVNLTMK--GGGPFFVNDP----IVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
             PV  L ++  GG  + V       +V V S+ +   + CL V+ +    +++IIG
Sbjct: 368 GVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAA-VECLLVLPASEKLSISIIG 423


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 36/273 (13%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           +SL  L Y  +V +G PA++  V +DTGSD+ W+ C+        ++ +G + D     P
Sbjct: 101 SSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 155

Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
             SST +   C++  C       +     A S C Y V+Y  DG+ +TG    DVL L+ 
Sbjct: 156 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 214

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSM 271
            +     V     FGC   + G+ +D    +GL GLG D  S V    A  G    SF  
Sbjct: 215 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSPVSQTAARYG---KSFFY 265

Query: 272 CFGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN-- 320
           C  +    +G ++ G   S G         TP    +  PTY    +  ++VGG  +   
Sbjct: 266 CLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 325

Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
              F   ++ DSGT  T L   AY  +S  F +
Sbjct: 326 PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 358


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 120/295 (40%), Gaps = 47/295 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P L +   +DTGSDL W  C  C+ C       + Q   +  +    S+T 
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             +PC S+ C            C YQ  Y  D   + G L  +          +K   + 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGA-ANSTKVRATN 197

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
           I+FGCG +  G   D A  +G+ G G    S+ S L      P+ FS C   + S    R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249

Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
           + FG            GSP Q  TPF +    P  Y +++  +S+G           A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308

Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
            + +   I DSGTS T+L   AY  +     S A      + +D+  + C+   P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLTAMNDTDIGLDTCFQWPP 362


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 93/386 (24%), Positives = 152/386 (39%), Gaps = 89/386 (23%)

Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPN 158
           S G  +   + +G P   F  A+DT SDL W  C  CV C   L+          +++P 
Sbjct: 83  SAGGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDP---------VFNPV 133

Query: 159 TSSTSSKVPCNSTLC-ELQ-KQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLA 211
            S++ + VPCNS  C EL   +C   G +     C Y   Y  + T + G L  D L + 
Sbjct: 134 ASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNAT-TRGILAVDRLAIG 192

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN--GLFGLGMDKTSVPSILANQGLI---- 265
            D      V   + FGC    + S + G  P   G+ GLG    S+ S L+ +  +    
Sbjct: 193 DD------VFRGVVFGC----SSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRRFMYCLP 242

Query: 266 -PNSFS---MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN 320
            P S S   +  G+D    +    + +  +   P S    +P+ Y + +  +S+G  A++
Sbjct: 243 PPVSRSAGRLVLGADAAATV----RNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMS 298

Query: 321 FE------------------------------------FSAIFDSGTSFTYLNDPAYTQI 344
           F                                     +  I D  ++ T+L +  Y   
Sbjct: 299 FRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLY--- 355

Query: 345 SETFNSLAKEKR--ETSTSDLPFEYCYVLSPN--QTNFEYPVVNLTMKGGGPFFVNDPIV 400
            E  + L +E R    S SDL  + C++L      +    P V+L  +G       + + 
Sbjct: 356 EEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMF 415

Query: 401 IVSSEPKGLYLYCLGVVKSDNVNIIG 426
           +   E +   + CL V K+D V+I+G
Sbjct: 416 V---EDRASGMMCLMVGKTDGVSILG 438


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 138/353 (39%), Gaps = 52/353 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +  ++++G P     + LDTGSDL W    C  C    + + G +       P+ SST  
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVW--TQCRPCPVCFSRALGPL------DPSNSSTFD 466

Query: 165 KVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            +PC+S +C+          N     C Y   Y +DG+++TG L  +    A  +   ++
Sbjct: 467 VLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY-ADGSITTGHLDAETFTFAAADGTGQA 525

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
               ++FGCG    G F       G+ G G    S+PS L       ++FS CF +    
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNE--TGIAGFGRGALSLPSQLKV-----DNFSHCFTAITGS 578

Query: 280 RISFGDKGSPGQ---------GETPF-----SLRQTHPTYNITITQVSVGGNAVNFEFS- 324
             S    G P             TP      SLR     Y +++  ++VG   +    S 
Sbjct: 579 EPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLR----AYYLSLKGITVGSTRLPIPEST 634

Query: 325 ----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS-P 373
                      I DSGT  T L   AY  + + F +  +   + +TS      C+  S P
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVP 694

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +   + P + L  +G       +  +    E  G  + CL +   D++ IIG
Sbjct: 695 RRAKPDVPKLVLHFEGATLDLPRENYMF-EFEDAGGSVTCLAINAGDDLTIIG 746


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 107/431 (24%), Positives = 174/431 (40%), Gaps = 68/431 (15%)

Query: 36  HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
           H  S P K +       K  S A  +AL     R  Y R R +  A Q  D  P      
Sbjct: 51  HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 102

Query: 93  NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
              +           N+S+G P  +  V LDTGSDLFW+ C+ C  C    +        
Sbjct: 103 KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 146

Query: 152 FNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
             IY+   S + +++ CN   C     + QC  +GS C YQ  Y +DG+ ++G L  + +
Sbjct: 147 --IYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGS-CLYQTSY-ADGSRTSGLLSYEKV 202

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
              T     +   +++ FGCG +Q  +F+  +   G+ GLG    S+ S L+  G +  S
Sbjct: 203 AF-TSHYSDEDKTAQVGFGCG-LQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKS 260

Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN 320
           F+ CFG+    +  G + FGD        TP  + + +        + + +  +  N+ +
Sbjct: 261 FAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSS 320

Query: 321 FEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE----TSTSDLPFEYCYV 370
           FE         I DSG++ +      Y  +        K+       TS+ D     C+ 
Sbjct: 321 FERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-----CFE 375

Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG---- 426
               +    +P + L ++  G   +ND   I         L+CLG    + ++IIG    
Sbjct: 376 GKIGRDLPLFPTLVLYLESTG--ILNDRWSIFLQRYDE--LFCLGFTSGEGLSIIGTLAQ 431

Query: 427 REYPIANNISL 437
           + Y    N+ L
Sbjct: 432 QSYKFGYNLEL 442


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 131/312 (41%), Gaps = 51/312 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   +S+G P +      DTGSDL W  C  C  C    N          ++ P +SS+ 
Sbjct: 60  YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNP---------MFDPRSSSSY 110

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           + + C +  C       C +    C Y   Y +D +++ G L ++ L L +   +  +  
Sbjct: 111 TNITCGTESCNKLDSSLCSTDQKTCNYTYSY-ADNSITQGVLAQETLTLTSTTGEPVAFQ 169

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS-ILANQGLIPNSFSMC---FGSDG 277
             I FGCG   +G F D     GL GLG    S+ S I ++ G   N FS C   F +D 
Sbjct: 170 GII-FGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDP 225

Query: 278 --TGRISFGDKGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
             T +++FG KGS     G   TP  + +    Y  T+  +SV    +N  FS       
Sbjct: 226 SITSQMNFG-KGSEVLGNGTVSTPL-ISKDGTGYFATLLGISV--EDINLPFSNGSSLGT 281

Query: 325 -----AIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
                 + DSGT+ TYL +  Y + I +  N +A E          +E CY      TN 
Sbjct: 282 ITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG----YELCY---QTPTNL 334

Query: 379 EYPVVNLTMKGG 390
             P + +  +GG
Sbjct: 335 NGPTLTIHFEGG 346


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 138/353 (39%), Gaps = 49/353 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   ++VG PA+  ++A+DTGSD+ WL C  C  C       SG V D     P  S++ 
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
            ++  ++  C+   +     +    C Y V Y  DG+ + G  +E+ L  A   +     
Sbjct: 185 REMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQV---- 240

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------- 273
              +S GCG    G F   AA  G+ GLG  + S PS +A  G    SFS C        
Sbjct: 241 -PHMSIGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSS 297

Query: 274 -GSDGTGRISFGD---KGSPGQGETPFSLRQTHPTY--------------NITITQVSVG 315
            G   +  ++ GD    GSP    TP        T+                 +T+  + 
Sbjct: 298 PGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLK 357

Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCYVLSP 373
            +        I DSGT+ T L   AY    + F + A +  + S       F+ CY +  
Sbjct: 358 LDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTM-- 415

Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
                + P V++   GG    +     ++  +  G   +        +V+IIG
Sbjct: 416 GGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIG 468


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 91/356 (25%), Positives = 138/356 (38%), Gaps = 50/356 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +++ + VG PA    V LDTGSD+ W+ C  C  C    +          I+ P +SST 
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDP---------IFDPTSSSTF 214

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             + C+   C          + C YQV Y  DG+ + G    D +      K +      
Sbjct: 215 KSLTCSDPKCASLDVSACRSNKCLYQVSY-GDGSFTVGNYATDTVTFGESGKVND----- 268

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
           ++ GCG    G F   A   GL G  +  T       NQ +   SFS C     + + S 
Sbjct: 269 VALGCGHDNEGLFTGAAGLLGLGGGALSMT-------NQ-IKAKSFSYCLVDRDSAKSSS 320

Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
            D  S     G    P        T Y + ++  SVGG  V+     FE  A      I 
Sbjct: 321 LDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVIL 380

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D GT+ T L   AY  + + F  L  + ++ ++    F+ CY  S   T  + P V    
Sbjct: 381 DCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLST-VKVPTVTFHF 439

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNI 435
            GG    +     ++  +  G + +      S +++IIG          Y +ANN+
Sbjct: 440 TGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSS-SLSIIGNVQQQGTRITYDLANNL 494


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 97/349 (27%), Positives = 141/349 (40%), Gaps = 55/349 (15%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS---CVHGLNSSSGQVIDFNIYSP 157
           G  +   + VGQP   F +  DTGSD+ WL C  C S   C    +          I+ P
Sbjct: 145 GAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDP---------IFDP 195

Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            +SS+ S + CNS  C+L  +       C YQV Y  DG+ +TG L  + L        S
Sbjct: 196 KSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY-GDGSFTTGELATETLSFG----NS 250

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
            S+ + +  GCG    G F  GA   GL G  +  +S         L  +SFS C     
Sbjct: 251 NSIPN-LPIGCGHDNEGLFAGGAGLIGLGGGAISLSS--------QLKASSFSYCLVNLD 301

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA--- 325
           SD +  + F          +P        +Y  + +  +SVGG  +      FE      
Sbjct: 302 SDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL 361

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQTNF 378
              I DSGT  + L    Y  + E F  L      +S S  P    F+ CY  S  Q+N 
Sbjct: 362 GGIIVDSGTIISRLPSDVYESLREAFVKLT-----SSLSPAPGISVFDTCYNFS-GQSNV 415

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
           E P +   +  G    +     ++  +  G   YCL  +K+  +++IIG
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAG--TYCLAFIKTKSSLSIIG 462


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 124/303 (40%), Gaps = 39/303 (12%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           V  G PA +     DTGSDL W+   C  C          V D     P  SS+ + VPC
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSWI--QCQPCSGHCYKQHDPVFD-----PAKSSSYAVVPC 168

Query: 169 NSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
            +T C     +C   G+ C Y V Y  DG+ +TG L  + L  ++  + +  +     FG
Sbjct: 169 GTTECAAAGGEC--NGTTCVYGVEY-GDGSSTTGVLARETLTFSSSSEFTGFI-----FG 220

Query: 228 CGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
           CG    G F  +DG    G   L +   + P+     G I   FS C  S  T  G +S 
Sbjct: 221 CGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF----GGI---FSYCLPSYNTTPGYLSI 273

Query: 284 GDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
           G     GQ    ++     P Y     I +  +++GG  +     EF+    + DSGT  
Sbjct: 274 GATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGTLLDSGTIL 333

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           TYL  PAYT + + F    +  +     D   + CY  +  Q+    P V+     G  F
Sbjct: 334 TYLPPPAYTALRDRFKFTMQGSKPAPPYDE-LDTCYDFT-GQSGILIPGVSFNFSDGAVF 391

Query: 394 FVN 396
            +N
Sbjct: 392 NLN 394


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 127/315 (40%), Gaps = 54/315 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA +  + LDTGSD+ WL C  C +C +  +          I+ P  S T 
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDV---------IFDPKKSKTF 188

Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           + VPC S LC       +C +  S  C YQV Y  DG+ + G    + L           
Sbjct: 189 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 242

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
           VD  +  GCG    G F+  A      GLG    S PS    +      FS C       
Sbjct: 243 VD-HVPLGCGHDNEGLFVGAAGLL---GLGRGGLSFPS--QTKSRYNGKFSYCLVDRTSS 296

Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
              S     I FG+   P    + F+   T+P     Y + +  +SVGG+ V       F
Sbjct: 297 GSSSKPPSTIVFGNDAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 354

Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
           +  A      I DSGTS T L   AY  + + F  L   K + + S   F+ C+ LS   
Sbjct: 355 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLS-GM 412

Query: 376 TNFEYPVVNLTMKGG 390
           T  + P V     GG
Sbjct: 413 TTVKVPTVVFHFGGG 427


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 79/286 (27%), Positives = 116/286 (40%), Gaps = 42/286 (14%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +S+G P +     +DTGSDL WL C  C +C   LN          ++ P +SST S + 
Sbjct: 63  LSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNP---------MFDPQSSSTYSNIA 113

Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
             S  C       C    +NC Y   Y  D +++ G L ++ L L +   +  ++   I 
Sbjct: 114 YGSESCSKLYSTSCSPDQNNCNYTYSY-EDDSITEGVLAQETLTLTSTTGKPVALKGVI- 171

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
           FGCG    G F D     G+ GLG    S+ S + +       FS C          T  
Sbjct: 172 FGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGS-SFGGKMFSQCLVPFHTNPSITSP 228

Query: 281 ISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEF------------ 323
           +SFG KGS   G     TP   + TH   Y +T+  +SV    +N  F            
Sbjct: 229 MSFG-KGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLPFNDGSSLEPITKG 285

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           + + DSGT  T L +  Y ++ E   +            L ++ CY
Sbjct: 286 NMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCY 331


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 122/316 (38%), Gaps = 64/316 (20%)

Query: 122 LDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
           +DTGSDL W+PC     C++C    ++S+G      ++ P  SS+   V C  + C+   
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPED-SASNG------VFLPRMSSSLHLVTCADSNCKTLY 53

Query: 175 ------LQKQCPSAGSNC-----PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
                 L + C  +  NC     PY ++Y    T   G L+ + L+L  +  +     + 
Sbjct: 54  GNNTELLCQSCAGSLKNCSETCPPYGIQYGRGST--AGLLLTETLNLPLENGEGARAITH 111

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------DG 277
            + GC      S +    P+G+ G G    S+PS L    +  + F+ C  S      + 
Sbjct: 112 FAVGC------SIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENK 164

Query: 278 TGRISFGDKGSPGQ---GETPFSLRQTHPT-------YNITITQVSVGGNAVN------F 321
              +  GDK  P       TPF      P        Y I +  VS+GG  +        
Sbjct: 165 KSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLL 224

Query: 322 EFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
            F        I DSGT+FT  +D  +  I+  F S    +R     D      CY ++  
Sbjct: 225 RFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVT-G 283

Query: 375 QTNFEYPVVNLTMKGG 390
             N   P      KGG
Sbjct: 284 LENIVLPEFAFHFKGG 299


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 100/425 (23%), Positives = 153/425 (36%), Gaps = 62/425 (14%)

Query: 30  FGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
           F  +  HRYS         +     G+   Y  +       ++R   LA      T   F
Sbjct: 28  FSLEIVHRYSR--------ESPFYPGNITDYERITRLVELSKIRAHNLAI----TTSSGF 75

Query: 90  SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQ 148
           S      R++     +   V +G P +   +  DTGS LFW  C+ C      L      
Sbjct: 76  SPEAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPP---- 131

Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQK---QCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
                I++   S T   +PC    C   +   QC      C Y++ Y + G+ + G   +
Sbjct: 132 -----IFNSTASRTYRDLPCQHQFCTNNQNVFQC--RDDKCVYRIAY-AGGSATAGVAAQ 183

Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQG 263
           D+L  A +++          FGC R      +F       G+ GL M   S+  +     
Sbjct: 184 DILQSAENDRIP------FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSL--LQQMNH 235

Query: 264 LIPNSFSMCFG-------SDGTGRISFGD---KGSPGQGETPFSLRQTHPTYNITITQVS 313
           +  N FS C         S  T  + FG+   K       TPF   +  P Y + +  VS
Sbjct: 236 ITKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVS 295

Query: 314 VGGNAVNF---EFS--------AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTS 361
           V GN +      F+         I DSGT+ TY++  AY  +   F N   +   +    
Sbjct: 296 VAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNI 355

Query: 362 DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
            L    CY      T   YP +    +G   FFV    V ++ + +G +   L  +    
Sbjct: 356 QLSGYICYK-QQGHTFHNYPSMAFHFQGAD-FFVEPEYVYLTVQDRGAFCVALQPISPQQ 413

Query: 422 VNIIG 426
             IIG
Sbjct: 414 RTIIG 418


>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 429

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 85/290 (29%), Positives = 129/290 (44%), Gaps = 51/290 (17%)

Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
           +GF + T +++GQPA  + + +DTGSDL WL CD   C H   +         +Y P   
Sbjct: 66  VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETP------HPLYRP--- 114

Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLA-T 212
            ++  VPC   LC  LQ   P+   NC       Y++ Y +D   + G L+ DV  L  T
Sbjct: 115 -SNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTFGVLLNDVYLLNFT 169

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           +  Q K    R++ GCG  Q  S       +GL GLG  K S+ S L +QGL+ N    C
Sbjct: 170 NGVQLKV---RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHC 226

Query: 273 FGSDGTG-----------RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
             + G G           R+++          TP S   +   Y+    ++  GG     
Sbjct: 227 LSAQGGGYIFFGNAYDSARVTW----------TPISSVDSK-HYSAGPAELVFGGRKTGV 275

Query: 322 -EFSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
              +A+FD+G+S+TY N  AY   +S     L+ +  + +  D     C+
Sbjct: 276 GSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCW 325


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 99/395 (25%), Positives = 163/395 (41%), Gaps = 59/395 (14%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A R    R   R LAA  ++ T  T SA     +++     +   +++G P +S+    D
Sbjct: 50  ALRRDMHRHNARQLAASSSNGT--TVSAPT---QISPTAGEYLMTLAIGTPPVSYQAIAD 104

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL----CELQKQC 179
           TGSDL W    C  C     SS        +Y+P++S+T + +PCNS+L      L    
Sbjct: 105 TGSDLIW--TQCAPC-----SSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTT 157

Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
           P  G  C Y + Y S  T  + +   +     +    +++    I+FGC     G   + 
Sbjct: 158 PPPGCTCMYNMTYGSGWT--SVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGG--FNT 213

Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS----PGQ 291
           ++ +GL GLG    S+ S L     +P  FS C      ++ T  +  G   S     G 
Sbjct: 214 SSASGLVGLGRGSLSLVSQLG----VPK-FSYCLTPYQDTNSTSTLLLGPSASLNDTGGV 268

Query: 292 GETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYL 336
             TPF    +       Y + +T +S+G  A++   +A           I DSGT+ T L
Sbjct: 269 SSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLL 328

Query: 337 NDPAYTQISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNF--EYPVVNLTMKGGGPF 393
            + AY Q+     SL      +  ++    + C+ L P+ T+     P + L   G    
Sbjct: 329 GNTAYQQVRAAVVSLVTLPTTDGGSAATGLDLCFEL-PSSTSAPPTMPSMTLHFDGADMV 387

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
              D  +++ S      L+CL +    +  V+I+G
Sbjct: 388 LPADSYMMLDSN-----LWCLAMQNQTDGGVSILG 417


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 132/311 (42%), Gaps = 56/311 (18%)

Query: 66  RDRYFRLRGRGLAA----QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
           R +  +LR + + +    Q   +T +  ++G    +L +L ++    V +G   +S IV 
Sbjct: 100 RVQSLQLRIKAMTSSTTEQSVSETQIPLTSG---IKLETLNYI--VTVELGGKNMSLIV- 153

Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----L 175
            DTGSDL W+ C  C SC +             +Y P+ SS+   V CNS+ C+      
Sbjct: 154 -DTGSDLTWVQCQPCRSCYNQQGP---------LYDPSVSSSYKTVFCNSSTCQDLVAAT 203

Query: 176 QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
               P  G N      C Y V Y  DG+ + G L  + + L   + ++      + FGCG
Sbjct: 204 GNSGPCGGFNGVVKTTCEYVVSY-GDGSYTRGDLASESIVLGDTKLEN------LVFGCG 256

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG-TGRISFGD- 285
           R   G F      +GL GLG  ++SV  +          FS C  S  DG +G +SFG+ 
Sbjct: 257 RNNKGLF---GGASGLMGLG--RSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGND 311

Query: 286 ----KGSPGQGETPFSLR-QTHPTYNITITQVSVGG---NAVNFEFSAIFDSGTSFTYLN 337
               K S     TP     Q    Y + +T  S+GG     ++F    + DSGT  T L 
Sbjct: 312 FSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTVITRLP 371

Query: 338 DPAYTQISETF 348
              Y  +   F
Sbjct: 372 PSIYKAVKTEF 382


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 112/291 (38%), Gaps = 46/291 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +D+GSD+ W+ C  C  C    +          ++ P  SS+ 
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180

Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S V C S +C                C Y V Y  DG+ + G L  + L L     Q   
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
               ++ GCG   +G F+  A   GL GLG    S+   L   G     FS C    G+ 
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAG 288

Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFE----------- 322
           G G +  G   +   G     L    Q    Y + +T + VGG  +  +           
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA 348

Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
              + D+GT+ T L   AY  +   F+ ++    R  + S L  + CY LS
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 397


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 117/315 (37%), Gaps = 64/315 (20%)

Query: 70  FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFIVALDTG 125
           F LR R + A+   + P            + L F H      +++VG P  +  + LDTG
Sbjct: 58  FALRARQMPARALPRQP------------SKLRFHHNVSLTVSLAVGTPPQNVTMVLDTG 105

Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCP 180
           S+L WL C      +  ++ S        + P  SST + VPC S  C  +       C 
Sbjct: 106 SELSWLLCAPAGARNKFSAMS--------FRPRASSTFAAVPCASAQCRSRDLPSPPACD 157

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
            A S C   + Y +DG+ S G L  DV  + +          R +FGC      S  DG 
Sbjct: 158 GASSRCSVSLSY-ADGSSSDGALATDVFAVGSGPPL------RAAFGCMSSAFDSSPDGV 210

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-SDGTGRISFGDKGSPG--------- 290
           A  GL G+     S  S  + +      FS C    D  G +  G    P          
Sbjct: 211 ASAGLLGMNRGALSFVSQASTR-----RFSYCISDRDDAGVLLLGHSDLPTFLPLNYTPM 265

Query: 291 -QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI-----------FDSGTSFTYLND 338
            Q   P         Y++ +  + VGG  +    S +            DSGT FT+L  
Sbjct: 266 YQPALPLPYFD-RVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 324

Query: 339 PAYTQISETFNSLAK 353
            AY+ +   F   A+
Sbjct: 325 DAYSALKAEFTRQAR 339


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 80/276 (28%), Positives = 113/276 (40%), Gaps = 46/276 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F + LDTGSDL W+ C  C +C            +   Y P  SS+ 
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQ---------NGPYYDPKDSSSF 245

Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDE-K 215
             + C+   C+L       + C     +CPY   Y      +  F +E   ++L T E K
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
               +   + FGCG    G F   A    L GLG    S  + L  Q L  +SFS C   
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVD 360

Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVN--- 320
               S  + ++ FG+       P    T F   + +P    Y + I  + VGG  +    
Sbjct: 361 RNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPE 420

Query: 321 --FEFSA------IFDSGTSFTYLNDPAYTQISETF 348
             +  SA      I DSGT+ TY  +PAY  I E F
Sbjct: 421 ETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF 456


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 102/234 (43%), Gaps = 30/234 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   +S+G P +      DTGSDL WL C  C +C   LN          ++   +SST 
Sbjct: 59  YLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNP---------MFDSQSSSTF 109

Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           S + C S  C       C     NC Y   Y+ DG+ + G L ++ L L +   +  +  
Sbjct: 110 SNIACGSESCSKLYSTSCSPDQINCKYNYSYV-DGSETQGVLAQETLTLTSTTGEPVAFK 168

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG-- 279
             I FGCG    G+F D     G+ GLG    S+ S + +  L  N FS C     T   
Sbjct: 169 GVI-FGCGHNNNGAFNDKEM--GIIGLGRGPLSLVSQIGSS-LGGNMFSQCLVPFNTNPS 224

Query: 280 ---RISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
               +SFG KGS   G     TP   + T+ + Y +T+  +SV    +N  F+A
Sbjct: 225 ISSPMSFG-KGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLPFNA 275


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 123/310 (39%), Gaps = 49/310 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ W+ C  C  C    +          +++P  SST 
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDP---------LFNPAASSTY 203

Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
            KVPC + LC   K+   +G      C YQV Y  DG+ + G    + L           
Sbjct: 204 RKVPCATPLC---KKLDISGCRNKRYCEYQVSY-GDGSFTVGDFSTETLTF------RGQ 253

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
           V  R++ GCG    G F+  A      GLG    S PS    Q      FS C      S
Sbjct: 254 VIRRVALGCGHDNEGLFIGAAGLL---GLGRGSLSFPSQTGAQ--FSKRFSYCLVDRSAS 308

Query: 276 DGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVN------FEFSA-- 325
                + FG    P     TP  S  +    Y + +  +SVGG  +       F   A  
Sbjct: 309 GTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATG 368

Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               I DSGTS T L D AY+ + + F       +      L F+ CY LS  +T  + P
Sbjct: 369 NGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSL-FDTCYDLSGLKT-VKVP 426

Query: 382 VVNLTMKGGG 391
            +    +GG 
Sbjct: 427 TLVFHFQGGA 436


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 88/308 (28%), Positives = 126/308 (40%), Gaps = 50/308 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG PA    + LDTGSD+ WL C  C  C    +          I+ P  S T 
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192

Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           + +PC+S  C   ++  SAG N     C YQV Y  DG+ + G    + L    +  +  
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
                ++ GCG    G F+  A    L GLG  K S P    ++      FS C      
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297

Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
           S     + FG+         TP  S  +    Y + +  +SVGG  V    +++F     
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357

Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
                  DSGTS T L  PAY  + + F   AK  +      L F+ C+ LS N    + 
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSL-FDTCFDLS-NMNEVKV 415

Query: 381 PVVNLTMK 388
           P V L  +
Sbjct: 416 PTVVLHFR 423


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 88/293 (30%), Positives = 124/293 (42%), Gaps = 54/293 (18%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
           + P+T  A     RL +L ++    +  G+      V +DT S+L W+ C  C SC    
Sbjct: 112 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 158

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
           +   G + D     P +S + + +PCNS+ C+ LQ    SA          +C Y + Y 
Sbjct: 159 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 212

Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
            DG+ S G L  D L LA +      V     FGCG    G F      +GL GLG  + 
Sbjct: 213 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 263

Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
           S+ S   +Q      FS C     S+ +G +  GD  S  +  TP       S     P 
Sbjct: 264 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 321

Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
           Y + +T +++GG  V  E SA   I DSGT  T L    Y  +   F S   E
Sbjct: 322 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAE 372


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 109/263 (41%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C    ++C Y V Y +    S G +V D L +   
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ +     +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 97/349 (27%), Positives = 141/349 (40%), Gaps = 55/349 (15%)

Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS---CVHGLNSSSGQVIDFNIYSP 157
           G  +   + VGQP   F +  DTGSD+ WL C  C S   C    +          I+ P
Sbjct: 145 GAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDP---------IFDP 195

Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
            +SS+ S + CNS  C+L  +       C YQV Y  DG+ +TG L  + L        S
Sbjct: 196 KSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY-GDGSFTTGELATETLSFG----NS 250

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
            S+ + +  GCG    G F  GA   GL G  +  +S         L  +SFS C     
Sbjct: 251 NSIPN-LPIGCGHDNEGLFAGGAGLIGLGGGAISLSS--------QLKASSFSYCLVNLD 301

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA--- 325
           SD +  + F          +P        +Y  + +  +SVGG  +      FE      
Sbjct: 302 SDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL 361

Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQTNF 378
              I DSGT  + L    Y  + E F  L      +S S  P    F+ CY  S  Q+N 
Sbjct: 362 GGIIVDSGTIISRLPSDVYESLREAFVKLT-----SSLSPAPGISVFDTCYNFS-GQSNV 415

Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
           E P +   +  G    +     ++  +  G   YCL  +K+  +++IIG
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAG--TYCLAFIKTKSSLSIIG 462


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 90/331 (27%), Positives = 142/331 (42%), Gaps = 57/331 (17%)

Query: 38  YSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYR 97
           ++  +K  L +DD   +       +L  R +   + GR +    +   PLT        R
Sbjct: 83  WNKKLKKHLIMDDFQLR-------SLQSRMKSI-ISGRNIDDSVDAPIPLT-----SGIR 129

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
           L +L ++    V +G   ++ IV  DTGSDL W+ C  C  C +  +          +++
Sbjct: 130 LQTLNYI--VTVELGGRKMTVIV--DTGSDLSWVQCQPCKRCYNQQDP---------VFN 176

Query: 157 PNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
           P+TS +   V C+S  C+ LQ        C S   +C Y V Y  DG+ + G L  + L 
Sbjct: 177 PSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNY-GDGSYTRGELGTEHLD 235

Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
           L      S +V++ I FGCGR   G F      +GL GLG  ++S+  I     +    F
Sbjct: 236 LGN----STAVNNFI-FGCGRNNQGLF---GGASGLVGLG--RSSLSLISQTSAMFGGVF 285

Query: 270 SMCF---GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-----YNITITQVSVGGNAVNF 321
           S C     ++ +G +  G   S  +  TP S  +  P      Y + +T ++VG  AV  
Sbjct: 286 SYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQA 345

Query: 322 ----EFSAIFDSGTSFTYLNDPAYTQISETF 348
               +   + DSGT  T L    Y  + + F
Sbjct: 346 PSFGKDGMMIDSGTVITRLPPSIYQALKDEF 376


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 145/360 (40%), Gaps = 59/360 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ ++ VG P     + LDTGSDL W+ CD C  C                Y+PN SS+ 
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPH---------YNPNESSSY 220

Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTG-FLVEDVLHLAT---- 212
             + C    C+L       + C +    CPY   Y +DG+ +TG F +E      T    
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDY-ADGSNTTGDFALETFTVNLTWPNG 279

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
            EK    VD  + FGCG    G F       GL GLG    S PS L  Q +  +SFS C
Sbjct: 280 KEKFKHVVD--VMFGCGHWNKGFF---HGAGGLLGLGRGPLSFPSQL--QSIYGHSFSYC 332

Query: 273 F-----GSDGTGRISFG-DKGSPGQGETPFS-LRQTHPT-----YNITITQVSVGGNAVN 320
                  +  + ++ FG DK         F+ L     T     Y + I  + VGG  ++
Sbjct: 333 LTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLD 392

Query: 321 -----FEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
                + +S+      I DSG++ T+  D AY  I E F    K  ++ +  D     CY
Sbjct: 393 IPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIK-LQQIAADDFIMSPCY 451

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
            +S      E P   +    G  +           EP    + CL ++K+ N   + IIG
Sbjct: 452 NVS-GAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDE--VICLAILKTPNHSHLTIIG 508


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 141/344 (40%), Gaps = 57/344 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G P  + ++A+DT +D  W+PC  C  C   L            ++P  S+T 
Sbjct: 98  YIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTL------------FAPEKSTTF 145

Query: 164 SKVPCNSTLCELQKQCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             V C S  C  Q   PS G S C + + Y S    +   +V+D + LATD         
Sbjct: 146 KNVSCGSPQCN-QVPNPSCGTSACTFNLTYGSSSIAAN--VVQDTVTLATDPIPD----- 197

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
             +FGC    TG+    A P GL GLG    S+ S    Q L  ++FS C  S    + +
Sbjct: 198 -YTFGCVAKTTGA---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 251

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
           G +  G    P + +    L+    +  Y + +  + VG   V+    A           
Sbjct: 252 GSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGT 311

Query: 326 IFDSGTSFTYLNDPAYTQISETFN---SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
           +FDSGT FT L  PAYT + + F    ++A +   T TS   F+ CY +         P 
Sbjct: 312 VFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVP-----IVAPT 366

Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
           +     G       D I+I S+        CL +  + DNVN +
Sbjct: 367 ITFMFSGMNVTLPEDNILIHSTAGSTT---CLAMASAPDNVNSV 407


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 88/293 (30%), Positives = 124/293 (42%), Gaps = 54/293 (18%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
           + P+T  A     RL +L ++    +  G+      V +DT S+L W+ C  C SC    
Sbjct: 113 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 159

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
           +   G + D     P +S + + +PCNS+ C+ LQ    SA          +C Y + Y 
Sbjct: 160 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 213

Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
            DG+ S G L  D L LA +      V     FGCG    G F      +GL GLG  + 
Sbjct: 214 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 264

Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
           S+ S   +Q      FS C     S+ +G +  GD  S  +  TP       S     P 
Sbjct: 265 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 322

Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
           Y + +T +++GG  V  E SA   I DSGT  T L    Y  +   F S   E
Sbjct: 323 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAE 373


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 131/322 (40%), Gaps = 70/322 (21%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
           +   +++G P  +  V +DTGSDL W+PC     DC+ C    +  S  +   +I+SP  
Sbjct: 11  YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCN---DLKSNNLKSSSIFSPLH 67

Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
           SS+S +  C S+ C E+         C  AG +            CP       +G + +
Sbjct: 68  SSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVS 127

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L  D+L   T +        R SFGC    T ++ +   P G+ G G    S+PS L 
Sbjct: 128 GILTRDILKARTRDV------PRFSFGC---VTSTYHE---PIGIAGFGRGLLSLPSQL- 174

Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
             G +   FS CF         + +  +  G        +     TP      +P +Y I
Sbjct: 175 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYI 232

Query: 308 TITQVSVGGNAVNFEF-------------SAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
            +  +++G N    +                + DSGT++T+L +P Y+Q+     S    
Sbjct: 233 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITY 292

Query: 355 KRETST-SDLPFEYCY-VLSPN 374
            R T T S   F+ CY V  PN
Sbjct: 293 PRATETESRTGFDLCYKVPCPN 314


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 124/322 (38%), Gaps = 55/322 (17%)

Query: 105 HYTNVSVGQP-----ALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPN 158
           +   ++VG P     +   +++ D GSD+ WL C  C  C H             +Y+  
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGP---------VYNRL 175

Query: 159 TSSTSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            SS++S V C +  C        C    + C Y+V Y  DG+ S G    + L      +
Sbjct: 176 KSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEY-GDGSSSAGDFGVETLTFPPGVR 234

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
                   ++ GCG    G F   AA  G+ GLG    S PS +A  G    SFS C   
Sbjct: 235 VPG-----VAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQIA--GRYGRSFSYCLAG 285

Query: 276 DGTG----RISFGDKGSPGQGETP-------FSLRQTHPTYNITITQVSVGGNAVNF--- 321
            GTG     ++FG   S     T         +  + +  Y + +  +SVGG  V     
Sbjct: 286 QGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTE 345

Query: 322 ----------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY---C 368
                         I DSGT+ T L+ PAY    + F   A ++    +   PF +   C
Sbjct: 346 SDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTC 405

Query: 369 YVLSPNQTNFEYPVVNLTMKGG 390
           Y     +   + P V++   GG
Sbjct: 406 YSSVRGRVMKKVPAVSMHFAGG 427


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 57/309 (18%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
           ++  +V +G PAL + + LDT +DL W+ C        H    S+GQ +           
Sbjct: 124 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEAS 183

Query: 153 -NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
            N Y P  SS+  ++ C+   C +      Q PS   +C Y  +   DGT++ G   ++ 
Sbjct: 184 KNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIYGKEK 242

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
             +   + +   +   I  GC  ++ G  +D  A +G+  LG    S     A +     
Sbjct: 243 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR--FGQ 297

Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
            FS C  S     D +  ++FG   +   PG  ET         P Y   +T V VGG  
Sbjct: 298 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGER 357

Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--- 364
           ++                I D+ TS T L   AY  ++   +           S LP   
Sbjct: 358 LDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHLPRVY 409

Query: 365 ----FEYCY 369
               FEYCY
Sbjct: 410 ELEGFEYCY 418


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 87/335 (25%), Positives = 140/335 (41%), Gaps = 60/335 (17%)

Query: 36  HRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDT 95
            R  D V+  L +D L       +  ++ +  R  R     +A     + PLT       
Sbjct: 68  ERKGDWVEKQLVLDGL-------HVRSIQNHIRK-RTSSSQIADSSETQVPLT-----SG 114

Query: 96  YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
            +  +L ++    V++G  + +  V +DTGSDL W+ C+ C SC +          +  +
Sbjct: 115 IKFQTLNYI----VTMGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQ---------NGPL 161

Query: 155 YSPNTSSTSSKVPCNSTLCELQK--QC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
           + P+TS +   + CNST C+  +   C   PS  + C Y V Y  DG+ ++G L  + L 
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNY-GDGSYTSGELGIEKLG 220

Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
                    SV S   FGCGR   G F      +GL GLG  + S+  I          F
Sbjct: 221 FG-----GISV-SNFVFGCGRNNKGLF---GGASGLMGLGRSELSM--ISQTNATFGGVF 269

Query: 270 SMCFGSD----GTGRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAV 319
           S C  S      +G +  G++    +  TP +  +  P       Y + +T + VGG ++
Sbjct: 270 SYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSL 329

Query: 320 NFEFSA------IFDSGTSFTYLNDPAYTQISETF 348
           + + S+      I DSGT  + L    Y  +   F
Sbjct: 330 HVQASSFGNGGVILDSGTVISRLAPSVYKALKAKF 364


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 137/342 (40%), Gaps = 57/342 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G P  + ++A+DT +D  W+PC  C  C   L            ++P  S+T 
Sbjct: 78  YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 125

Query: 164 SKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             V C +  C   KQ P+ G   S+C + + Y S    +   LV+D + LATD   S   
Sbjct: 126 KNVSCAAPEC---KQVPNPGCGVSSCNFNLTYGSSSIAAN--LVQDTITLATDPVPS--- 177

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----D 276
               +FGC    TG+    A P GL GLG    S+ S    Q L  ++FS C  S    +
Sbjct: 178 ---YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLN 229

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
            +G +  G    P + +    L+    +  Y + +  + VG   V+   +A         
Sbjct: 230 FSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGA 289

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             IFDSGT FT L  P Y  + + F      K  T TS   F+ CY           P +
Sbjct: 290 GTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY-----NVPIVVPTI 343

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
                G       D I+I S+      L   G    DNVN +
Sbjct: 344 TFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGA--PDNVNSV 383


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ +     +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ +     +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357


>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
 gi|219887685|gb|ACL54217.1| unknown [Zea mays]
          Length = 292

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/177 (26%), Positives = 81/177 (45%), Gaps = 11/177 (6%)

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSIL 259
           G  V D +    ++ + ++ D  I FGCG  Q G  L+     +G+ GL     S+P+ L
Sbjct: 2   GVYVRDSMQFVGEDGERENAD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQL 59

Query: 260 ANQGLIPNSFSMCFGSDGTGR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSV 314
           A++G+I N+F  C  +D +G    +  GD   P  G T   +R           + Q++ 
Sbjct: 60  ASRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINH 119

Query: 315 GGNAVNFE---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
           G   +N +      +FD+G+++TY  D A T++  +    A  +     SD    +C
Sbjct: 120 GDQQLNAQGKLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 176


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 109/285 (38%), Gaps = 56/285 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +D+GSD+ W+ C  C  C    +          ++ P  SS+ 
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180

Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           S V C S +C                C Y V Y  DG+ + G L  + L L     Q   
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
               ++ GCG   +G F+  A   GL GLG    S+   L   G     FS C  S G G
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288

Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFD 328
                     G G    S       Y + +T + VGG  +  + S            + D
Sbjct: 289 ----------GAGSLASSF------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMD 332

Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
           +GT+ T L   AY  +   F+ ++    R  + S L  + CY LS
Sbjct: 333 TGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 375


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 128/307 (41%), Gaps = 47/307 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
           +Y  V +G P     +  DTGS L W  C+ C  SC    +          I+ P+ SS+
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDP---------IFDPSKSSS 190

Query: 163 SSKVPCNSTLCELQKQCPSAG------SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEK 215
            + + C S+LC    Q  SAG      ++C Y V+Y  D ++S GFL ++ L + ATD  
Sbjct: 191 YTNIKCTSSLC---TQFRSAGCSSSTDASCIYDVKY-GDNSISRGFLSQERLTITATD-- 244

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
               +     FGCG+   G F   A   GL  +G+ +  +  +     +    FS C  S
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRGTA---GL--MGLSRHPISFVQQTSSIYNKIFSYCLPS 295

Query: 276 --DGTGRISFGDKGSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA- 325
                G ++FG   +       TPFS +   +  Y + I  +SVGG  +    +  FSA 
Sbjct: 296 TPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAG 355

Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             I DSGT  T L   AY  +   F      K   +      + CY  S  +     P +
Sbjct: 356 GSIIDSGTVITRLPPTAYAALRSAFRQFMM-KYPVAYGTRLLDTCYDFSGYK-EISVPRI 413

Query: 384 NLTMKGG 390
           +    GG
Sbjct: 414 DFEFAGG 420


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 173/434 (39%), Gaps = 74/434 (17%)

Query: 36  HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
           H  S P K +       K  S A  +AL     R  Y R R +  A Q  D  P      
Sbjct: 38  HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 89

Query: 93  NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
              +           N+S+G P  +  V LDTGSDLFW+ C+ C  C    +        
Sbjct: 90  KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 133

Query: 152 FNIYSPNTSSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
             IY+   S + +++ CN   C     + QC  +GS C YQ  Y +DG  ++G L  + +
Sbjct: 134 --IYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGS-CLYQTAY-ADGARTSGLLSYEKV 189

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
              T     +   +++ FGCG +Q  +F+      G+ GLG    S+ S L+  G +  S
Sbjct: 190 AF-TSHYSDEDKTAQVGFGCG-LQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKS 247

Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------N 317
           F+ CFG+    +  G + FGD        TP  + +    Y + +  + +G        N
Sbjct: 248 FAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAE---FYYVNLLGIGLGVGEPRLDIN 304

Query: 318 AVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE----TSTSDLPFEY 367
           + +FE         I DSG++ +      Y  +        K+       TS+ D     
Sbjct: 305 SSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----- 359

Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG- 426
           C+     +    +P + L ++  G   +ND   I         L+CLG    + ++IIG 
Sbjct: 360 CFEGKIERDLPLFPTLVLYLESTG--ILNDRWSIFLQRYDE--LFCLGFTSGEGLSIIGT 415

Query: 427 ---REYPIANNISL 437
              + Y    N+ L
Sbjct: 416 LAQQSYKFGYNLEL 429


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 55/364 (15%)

Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYS 156
           SLG  +Y   + +G P   F V  DTGSD  W+ C    VSC    +          ++ 
Sbjct: 157 SLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKD---------RLFD 207

Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           P  SST + V C    C           +C Y ++Y  DG+ + GF  +D L +A D  +
Sbjct: 208 PAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY-GDGSYTVGFFAKDTLAVAQDAIK 266

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
                    FGCG    G F   A   GL GLG   TS+ ++ A +     SFS C    
Sbjct: 267 G------FKFGCGEKNRGLFGQTA---GLLGLGRGPTSI-TVQAYE-KYGGSFSYCLPAS 315

Query: 275 SDGTGRISF---GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIF--- 327
           S  TG + F       S    +T   L    PT Y + +T + VGG  +     ++F   
Sbjct: 316 SAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNS 375

Query: 328 ----DSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEY 380
               DSGT  T L D AY  +S  F +       K+  + S L  + CY  +   +    
Sbjct: 376 GTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSIL--DTCYDFT-GLSQVSL 432

Query: 381 PVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG----REYPIA 432
           P V+L  +GG    ++   IV   S+ +     CLG   +   ++V I+G    R Y + 
Sbjct: 433 PTVSLVFQGGACLDLDASGIVYAISQSQ----VCLGFASNGDDESVGIVGNTQQRTYGVL 488

Query: 433 NNIS 436
            ++S
Sbjct: 489 YDVS 492


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 228

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ +     +FS
Sbjct: 229 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 276

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 277 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 336

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTI 359


>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
          Length = 312

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 108/251 (43%), Gaps = 39/251 (15%)

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
           +Q+ +  + I FGC   Q+G       A +G+FG G  + SV S L + G+ P  FS C 
Sbjct: 10  EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 69

Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
            GSD G G +  G+   PG   TP  L  + P YN+ +  ++V G  +  + S       
Sbjct: 70  KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 127

Query: 325 --AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
              I DSGT+  YL D AY          +S +  SL  +  +          C++ S +
Sbjct: 128 QGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS-S 176

Query: 375 QTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIAN 433
             +  +P V L   GG    V  +  ++  +      L+C+G  ++      G+E  I  
Sbjct: 177 SVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITILG 231

Query: 434 NISLFHNCYSY 444
           ++ L    + Y
Sbjct: 232 DLVLKDKIFVY 242


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 123/308 (39%), Gaps = 49/308 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++ +V +G P   F + LDTGSDL W+   CV C      +         Y P  S +  
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247

Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + CN   C+L       + C     +CPY   Y      +  F +E      T     K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307

Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S   R+    FGCG    G F       GL GLG    S  S L  Q L  +SFS C   
Sbjct: 308 SEFRRVENVMFGCGHWNRGLF---HGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362

Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
               +  + ++ FG+       P    T     + +P    Y + I  + VGG  +    
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422

Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CYVL 371
            N+  SA      I DSGT+ +Y +DPAY  I E F  L K K      D P  + CY +
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCYNV 480

Query: 372 S-PNQTNF 378
           S  ++ NF
Sbjct: 481 SGTDELNF 488


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 123/308 (39%), Gaps = 49/308 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++ +V +G P   F + LDTGSDL W+   CV C      +         Y P  S +  
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247

Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + CN   C+L       + C     +CPY   Y      +  F +E      T     K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307

Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S   R+    FGCG    G F       GL GLG    S  S L  Q L  +SFS C   
Sbjct: 308 SEFRRVENVMFGCGHWNRGLF---HGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362

Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
               +  + ++ FG+       P    T     + +P    Y + I  + VGG  +    
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422

Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CYVL 371
            N+  SA      I DSGT+ +Y +DPAY  I E F  L K K      D P  + CY +
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCYNV 480

Query: 372 S-PNQTNF 378
           S  ++ NF
Sbjct: 481 SGTDELNF 488


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 75/276 (27%), Positives = 111/276 (40%), Gaps = 48/276 (17%)

Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
           G L Y  ++++G P       LDTGSDL W  C  C SC+   +          +++P  
Sbjct: 92  GDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDP---------LFAPGQ 142

Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           S++   + C  TLC   L   C      C Y+  Y  DGTM+ G    +    A+     
Sbjct: 143 SASYEPMRCAGTLCSDILHHSCERP-DTCTYRYNY-GDGTMTVGVYATERFTFASSGGGG 200

Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
            +  +  + FGCG V  GS  +G   +G+ G G +  S+ S L+ +      FS C  S 
Sbjct: 201 LTTTTVPLGFGCGSVNVGSLNNG---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSY 252

Query: 277 GTGRIS-----------FGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
            + R S           +GD     Q  TP      +PT Y +  T ++VG   +    S
Sbjct: 253 ASRRQSTLLFGSLSDGVYGDATGRVQ-TTPLLQSPQNPTFYYVHFTGLTVGARRLRIPES 311

Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFN 349
           A           I DSGT+ T L      ++   F 
Sbjct: 312 AFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFR 347


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 84/190 (44%), Gaps = 32/190 (16%)

Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHG---LNSSSGQVI--------DFNI 154
           T + +G P   F + +D+GS + ++PC DC  C      L+S   Q++         F I
Sbjct: 94  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKVQIFKI 153

Query: 155 ----------YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
                     + P  SST   V CN     +   C      C Y+  Y ++ + S G L 
Sbjct: 154 SYGLFDEDPKFQPELSSTYQPVKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLG 207

Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
           ED++       +S     R  FGC  V+TG      A +G+ GLG    S+   L ++GL
Sbjct: 208 EDLISFGN---ESHLTPQRAVFGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGL 263

Query: 265 IPNSFSMCFG 274
           I NSF +C+G
Sbjct: 264 ISNSFGLCYG 273


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 116/290 (40%), Gaps = 43/290 (14%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  +S+G P +  IV  DTGSDL W+ C  C  C    +          ++ P+ SS+ 
Sbjct: 94  YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSP---------LFDPSRSSSY 144

Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
             + C S  C      ++ C    + C Y   Y  D + + G L  +   + +   +   
Sbjct: 145 RHMLCGSRFCNALDVSEQACTMDTNICEYHYSY-GDKSYTNGNLATEKFTIGSTSSRPVH 203

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---- 273
           + S I FGCG    G+F      + L    +        L +Q   +I   FS C     
Sbjct: 204 L-SPIVFGCGTGNGGTF------DELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLS 256

Query: 274 -GSDGTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------- 321
             S+ T +I FG       P    TP   +Q    Y +T+  +SVG   + +        
Sbjct: 257 EQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGN 316

Query: 322 --EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
             + + I DSGT+ T+L+   +T++        K +R +    L F  C+
Sbjct: 317 VEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGL-FSVCF 365


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 144/360 (40%), Gaps = 68/360 (18%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
           H   V + QP    +   DTGSDL W  C        L+SS+          +Y P  SS
Sbjct: 16  HSLTVGIVQPRKLIV---DTGSDLIWTQCK-------LSSSTAAAARHGSPPVYDPGESS 65

Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           T + +PC+  LC+      K C S  + C Y+  Y S    + G L  +           
Sbjct: 66  TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 118

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
           ++V  R+ FGCG +  GS +      G+ GL  +  S+ + L  Q      FS C   F 
Sbjct: 119 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 170

Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
              T  + FG      + +T   ++ T    +P     Y + +  +S+G   +    ++ 
Sbjct: 171 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASL 230

Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
                     I DSG++  YL + A+  + E    + +      T +  +E C+VL P +
Sbjct: 231 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVL-PRR 288

Query: 376 T------NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
           T        + P + L   GG    +  P      EP+   L CL V K+ +   V+IIG
Sbjct: 289 TAAAAMEAVQVPPLVLHFDGGAAMVL--PRDNYFQEPRA-GLMCLAVGKTTDGSGVSIIG 345


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 142/357 (39%), Gaps = 51/357 (14%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
           H  R  +  GR      +   P +  A  D+         +   + +G PA+   V +DT
Sbjct: 95  HITRKAKASGR-TTTLSDVSIPTSLGAAVDSLE-------YVVTLGIGTPAVQQTVLIDT 146

Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE------LQKQ 178
           GSDL W+   C  C    NSSS       +Y P  SST + VPC+S  C+          
Sbjct: 147 GSDLSWV--QCKPC----NSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHG 200

Query: 179 C--PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
           C   S  S C Y + Y +  T + G    + L L+    Q    D    FGCG VQ G+F
Sbjct: 201 CTNSSGTSLCQYGIEYGNRDT-TVGVYSTETLTLS---PQVSVKD--FGFGCGLVQQGTF 254

Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIP--NSFSMCF--GSDGTGRISFG----DKGS 288
                        +     P  L +Q       +FS C   G+  TG ++ G    +  +
Sbjct: 255 DLFDG-------LLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFLALGAPTNNNDT 307

Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYT 342
            G   TP  SL +    Y + +T VSVGG  ++   +      I DSGT  T L D AY+
Sbjct: 308 AGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSGTIITGLPDTAYS 367

Query: 343 QISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
            +   F  +++        +D   + CY  +    N   P V LT  GG    ++ P
Sbjct: 368 ALRTAFRTAMSAYPLLPPNNDDVLDTCYNFT-GIANVTVPTVALTFDGGATIDLDVP 423


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 89/330 (26%), Positives = 127/330 (38%), Gaps = 50/330 (15%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G P  + ++ALDT SD  W+PC  CV C     S+S        ++P  S++   V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
             C+        GS C +   Y S    ++  +V+D L LATD           +FGC  
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLATDPIPG------YTFGCVN 204

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
             TGS    +AP               +  +Q L  ++FS C  S    + +G +  G  
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259

Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
             P + +    LR    +  Y + +  + VG   V+   +A           IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           T L +P YT +   F      K   +T    F+ CY           P +     G    
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLG-GFDTCY-----NVPIVVPTITFLFSGMNVT 373

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
              D IVI S+      L   G    DNVN
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGA--PDNVN 401


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/329 (26%), Positives = 132/329 (40%), Gaps = 71/329 (21%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD----CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           +S G P  +  + +DTGSDL W PC     C +C    ++ S      NI+ P +SS+S 
Sbjct: 94  LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-----NIFIPKSSSSSK 148

Query: 165 KVPCNSTLC------ELQKQC-------PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
            + C +  C      ++Q +C       P+    CP  + +   G ++ G ++ + L L 
Sbjct: 149 VLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDLP 207

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                 K V + I  GC      S L  + P G+ G G    S+PS L   GL    FS 
Sbjct: 208 -----GKGVPNFI-VGC------SVLSTSQPAGISGFGRGPPSLPSQL---GL--KKFSY 250

Query: 272 CFGS----DGTGRISFGDKGSPGQGE-------TPF----SLRQTHP---TYNITITQVS 313
           C  S    D T   S    G    GE       TPF     +   H     Y + +  ++
Sbjct: 251 CLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHIT 310

Query: 314 VGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           VGG  V   +             I DSGT+FTY+    +  ++  F    + KR T    
Sbjct: 311 VGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEG 370

Query: 363 LP-FEYCYVLSPNQTNFEYPVVNLTMKGG 390
           +     C+ +S   T   +P + L  +GG
Sbjct: 371 ITGLRPCFNISGLNTP-SFPELTLKFRGG 398


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 146/357 (40%), Gaps = 57/357 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F++ +DTGSDL WL C  C +C       SG V D     P+ S++ 
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACF----DQSGPVFD-----PSQSTSF 221

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNC-PYQVRYL---SDGTMSTGFLVEDVLHLATDEKQS 217
             +PCN+  C+L    +C    S   P   +Y     D + ++G L  + L ++  +  S
Sbjct: 222 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPS 281

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
                 +  GCG    G F        L GLG    S PS L +   I  SFS C   D 
Sbjct: 282 SLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQLRSSP-IGQSFSYCL-VDR 336

Query: 278 TGRISFGDKGSPGQG-----------ETPFSLRQTHPTYN--------ITITQVSVGGNA 318
           T  +S     S G G            TPF +R  +            I I Q  +   A
Sbjct: 337 TNNLSVSSAISFGAGFALSRHFDQMRFTPF-VRTNNSVETFYYLGIQGIKIDQELLPIPA 395

Query: 319 VNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE---YCY 369
             F  +       I DSGT+ TYLN  AY  +   F +     R       PF+    CY
Sbjct: 396 ERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-----PFDILGICY 450

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             +  +T   +P +++  + G    +      +  +P+    +CL ++ +D ++IIG
Sbjct: 451 NAT-GRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAK-HCLAILPTDGMSIIG 505


>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
 gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
          Length = 475

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 94/205 (45%), Gaps = 21/205 (10%)

Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
           Y  R  ++ + S G++VED      D+        R+ FGC   +TG      A +G+ G
Sbjct: 8   YYSRTYAERSSSEGWMVEDAFGFPDDQPPV-----RMVFGCENGETGEIYRQLA-DGIMG 61

Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS--LRQTH-PT 304
           +G +  +  S L  +G+I + FS+CFG    G +  GD   P    T ++  L   H   
Sbjct: 62  MGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHLHY 121

Query: 305 YNITITQVSVGG-----NAVNFE--FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
           YN+ +  ++V G     NA  F   +  + DSGT+FTYL   A+  ++    S A     
Sbjct: 122 YNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYALSHGL 181

Query: 358 TSTSDLPFEY---CYVLSPNQTNFE 379
            ST     +Y   C+  +P+  NF+
Sbjct: 182 QSTPGADPQYNDICWKGAPD--NFQ 204


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 127/313 (40%), Gaps = 45/313 (14%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGLNSSSGQVIDFNIY 155
           L++L F+    V  G PA +  + LDTGSDL W+ C   S  C    +       DF+  
Sbjct: 132 LDTLEFV--VVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDP------DFD-- 181

Query: 156 SPNTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
            P  SS+ + VPC + +C      C   G+ C Y V+Y  DG+ +TG L  D L   +  
Sbjct: 182 -PAKSSSYAAVPCGTPVCAAAGGMC--NGTTCLYGVQY-GDGSSTTGVLSRDTLTFNSSS 237

Query: 215 KQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
           K +       +FGCG    G F  +DG    G   L +   + PS           FS C
Sbjct: 238 KFTG-----FTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFG-------GVFSYC 285

Query: 273 FGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGG------NAVN 320
             S  T  G ++ G           ++     P Y     I +  +++GG       +V 
Sbjct: 286 LPSYNTTPGYLNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF 345

Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
            +   + DSGT  TYL  PAYT + + F    +  +     + P + CY  +  Q     
Sbjct: 346 TKTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYE-PLDTCYDFT-GQGAIVI 403

Query: 381 PVVNLTMKGGGPF 393
           P V+     G  F
Sbjct: 404 PAVSFNFSDGAVF 416


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 107/269 (39%), Gaps = 42/269 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +++VG P     + LDTGSDL W  C  C  C               +  P  SST 
Sbjct: 86  YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQ---------GIPLLDPAASSTY 136

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ----SKS 219
           + +PC +  C         G +C Y   Y  D +++ G +  D      + ++    S  
Sbjct: 137 AALPCGAPRCRALPFTSCGGRSCVYVYHY-GDKSVTVGKIATDRFTFGDNGRRNGDGSLP 195

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---D 276
              R++FGCG    G F       G+ G G  + S+PS L        SFS CF S    
Sbjct: 196 ATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDS 248

Query: 277 GTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
            +  ++ G  G+P         GE   TP     + P+ Y +++  +SVG   +      
Sbjct: 249 KSSIVTLG--GAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETK 306

Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNS 350
             S I DSG S T L +  Y  +   F +
Sbjct: 307 FRSTIIDSGASITTLPEEVYEAVKAEFAA 335


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 122/308 (39%), Gaps = 53/308 (17%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   ++VG P     + LDTGSDL W  C  C  C            D  +  P  SST 
Sbjct: 84  YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQ---------DLPVLDPAASSTY 134

Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
           + +PC +  C          +      +C Y   Y  D +++ G +  D           
Sbjct: 135 AALPCGAARCRALPFTSCGVRTLGNHRSCIYAYHY-GDKSLTVGEIATDRFTFGDSGGSG 193

Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
           +S+ + R++FGCG +  G F       G+ G G  + S+PS L        SFS CF S 
Sbjct: 194 ESLHTRRLTFGCGHLNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TSFSYCFTSM 246

Query: 276 --DGTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAV--- 319
               +  ++ G  GSP         GE   TP     + P+ Y +++  +SVG   +   
Sbjct: 247 FESKSSLVTLG--GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVP 304

Query: 320 NFEF-SAIFDSGTSFTYLNDPAYTQISETFNS---LAKEKRETSTSDLPFEYCYVLSPNQ 375
             +F S I DSG S T L +  Y  +   F +   L     E S  DL    C+ L P  
Sbjct: 305 ETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDL----CFAL-PVT 359

Query: 376 TNFEYPVV 383
             +  P V
Sbjct: 360 ALWRRPAV 367


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 119/318 (37%), Gaps = 54/318 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +++VG P     + LDTGSDL W  C  C  C H             +  P  SST 
Sbjct: 92  YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQ---------GLPLLDPAASSTY 142

Query: 164 SKVPCNSTLCELQ--KQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           + +PC +  C       C   G         +C Y   Y  D +++ G +  D      D
Sbjct: 143 AALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHY-GDKSVTVGEIATDRFTFGGD 201

Query: 214 --EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
             +  S+    R++FGCG    G F       G+ G G  + S+PS L        +FS 
Sbjct: 202 NGDGDSRLPTRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TTFSY 254

Query: 272 CFGS---DGTGRISFGDKGSPG-----------QGE---TPFSLRQTHPT-YNITITQVS 313
           CF S     +  ++ G  G+P             GE   TP     + P+ Y +++  +S
Sbjct: 255 CFTSMFESKSSLVTLG--GAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGIS 312

Query: 314 VGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
           VG   +        S I DSG S T L +  Y  +   F +               + C+
Sbjct: 313 VGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCF 372

Query: 370 VLSPNQTNFEYPVVNLTM 387
            L         PV +LT+
Sbjct: 373 ALPVTALWRRPPVPSLTL 390


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 149/381 (39%), Gaps = 76/381 (19%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD----CVSCV 139
           KTP + S        +S G  + T +S G P  +  +  DTGS L W PC     C  C 
Sbjct: 61  KTPKSNSVFKSPLSPHSYG-AYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 119

Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG-------SNC 186
                 +G       + P  SS+S  V C +  C      +++ QC S           C
Sbjct: 120 FPKIDPTG----IPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTC 175

Query: 187 P-YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           P Y V+Y S  T   G L+ + L    D+K    V      GC      SFL    P+G+
Sbjct: 176 PAYVVQYGSGST--AGLLLSETLDFP-DKKIPNFV-----VGC------SFLSIHQPSGI 221

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--------------DGTGRISFGDKGSPGQ 291
            G G    S+PS +   GL    F+ C  S              D TG  S G   +P +
Sbjct: 222 AGFGRGSESLPSQM---GL--KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFR 276

Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPA 340
                S       Y + I ++ VG  AV   +            +I DSG++FT+++ P 
Sbjct: 277 QNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 336

Query: 341 YTQISETF-NSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF--VN 396
              ++  F   LA   R T    L     C+ +S  + + ++P +    KGG  +   +N
Sbjct: 337 LEVVAREFEKQLANWTRATDVETLTGLRPCFDIS-KEKSVKFPELIFQFKGGAKWALPLN 395

Query: 397 DPIVIVSSEPKGLYLYCLGVV 417
           +   +VSS      + CL VV
Sbjct: 396 NYFALVSSS----GVACLTVV 412


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 143/357 (40%), Gaps = 54/357 (15%)

Query: 44  GILAVDDLPKKGSFAYYSALAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
           G+  +   P   +  +      RD +   R  R LA+ G D+T         T +    G
Sbjct: 32  GLTRIHSNPDVSATEFVRDALRRDMHRHARFTRELASSG-DRT-----VAAPTRKDLPNG 85

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
             +   +++G P LS+    DTGSDL W    C  C       +GQ      Y+P++S+T
Sbjct: 86  GEYIMTLAIGTPPLSYPAIADTGSDLIW--TQCAPCGSQCFKQAGQP-----YNPSSSTT 138

Query: 163 SSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFL--VEDVLHLATDEKQS 217
              +PCNS++     L    P  G +C Y   Y   GT  T  +  VE     +T   Q+
Sbjct: 139 FGVLPCNSSVSMCAALAGPSPPPGCSCMYNQTY---GTGWTAGIQSVETFTFGSTPADQT 195

Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
           +     I+FGC    +  + +G+A  GL GLG    S+ S L         FS C     
Sbjct: 196 RV--PGIAFGCSNASSDDW-NGSA--GLVGLGRGSMSLVSQLGA-----GMFSYCLTPFQ 245

Query: 274 GSDGTGRISFGDKGS---PGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
            ++ T  +  G   +    G   TPF    S       Y + +T +S+G  A++   +A 
Sbjct: 246 DANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAF 305

Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
                     I DSGT+ T L D AY Q+     SL        +     + C+ L+
Sbjct: 306 ALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALT 362


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 107/263 (40%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ + L     S
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----S 274

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 107/263 (40%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 228

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ + L     S
Sbjct: 229 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----S 276

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      
Sbjct: 277 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 336

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTI 359


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 134/332 (40%), Gaps = 52/332 (15%)

Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
           +G P +  +   DTGSDL W+ C  C +C            D  ++ P  SST     C+
Sbjct: 98  IGTPPVERLAIADTGSDLIWVQCSPCQNCFPQ---------DTPLFEPLKSSTFKAATCD 148

Query: 170 STLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
           S  C      Q+QC   G  C Y   Y  D + + G +  + L   +T + Q+ S  S I
Sbjct: 149 SQPCTSVPPSQRQCGKVG-QCIYSYSY-GDKSFTVGVVGTETLSFGSTGDAQTVSFPSSI 206

Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
            FGCG     +F       GL GLG    S+ S L  Q  I   FS C   F S+ T ++
Sbjct: 207 -FGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQ--IGYKFSYCLLPFSSNSTSKL 263

Query: 282 SFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---NFEFSAIFDSGTSFT 334
            FG +    + G   TP  ++   P+ Y + +  V++G   V     + + I DSGT  T
Sbjct: 264 KFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLT 323

Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
           YL    Y        SL +     S  DLPF + +       +   PV+     G     
Sbjct: 324 YLEQTFYNNF---VASLQEVLSVESAQDLPFPFKFCFP--YRDMTIPVIAFQFTGAS--- 375

Query: 395 VNDPIVIVSSEPKGLY-------LYCLGVVKS 419
                  V+ +PK L        + CL VV S
Sbjct: 376 -------VALQPKNLLIKLQDRNMLCLAVVPS 400


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 129/313 (41%), Gaps = 38/313 (12%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           R+ S    +   +++G P +     +DTGSDL W  C  C  C    +          ++
Sbjct: 42  RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSP---------MF 92

Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
            P  S+T + +PC+S  C  L     S    C Y   Y +D +++ G L  + +  ++ +
Sbjct: 93  EPLRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAY-ADSSVTKGVLARETVTFSSTD 151

Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS--FSMC 272
            +   V   I FGCG   +G+F +        G+        S+++  G +  S  FS C
Sbjct: 152 GEPVVV-GDIVFGCGHSNSGTFNEND-----MGIIGLGGGPLSLVSQFGNLYGSKRFSQC 205

Query: 273 ---FGSD--GTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
              F +D    G ISFGD       G   TP    +    Y +T+  +SVG   V+F  S
Sbjct: 206 LVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS 265

Query: 325 AIF-------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
            +        DSGT  TYL    Y ++ +     +         DL  + CY    ++TN
Sbjct: 266 EMLSKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYR---SETN 322

Query: 378 FEYPVVNLTMKGG 390
            E P++    +G 
Sbjct: 323 LEGPILIAHFEGA 335


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 129/315 (40%), Gaps = 50/315 (15%)

Query: 98  LNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
           +   G LH+T  VS+G P     + LDTGSDL W  C            + Q  +  +Y 
Sbjct: 81  IRPFGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLF--------DTRQHREKPLYD 132

Query: 157 PNTSSTSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
           P  SS+ +  PC+  LCE      K C  + + C Y   Y S  T   G L  +      
Sbjct: 133 PAKSSSFAAAPCDGRLCETGSFNTKNC--SRNKCIYTYNYGSATT--KGELASETFTFGE 188

Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
             + S S+D    FGCG++ +GS L GA+  G+ G+  D+ S    L +Q  IP  FS C
Sbjct: 189 HRRVSVSLD----FGCGKLTSGS-LPGAS--GILGISPDRLS----LVSQLQIPR-FSYC 236

Query: 273 ----FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAV 319
                  + T  I FG      +  T   ++ T            Y + +  +SVG   +
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296

Query: 320 NFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
           N   S  AI   G+  T+++    T +  +    A ++       LP     V++     
Sbjct: 297 NVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLP-----VVNATDHG 351

Query: 378 FEYPV-VNLTMKGGG 391
           +EY +   L   GGG
Sbjct: 352 YEYELCFQLPRNGGG 366


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 142/361 (39%), Gaps = 70/361 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ + S+G P   F + +DTGSDL ++ C  C  C            D  +Y P+ SST 
Sbjct: 34  YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQ---------DGPLYQPSNSSTF 84

Query: 164 SKVPCNSTLCEL------------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
           + VPC+S  C L              + P  G+ C Y+ RY  D + + G    +   + 
Sbjct: 85  TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGA-CSYEYRY-GDNSSTVGVFAYETATVG 142

Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
                     + ++FGCG    GSF+      G+ GLG    S  S         N F+ 
Sbjct: 143 GIRV------NHVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFAY 191

Query: 272 CFGSDGT-----GRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE 322
           C  S  +       + FGD       +  F+   ++P     Y + I ++  GG  +   
Sbjct: 192 CLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIP 251

Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYV 370
            SA           IFDSGT+ TY +  AY +I   F  S+   +   S   LP      
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLP------ 305

Query: 371 LSPNQTNFEYPV---VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK--SDNVNII 425
           L  N +  ++P+     +    G  +  N     +   P    + CL +++  SD  N+I
Sbjct: 306 LCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPN---IDCLAMLESSSDGFNVI 362

Query: 426 G 426
           G
Sbjct: 363 G 363


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 100/355 (28%), Positives = 145/355 (40%), Gaps = 57/355 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++++G P     + LDTGSDL W  C  C +C         Q + +  + P+TSST 
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFD-------QALPY--FDPSTSSTL 85

Query: 164 SKVPCNSTLCELQKQCPSAGS-------NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           S   C+STLC+      S GS        C Y   Y  D +++TGFL  D          
Sbjct: 86  SLTSCDSTLCQ-GLPVASCGSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFV---GA 140

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
             SV   ++FGCG    G F       G+ G G    S+PS L        +FS CF + 
Sbjct: 141 GASVPG-VAFGCGLFNNGVFKSNE--TGIAGFGRGPLSLPSQLKV-----GNFSHCFTTI 192

Query: 277 GTGRISF-------GDKGSPGQGE---TP---FSLRQTHPT-YNITITQVSVGGNAVNFE 322
            TG I          D  S GQG    TP   ++  + +PT Y +++  ++VG   +   
Sbjct: 193 -TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVP 251

Query: 323 FSA----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
            SA          I DSGTS T L    Y  + + F   A+ K      +    Y    +
Sbjct: 252 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEF--AAQIKLPVVPGNATGHYTCFSA 309

Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
           P+Q   + P + L  +G       +  V    +  G  + CL + K D   IIG 
Sbjct: 310 PSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGN 364


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 84/304 (27%), Positives = 122/304 (40%), Gaps = 40/304 (13%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           ++ +G P ++ I   DTGSDL W    C+ C    N S        I++P  SS+  KV 
Sbjct: 93  SIFIGTPPVNVIAIADTGSDLTW--TQCLPCRECFNQSQ------PIFNPRRSSSYRKVS 144

Query: 168 CNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           C S  C   +   C     +C Y   Y  D + + G L  D + + +  K  K+V     
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSY-GDRSFTYGDLASDQITIGS-FKLPKTV----- 197

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
            GCG    G+F  G     +   G   + V  +    G+ P  FS C       ++ TG 
Sbjct: 198 IGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAGVKPR-FSYCLPTFFSNANITGT 255

Query: 281 ISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSVGG---------NAVNFEFSAIFD 328
           ISFG K      +   TP   R     Y +T+  +SVG          +A+    + I D
Sbjct: 256 ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIID 315

Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEYPVVNLTM 387
           SGT+ T L    Y  +  T   + K KR    S +  E CY  S  Q  +   P++    
Sbjct: 316 SGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGI-LELCY--SAGQVDDLNIPIITAHF 372

Query: 388 KGGG 391
            GG 
Sbjct: 373 AGGA 376


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 92/346 (26%), Positives = 138/346 (39%), Gaps = 46/346 (13%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
           ++ S  F +   V++G P  S +   DTGSDL W     V C  G N +S        + 
Sbjct: 93  KVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVW-----VKCKKGNNDTSSAAAPTTQFD 147

Query: 157 PNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           P+ SST  +V C +  CE L +     GSNC Y   Y  DG+ +TG L  +         
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAY-GDGSNTTGVLSTETFTFDDGGA 206

Query: 216 QSKSVDSRI---SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
                  RI    FGC     GSF      +GL GLG    S+ + L     +   FS C
Sbjct: 207 GRSPRQVRIGGVKFGCSTATAGSF----PADGLVGLGGGAVSLVTQLGGATSLGRRFSYC 262

Query: 273 F---GSDGTGRISFG---DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
                 + +  ++FG   D   PG   TP                  VG   V    S+ 
Sbjct: 263 LVPHSVNASSALNFGALADVTEPGAASTPL-----------------VGNKTVASAASSR 305

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPV 382
            I DSGT+ T+L DP+   +    + L++        + D   + CY ++  +      +
Sbjct: 306 IIVDSGTTLTFL-DPSL--LGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 362

Query: 383 VNLTMK-GGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDNVNIIG 426
            +LT++ GGG      P    V+ +   L L  +   +   V+I+G
Sbjct: 363 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILG 408


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 143/357 (40%), Gaps = 57/357 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F++ +DTGSDL WL C  C +C       SG V D     P+ S++ 
Sbjct: 87  YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACF----DQSGPVFD-----PSQSTSF 137

Query: 164 SKVPCNSTLCEL--QKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             +PCN+  C+L    +C    S      C Y   Y  D + ++G L  + L ++  +  
Sbjct: 138 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWY-GDSSRTSGDLALESLSVSLSDHP 196

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           S      +  GCG    G         GL GLG    S PS L +   I  SFS C   D
Sbjct: 197 SSLEIRDMVIGCGHSNKGL---FQGAGGLLGLGQGALSFPSQLRSSP-IGQSFSYCL-VD 251

Query: 277 GTGRISFGDKGSPGQG-----------ETPF--SLRQTHPTYNITITQVSVGGN------ 317
            T  +S     S G G            TPF  +       Y + I  + +         
Sbjct: 252 RTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPA 311

Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE---YCY 369
                A N     I DSGT+ TYLN  AY  +   F +     R       PF+    CY
Sbjct: 312 ERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-----PFDILGICY 366

Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
             +  +    +P +++  + G    +      +  +P+    +CL ++ +D ++IIG
Sbjct: 367 NAT-GRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAK-HCLAILPTDGMSIIG 421


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 163/397 (41%), Gaps = 37/397 (9%)

Query: 41  PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNS 100
           P   +L   D  +  S A   A     R  +LR RG ++  + ++  +   G  T    S
Sbjct: 62  PFSAVL-THDHARIASLAARLAKTPSSRPTKLR-RGSSSSPDAESLASVPLGPGT----S 115

Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           +G  +Y T + +G PA S+++ +DTGS L WL   C  C+   +  SG V +    S   
Sbjct: 116 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPVFNPRSSSSYA 173

Query: 160 SSTSSKVPCNS-TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           S + S   C++ T   L     S  + C YQ  Y  D + S G+L +D +        S 
Sbjct: 174 SVSCSAPQCDALTTATLNPSTCSTSNVCIYQASY-GDSSFSVGYLSKDTVSFG-----ST 227

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
           SV +   +GCG+   G F   A   GL GL  +K S+   LA    +  SFS C  +  +
Sbjct: 228 SVPN-FYYGCGQDNEGLFGQSA---GLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSS 281

Query: 279 GRISFGDKG-SPGQ-GETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
                     +PGQ   TP +      + Y I +T ++V G  ++   SA      I DS
Sbjct: 282 SSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDS 341

Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
           GT  T L    Y+ +S+      K     S   +  + C+      +    P V++   G
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSI-LDTCF--QGQASRLRVPQVSMAFAG 398

Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           G    +    ++V  +       CL    + +  IIG
Sbjct: 399 GAALKLKATNLLVDVDSA---TTCLAFAPARSAAIIG 432


>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 242

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 105/236 (44%), Gaps = 22/236 (9%)

Query: 199 STGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI 258
           S+G L ED++      ++S+    R  FGC   +TG      A +G+ GLG  + S+   
Sbjct: 4   SSGVLGEDIVSFG---RESELKAQRAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQ 59

Query: 259 LANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSV 314
           L  +G+I +SFS+C+G    G  +    G P   +  FS    LR   P YNI + ++ V
Sbjct: 60  LVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRS--PYYNIELKEIHV 117

Query: 315 GGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF-E 366
            G A+  +          + DSGT++ YL + A+    +   S     ++    D  + +
Sbjct: 118 AGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKD 177

Query: 367 YCYV---LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
            C+     + ++ +  +P V++   G G      P   +    K    YCLGV ++
Sbjct: 178 ICFAGARRNVSKLHEVFPDVDMVF-GNGQKLSLTPENYLFRHSKVDGAYCLGVFQN 232


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 146/383 (38%), Gaps = 60/383 (15%)

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
           R R  R +   L A  N +       GN  + +          +++G P  ++   +DTG
Sbjct: 67  RHRLQRFKAMALVASSNSEIDAPVLPGNGEFLMK---------LAIGTPPETYSAIMDTG 117

Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
           SDL W  C  C  C               I+ P  SS+ SK+ C+S LCE   Q  +   
Sbjct: 118 SDLIWTQCKPCTQCFDQPTP---------IFDPKKSSSFSKLSCSSKLCEALPQS-TCSD 167

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPN 243
            C Y   Y  D + + G L  + L         K     ++FGCG    GS F  G+   
Sbjct: 168 GCEYLYGY-GDYSSTQGMLASETLTFG------KVSVPEVAFGCGEDNEGSGFSQGS--- 217

Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE--------TP 295
           GL GLG    S+ S L         FS C  S    + S    GS    +        TP
Sbjct: 218 GLVGLGRGPLSLVSQLKEP-----KFSYCLTSVDDTKASTLLMGSLASVKASDSEIKTTP 272

Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
                  P+ Y +++  +SVG  ++  + S            I DSGT+ TYL   A+  
Sbjct: 273 LIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDL 332

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
           +++ F S      + S S    E C+ L    T+ E P +     G       +  +I  
Sbjct: 333 VAKEFTSQINLPVDNSGST-GLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIAD 391

Query: 404 SEPKGLYLYCLGVVKSDNVNIIG 426
           +    + + CL +  S  ++I G
Sbjct: 392 AS---MGVACLAMGSSSGMSIFG 411


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/274 (25%), Positives = 108/274 (39%), Gaps = 40/274 (14%)

Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
           + +DT SD+ W+ C      H    +        +Y P+ SS+S+  PC+S  C      
Sbjct: 158 MVIDTASDVPWVQCAPCPAPHCHAQTD------VLYDPSKSSSSAAFPCSSPACRNLGPY 211

Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR--VQT 233
              C  AG  C Y+V+Y  DG+ S G  + DVL L  +  +  S  S   FGC    +Q 
Sbjct: 212 ANGCTPAGDQCQYRVQY-PDGSASAGTYISDVLTL--NPAKPASAISEFRFGCSHALLQP 268

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---------GTGRISFG 284
           GSF +    +G+  LG    S+P+    +    + FS C             G  R++  
Sbjct: 269 GSFSNKT--SGIMALGRGAQSLPT--QTKATYGDVFSYCLPPTPVHSGFFILGVPRVAAS 324

Query: 285 DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
                    TP    +  P  Y + +  + V G  +      F   A+ DS T  T L  
Sbjct: 325 RYAV-----TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPP 379

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
            AY  +   F +  +  R  +  +   + CY  S
Sbjct: 380 TAYMALRAAFVAEMRAYRAAAPKEH-LDTCYDFS 412


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 128/330 (38%), Gaps = 49/330 (14%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
           ++  +V  G PAL + + LDT +DL W+ C             +S   G + +++ +   
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185

Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
            N Y P  SS+  ++ C+   C L      Q PS   +C Y  + + DGT++ G   ++ 
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
             +   + +   +   I  GC  ++ G  +D  A +G+  LG  + S     A +     
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299

Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
            FS C  S     D +  ++FG   +   PG  ET         P Y   +T + VGG  
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359

Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
           ++                I D+ TS T L   AY  ++   +            D  FEY
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GFEY 418

Query: 368 CYVLS------PNQTNFEYPVVNLTMKGGG 391
           CY  +          N   P + + M GG 
Sbjct: 419 CYRWTFAGDGVDLAHNVTVPRLTVEMAGGA 448


>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
          Length = 642

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 81/308 (26%), Positives = 132/308 (42%), Gaps = 52/308 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           HY  + +G PA    V +DTGS L  LPC  C  C        GQ  D  ++  + S+T+
Sbjct: 95  HYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGC--------GQHTD-PLFDVSKSTTA 145

Query: 164 SKVPCNS----TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDE 214
             + C+       CE Q +C        Y  +   +G+M    +V++++ +       DE
Sbjct: 146 KYLACHDFDSCRSCE-QDRC--------YISQSYMEGSMWEAVMVDELVWVGGFSSPADE 196

Query: 215 KQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSM 271
            +   K+   R   GC   +TG F+     NG+ GLG  +++V S + N G +  N F++
Sbjct: 197 MEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFTL 255

Query: 272 CFGSDGTGRISFG----DKGSPGQGETPFSLRQT--HPTY--NITITQVSVGGN--AVNF 321
           CF  DG G + FG       +   G TP    ++  +P +  +I +  VS+G +   +N 
Sbjct: 256 CFAGDG-GELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTGTINS 314

Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
               I DSGT+ T+ +          F+  A      S   L  E    L         P
Sbjct: 315 GRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKLTSEELAAL---------P 365

Query: 382 VVNLTMKG 389
           V+++ + G
Sbjct: 366 VISIILSG 373


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 122/304 (40%), Gaps = 42/304 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG P  S  V +D+GSD+ W+ C  C  C    +          ++ P  S+T 
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP---------VFDPAGSATY 187

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           + + C+S++C+           C Y+V Y  DG+ + G L  + L         + +   
Sbjct: 188 AGISCDSSVCDRLDNAGCNDGRCRYEVSY-GDGSYTRGTLALETLTFG------RVLIRN 240

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           I+ GCG +  G F+  A   GL G  M   S    L  Q     +FS C    G++ TG 
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAM---SFVGQLGGQ--TGGAFSYCLVSRGTESTGT 295

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFS------AIF 327
           + FG    P G    P       P++       + +  + V      FE +       + 
Sbjct: 296 LEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM 355

Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           D+GT+ T L  PAY    +TF    A   R    S   F+ CY L+    +   P V+  
Sbjct: 356 DTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSI--FDTCYNLN-GFVSVRVPTVSFY 412

Query: 387 MKGG 390
             GG
Sbjct: 413 FSGG 416


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C EL       Q  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ +     +FS C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312


>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
          Length = 431

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 121/323 (37%), Gaps = 68/323 (21%)

Query: 70  FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
           F  + R LAA + +D +  L   AG D     T R  ++G L+Y  + +G PA  + V +
Sbjct: 57  FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQM 115

Query: 123 DTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS- 181
           +                              +Y    S T   V C+   C      P  
Sbjct: 116 E----------------------------LTLYDIKESLTGKLVSCDQDFCYAINGGPPS 147

Query: 182 ---AGSNCPYQVRYLSDGTMSTGFLVE---------DVLHLATDEKQSKSVDSRISFGCG 229
              A  +C Y   Y +DG+ S G+ V+          + HL  +          +   C 
Sbjct: 148 YCIANMSCSYTEIY-ADGSSSFGYFVKGYCTASKYNSIPHLNNNPLL------EVPLRCS 200

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGS 288
             Q+G      A +G+ G G   TS+ S LA+ G +   F+ C  G +G G  + G    
Sbjct: 201 ATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQ 260

Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDP 339
           P    TP    QTH  YN+ +  V VGG  +N          +   I DSGT+  YL + 
Sbjct: 261 PKVNTTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEV 318

Query: 340 AYTQISETFNSLAKEKRETSTSD 362
            Y Q+     S   + +  +  D
Sbjct: 319 VYDQLLSKIFSWQSDLKVHTIHD 341


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 137/334 (41%), Gaps = 58/334 (17%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   + VG P       +DTGSDL W  C  C  C    +          I+ P+ SST
Sbjct: 81  IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDP---------IFDPSKSST 131

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            ++  C+             G +C Y++ Y  D T S G L  + + + +   +   V +
Sbjct: 132 FNEQRCH-------------GKSCHYEIIY-EDNTYSKGILATETVTIHSTSGE-PFVMA 176

Query: 223 RISFGCGRVQTGSFLD----GAAPNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSD 276
             + GCG   T   LD     ++ +G+ GL M   S+ S   L   GLI    S CF   
Sbjct: 177 ETTIGCGLHNTD--LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLI----SYCFSGQ 230

Query: 277 GTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSA 325
           GT +I+FG        G       +++ +P Y + +  VSV  N +          + + 
Sbjct: 231 GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNI 290

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVV 383
           + DSG++ TY        + +    +    R  + S +D+    CY    ++T   +PV+
Sbjct: 291 VIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDM---LCYF---SETIDIFPVI 344

Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
            +   GG    ++   + + S   G  L+CL ++
Sbjct: 345 TMHFSGGADLVLDKYNMYMESNSGG--LFCLAII 376



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/249 (27%), Positives = 104/249 (41%), Gaps = 44/249 (17%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   + VG P    +  +DTGSD+ W  C  C +C               I+ P+ SST
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAP---------IFDPSKSST 470

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  CN             G++C Y++ Y +D T S G L  + + + +   +   V +
Sbjct: 471 FREQRCN-------------GNSCHYEIIY-ADKTYSKGILATETVTIPSTSGE-PFVMA 515

Query: 223 RISFGCGRVQTGSFLDGAA--PNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSDGT 278
               GCG   T     G A   +G+ GL M   S+ S   L   GLI    S CF   GT
Sbjct: 516 ETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFSGQGT 571

Query: 279 GRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIF- 327
            +I+FG        G       +++ +P Y + +  VSV  N +       + E   IF 
Sbjct: 572 SKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFI 631

Query: 328 DSGTSFTYL 336
           DSGT+ TY 
Sbjct: 632 DSGTTLTYF 640


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/283 (28%), Positives = 111/283 (39%), Gaps = 46/283 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++ +V VG P   F + LDTGSDL W+ C  C  C     +          Y P  S++ 
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGA---------FYDPKASASY 205

Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
             + CN   C L       K C S   +CPY   Y      +  F VE   ++L T    
Sbjct: 206 KNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGS 265

Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S+  +   + FGCG    G F   A    L GLG    S  S L  Q L  +SFS C   
Sbjct: 266 SELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 320

Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
               ++ + ++ FG+       P    T F  R+ +     Y + I  + V G  +N   
Sbjct: 321 RNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPE 380

Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
                        I DSGT+ +Y  +PAY  I       AK K
Sbjct: 381 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 423


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 142/342 (41%), Gaps = 52/342 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCV-SCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG PA + ++ LDTGSD+ W P   +   +  +   S           +T +  
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGS-----------STGAAP 170

Query: 164 SKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
           +  P  + +  + ++  SAG     ++C YQV Y  DG+++ G    + L  A   +   
Sbjct: 171 APTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-- 227

Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
               R++ GCG    G F+   A +GL GLG  + S PS +A       SFS C     +
Sbjct: 228 ---QRVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTS 279

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------------NAVNFEFSA 325
              S   + S   G TP    +    Y + +   SVGG             N        
Sbjct: 280 ---SRRARPSRRWGGTP----RMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 332

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           I DSGTS T L  P Y  + + F + A   R +      F+ CY LS  +   + P V++
Sbjct: 333 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV-VKVPTVSM 391

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
            + GG    +     ++  +  G   +C  +  +D  V+IIG
Sbjct: 392 HLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 431


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 152/355 (42%), Gaps = 55/355 (15%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
           RL SL ++    V +G   ++ IV  DTGSDL W+ C  C  C +  +          ++
Sbjct: 60  RLQSLNYI--VTVELGGRKMTVIV--DTGSDLSWVQCQPCNRCYNQQDP---------VF 106

Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
           +P+ S +   V CNS  C  LQ        C S    C Y V Y  DG+ ++G +  + L
Sbjct: 107 NPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNY-GDGSYTSGEVGMEHL 165

Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
           +L      + +V++ I FGCGR   G F      +GL GLG    S+ S ++   +    
Sbjct: 166 NLG-----NTTVNNFI-FGCGRKNQGLF---GGASGLVGLGRTDLSLISQISP--MFGGV 214

Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFS-LRQTH----PTYNITITQVSVGGNAVN 320
           FS C     ++ +G +  G   S  +  TP S  R  H    P Y + +T ++VGG  V 
Sbjct: 215 FSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQ 274

Query: 321 F----EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYVLSPN 374
                +   I DSGT  + L    Y  +   F    K+     ++ S +  + C+ LS  
Sbjct: 275 APSFGKDRMIIDSGTVISRLPPSIYQALKAEF---VKQFSGYPSAPSFMILDSCFNLSGY 331

Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
           Q   + P + +  +G     V+   V  S +     + CL +      D V IIG
Sbjct: 332 Q-EVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQV-CLAIASLPYEDEVGIIG 384


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/336 (23%), Positives = 130/336 (38%), Gaps = 63/336 (18%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
           ++  +V  G PAL + + LDT +DL W+ C             +S   G + +++ +   
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185

Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
            N Y P  SS+  ++ C+   C L      Q PS   +C Y  + + DGT++ G   ++ 
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
             +   + +   +   I  GC  ++ G  +D  A +G+  LG  + S     A +     
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299

Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
            FS C  S     D +  ++FG   +   PG  ET         P Y   +T + VGG  
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359

Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--- 364
           ++                I D+ TS T L   AY  ++   +           S LP   
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDR--------HLSHLPRVY 411

Query: 365 ----FEYCYVLS------PNQTNFEYPVVNLTMKGG 390
               FEYCY  +          N   P + + M GG
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLTHNVTVPRLTVEMAGG 447


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/330 (26%), Positives = 126/330 (38%), Gaps = 50/330 (15%)

Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
           G P  + ++ALDT SD  W+PC  CV C     S+S        ++P  S++   V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152

Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
             C+        GS C +   Y S    ++  +V+D L LA D           +FGC  
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLAADPIPG------YTFGCVN 204

Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
             TGS    +AP               +  +Q L  ++FS C  S    + +G +  G  
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259

Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
             P + +    LR    +  Y + +  + VG   V+   +A           IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319

Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
           T L +P YT +   F      K   +T    F+ CY           P +     G    
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLG-GFDTCY-----NVPIVVPTITFLFSGMNVA 373

Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
              D IVI S+      L   G    DNVN
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGA--PDNVN 401


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 135/323 (41%), Gaps = 35/323 (10%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
            R  Y   R  G A Q  D      +A         +G L+Y    S+G P ++  + +D
Sbjct: 99  RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
           TGSDL W+ C   S      S    + D     P  SS+ + VPC   +C    +     
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
            + + C Y V Y  DG+ +TG    D L L+     + S      FGCG  Q+G F +G 
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
             +GL GLG ++ S+  +    G     FS C  +  +  G ++ G  G P      FS 
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321

Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
            Q  P+      Y + +T +SVGG  ++   SA       D+GT  T L   AY  +   
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381

Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
           F S +A     T+ S+   + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 134/323 (41%), Gaps = 35/323 (10%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
            R  Y   R  G A Q  D       A         +G L+Y    S+G P ++  + +D
Sbjct: 99  RRAEYILRRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158

Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
           TGSDL W+ C   +      S    + D     P  SS+ + VPC   +C    +     
Sbjct: 159 TGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
            + + C Y V Y  DG+ +TG    D L L+     + S      FGCG  Q+G F +G 
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
             +GL GLG ++ S+  +    G     FS C  +  +  G ++ G  G P      FS 
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321

Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
            Q  P+      Y + +T +SVGG  ++   SA       D+GT  T L   AY  +   
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381

Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
           F S +A     T+ S+   + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 141/358 (39%), Gaps = 55/358 (15%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           A RD    L    LA +G    P+  ++G    +  +    +     +G PA   ++A+D
Sbjct: 72  AARDASRLLYLDSLAVKGRAYAPI--ASGRQLLQTPT----YVVRARLGTPAQQLLLAVD 125

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
           T +D  W+PC  C  C              + ++P  S++   VPC S  C L     C 
Sbjct: 126 TSNDAAWIPCSGCAGCPTS-----------SPFNPAASASYRPVPCGSPQCVLAPNPSCS 174

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
               +C + + Y +D ++    L +D L +A D      V    +FGC +  TG+    A
Sbjct: 175 PNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VVKAYTFGCLQRATGT---AA 223

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
            P GL GLG    S   +   + +   +FS C  S    + +G +  G  G P + +T  
Sbjct: 224 PPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTP 281

Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
            L   H +  Y + +T + VG   V+   SA           + DSGT FT L  P Y  
Sbjct: 282 LLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLA 341

Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI 401
           + +             +S   F+ CY      T   +P V L   G       + +VI
Sbjct: 342 LRDEVRRRVGAGAAAVSSLGGFDTCY-----NTTVAWPPVTLLFDGMQVTLPEENVVI 394


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 148/381 (38%), Gaps = 76/381 (19%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD----CVSCV 139
           KTP + S        +S G  + T +S G P  +  +  DTGS L W PC     C  C 
Sbjct: 61  KTPKSNSVFKSPLSPHSYG-AYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 119

Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG-------SNC 186
                 +G       + P  SS+S  V C +  C      +++ QC S           C
Sbjct: 120 FPKIDPTG----IPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTC 175

Query: 187 P-YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
           P Y V+Y S  T   G L+ + L         K + + +  GC      SFL    P+G+
Sbjct: 176 PAYVVQYGSGST--AGLLLSETLDFP-----DKXIPNFV-VGC------SFLSIHQPSGI 221

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--------------DGTGRISFGDKGSPGQ 291
            G G    S+PS     GL    F+ C  S              D TG  S G   +P +
Sbjct: 222 AGFGRGSESLPS---QMGL--KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFR 276

Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPA 340
                S       Y + I ++ VG  AV   +            +I DSG++FT+++ P 
Sbjct: 277 QNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 336

Query: 341 YTQISETF-NSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF--VN 396
              ++  F   LA   R T    L     C+ +S  + + ++P +    KGG  +   +N
Sbjct: 337 LEVVAREFEKQLANWTRATDVETLTGLRPCFDIS-KEKSVKFPELIFQFKGGAKWALPLN 395

Query: 397 DPIVIVSSEPKGLYLYCLGVV 417
           +   +VSS      + CL VV
Sbjct: 396 NYFALVSSS----GVACLTVV 412


>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
 gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
          Length = 817

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 154/366 (42%), Gaps = 71/366 (19%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNI---YSPN 158
           F ++  + VG P   F V +DTGS    +P  +C         +S    D N+   YS  
Sbjct: 203 FEYFIPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSCSCSDGNLDGLYSLE 262

Query: 159 TSSTSSKVPCNSTL-CELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
            S +S+++ C+ T  C     C +  SN  CP+ ++Y  DG+   G LV D + +     
Sbjct: 263 ESISSNQLNCSDTSNC---NTCKNNKSNKPCPFVLKY-GDGSFIAGSLVIDHVTIGDFTV 318

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP---------NGLFGLGMDKTS------VPSILA 260
            +K       FG  + ++ SF     P         +G+ GL   +        + S + 
Sbjct: 319 PAK-------FGNIQKESLSFSQLTCPSTQRSQAVRDGILGLSFQQLDPDNGDDIFSKIV 371

Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP--FSLRQTHPTYNITITQVSVGGNA 318
               IPN FSMC G DG G ++ G        ETP    +  +H  Y+IT+T + VG ++
Sbjct: 372 AHYNIPNVFSMCLGKDG-GLLTIGGTNDHITQETPKYTPIFDSH-YYSITVTNIYVGNDS 429

Query: 319 VNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS-----DLPFEY-- 367
           +N       ++I DSGT+  Y +D       E F S+ +   E         + PF    
Sbjct: 430 LNLAPPDLSTSIVDSGTTLLYFSD-------EIFYSIVRNLEEKHCELPGICNDPFWEGN 482

Query: 368 CYVLSPNQTNFEYPVVNLTMKG--GGPFFVNDPIVIVSSEPKGLY------LYCLGVVKS 419
           C+ L     + EYP + L MKG  G P F  +        P  LY      LYC G+   
Sbjct: 483 CHHLEEKLIS-EYPTIYLEMKGMNGEPSFKLEV-------PPDLYFLNINGLYCFGISHM 534

Query: 420 DNVNII 425
             ++++
Sbjct: 535 KEISVL 540


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 62/131 (47%), Gaps = 21/131 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG PA S  + +DTGSDL WL C  C SC    +          I+ P  SS+ 
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSF 179

Query: 164 SKVPCNSTLC---ELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            ++PC S LC   E+     S G  S C YQV Y  DG+ S G    D+  L T  K   
Sbjct: 180 QRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS 238

Query: 219 SVDSRISFGCG 229
                ++FGCG
Sbjct: 239 -----VAFGCG 244


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 112/272 (41%), Gaps = 44/272 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V VG P   F + +DTGSDL WL C  C+ C        G V D     P  S++ 
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCF----DQRGPVFD-----PMASTSY 200

Query: 164 SKVPCNSTLCEL------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             V C  T C L       + C S+ S+ CPY   Y  D + +TG L  +   +      
Sbjct: 201 RNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTASS 259

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           S+ VD  +  GCG    G F   A    L GLG    S  S L  + +  ++FS C    
Sbjct: 260 SRRVDG-VVLGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHAFSYCLVDH 313

Query: 277 GTG---RISFGDK----GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
           G+    +I FGD       P    T F+      T Y + +  + VGG  ++   +    
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGV 373

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF 348
                    I DSGT+ +Y  +PAY  I + F
Sbjct: 374 SKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAF 405


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 125/301 (41%), Gaps = 49/301 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           ++ +V +G P   + + LDTGSDL W+   CV C H     +G       Y P  SS+  
Sbjct: 90  YFMDVFIGTPPKHYSLILDTGSDLNWI--QCVPC-HDCFEQNGPY-----YDPKESSSFR 141

Query: 165 KVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
            + C+   C L         C +    CPY   Y  D + +TG    +   +       K
Sbjct: 142 NIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWY-GDSSNTTGDFATETFTVNLTSPTGK 200

Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
           S   R+    FGCG    G F  GA+  GL GLG    S  S L  Q L  +SFS C   
Sbjct: 201 SEFKRVENVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 255

Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFSLR---QTHPT---YNITITQVSVGGNAVNFEF 323
               ++ + ++ FG DK      E  F+     + +P    Y + I  + VGG  +N   
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315

Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
           S            I DSGT+ +Y  +PAY  I + F  + K K      D P  + CY +
Sbjct: 316 STWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAF--VKKVKGYPIVQDFPILDPCYNV 373

Query: 372 S 372
           S
Sbjct: 374 S 374


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 136/336 (40%), Gaps = 72/336 (21%)

Query: 99  NSLGFLH----YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
           + L F H       ++VG P  +  + LDTGS+L WL C     +             ++
Sbjct: 51  DKLSFRHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLG------------SV 98

Query: 155 YSPNTSSTSSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
           ++P +SST S VPC+S +C  + +       C      C   + Y +D T   G L  D 
Sbjct: 99  FNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFCHVAISY-ADATSIEGNLAHDT 157

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDG---AAPNGLFGLGMDKTSVPSILANQGL 264
             + +  +          FGC  + +G   D    A   GL  +GM++ S+ S +   G 
Sbjct: 158 FVIGSVTRPGT------LFGC--MDSGLSSDSEEDAKSTGL--MGMNRGSL-SFVNQLGF 206

Query: 265 IPNSFSMCF-GSDGTGRISFGDKGSPGQGE---TPFSLRQT------HPTYNITITQVSV 314
             + FS C  GSD +G +  GD      G    TP  L+ T         Y + +  + V
Sbjct: 207 --SKFSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRV 264

Query: 315 GGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
           G   ++   S            + DSGT FT+L  P YT +   F  +A+ K      D 
Sbjct: 265 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEF--IAQTKSVLRIVDD 322

Query: 364 P-------FEYCY-VLSPNQTNFE-YPVVNLTMKGG 390
           P        + CY V S  + NF   PV++L  +G 
Sbjct: 323 PNFVFQGTMDLCYRVGSSTRPNFTGLPVISLMFRGA 358


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 131/342 (38%), Gaps = 46/342 (13%)

Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           SVG P    +  +DTGS + W+ C  C  C               I+ P+ S T   +PC
Sbjct: 102 SVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTP---------IFDPSKSKTYKTLPC 152

Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
           +S +C+     PS  S+   C Y ++Y  DG+ S G L  + L L +    S    + + 
Sbjct: 153 SSNMCQSVISTPSCSSDKIGCKYTIKY-GDGSHSQGDLSVETLTLGSTNGSSVQFPNTV- 210

Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGR 280
            GCG    G+F    +     G G          +  G     FS C       S+ + +
Sbjct: 211 IGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGG----KFSYCLAPMFSQSNSSSK 266

Query: 281 ISFGDKG---SPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF------------EFS 324
           ++FGD       G   TP  S   +   Y +T+   SVG   + F            E +
Sbjct: 267 LNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGN 326

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGT+ T L    Y+ +        +  R +  S+     CY  +P+    + PV+ 
Sbjct: 327 IIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNF-LSLCYQTTPS-GQLDVPVIT 384

Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              KG       +PI       +G  + C     S+ V+I G
Sbjct: 385 AHFKGADVEL--NPISTFVQVAEG--VVCFAFHSSEVVSIFG 422


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 76/171 (44%), Gaps = 22/171 (12%)

Query: 65  HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
            R R+   + + LA +  D+   T   G  T  L      ++  + +G PA S  + +DT
Sbjct: 15  RRVRWIESKAK-LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPARSLFMVVDT 73

Query: 125 GSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
           GSDL WL C  C SC    +          I+ P  SS+  ++PC S LC+  +    +G
Sbjct: 74  GSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSFQRIPCLSPLCKALEVHSCSG 124

Query: 184 -----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
                S C YQV Y  DG+ S G    D+  L T  K        ++FGCG
Sbjct: 125 SRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS-----VAFGCG 169


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C EL       Q  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ + L     S C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312


>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
 gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C EL       Q  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ + L     S C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 123/307 (40%), Gaps = 52/307 (16%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +   V +G P  +  + LDT +D  W PC  C+ C     SS+        +S   SST 
Sbjct: 95  YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC-----SST------TTFSAQNSSTF 143

Query: 164 SKVPCNSTLCELQK--QCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
           + + C+   C   +   CP+ G+ +C +   Y  D T S   LV+D LHL  +      V
Sbjct: 144 ATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDSTFS-ATLVQDSLHLGPN------V 196

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
               SFGC    +GS +    P GL GLG    S+  I  +  L    FS C  S     
Sbjct: 197 IPNFSFGCISSASGSSI---PPQGLMGLGRGPLSL--ISQSGSLYSGLFSYCLPSFKSYY 251

Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
            +G +  G  G P    T   L   H P+ Y + +T +SVG   V           N   
Sbjct: 252 FSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGA 311

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPV 382
             I DSGT  T      YT + + F    +++   S S L  F+ C+           P 
Sbjct: 312 GTIIDSGTVITRFVPAIYTAVRDEF----RKQVGGSFSPLGAFDTCFA---TNNEVSAPA 364

Query: 383 VNLTMKG 389
           + L + G
Sbjct: 365 ITLHLSG 371


>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
          Length = 357

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C EL       Q  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ +     +FS C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T    ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 122/287 (42%), Gaps = 45/287 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + VG P  S  + +D+GSD+ W+ C  C  C H  +          ++ P  S++ 
Sbjct: 43  YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93

Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             V C+S +C+   Q  +AG N   C Y+V Y  DG+ + G L  + L L       ++V
Sbjct: 94  MGVSCSSAVCD---QVDNAGCNSGRCRYEVSY-GDGSSTKGTLALETLTLG------RTV 143

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT-- 278
              ++ GCG +  G F+  A   GL G  M  + V  +   +G   N+FS C  S  T  
Sbjct: 144 VQNVAIGCGHMNQGMFVGAAGLLGLGGGSM--SFVGQLSRERG---NAFSYCLVSRVTNS 198

Query: 279 -GRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------ 324
            G + FG +  P G    P       P+ Y I ++ + VG   V      FE +      
Sbjct: 199 NGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGG 258

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
            + D+GT+ T     AY    + F          S   + F+ CY L
Sbjct: 259 VVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSI-FDTCYNL 304


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 118/313 (37%), Gaps = 61/313 (19%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
           ++  +V +G PAL + + LDT +DL W+ C        H    S GQ +           
Sbjct: 123 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAK 182

Query: 153 -----NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFL 203
                N Y P  SS+  ++ C+   C +      Q PS   +C Y  +   DGT++ G  
Sbjct: 183 KEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIY 241

Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
            ++   +   + +   +   I  GC  ++ G  +D  A +G+  LG    S     A + 
Sbjct: 242 GKEKATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR- 297

Query: 264 LIPNSFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSV 314
                FS C  S     D +  ++FG   +   PG  ET         P Y   +T V V
Sbjct: 298 -FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLV 356

Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
           GG  ++                I D+ TS T L   AY  ++   +           S L
Sbjct: 357 GGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHL 408

Query: 364 P-------FEYCY 369
           P       FEYCY
Sbjct: 409 PRVYELEGFEYCY 421


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 143/377 (37%), Gaps = 57/377 (15%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
           RGR LA  G D TP   +AG        L+S G L+  N ++G P       +D   +L 
Sbjct: 27  RGRLLA--GVDATPP--AAGGAVAVPIYLSSQG-LYVANFTIGTPPQPVSAVVDLTGELV 81

Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPY 188
           W  C  C  C            D  ++ P  SST   +PC S LCE     P +  NC  
Sbjct: 82  WTQCTPCQPCFEQ---------DLPLFDPTKSSTFRGLPCGSHLCE---SIPESSRNCTS 129

Query: 189 QVRYLSDGTMS--TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLF 246
            V      T +  TG          TD     +    + FGC  +          P+G+ 
Sbjct: 130 DVCIYEAPTKAGDTGG------KAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIV 183

Query: 247 GLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQG----ETPFSLRQ-- 300
           GLG      P  L  Q  +  +FS C     +G +  G       G     TPF ++   
Sbjct: 184 GLGR----TPWSLVTQMNV-TAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSA 238

Query: 301 ------THPTYNITITQVSVGGNAVNFEFSA----IFDSGTSFTYLNDPAYTQISETFNS 350
                 ++P Y + +  +  GG  +    S+    + D+ +  +YL D AY  + +   +
Sbjct: 239 GSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTA 298

Query: 351 LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY 410
            A   +  ++   P++ C+   P     + P +  T  GG    V     +++S   G  
Sbjct: 299 -AVGVQPVASPPKPYDLCF---PKAVAGDAPELVFTFDGGAALTVPPANYLLAS---GNG 351

Query: 411 LYCLGVVKSDNVNIIGR 427
             CL +  S ++N+ G 
Sbjct: 352 TVCLTIGSSASLNLTGE 368


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 128/343 (37%), Gaps = 56/343 (16%)

Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK 177
             V +DTGSDL W+ C   S  +             ++ P+ S++ + VPCN++ CE   
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDP--------LFDPSGSASYAAVPCNASACEASL 227

Query: 178 QCPSA----------------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           +  +                    C Y + Y  DG+ S G L  D + L        SVD
Sbjct: 228 KAATGVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTVALG-----GASVD 281

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
             + FGCG    G F       GL GLG  + S+ S  A +      FS C       D 
Sbjct: 282 GFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDA 335

Query: 278 TGRISFGDKGSPGQGETPFSLRQT------HPTYNITIT----QVSVGGNAVNFEFSAIF 327
            G +S G   S  +  TP S  +        P Y + +T      +    A     + + 
Sbjct: 336 AGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLL 395

Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           DSGT  T L    Y  +   F      E+   +      + CY L+      + P++ L 
Sbjct: 396 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLT-GHDEVKVPLLTLR 454

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
           ++GG    V+   ++  +   G  + CL +      D   IIG
Sbjct: 455 LEGGADMTVDAAGMLFMARKDGSQV-CLAMASLSFEDQTPIIG 496


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 135/340 (39%), Gaps = 53/340 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G P  + ++A+DT +D  W+PC  C  C   L            ++P  S+T 
Sbjct: 93  YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 140

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
             V C +  C   KQ P+ G     +   L+ G+ S    LV+D + LATD   S     
Sbjct: 141 KNVSCAAPEC---KQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLATDPVPS----- 192

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
             +FGC    TG+    A P GL GLG    S+ S    Q L  ++FS C  S    + +
Sbjct: 193 -YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 246

Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
           G +  G    P + +    L+    +  Y + +  + VG   V+   +A           
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           IFDSGT FT L  P Y  + + F      K  T TS   F+ CY           P +  
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY-----NVPIVVPTITF 360

Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
              G       D I+I S+      L   G    DNVN +
Sbjct: 361 IFTGMNVTLPQDNILIHSTAGSTTCLAMAGA--PDNVNSV 398


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 128/343 (37%), Gaps = 56/343 (16%)

Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK 177
             V +DTGSDL W+ C   S  +             ++ P+ S++ + VPCN++ CE   
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDP--------LFDPSGSASYAAVPCNASACEASL 228

Query: 178 QCPSA----------------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           +  +                    C Y + Y  DG+ S G L  D + L        SVD
Sbjct: 229 KAATGVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTVALG-----GASVD 282

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
             + FGCG    G F       GL GLG  + S+ S  A +      FS C       D 
Sbjct: 283 GFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDA 336

Query: 278 TGRISFGDKGSPGQGETPFSLRQT------HPTYNITIT----QVSVGGNAVNFEFSAIF 327
            G +S G   S  +  TP S  +        P Y + +T      +    A     + + 
Sbjct: 337 AGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLL 396

Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
           DSGT  T L    Y  +   F      E+   +      + CY L+      + P++ L 
Sbjct: 397 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLT-GHDEVKVPLLTLR 455

Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
           ++GG    V+   ++  +   G  + CL +      D   IIG
Sbjct: 456 LEGGADMTVDAAGMLFMARKDGSQV-CLAMASLSFEDQTPIIG 497


>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
 gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
 gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
 gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
 gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
 gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
 gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
 gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
 gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
          Length = 357

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C        LQ+  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ +     +FS C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 135/351 (38%), Gaps = 46/351 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  +V VG P   F + +DTGSDL WL C  C+ C        G V D     P  SS+ 
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QRGPVFD-----PAASSSY 199

Query: 164 SKVPCNSTLC------ELQKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
             V C    C      E  + C   A  +CPY   Y      +    +E      T    
Sbjct: 200 RNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 259

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
           S+ VD  + FGCG    G F   A    L GLG    S  S L  + +  ++FS C    
Sbjct: 260 SRRVDG-VVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHTFSYCLVEH 313

Query: 274 GSDGTGRISFGDK----GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS--- 324
           GSD   ++ FG+       P    T F+   +     Y + +  V VGG+ +N       
Sbjct: 314 GSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWD 373

Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQ 375
                    I DSGT+ +Y  +PAY  I + F  L   +      D P    CY +S  +
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMS-RLYPLIPDFPVLNPCYNVSGVE 432

Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
              E P ++L    G  +        V  +P G+    +       ++IIG
Sbjct: 433 RP-EVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIG 482


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 127/303 (41%), Gaps = 55/303 (18%)

Query: 84  KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
           +TP+T   G+  Y +          +++G PALS    +DTGSDL W  C+ C  C    
Sbjct: 30  ETPVTPDIGSGEYLIQ---------MAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSS 80

Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMST 200
                          ++SST SKV C S+LC+      C + G +C Y   Y  D + ++
Sbjct: 81  IYDP-----------SSSSTYSKVLCQSSLCQPPSIFSCNNDG-DCEYVYPY-GDRSSTS 127

Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
           G L ++   ++     S+S+   I+FGCG    G   D     GL G G    S+ S L 
Sbjct: 128 GILSDETFSIS-----SQSL-PNITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLG 177

Query: 261 NQGLIPNSFSMCF----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVS 313
               + N FS C      S  T  +  G+  S      G TP     +   Y +++  +S
Sbjct: 178 PS--MGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGIS 235

Query: 314 VGGNAV-----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
           VGG ++      F+  +      I DSGT+ T+L   AY  + E   S     +     D
Sbjct: 236 VGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQADGQLD 295

Query: 363 LPF 365
           L F
Sbjct: 296 LCF 298


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C        LQ+  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ +     +FS C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 113/284 (39%), Gaps = 39/284 (13%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  + +G P  S  + +D+GSD+ W+ C  C  C H  +          ++ P  S++ 
Sbjct: 43  YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
             V C+S +C+  +        C Y+V Y  DG+ + G L  + L         ++V   
Sbjct: 94  MGVSCSSAVCDRVENAGCNSGRCRYEVSY-GDGSYTKGTLALETLTFG------RTVVRN 146

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---GR 280
           ++ GCG    G F+  A   GL G  M      S     G   N+FS C  S GT   G 
Sbjct: 147 VAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLS-----GQTGNAFSYCLVSRGTNTNGF 201

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIF 327
           + FG +  P G    P       P+ Y I +  + VG   V      F+ +       + 
Sbjct: 202 LEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVM 261

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
           D+GT+ T     AY      F    +     S   + F+ CY L
Sbjct: 262 DTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSI-FDTCYNL 304


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 118/287 (41%), Gaps = 36/287 (12%)

Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
           P +   V LD+ SD+ W+   CV C   +     QV  F  Y P+ S TS+   C+S  C
Sbjct: 25  PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPTSAAFSCSSPTC 78

Query: 174 EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
                    C  A + C Y VRY  DG+ ++G  + D+L L      + +  S   FGC 
Sbjct: 79  TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 130

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
             + GSF   AA  G+  LG    S+ S  A++    N+FS C  +  +    F   G P
Sbjct: 131 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 185

Query: 290 GQGETPF------SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
            +  + +        RQ    Y + +  ++VGG  +      F   ++ DS T+ T L  
Sbjct: 186 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 245

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
            AY  +   F S     R         + CY  +    N   P ++L
Sbjct: 246 TAYQALRAAFRSSMTMYRSAPPKGY-LDTCYDFT-GVVNIRLPKISL 290


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 150/381 (39%), Gaps = 60/381 (15%)

Query: 68  RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSD 127
           R  RL    LAA  N +      +GN  + +N         +++G P  ++   +DTGSD
Sbjct: 72  RLERLNAMVLAASSNAEINSPVLSGNGEFLMN---------LAIGTPPETYSAIMDTGSD 122

Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNC 186
           L W  C  C  C    +          I+ P  SS+ SK+ C+S LC+   Q  S   +C
Sbjct: 123 LIWTQCKPCTQCFDQPSP---------IFDPKKSSSFSKLSCSSQLCKALPQS-SCSDSC 172

Query: 187 PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGL 245
            Y   Y  D + + G +  +           K     + FGCG    G  F  G+   GL
Sbjct: 173 EYLYTY-GDYSSTQGTMATETFTFG------KVSIPNVGFGCGEDNEGDGFTQGS---GL 222

Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT-------GRISFGDKGSPGQGETPFS 297
            GLG    S+ S L         FS C  S D T       G ++  +  S     TP  
Sbjct: 223 VGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLI 277

Query: 298 LRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQIS 345
                P+ Y +++  +SVGG  +  + S            I DSGT+ TYL + A+  + 
Sbjct: 278 QNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVK 337

Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
           + F S      + S +    E CY L  + +  E P + L   G       +  +I  S 
Sbjct: 338 KEFTSQMGLPVDNSGAT-GLELCYNLPSDTSELEVPKLVLHFTGADLELPGENYMIADSS 396

Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
              + + CL +  S  ++I G
Sbjct: 397 ---MGVICLAMGSSGGMSIFG 414


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 95/213 (44%), Gaps = 38/213 (17%)

Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQ- 176
           V +DTGSDL W+ C+ C+SC +             ++ P+TSS+   +PCNS+ C+ LQ 
Sbjct: 158 VIIDTGSDLTWVQCEPCMSCYNQQGP---------VFKPSTSSSYQSIPCNSSTCQSLQL 208

Query: 177 -----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRV 231
                  C S  SNC Y V Y  DG+ + G L  + L          SV S   FGCG+ 
Sbjct: 209 TTGNAGACESNPSNCSYAVNY-GDGSYTNGELGAEHLSFG-----GISV-SNFVFGCGKN 261

Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGS 288
             G F      +GL GLG    S+  I          FS C     +  +G ++ G++ S
Sbjct: 262 NKGLF---GGVSGLMGLGRSNLSL--ISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESS 316

Query: 289 PGQGETPFSLRQTHPT------YNITITQVSVG 315
             +  TP +  +  P       Y + +T + VG
Sbjct: 317 VFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 128/329 (38%), Gaps = 52/329 (15%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   + VG P       +DTGSDL W  C  C +C               I+ P+ SST
Sbjct: 60  IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  CN             G++C Y++ Y +D T S G L  + + + +   +   V  
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
             + GCG     S       +G+ GL    +S+  I    G  P   S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210

Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
           FG        G   T   L    P  Y + +  VSVG   V          E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           T+ TY        + E  +      R  + + +D+    CY      T   +PV+ +   
Sbjct: 271 TTLTYFPVSYCNLVREAVDHYVTAVRTADPTGNDM---LCYY---TDTIDIFPVITMHFS 324

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           GG    ++   + + +  +G   +CL ++
Sbjct: 325 GGADLVLDKYNMYIETITRG--TFCLAII 351


>gi|7548466|gb|AAA34371.2| secreted aspartyl proteinase 1 [Candida albicans]
          Length = 391

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
           LN+    +  ++++G     F V +DTGS   W+P   V+C        GQ  DF     
Sbjct: 57  LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY+P +S+TS  +                    P+ + Y  DG+ S G L +D       
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFNIGY-GDGSSSQGTLYKDT------ 148

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
                     + FG   +    F D    + P G+ G+G        D  +VP  L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198

Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
           +I  N++S+   S    TG+I FG         +  ++  T      IT+  +   G  +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           N     + DSGT+ TYL       I + F +  K   +  T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 128/329 (38%), Gaps = 52/329 (15%)

Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
           ++   + VG P       +DTGSDL W  C  C +C               I+ P+ SST
Sbjct: 60  IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110

Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
             +  CN             G++C Y++ Y +D T S G L  + + + +   +   V  
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
             + GCG     S       +G+ GL    +S+  I    G  P   S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210

Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
           FG        G   T   L    P  Y + +  VSVG   V          E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270

Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
           T+ TY        + E  +      R  + + +D+    CY      T   +PV+ +   
Sbjct: 271 TTLTYFPVSYCNLVREAVDHYVTAVRTADPTGNDM---LCYY---TDTIDIFPVITMHFS 324

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
           GG    ++   + + +  +G   +CL ++
Sbjct: 325 GGADLVLDKYNMYIETITRG--TFCLAII 351


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 126/317 (39%), Gaps = 49/317 (15%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA   ++A+DT +D  W+PC  C  C              + ++P  S++ 
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTS-----------SPFNPAASASY 102

Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             VPC S  C L     C     +C + + Y +D ++    L +D L +A D      V 
Sbjct: 103 RPVPCGSPQCVLAPNPSCSPNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VV 154

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DG 277
              +FGC +  TG+    A P GL GLG    S   +   + +   +FS C  S    + 
Sbjct: 155 KAYTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNF 209

Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA---------- 325
           +G +  G  G P + +T   L   H +  Y + +T + VG   V+   SA          
Sbjct: 210 SGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAG 269

Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            + DSGT FT L  P Y  + +             +S   F+ CY      T   +P V 
Sbjct: 270 TVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCY-----NTTVAWPPVT 324

Query: 385 LTMKGGGPFFVNDPIVI 401
           L   G       + +VI
Sbjct: 325 LLFDGMQVTLPEENVVI 341


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 94/381 (24%), Positives = 136/381 (35%), Gaps = 103/381 (27%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
            VS+G P     V LDTGS L W+PC     C +C     SS       +++ P  SS+S
Sbjct: 92  TVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNC-----SSLSAASPLHVFHPKNSSSS 146

Query: 164 SKVPCNS------------TLCELQKQCPSAGSNC------------PYQVRYLSDGTMS 199
             + C +            + C     CP  G+NC            PY V Y S  T  
Sbjct: 147 RLIGCRNPSCLWIHSPDHLSDCRAASSCP--GANCTPRNANANNVCPPYLVVYGSGST-- 202

Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
            G L+ D L         ++V + +  GC             P+GL G G    SVPS L
Sbjct: 203 AGLLISDTL-----RTPGRAVRNFV-IGCSLASVHQ-----PPSGLAGFGRGAPSVPSQL 251

Query: 260 ANQGLIPNSFSMCFGS---DGTGRIS------------------FGDKGSPGQGETPFSL 298
              GL    FS C  S   D    +S                  +           P+S+
Sbjct: 252 ---GL--TKFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV 306

Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSA----------IFDSGTSFTYLNDPAYTQISETF 348
                 Y + +T ++VGG +V     A          I DSGT+F+Y +   +  ++   
Sbjct: 307 -----YYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAV 361

Query: 349 NSLAK---EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI----VI 401
            +       + +     L    C+ + P     E P ++L  KGG    +N P+    V+
Sbjct: 362 VAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGS--VMNLPVENYFVV 419

Query: 402 VSSEPKG-----LYLYCLGVV 417
               P G         CL VV
Sbjct: 420 AGPAPSGGAPAMAEAICLAVV 440


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 142/375 (37%), Gaps = 53/375 (14%)

Query: 73  RGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
           RGR LA  G D TP   +AG        L+S G L+  N ++G P       +D   +L 
Sbjct: 27  RGRLLA--GVDATPP--AAGGAVAVPIYLSSQG-LYVANFTIGTPPQPVSAVVDLTGELV 81

Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPY 188
           W  C  C  C            D  ++ P  SST   +PC S LCE     P +  NC  
Sbjct: 82  WTQCTPCQPCFEQ---------DLPLFDPTKSSTFRGLPCGSHLCE---SIPESSRNCTS 129

Query: 189 QVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGL 248
            V      T +     +      TD     +    + FGC  +          P+G+ GL
Sbjct: 130 DVCIYEAPTKAG----DTGGKAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGL 185

Query: 249 GMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQG----ETPFSLRQ---- 300
           G      P  L  Q  +  +FS C     +G +  G       G     TPF ++     
Sbjct: 186 GR----TPWSLVTQMNV-TAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGS 240

Query: 301 ----THPTYNITITQVSVGGNAVNFEFSA----IFDSGTSFTYLNDPAYTQISETFNSLA 352
               ++P Y + +  +  GG  +    S+    + D+ +  +YL D AY  + +   + A
Sbjct: 241 SDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTA-A 299

Query: 353 KEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
              +  ++   P++ C+   P     + P +  T  GG    V     +++S   G    
Sbjct: 300 VGVQPVASPPKPYDLCF---PKAVAGDAPELVFTFDGGAALTVPPANYLLAS---GNGTV 353

Query: 413 CLGVVKSDNVNIIGR 427
           CL +  S ++N+ G 
Sbjct: 354 CLTIGSSASLNLTGE 368


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 35/287 (12%)

Query: 101 LGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
           +G L+Y    S+G P ++  + +DTGSDL W+ C   +      S    + D     P  
Sbjct: 43  IGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQ 97

Query: 160 SSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
           SS+ + VPC   +C    +      + + C Y V Y  DG+ +TG    D L L+     
Sbjct: 98  SSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS----- 151

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
           + S      FGCG  Q+G F +G   +GL GLG ++ S+  +    G     FS C  + 
Sbjct: 152 ASSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTK 206

Query: 277 GT--GRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI-- 326
            +  G ++ G  G P      FS  Q  P+      Y + +T +SVGG  ++   SA   
Sbjct: 207 PSTAGYLTLG-VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG 265

Query: 327 ---FDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
               D+GT  T L   AY  +   F S +A     T+ S+   + CY
Sbjct: 266 GTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCY 312


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 118/306 (38%), Gaps = 65/306 (21%)

Query: 84  KTPLTFSAGND-TYRLN-SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
           KTP   SA +   YR       +   ++ +G P  S  + LDTGS L W+ C        
Sbjct: 54  KTPALKSAASPYNYRSRFKYSMILLVSLPIGTPPQSQQMILDTGSQLSWIQCH------- 106

Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLS 194
                 +     ++ P+ SS+ S +PCN  LC+       L   C      C Y   Y +
Sbjct: 107 -KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFY-A 163

Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
           DGT++ G LV + +  +T +     +      GC         D +   G+ G+ + + S
Sbjct: 164 DGTLAEGNLVREKITFSTSQSTPPLI-----LGCAE-------DASDDKGILGMNLGRLS 211

Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP------------FSLRQTH 302
                A+Q  I   FS C  +    R  F   GS   GE P            FS  Q  
Sbjct: 212 ----FASQAKI-TKFSYCVPTRQV-RPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRM 265

Query: 303 P-----TYNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISE 346
           P      + + +  + +G   +N   SA           + DSG+ FTYL D AY ++ E
Sbjct: 266 PNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVRE 325

Query: 347 TFNSLA 352
               LA
Sbjct: 326 EVVRLA 331


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 143/363 (39%), Gaps = 65/363 (17%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
           RL +L ++    V+VG    +  + +DTGSDL W+ C  C  C +             ++
Sbjct: 139 RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 185

Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
           +P+ SS+   +PCNS  C  LQ    S+G       ++C YQ+ Y  DG+ S G L  + 
Sbjct: 186 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 244

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           L L   E     +D+ I FGCGR   G F      +GL GL   + S+ S      L  +
Sbjct: 245 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 293

Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
            FS C  + G      G  GS   G   FS  +   P               Y + +T +
Sbjct: 294 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 348

Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
           S+GG  +N           ++ DSGT  T L+   Y      F       R T    +  
Sbjct: 349 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSI-L 407

Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIV--IVSSEPKGLYLYCLGVVKSDNVN 423
             C+ L+  +     P V    +G     V+   V   V S+   + L    +   D   
Sbjct: 408 NTCFNLTGYE-EVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTM 466

Query: 424 IIG 426
           IIG
Sbjct: 467 IIG 469


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 86/366 (23%), Positives = 139/366 (37%), Gaps = 41/366 (11%)

Query: 46  LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN-DKTPLTFSAGNDTYRLNSLGFL 104
           L   D PK  S  Y S   H  R+ +   R ++   +  +T  T S       + + G  
Sbjct: 35  LVHRDSPK--SPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGE 92

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +  ++S+G P    +   DTGSDL W  C  C  C   +           ++ P +S T 
Sbjct: 93  YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAP---------LFDPKSSKTY 143

Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
             + C++  C+   +  S  S   C Y   Y  D + + G L  D + L +         
Sbjct: 144 RDLSCDTRQCQNLGESSSCSSEQLCQYSY-YYGDRSFTNGNLAVDTVTLPSTNGGPVYFP 202

Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT 278
             +  GCGR   G+F      +G+ GLG    S+ S + +   +   FS C   F S+  
Sbjct: 203 KTV-IGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESA 257

Query: 279 G---RISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--------NFEFS 324
           G   ++ FG        G   TP   +     Y +T+  +SVG   +          E +
Sbjct: 258 GNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGN 317

Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
            I DSGTS T      +T+ +    +       T  +     +CY  +P   + + PV+ 
Sbjct: 318 IIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP---DLKVPVIT 374

Query: 385 LTMKGG 390
               G 
Sbjct: 375 AHFNGA 380


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 107/263 (40%), Gaps = 34/263 (12%)

Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
           FL    VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S 
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166

Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           TS +V C+S  C EL       Q  C     +C Y V Y +    S G +V D L +   
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226

Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
                     + FGC   V+   F  G    G       +     P IL+ +     +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274

Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
            C  +D T  G +  G  D+ +   G T        PTY++T+   ++ G   V      
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334

Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
           I DSG   T L    +  + +T 
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 130/349 (37%), Gaps = 42/349 (12%)

Query: 74  GRGLAAQGNDKTPL-TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW-- 130
            R       D TP  T S G ++    ++     T  +  Q A+S  V +DT SD+ W  
Sbjct: 124 ARSTTVSNRDYTPSSTASVGTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQ 183

Query: 131 -LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQ----CPSAGS 184
            LPC    C          +    +Y P  SST + +PC S  C EL       C     
Sbjct: 184 CLPCPIPQC---------HLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTD 234

Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
            C Y V Y  DG  +TG  V D L ++        V     FGC     GSF +  A  G
Sbjct: 235 ECKYIVNY-GDGKATTGTYVTDTLTMS-----PTIVVKDFRFGCSHAVRGSFSNQNA--G 286

Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQ 300
           +  LG  + S+    A+     N+FS C     +    F   G P +    FS    ++ 
Sbjct: 287 ILALGGGRGSLLEQTADA--YGNAFSYCIPKPSSA--GFLSLGGPVEASLKFSYTPLIKN 342

Query: 301 TH-PT-YNITITQVSVGGNAV-----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
            H PT Y + +  + V G  +      F   A+ DSG   T L    Y  +   F S   
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMA 402

Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
                +      + CY  +    + + P V+L   GG    +    +I+
Sbjct: 403 AYGPLAAPVRNLDTCYDFT-RFPDVKVPKVSLVFAGGATLDLEPASIIL 450


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 65/225 (28%), Positives = 99/225 (44%), Gaps = 27/225 (12%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++T + VG P     + LDTGSD+ W+ C+ C  C    +          I++P+ S++ 
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207

Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
           S V C+S +C            C Y+  Y  DG+ STG    + L   T         + 
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260

Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
           ++ GCG    G F+  A    L GLG    S P+ +  Q    ++FS C     SD +G 
Sbjct: 261 VAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315

Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
           + FG K  P G   TP       PT Y +++T +S+   A  + F
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISISAIACVWSF 360


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 143/363 (39%), Gaps = 65/363 (17%)

Query: 97  RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
           RL +L ++    V+VG    +  + +DTGSDL W+ C  C  C +             ++
Sbjct: 60  RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 106

Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
           +P+ SS+   +PCNS  C  LQ    S+G       ++C YQ+ Y  DG+ S G L  + 
Sbjct: 107 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 165

Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
           L L   E     +D+ I FGCGR   G F      +GL GL   + S+ S      L  +
Sbjct: 166 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 214

Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
            FS C  + G      G  GS   G   FS  +   P               Y + +T +
Sbjct: 215 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 269

Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
           S+GG  +N           ++ DSGT  T L+   Y      F       R T    +  
Sbjct: 270 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSI-L 328

Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIV--IVSSEPKGLYLYCLGVVKSDNVN 423
             C+ L+  +     P V    +G     V+   V   V S+   + L    +   D   
Sbjct: 329 NTCFNLTGYE-EVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTM 387

Query: 424 IIG 426
           IIG
Sbjct: 388 IIG 390


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 131/343 (38%), Gaps = 51/343 (14%)

Query: 112 GQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
           G  A +  V +DTGSDL W+   PC   SC    +          ++ P  S T + VPC
Sbjct: 188 GGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDP---------LFDPAASPTFAAVPC 238

Query: 169 NSTLC--ELQKQCPSAGS----------NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
            S  C   L+    + GS           C Y + Y  DG+ S G L +D L L T  K 
Sbjct: 239 GSPACAASLKDATGAPGSCARSAGNSEQRCYYALSY-GDGSFSRGVLAQDTLGLGTTTKL 297

Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
              V     FGCG    G F   A   GL GLG    S+ S  A +      FS C    
Sbjct: 298 DGFV-----FGCGLSNRGLFGGTA---GLMGLGRTDLSLVSQTAAR--FGGVFSYCLPAT 347

Query: 275 SDGTGRISFGDKGS---PGQGETPFSLRQTHPTY---NITITQVSVGGNAVNFEFSA--- 325
           +  TG +S G   S   P    T      T P +   NIT   V  G       F A   
Sbjct: 348 TTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNV 407

Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
           + DSGT  T L    Y  +   F    +       S L  + CY L+  +     P++ L
Sbjct: 408 LVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSIL--DACYDLT-GRDEVNVPLLTL 464

Query: 386 TMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
           T++GG    V+    + +V  +   + L    +   D   IIG
Sbjct: 465 TLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIG 507


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 122/313 (38%), Gaps = 35/313 (11%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
           L++L F+    V +G PA    +  DTGSDL W+ C  C S  H             ++ 
Sbjct: 139 LDTLEFV--VAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQD------PLFD 190

Query: 157 PNTSSTSSKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
           P+ SST + V C    C      C    + C Y VRY  DG+ +TG L  D L L +   
Sbjct: 191 PSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRY-GDGSSTTGVLSRDTLALTSSRA 249

Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
            +        FGCG    G F  G     L     + +      A+ G +   FS C  S
Sbjct: 250 LTG-----FPFGCGTRNLGDF--GRVDGLLGLGRGELSLPSQAAASFGAV---FSYCLPS 299

Query: 276 DG--TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------NAVNFEF 323
               TG ++ G   +   G   ++     P     Y + +  + +GG       AV    
Sbjct: 300 SNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG 359

Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
             + DSGT  TYL   AY  + + F  L  E+   +  +   + CY  +  ++    P V
Sbjct: 360 GTLLDSGTVLTYLPAQAYALLRDRFR-LTMERYTPAPPNDVLDACYDFA-GESEVVVPAV 417

Query: 384 NLTMKGGGPFFVN 396
           +     G  F ++
Sbjct: 418 SFRFGDGAVFELD 430


>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
 gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
          Length = 472

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 102/408 (25%), Positives = 161/408 (39%), Gaps = 54/408 (13%)

Query: 6   RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
           R S +C LL L+      C     + F   H  + PV G+  V+ + +K      +AL  
Sbjct: 4   RRSVLCFLLALV------CI----WEFSRPHVEAAPVSGL--VNAIARK---VLPAALKE 48

Query: 66  RDRYFRLRGRGLAAQGNDKTPLTFSAGNDT--YRLNSLGFLHYTNVSVGQPALSFIVALD 123
                  + R LA    D +      G +T  Y  N L F    N+++G P +     + 
Sbjct: 49  GGAIVWKQRRTLANITTDFSVRGGDKGLETSFYVDNGLNFAM--NLNLGTPPVQHNFTMA 106

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----K 177
             S+ FW  C  CV C    N          ++S  +S++ +++PC S  C         
Sbjct: 107 LNSEFFWAACSPCVDCNVSTNDP--------LFSSASSTSYTRIPCTSPFCSTSPGFSTN 158

Query: 178 QCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
            C S+    + C Y   Y +D + S G +  DV+ + T  K   +   R+S GCGR  T 
Sbjct: 159 ACGSSAVGSTTCLYNFSYSTDYS-SAGEMASDVVAMKTPRKTRGNKSLRMSLGCGREST- 216

Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG-TGRISFGDKGSPGQGE 293
           + L     +GL G      S    LA      + F  C  SD  +G+I  G+        
Sbjct: 217 TLLGILNTSGLVGFAKTDKSFIGQLAEMDYT-SKFIYCVPSDTFSGKIVLGNYKISSHSS 275

Query: 294 ---TPFSLRQTHPTY----NITITQV---SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
              TP  +  T   Y    +I+IT      V G   +     I DS  +F+Y    +YT 
Sbjct: 276 LSYTPMIVNSTALYYIGLRSISITDTLTFPVQGILADGTGGTIIDSTFAFSYFTPDSYTP 335

Query: 344 ISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTM 387
           + +   +L     + S+++    L  + CY +S N  + E   V L +
Sbjct: 336 LVQAIQNLNSNLTKVSSNETAALLGNDICYNVSVNDDDAENATVCLAV 383


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 96/395 (24%), Positives = 142/395 (35%), Gaps = 61/395 (15%)

Query: 60  YSALAHRDRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
           Y  L    R   LRG   R + A  ND      S G            +  N+S+G P +
Sbjct: 56  YQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGG----------AYLMNISLGTPPV 105

Query: 117 SFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
             +   DTGSDL W  C  C +C   +           ++ P  S T   + C++  C+ 
Sbjct: 106 PMLGIADTGSDLIWRQCLPCPNCYEQVEP---------LFDPKESETYKTLDCDNEFCQD 156

Query: 176 QKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
             Q  S   +  C Y   Y  D + + G L  D L + + E    S    I+FGCG    
Sbjct: 157 LGQQGSCDDDNTCTYSYSY-GDRSYTRGDLSSDTLTIGSTEGDPASFPG-IAFGCGHDNG 214

Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT--GRISFGDKG- 287
           G+F +         +G+    +  ++     +   FS C     SD T   +I+FG  G 
Sbjct: 215 GTFNE----KDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGV 270

Query: 288 --SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------------EFSAIFDSGT 331
               G   TP         Y +T+  +SVG   V F              E + I DSGT
Sbjct: 271 VSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGT 330

Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
           + T L    YT +     +    +  T  + + F  CY    +  N E P +     G  
Sbjct: 331 TLTLLPQDFYTDVESALTNAIGGQTTTDPNGI-FSLCY---SSVNNLEIPTITAHFTGAD 386

Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
                    +   E     L C  ++ S N+ I G
Sbjct: 387 VQLPPLNTFVQVQED----LVCFSMIPSSNLAIFG 417


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/454 (21%), Positives = 165/454 (36%), Gaps = 87/454 (19%)

Query: 35  HHRYS---------DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
           H R+S         + VKG +  D L ++     +  +++ DR    R +GL      + 
Sbjct: 42  HERFSGGGGDVDQVEAVKGFVNRDGLRRQRMNQRW-GVSNYDR----RRKGLETTTTTEV 96

Query: 86  PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
            +   AG D    ++LG  ++T V VG P   F +A DTGS+  W  C   +      + 
Sbjct: 97  EMPMRAGRD----DALG-EYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTK 151

Query: 146 SGQVIDF------------------------------NIYSPNTSSTSSKVPCNSTLCEL 175
             +                                   ++ P+ S +   V C S  C++
Sbjct: 152 KTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKI 211

Query: 176 Q-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
                     CP     C Y + Y +DG+ + GF   D + +     +   +++ ++ GC
Sbjct: 212 DLSQLFSLSLCPKPSDPCLYDISY-ADGSSAKGFFGTDTITVDLKNGKEGKLNN-LTIGC 269

Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDK 286
            +             G+ GLG  K S     A +      FS C     + R   S+   
Sbjct: 270 TKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYE--YGAKFSYCLVDHLSHRNVSSYLTI 327

Query: 287 GSPGQGETPFSLRQTH-----PTYNITITQVSVGGNAV---------NFEFSAIFDSGTS 332
           G     +    +++T      P Y + +  +S+GG  +         N +   + DSGT+
Sbjct: 328 GGHHNAKLLGEIKRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTT 387

Query: 333 FTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFE---YPVVNLTMK 388
            T L  PAY  + E    SL K KR T       ++C+    +   F+    P +     
Sbjct: 388 LTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCF----DAEGFDDSVVPRLVFHFA 443

Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
           GG  F       I+   P    + C+G+V  D +
Sbjct: 444 GGARFEPPVKSYIIDVAP---LVKCIGIVPIDGI 474


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/256 (31%), Positives = 109/256 (42%), Gaps = 38/256 (14%)

Query: 99  NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
           +S+G L Y   VS+G P ++  V +DTGSD+ W+ C   +                ++ P
Sbjct: 493 HSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKD------QLFDP 546

Query: 158 NTSSTSSKVPCNSTLC-ELQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
             SS+ S VPC +  C EL      C +AGS C Y V Y  DG+ +TG    D L L   
Sbjct: 547 AKSSSYSAVPCAADACSELSTYGHGC-AAGSQCGYVVSY-GDGSNTTGVYGSDTLTLTDA 604

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGL---GMDKTSVPSILANQGLIPNSFS 270
           +  +  +     FGCG  Q G F   A  +GL  L   GM  TS  S     G+    FS
Sbjct: 605 DAVTGFL-----FGCGHAQAGLF---AGIDGLLALGRKGMSLTSQTSGAYGGGV----FS 652

Query: 271 MCF--GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGN------AVN 320
            C       TG ++ G   S  G   T        PT Y + +T + VGG       A  
Sbjct: 653 YCLPPSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASA 712

Query: 321 FEFSAIFDSGTSFTYL 336
           F    + D+GT  T L
Sbjct: 713 FAGGTVVDTGTVITRL 728


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 117/317 (36%), Gaps = 71/317 (22%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           ++  V VG P     + +D+GSD+ W+ C  C  C    +          ++ P  S++ 
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADP---------LFDPAASASF 183

Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           + VPC+S +C         C  +G+ C YQV Y  DG+ + G L  + L        S  
Sbjct: 184 TAVPCDSGVCRTLPGGSSGCADSGA-CRYQVSY-GDGSYTQGVLAMETLTFG----DSTP 237

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--- 276
           V   ++ GCG    G F+  A   GL GLG    S+   L        +FS C  S    
Sbjct: 238 VQG-VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAD 291

Query: 277 -GTGRISFG-DKGSP-GQGETPFSLRQTHPTYNITITQVSV------------------G 315
            G G + FG D   P G    P       P++                           G
Sbjct: 292 AGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDG 351

Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYC 368
           G  V      + D+GT+ T L   AY  + + F S       T   DLP        + C
Sbjct: 352 GGGV------VMDTGTAVTRLPPDAYAALRDAFAS-------TIGGDLPRAPGVSLLDTC 398

Query: 369 YVLSPNQTNFEYPVVNL 385
           Y LS    +   P V L
Sbjct: 399 YDLS-GYASVRVPTVAL 414


>gi|68475693|ref|XP_718053.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
 gi|68475828|ref|XP_717987.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
 gi|7548425|gb|AAA34368.2| secreted aspartyl proteinase 1 [Candida albicans]
 gi|7548465|gb|AAA34370.2| secreted aspartyl proteinase 1 [Candida albicans]
 gi|46439729|gb|EAK99043.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
 gi|46439804|gb|EAK99117.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
          Length = 391

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
           LN+    +  ++++G     F V +DTGS   W+P   V+C        GQ  DF     
Sbjct: 57  LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY+P +S+TS  +                    P+ + Y  DG+ S G L +D       
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
                     + FG   +    F D    + P G+ G+G        D  +VP  L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198

Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
           +I  N++S+   S    TG+I FG         +  ++  T      IT+  +   G  +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           N     + DSGT+ TYL       I + F +  K   +  T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/270 (27%), Positives = 110/270 (40%), Gaps = 54/270 (20%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
           H   V VG P     V LD GSDL W  C  V        ++ Q+    ++    SS+ S
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLV------GPTAKQLEP--VFDAARSSSFS 158

Query: 165 KVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTM-STGFLVEDVLHLATDEKQSKS 219
            +PC+S LCE      K C      C Y+  Y   G M +TG L  +             
Sbjct: 159 VLPCDSKLCEAGTFTNKTC--TDRKCAYENDY---GIMTATGVLATETFTFGAHH----G 209

Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD 276
           V + ++FGCG++  G+    A  +G+ GL     S+   LA        FS C   F   
Sbjct: 210 VSANLTFGCGKLANGTI---AEASGILGLSPGPLSMLKQLAI-----TKFSYCLTPFADR 261

Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAVNF--EFSA 325
            T  + FG     G+ +T   + QT P          Y + +  +SVG   ++   E  A
Sbjct: 262 KTSPVMFGAMADLGKYKTTGKV-QTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLA 320

Query: 326 I---------FDSGTSFTYLNDPAYTQISE 346
           I          DS T+  YL +PA+T++ +
Sbjct: 321 IKPDGTGGTVLDSATTLAYLVEPAFTELKK 350


>gi|193885194|pdb|2QZW|A Chain A, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
 gi|193885195|pdb|2QZW|B Chain B, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
          Length = 341

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
           LN+    +  ++++G     F V +DTGS   W+P   V+C        GQ  DF     
Sbjct: 7   LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 63

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY+P +S+TS  +                    P+ + Y  DG+ S G L +D       
Sbjct: 64  IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 98

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
                     + FG   +    F D    + P G+ G+G        D  +VP  L NQG
Sbjct: 99  ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 148

Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
           +I  N++S+   S    TG+I FG         +  ++  T      IT+  +   G  +
Sbjct: 149 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 208

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           N     + DSGT+ TYL       I + F +  K   +  T
Sbjct: 209 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 249


>gi|353678009|sp|C4YSF6.1|CARP1_CANAW RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
           Full=Aspartate protease 1; AltName: Full=Secreted
           aspartic protease 1; Flags: Precursor
 gi|238883021|gb|EEQ46659.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 391

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
           LN+    +  ++++G     F V +DTGS   W+P   V+C        GQ  DF     
Sbjct: 57  LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY+P +S+TS  +                    P+ + Y  DG+ S G L +D       
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
                     + FG   +    F D    + P G+ G+G        D  +VP  L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198

Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
           +I  N++S+   S    TG+I FG         +  ++  T      IT+  +   G  +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           N     + DSGT+ TYL       I + F +  K   +  T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299


>gi|353678008|sp|P0CY27.1|CARP1_CANAL RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
           Full=Aspartate protease 1; AltName: Full=Secreted
           aspartic protease 1; Flags: Precursor
 gi|7548436|gb|AAA34369.2| secreted aspartyl proteinase 1 [Candida albicans]
          Length = 391

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)

Query: 98  LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
           LN+    +  ++++G     F V +DTGS   W+P   V+C        GQ  DF     
Sbjct: 57  LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113

Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
           IY+P +S+TS  +                    P+ + Y  DG+ S G L +D       
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148

Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
                     + FG   +    F D    + P G+ G+G        D  +VP  L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198

Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
           +I  N++S+   S    TG+I FG         +  ++  T      IT+  +   G  +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258

Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
           N     + DSGT+ TYL       I + F +  K   +  T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299


>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C        LQ+  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ + L     S C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T+   ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312


>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
 gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
 gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
 gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
 gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)

Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           VS+G+P +  +VA+DTGS L W+ C  C    H  ++ +G + D     P  S TS +V 
Sbjct: 3   VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57

Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
           C+S  C        LQ+  C     +C Y V Y +    S G +V D L +         
Sbjct: 58  CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114

Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
               + FGC   V+   F  G    G       +     P IL+ +     +FS C  +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165

Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
            T  G +  G  D+ +   G TP       PTY++T    ++ G   V      I DSG 
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225

Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
             T L    +  + +T            TS +      CY+           ++P     
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285

Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
             P++ +   GG          F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 108/270 (40%), Gaps = 73/270 (27%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
           N+SVG P L+F V  DTGSDL W  C  C  C                + P +SST SK+
Sbjct: 89  NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139

Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
           PC S+ C+      + C + G  C Y  +Y S  T   G+L  + L +      S     
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190

Query: 223 RISFGCGRVQTGSFLDGAAPNGL--FGLGMDKTSVPSILANQGLIPNSFSMCFGSD---G 277
            ++FGC           +  NGL    LG+ +                FS C  S    G
Sbjct: 191 -VAFGC-----------STENGLGQLDLGVGR----------------FSYCLRSGSAAG 222

Query: 278 TGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS--- 324
              I FG   +   G    TPF +    HP+ Y + +T ++VG   +      F F+   
Sbjct: 223 ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNG 282

Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNS 350
                I DSGT+ TYL    Y  + + F S
Sbjct: 283 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 312


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 95/330 (28%), Positives = 129/330 (39%), Gaps = 58/330 (17%)

Query: 64  AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
           +HR  Y       L A     T +  ++GN  +  N     +     +G P     + LD
Sbjct: 72  SHRLTYLS----SLVAGKPKPTSVPVASGNQLHIGN-----YVVRAKLGTPPQLMFMVLD 122

Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCP 180
           T +D  WLPC  C  C +   S +            +SST S V C++  C   +   CP
Sbjct: 123 TSNDAVWLPCSGCSGCSNASTSFNTN----------SSSTYSTVSCSTAQCTQARGLTCP 172

Query: 181 SAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
           S+    S C +   Y  D + S   LV+D L LA D      V    SFGC    +G+ L
Sbjct: 173 SSSPQPSVCSFNQSYGGDSSFSAS-LVQDTLTLAPD------VIPNFSFGCINSASGNSL 225

Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE 293
               P GL GLG    S+  +     L    FS C  S      +G +  G  G P    
Sbjct: 226 ---PPQGLMGLGRGPMSL--VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIR 280

Query: 294 -TPFSLRQTHPT-YNITITQVSVGG-----NAVNFEFSA------IFDSGTSFTYLNDPA 340
            TP       P+ Y + +T VSVG      + V   F A      I DSGT  T    P 
Sbjct: 281 YTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPV 340

Query: 341 YTQISETFNSLAKEKRETSTSDL-PFEYCY 369
           Y  I + F    K+   +S S L  F+ C+
Sbjct: 341 YEAIRDEFR---KQVNVSSFSTLGAFDTCF 367


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 118/287 (41%), Gaps = 36/287 (12%)

Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
           P +   V LD+ SD+ W+   CV C   +     QV  F  Y P+ S +S+   C+S  C
Sbjct: 155 PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPSSAPFSCSSPTC 208

Query: 174 EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
                    C  A + C Y VRY  DG+ ++G  + D+L L      + +  S   FGC 
Sbjct: 209 TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 260

Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
             + GSF   AA  G+  LG    S+ S  A++    N+FS C  +  +    F   G P
Sbjct: 261 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 315

Query: 290 GQGETPF------SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
            +  + +        RQ    Y + +  ++VGG  +      F   ++ DS T+ T L  
Sbjct: 316 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 375

Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
            AY  +   F S     R         + CY  +    N   P ++L
Sbjct: 376 TAYQALRSAFRSSMTMYRSAPPKGY-LDTCYDFT-GVVNIRLPKISL 420


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 126/323 (39%), Gaps = 58/323 (17%)

Query: 46  LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH 105
           L +  +  KG +     +       RLR    A  G D T         + RL+S+   +
Sbjct: 25  LVLTHVDSKGGYTKTELMRRAVHRSRLR----ALSGYDAT---------SPRLHSVQVEY 71

Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
              +++G+P + F+   DTGSDL W  C  C  C            D  +Y P+ SST S
Sbjct: 72  LMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASSTFS 122

Query: 165 KVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
            +PC+S  C     + C +  S C Y+  Y  DG  S G L  + L L            
Sbjct: 123 PLPCSSATCLPIWSRNC-TPSSLCRYRYAY-GDGAYSAGILGTETLTLGPSSAPVSV--G 178

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGT 278
            ++FGCG    G  L+     G  GLG       S+LA  G+    FS C    F S   
Sbjct: 179 GVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSALD 230

Query: 279 GRISFGDKGSPGQG-----ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSA-- 325
                G       G      TP      +P+ Y +++  +S+G   +      F+     
Sbjct: 231 SPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDG 290

Query: 326 ----IFDSGTSFTYLNDPAYTQI 344
               I DSGT+FT L +  + ++
Sbjct: 291 TGGMIVDSGTTFTILAESGFREV 313


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/336 (24%), Positives = 121/336 (36%), Gaps = 65/336 (19%)

Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
           +     +G PA + +VA+D  +D  W+PC  C  C     S          +SP  SST 
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 151

Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
             VPC S  C       CP+  GS+C + + Y +    +   L +D L L  +      V
Sbjct: 152 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 203

Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
               +FGC RV  G+    A  + L        +   ++A+QG                 
Sbjct: 204 VVSYTFGCLRVVNGNSRAAAGAHRL-----RPRAALLLVADQG----------------- 241

Query: 281 ISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA-----------IF 327
              G  G P + +T   L   H P+ Y + +  + VG   V    SA           I 
Sbjct: 242 -HLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTII 300

Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
           D+GT FT L  P Y  + + F    +           F+ CY           P V    
Sbjct: 301 DAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTVTFMF 353

Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
            G     + +  V++ S   G+    +    SD VN
Sbjct: 354 AGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 389


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 119/313 (38%), Gaps = 49/313 (15%)

Query: 68  RYFRLRGRGLAAQ-----GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP-ALSFIVA 121
           R   +R R  AA      G    P T   G     +NS   +H   +S+G P +   ++ 
Sbjct: 53  RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIH---LSIGAPRSQPVVLT 109

Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
           LDTGSD+ W  C+ C  C       +  +  F+  + NT  +   V C+  LC    +  
Sbjct: 110 LDTGSDVVWTQCEPCAECF------TQPLPRFDTAASNTVRS---VACSDPLCNAHSEHG 160

Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
                C Y   Y  DG++S G  + D       +   K     I FGCG    G FL   
Sbjct: 161 CFLHGCTYVSGY-GDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ-- 217

Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS------FGDKGSPGQG-- 292
              G+ G G    S+PS L  +      FS CF +    + S       GD  +   G  
Sbjct: 218 TETGIAGFGRGPLSLPSQLKVR-----QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPI 272

Query: 293 -ETPFSLRQTHP-----TYNITITQVSVGGNAVNF-EFSA------IFDSGTSFTYLNDP 339
             TPF +R   P      Y ++   V+VG   +   E  A        DSGT  T   D 
Sbjct: 273 LSTPF-VRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSGTDITTFPDA 331

Query: 340 AYTQISETFNSLA 352
            + Q+   F + A
Sbjct: 332 VFRQLKSAFIAQA 344


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 87/336 (25%), Positives = 139/336 (41%), Gaps = 56/336 (16%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
           +++VG P  +  + +DTGS+L WL C         N+S       + ++P  SS+ S +P
Sbjct: 76  SLTVGTPPQNVTMVIDTGSELSWLHC---------NTSQNSSSSSSTFNPVWSSSYSPIP 126

Query: 168 CNSTLCELQKQ----CPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
           C+S+ C  Q +     PS  SN  C   + Y +D + S G L  D  ++ +      S  
Sbjct: 127 CSSSTCTDQTRDFPIRPSCDSNQFCHATLSY-ADASSSEGNLATDTFYIGS------SGI 179

Query: 222 SRISFGC-GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTG 279
             + FGC   + + +  + +   GL  +GM++ S+ S ++  G     FS C    D +G
Sbjct: 180 PNVVFGCMDSIFSSNSEEDSKNTGL--MGMNRGSL-SFVSQMGF--PKFSYCISEYDFSG 234

Query: 280 RISFGDKG----SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF------------ 323
            +  GD      +P        +    P ++     V + G  V  +             
Sbjct: 235 LLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDH 294

Query: 324 ----SAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE----YCYVLSPN 374
                 + DSGT FT+L  PAYT + + F N  A   R    S+  F+     CY +  N
Sbjct: 295 TGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTN 354

Query: 375 QTNF-EYPVVNLTMKGGGPFFVNDPIVI-VSSEPKG 408
           QT     P V L  +G       D I+  V  E +G
Sbjct: 355 QTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRG 390


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/272 (27%), Positives = 108/272 (39%), Gaps = 42/272 (15%)

Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
            +S+G P       +DTGSDL WL CD  +C H      G+ I F+    + SS+  K+P
Sbjct: 8   ELSIGTPPQLIPAMIDTGSDLVWLKCD--NCDHCDLDHHGETIFFS----DASSSYKKLP 61

Query: 168 CNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDS 222
           CNST C         P     C Y+  Y  DG+ ++G +  D +   +    +  +S   
Sbjct: 62  CNSTHCSGMSSAGIGPRCEETCKYKYEY-GDGSRTSGDVGSDRISFRSHGAGEDHRSFFD 120

Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GR 280
              FGCGR   G   D     GL GLG    S+   L ++  +   FS C  S  +    
Sbjct: 121 GFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSPPSA 175

Query: 281 ISFGDKGSPG--QGETPFSLRQTH------PTYNITITQVSVGGNAVN------------ 320
            SF   GS    +G    S    H        Y + +  ++VGG  V             
Sbjct: 176 KSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSV 235

Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFN 349
                   + DSGT++T L  P Y  + ++  
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIE 267


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.421 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,403,271,145
Number of Sequences: 23463169
Number of extensions: 335123496
Number of successful extensions: 674075
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 316
Number of HSP's successfully gapped in prelim test: 2285
Number of HSP's that attempted gapping in prelim test: 669863
Number of HSP's gapped (non-prelim): 2957
length of query: 444
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 298
effective length of database: 8,933,572,693
effective search space: 2662204662514
effective search space used: 2662204662514
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)