BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 016916
         (380 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  523 bits (1346), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           S  F    SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 36  SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95

Query: 87  VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
           +EAPHPLY+PS+DL+PC DP+C +LH   +  CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 96  LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 155

Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           + NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ  ++NV+G
Sbjct: 156 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 215

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
           HCLS  GGG LFFGDDLYDSSRV WT MS +Y+K+YSP +  EL FGG TTGLKNL  VF
Sbjct: 216 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 275

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSGSSYTY N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335

Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            LALSF  G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 336 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  523 bits (1346), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           S  F    SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 36  SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95

Query: 87  VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
           +EAPHPLY+PS+DL+PC DP+C +LH   +  CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 96  LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 155

Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           + NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ  ++NV+G
Sbjct: 156 SMNYTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 215

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
           HCLS  GGG LFFGDDLYDSSRV WT MS +Y+K+YSP +  EL FGG TTGLKNL  VF
Sbjct: 216 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 275

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSGSSYTY N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335

Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            LALSF  G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 336 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score =  522 bits (1345), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 247/352 (70%), Positives = 291/352 (82%), Gaps = 3/352 (0%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           S  F    SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 33  SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 92

Query: 87  VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
           +EAPHPLY+PS+DL+PC DP+C +LH   +  CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 93  LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 152

Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           + NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ  ++NV+G
Sbjct: 153 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 212

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
           HCLS  GGG LFFGDDLYDSSRV WT MS +Y+K+YSP +  EL FGG TTGLKNL  VF
Sbjct: 213 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 272

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSGSSYTY N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 273 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 332

Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
            LALSF  G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IGG
Sbjct: 333 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGG 384


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           S  F    SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 24  SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 83

Query: 87  VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
           +EAPHPLY+PS+DL+PC DP+C +LH   +  CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 84  LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 143

Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           + NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ  ++NV+G
Sbjct: 144 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 203

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
           HCLS  GGG LFFGDDLYDSSRV WT MS +Y+K+YSP +  EL FGG TTGLKNL  VF
Sbjct: 204 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 263

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSGSSYTY N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 264 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 323

Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            LALSF  G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 324 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 376


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 246/354 (69%), Positives = 292/354 (82%), Gaps = 3/354 (0%)

Query: 26  SSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR 85
           ++  F    SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCV 
Sbjct: 32  AADRFTRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVH 91

Query: 86  CVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDA 145
           C+EAPHPLY+PSNDL+PC DP+C +LH  G+H CE P QCDYE+EYADGGSSLGVLV+D 
Sbjct: 92  CLEAPHPLYQPSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151

Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 204
           F+ NYT G RL PRLALGCGY+Q+PGAS +HPLDG+LGLG+GK SI+SQLHSQ  ++NVV
Sbjct: 152 FSLNYTKGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVV 211

Query: 205 GHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVV 263
           GHCLS  GGG LFFG+DLYDSSRV WT M+ + +K+YSP +  EL FGG TTGLKNL  V
Sbjct: 212 GHCLSSLGGGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLLTV 271

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTY N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F
Sbjct: 272 FDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYF 331

Query: 324 RTLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + LALSF  G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 332 KPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 385


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score =  516 bits (1328), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 248/354 (70%), Positives = 293/354 (82%), Gaps = 2/354 (0%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           ++SSSL N + SS++F ++GNVYP GYY V++ IGQP +PYFLD DTGSDL+WLQCDAPC
Sbjct: 40  AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPC 99

Query: 84  VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
           VRC +APHPLYRP+N+LV C+DP+CASLH PG+  CE P QCDYE+EYADGGSSLGVLVK
Sbjct: 100 VRCTKAPHPLYRPNNNLVICKDPMCASLHPPGY-KCEHPEQCDYEVEYADGGSSLGVLVK 158

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
           D F  N+TNG RL PRLALGCGY+Q+PG SYHPLDG+LGLGKGKSSIVSQLHSQ +IRNV
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNV 218

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
           VGHC+S  GGGFLFFGDDLYDSSRVVWT M  D   +YS G AEL  GG+TT  KNL V 
Sbjct: 219 VGHCVSSRGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVT 278

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYLN + YQ L  +++KELS K ++EA +D+TLPLCW+G+RPFK+V DVKK F
Sbjct: 279 FDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFF 338

Query: 324 RTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + LALSF   G+T+T +++  E+YLIIS KGNVCLGILNG E GLQD N+IG I
Sbjct: 339 KPLALSFPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDI 392


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 245/362 (67%), Positives = 289/362 (79%), Gaps = 20/362 (5%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC+EAPHPLY
Sbjct: 22  SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 81

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
           +PS+DL+PC DP+C +LH   +  CE P QCDYE+EYADGGSSLGVLV+D F+ NYT G 
Sbjct: 82  QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGL 141

Query: 155 RLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
           RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ  ++NV+GHCLS  GG
Sbjct: 142 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 201

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVFDSGSSYTY 272
           G LFFGDDLYDSSRV WT MS +Y+K+YSP +  EL FGG TTGLKNL  VFDSGSSYTY
Sbjct: 202 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTY 261

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
            N   YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+ LALSF  
Sbjct: 262 FNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKT 321

Query: 333 G-KTRTLFELTPEAYLIIS-----------------NKGNVCLGILNGAEVGLQDLNVIG 374
           G +++TLFE+ PEAYLIIS                  KGNVCLGILNG E+GLQ+LN+IG
Sbjct: 322 GWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNVCLGILNGTEIGLQNLNLIG 381

Query: 375 GI 376
            I
Sbjct: 382 DI 383


>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
 gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
          Length = 433

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 253/364 (69%), Positives = 295/364 (81%), Gaps = 3/364 (0%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R +  S   +SS + N  GSSL+F +HGNVYP GYYNVT+ IGQPA+PYFLD+DTGSDLT
Sbjct: 36  RKAVLSGEITSSMMINRAGSSLVFPLHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLT 95

Query: 76  WLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG 135
           WLQCDAPC +C+EAPHPLYRPSN+LV CEDP+CASL  PG HNC+DP QCDYE+EYADGG
Sbjct: 96  WLQCDAPCRQCIEAPHPLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGG 155

Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           SSLGVLVKD F  N+TNG+RLNP LALGCGY+Q+PG S HPLDGILGLG+G SSI SQL 
Sbjct: 156 SSLGVLVKDVFVLNFTNGKRLNPLLALGCGYDQLPGRSNHPLDGILGLGRGISSIPSQLS 215

Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
           SQ L+ NV+GHCLSG GGGFLFFG+D+YDSS V WT MS D+ K+YSPG AEL F G++T
Sbjct: 216 SQGLVSNVIGHCLSGRGGGFLFFGEDIYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKST 275

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
           G++NL VVFDSGSSYTYLN   YQ L   +K+ELS K + EA +D+TLPLCWKG+RPFK+
Sbjct: 276 GIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCWKGKRPFKS 335

Query: 316 VHDVKKCFRTLALSFTDGKTR---TLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
           + DVKK F+  AL F     R   T FE +PEAYLIIS+KGN CLGILNG EVGL+DLNV
Sbjct: 336 IRDVKKYFKPFALVFKTSSGRSSKTQFEFSPEAYLIISSKGNACLGILNGTEVGLRDLNV 395

Query: 373 IGGI 376
           IG +
Sbjct: 396 IGDV 399


>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 242/354 (68%), Positives = 287/354 (81%), Gaps = 4/354 (1%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           ++SSSL N + SS++F ++GNVYP GYY V++ IGQP  PYFLD  TGSDL+WLQCDAPC
Sbjct: 40  AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPC 99

Query: 84  VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
           VRC +A H LYRP+N+LV C+DP+CA LH PG+  CE P QCDYE+EYADGGSSLGVLVK
Sbjct: 100 VRCTKAXHXLYRPNNNLVICKDPMCAXLHPPGY-KCEHPEQCDYEVEYADGGSSLGVLVK 158

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
           D F  N+TNG RL PRLALGCGY+Q+PG SYHPLDG+LGLGKGKSSIVSQLHSQ +IRNV
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQIPGXSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNV 218

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
           VGHC+S  GGGFLFFGDDLYDSSRVVWT M  D   +YS G AEL  GG+TT  KNL V 
Sbjct: 219 VGHCVSSHGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVT 278

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYLN + YQ L  +++KELS K ++EA +D+TLPLCW+G+RPFK+V DV+K F
Sbjct: 279 FDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVRKFF 338

Query: 324 RTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + LALSF   G+T+T +++  E+YLIIS  GNVCLGILNG E GLQD N+IG I
Sbjct: 339 KPLALSFAGGGRTKTQYDIPLESYLIIS--GNVCLGILNGTEAGLQDFNLIGDI 390


>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
          Length = 393

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 251/358 (70%), Positives = 292/358 (81%), Gaps = 2/358 (0%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           S  + +SS L N V SS++  +HGNVYP GYYNVT+ IGQP++PYFLD+DTGSDLTWLQC
Sbjct: 3   SGETMASSMLINRVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQC 62

Query: 80  DAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLG 139
           DAPCV+C EAPHP YRP N+LVPC DPIC SLH+ G H CE+P QCDYE+EYADGGSS G
Sbjct: 63  DAPCVQCTEAPHPYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFG 122

Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           VLV D F  N+T+ +R +P LALGCGY+Q PG S+HP+DG+LGLGKGKSSIVSQL S  L
Sbjct: 123 VLVTDTFNLNFTSEKRHSPLLALGCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGL 182

Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
           +RNV+GHCLSG GGGFLFFGDDLYDSSRV WT MS D  K+YSPG+AEL F G+TTG KN
Sbjct: 183 VRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGFKN 241

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
           L   FDSG+SYTYLN   YQ L S++KKELS K L+EA +D+TLPLCWKGR+PFK++ DV
Sbjct: 242 LLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDV 301

Query: 320 KKCFRTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           KK F+T ALSFT + K++T  E  PEAYLIIS+KGN CLGILNG EVGL DLNVIG I
Sbjct: 302 KKYFKTFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDI 359


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score =  489 bits (1259), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 228/337 (67%), Positives = 278/337 (82%), Gaps = 2/337 (0%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           ++GNVYP+GYY+V   IGQP +PYFLD DTGSDLTWLQCDAPC++C  APHPLY+P+NDL
Sbjct: 57  LYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDL 116

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
           V C+DPICASLH P ++ C+DP QCDYE+EYADGGSS+GVLV D F  N T+G R  PRL
Sbjct: 117 VVCKDPICASLH-PDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRL 175

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
            +GCGY+Q+PG +YHPLDG+LGLG+G SSIV+QL SQ L+RNVVGHC S  GGG+LFFGD
Sbjct: 176 TIGCGYDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGD 235

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
           D+YDSS+V+WT MS DY K+Y+PG AEL   G ++GLKNL VVFDSGSSYTY N  TYQT
Sbjct: 236 DIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQT 295

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG-KTRTLF 339
           L S +KK+L  K LKEA ED+TLP+CW+G++PFK++ D KK F+ LALSF  G KT++ F
Sbjct: 296 LLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQF 355

Query: 340 ELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           E+  E+YLIIS+KG+VCLGILNG EVGLQ+ N+IG I
Sbjct: 356 EIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDI 392


>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
 gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
          Length = 376

 Score =  483 bits (1242), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 246/347 (70%), Positives = 286/347 (82%), Gaps = 3/347 (0%)

Query: 32  HVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
            V SS++  +HGNVYP GYYNVT+ IGQP++PYFLD+DTGSDLTWLQCDAPCV+C EAPH
Sbjct: 1   RVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH 60

Query: 92  PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           P YRP N+LVPC DPIC SLH+ G H CE+P QCDYE+EYADGGSS GVLV+D F  N+T
Sbjct: 61  PYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFT 120

Query: 152 NGQRLNPRLALG-CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           + +R +P LALG CGY+Q PG S+HP+DG+LGLGKGKSSIVSQL S  L+RNV+GHCLSG
Sbjct: 121 SEKRHSPLLALGLCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSG 180

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            GGGFLFFGDDLYDSSRV WT MS D  K+YSPG+AEL F G+TTG KNL   FDSG+SY
Sbjct: 181 HGGGFLFFGDDLYDSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGFKNLLTTFDSGASY 239

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TYLN   YQ L S++KKELS K L+EA +D+TLPLCWKGR+PFK++ DVKK F+T ALSF
Sbjct: 240 TYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSF 299

Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T + K++T  E  PEAYLIIS+KGN CLGILNG EVGL DLNVIG I
Sbjct: 300 TNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDI 346


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 221/342 (64%), Positives = 276/342 (80%), Gaps = 2/342 (0%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP 96
           ++  + GNVYP G+YNVT+Y+GQP +PYFLD DTGSDLTWLQCDAPC +C E  HPLY+P
Sbjct: 43  IVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP 102

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           SNDLVPC+DP+C SLH+   H CE+P QCDYE+EYADGGSSLGVLV+D F  N TNG  +
Sbjct: 103 SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162

Query: 157 NPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
            PRLALGCGY+Q PG+S YHP+DGILGLG+G  SIVSQLH+Q ++RNVVGHC +  GGG+
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
           LFFGD +YD  R+VWT MS DY K+YSPG  EL F G +TGL+NL VVFDSGSSYTY N 
Sbjct: 223 LFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD-GK 334
             YQ LTS++ +EL+ K L+EA +D+TLPLCW+GR+P K++ DV+K F+ LALSF+  G+
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGR 342

Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           ++ +FE+  E Y+IIS+ GNVCLGILNG +VGL++ N+IG I
Sbjct: 343 SKAVFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDI 384


>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
 gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 239/347 (68%), Positives = 284/347 (81%), Gaps = 3/347 (0%)

Query: 32  HVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
            V SS++  +HGNVYPTG+YNVT+ IGQP++PYFLD+DTGSDLTWLQCD P  +C EAPH
Sbjct: 1   RVPSSIVLPLHGNVYPTGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH 60

Query: 92  PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           P Y+PSN+LV C+DPIC SLH  G   CE+P QCDYE+EYADGGSSLGVLVKDAF  N+T
Sbjct: 61  PYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNLNFT 120

Query: 152 NGQRLNPRLALG-CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           + +R +P LALG CGY+Q+PG +YHP+DG+LGLG+GK SIVSQL    L+RNV+GHCLSG
Sbjct: 121 SEKRQSPLLALGLCGYDQLPGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSG 180

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            GGGFLFFGDDLYDSSRV WT MS +  K+YSPG AEL F G+TTG KNL V FDSG+SY
Sbjct: 181 RGGGFLFFGDDLYDSSRVAWTPMSPN-AKHYSPGFAELTFDGKTTGFKNLIVAFDSGASY 239

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TYLN   YQ L S++K+ELS K L+EA +D+TLP+CWKGR+PFK+V DVKK F+T ALSF
Sbjct: 240 TYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSF 299

Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             DGK++T  E  PEAYLI+S+KGN CLG+LNG EVGL DLNVIG I
Sbjct: 300 ANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDI 346


>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 440

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 221/347 (63%), Positives = 277/347 (79%), Gaps = 4/347 (1%)

Query: 33  VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
            GSS++F VHGNVYP G+YNVT+ IGQP RPYFLD+DTGSDLTWLQCDAPC RC + PHP
Sbjct: 61  AGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP 120

Query: 93  LYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           LYRPSNDLVPC   +CASLH   +++CE P QCDYE++YAD  SSLGVL+ D +  N+TN
Sbjct: 121 LYRPSNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTN 180

Query: 153 GQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
           G +L  R+ALGCGY+Q+ P  S+HPLDG+LGLG+GK+S+ SQL+SQ L+RNV+GHCLS  
Sbjct: 181 GVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQ 240

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGG++FFG D+YDS R+ WT MSS DY  Y   G AEL FGG+ +G+ NL  VFD+GSSY
Sbjct: 241 GGGYIFFG-DVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSY 299

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY N   YQ L S +KKE   K LKEA +D+TLPLCW+GRRPF+++++V+K F+ + LSF
Sbjct: 300 TYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSF 359

Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T +G+++  FE+ PEAYLI+SN GNVCLGILNG+EVG+ DLN+IG I
Sbjct: 360 TSNGRSKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDLNLIGDI 406


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score =  467 bits (1201), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 220/347 (63%), Positives = 277/347 (79%), Gaps = 4/347 (1%)

Query: 33  VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
            GSS++F VHGNVYP G+YNVT+ IGQP RPYFLD+DTGSDLTWLQCDAPC RC + PHP
Sbjct: 59  AGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP 118

Query: 93  LYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           LYRPSND VPC   +CASLH   +++CE P QCDYE++YAD  SSLGVL+ D +  N+TN
Sbjct: 119 LYRPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTN 178

Query: 153 GQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
           G +L  R+ALGCGY+Q+ P  S+HPLDG+LGLG+GK+S+ SQL+SQ L+RNV+GHCLS  
Sbjct: 179 GVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQ 238

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGG++FFG D+YDSSR+ WT MSS DY  Y + G AEL FGG+ +G+ +L  VFD+GSSY
Sbjct: 239 GGGYIFFG-DVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSY 297

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY N   YQ L S + KE   K LKEA +D+TLPLCW+GRRPF+++++V+K F+ + LSF
Sbjct: 298 TYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSF 357

Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T +G+++  FE+ PEAYLIISN GNVCLGILNG+EVG+ DLN+IG I
Sbjct: 358 TSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLNLIGDI 404


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 220/357 (61%), Positives = 285/357 (79%), Gaps = 5/357 (1%)

Query: 24  SSSSSLFNHV-GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           SS  SL NH  GSS++F ++GNVYP G+YNVT+ IGQP RPYFLD+DTGS+LTWLQCDAP
Sbjct: 46  SSRPSLMNHAAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAP 105

Query: 83  CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
           C +C E PHPLY+PSND +PC+DP+CASL     + CEDP QCDYE++YAD  S+LGVL+
Sbjct: 106 CSQCSETPHPLYKPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLL 165

Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
            D +  N+TNG +L  R+ALGCGY+Q+   ++YHPLDGILGLG+GK+S++SQL+SQ L+R
Sbjct: 166 NDVYLLNFTNGVQLKVRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVR 225

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNL 260
           NV+GHCLS  GGG++FFG ++YDSSR+ WT +SS D  K+YS G AEL FGG  TG+ +L
Sbjct: 226 NVMGHCLSSRGGGYIFFG-NVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSL 284

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            ++FD+GSSYTY N   YQ + S++ KEL  K +K AP+D+TLP+CW G+RPF+++++VK
Sbjct: 285 NIIFDTGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVK 344

Query: 321 KCFRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           K F+ L LSFT+ G+ +  FE+ PEAYLIISN GNVCLGILNG EVGL +LN+IG I
Sbjct: 345 KYFKPLTLSFTNGGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDI 401


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 222/345 (64%), Positives = 274/345 (79%), Gaps = 8/345 (2%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           GSS++F VHGNVYP G+YNVT+ IG P RPYFLD+DTGSDLTWLQCDAPC RC + PHPL
Sbjct: 68  GSSVVFPVHGNVYPVGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPL 127

Query: 94  YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           YRPSNDLVPC  P+CAS+H   ++ CE   QCDYE+EYAD  SSLGVLV D +  N+TNG
Sbjct: 128 YRPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDVYVLNFTNG 187

Query: 154 QRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
            +L  R+ALGCGY+Q+ P +SYHP+DG+LGLG+GKSS++SQL+ Q L+RNVVGHCLS  G
Sbjct: 188 VQLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQG 247

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           GG++FFG D+YDSSR+ WT MSS   K+YS G AEL  GG+ TG  NL  VFD+GSSYTY
Sbjct: 248 GGYIFFG-DVYDSSRLAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTY 306

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
            N   YQ     + KEL+ K +KEAPED+TLPLCW G+RPF++V++VKK F+ +ALSF  
Sbjct: 307 FNSNAYQ-----LTKELAGKPIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKPIALSFPG 361

Query: 333 G-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             +++  FE+ PEAYLIISN GNVCLGIL+G+EVG++DLN+IG I
Sbjct: 362 SRRSKAQFEIPPEAYLIISNMGNVCLGILDGSEVGVEDLNLIGDI 406


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 220/342 (64%), Positives = 275/342 (80%), Gaps = 2/342 (0%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP 96
           ++  + GNVYP G+YNVT+Y+GQP +PYFLD DTGSDLTWLQCDAPC +C E  HPLY+P
Sbjct: 43  IVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP 102

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           SNDLVPC+DP+C SLH+   H CE+P QCDYE+EYADGGSSLGVLV+D F  N TNG  +
Sbjct: 103 SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162

Query: 157 NPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
            PRLALGCGY+Q PG+S YHP+DGILGLG+G  SIVSQLH+Q ++RNVVGHC +  GGG+
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
            FFGD +YD  R+VWT MS DY K+YSPG  EL F G +TGL+NL VVFDSGSSYTY N 
Sbjct: 223 XFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD-GK 334
             YQ LTS++ +EL+ K L+EA +D+TLPLCW+GR+P K++ DV+K F+ LALSF+  G+
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGR 342

Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           ++ +FE+  E Y+IIS+ GNVCLGILNG +VGL++ N+IG I
Sbjct: 343 SKAVFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDI 384


>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 429

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 227/359 (63%), Positives = 283/359 (78%), Gaps = 3/359 (0%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           S ++SS S L N  GSS++  ++GNVYP G+YNVT+ IGQPARPYFLD+DTGSDLTWLQC
Sbjct: 38  SEATSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQC 97

Query: 80  DAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLG 139
           DAPC  C E PHPLYRPSND VPC DP+CASL     +NCE P QCDYE+ YAD  S+ G
Sbjct: 98  DAPCTHCSETPHPLYRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTFG 157

Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQK 198
           VL+ D +  N+TNG +L  R+ALGCGY+QV   +SYHPLDG+LGLG+GK+S++SQL+SQ 
Sbjct: 158 VLLNDVYLLNFTNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQG 217

Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
           L+RNV+GHCLS  GGG++FFG + YDS+RV WT +SS  +K+YS G AEL FGG  TG+ 
Sbjct: 218 LVRNVIGHCLSAQGGGYIFFG-NAYDSARVTWTPISSVDSKHYSAGPAELVFGGRKTGVG 276

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
           +L  VFD+GSSYTY N   YQ L S +KKELS K LK AP+D+TLPLCW G+RPF ++ +
Sbjct: 277 SLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLRE 336

Query: 319 VKKCFRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           V+K F+ +AL FT+ G+T+  FE+ PEAYLIISN GNVCLGILNG+EVGL++LN+IG I
Sbjct: 337 VRKYFKPVALGFTNGGRTKAQFEILPEAYLIISNLGNVCLGILNGSEVGLEELNLIGDI 395


>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 431

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 222/355 (62%), Positives = 277/355 (78%), Gaps = 3/355 (0%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           SS  SL N  GSS++F ++GNVYP G+YNVT+ IGQPARPYFLD+DTGSDLTWLQCDAPC
Sbjct: 44  SSWPSLLNPAGSSIVFPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPC 103

Query: 84  VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
             C E PHPL+RPSND VPC DP+CASL     +NCE P QCDYE+ YAD  S+ GVL+ 
Sbjct: 104 THCSETPHPLHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTYGVLLN 163

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           D +  N +NG +L  R+ALGCGY+QV   +SYHPLDG+LGLG+GK+S++SQL+SQ L+RN
Sbjct: 164 DVYLLNSSNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRN 223

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV 262
           V+GHCLS  GGG++FFG + YDS+RV WT +SS  +K+YS G AEL FGG  TG+ +L  
Sbjct: 224 VIGHCLSSQGGGYIFFG-NAYDSARVTWTPISSVDSKHYSAGPAELVFGGRKTGVGSLTA 282

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           VFD+GSSYTY N   YQ L S + KELS K LK AP+D+TL LCW G+RPF ++ +V+K 
Sbjct: 283 VFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLREVRKY 342

Query: 323 FRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           F+ +ALSFT+ G+ +  FE+ PEAYLIISN GNVCLGILNG EVGL++LN++G I
Sbjct: 343 FKPVALSFTNGGRVKAQFEIPPEAYLIISNLGNVCLGILNGFEVGLEELNLVGDI 397


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 202/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42  SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101

Query: 95  RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ N LVPC D +CA+LH    G H C+ P  QCDYE++YAD GSSLGVLV D+FA   
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
            N   + P LA GCGY+Q  G+S      DG+LGLG G  S++SQL    + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+   SR  W  M+   ++ YYSPG A L+FGG   G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY +   YQ L   +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+ 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF++GK + L E+ PE YLI++  GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42  SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101

Query: 95  RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ N LVPC D +CA+LH    G H C+ P  QCDYE++YAD GSSLGVLV D+FA   
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
            N   + P LA GCGY+Q  G+S      DG+LGLG G  S++SQL    + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+   SR  W  M+   ++ YYSPG A L+FGG   G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY +   YQ L   +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+ 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF++GK + L E+ PE YLI++  GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 201/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42  SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101

Query: 95  RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ N LVPC D +CA+LH    G H C+ P  QCDYE++YAD GSSLGVLV D+FA   
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
            N   + P LA GCGY+Q  G+S      DG+LGLG G  S++SQL    + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+   SR  W  M+   ++ YYSPG A L+FGG   G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY +   YQ L   +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK F+T+ 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFKTVV 339

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF++GK + L E+ PE YLI++  GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 199/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +FQ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42  SSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLY 101

Query: 95  RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ N +VPC D +C+SLH    G H C+ P  QCDYE++YAD GSSLGVL+ D+FA   
Sbjct: 102 RPTKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRL 161

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
            N   + P LA GCGY+Q  G+S    P DG+LGLG G  S++SQL    + +NVVGHCL
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGD+L   SR  W  M  S +  YYSPG A L+FGG + G++ + VV DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY     YQ L + +K +LS K+LKE   D +LPLCWKG++PFK+V DVKK F++L 
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLS-KTLKEV-FDPSLPLCWKGKKPFKSVLDVKKEFKSLV 339

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF++GK + L E+ PE YLI++  GN CLGILNG+E+GL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKFGNACLGILNGSEIGLKDLNIVGDI 387


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 194/338 (57%), Positives = 241/338 (71%), Gaps = 10/338 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42  SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101

Query: 95  RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ N LVPC D +CA+LH    G H C+ P  QCDYE++YAD GSSLGVLV D+FA   
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
            N   + P LA GCGY+Q  G+S      DG+LGLG G  S++SQL    + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+   SR  W  M+   ++ YYSPG A L+FGG   G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY +   YQ L   +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+ 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
           LSF++GK + L E+ PE YLI++  GN CLGILNG+E+
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEL 376


>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 452

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 195/356 (54%), Positives = 249/356 (69%), Gaps = 7/356 (1%)

Query: 26  SSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR 85
           SS   + + SS +F+V GNVYP G+Y V++ IG P + Y LD+D+GSDLTW+QCDAPC  
Sbjct: 39  SSDNHHRLSSSAVFKVQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKG 98

Query: 86  CVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD 144
           C +    LY+P+++LV C D +C+ +     + C  P  QCDYE+EYAD GSSLGVLV+D
Sbjct: 99  CTKPRDQLYKPNHNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRD 158

Query: 145 AFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRN 202
              F +TNG  + PR+A GCGY+Q    S  P    G+LGLG G++SI+SQLHS  LI N
Sbjct: 159 YIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHN 218

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLP 261
           VVGHCLS  GGGFLFFGDD   SS +VWTSM  S   K+YS G AEL F G+ T +K L 
Sbjct: 219 VVGHCLSARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLE 278

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           ++FDSGSSYTY N   YQ +  ++ ++L  K LK A +D +LP+CWKG + FK++ DVKK
Sbjct: 279 LIFDSGSSYTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKK 338

Query: 322 CFRTLALSFTDGKTRTL-FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F+ LALSFT  KT+ L   L PEAYLII+  GNVCLGIL+G EVGL++LN+IG I
Sbjct: 339 YFKPLALSFT--KTKILQMHLPPEAYLIITKHGNVCLGILDGTEVGLENLNIIGDI 392


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/350 (55%), Positives = 244/350 (69%), Gaps = 11/350 (3%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC  C E PHPLY
Sbjct: 48  SSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY 107

Query: 95  RPS-NDLVPCEDPICASLHAP---GHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFN 149
           RP+ + LVPC   +CASLH     G H CE P  QCDY ++YAD GSS GVLV D+FA  
Sbjct: 108 RPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALR 167

Query: 150 YTNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            TNG    P +A GCGY+Q    G    P DG+LGLG G  S++SQL  + + +NVVGHC
Sbjct: 168 LTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHC 227

Query: 208 LSGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDS 266
           LS  GGGFLFFGDDL    R  WT M+ S +  YYSPG A L+FG  + G++   VVFDS
Sbjct: 228 LSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDS 287

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           GSS+TY     YQ L + +K  LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L
Sbjct: 288 GSSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSL 345

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            L+F  GK +TL E+ PE YLI++  GN CLGILNG+E+GL+DL++IG I
Sbjct: 346 VLNFASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 394


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 193/349 (55%), Positives = 244/349 (69%), Gaps = 10/349 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC  C E PHPLY
Sbjct: 50  SSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY 109

Query: 95  RPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
           RP+ + LVPC   +CASLH    G H C+ P  QCDY ++YAD GSS GVL+ D+FA   
Sbjct: 110 RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRL 169

Query: 151 TNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           TNG    P +A GCGY+Q    G    P DG+LGLG G  S++SQL  + + +NVVGHCL
Sbjct: 170 TNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 229

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDDL    R  WT M+ S +  YYSPG A L+FG  + G++   VVFDSG
Sbjct: 230 SLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SS+TY     YQ L + +K  LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L 
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLV 347

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           L+F  GK +TL E+ PE YLI++  GN CLGILNG+E+GL+DL++IG I
Sbjct: 348 LNFASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 395


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 242/346 (69%), Gaps = 10/346 (2%)

Query: 38  LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
           +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC  C E PHPLYRP+
Sbjct: 44  VFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPT 103

Query: 98  -NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
            + LVPC   +CASLH    G H C+ P  QCDY ++YAD GSS GVL+ D+FA   TNG
Sbjct: 104 KSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNG 163

Query: 154 QRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
               P +A GCGY+Q    G    P DG+LGLG G  S++SQL  + + +NVVGHCLS  
Sbjct: 164 SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLR 223

Query: 212 GGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGGFLFFGDDL    R  WT M+ S +  YYSPG A L+FG  + G++   VVFDSGSS+
Sbjct: 224 GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSF 283

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY     YQ L + +K  LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L L+F
Sbjct: 284 TYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNF 341

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             GK +TL E+ PE YLI++  GN CLGILNG+E+GL+DL++IG I
Sbjct: 342 ASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 386


>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
 gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 405

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 182/359 (50%), Positives = 244/359 (67%), Gaps = 4/359 (1%)

Query: 21  SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S SS   +      SS++F + GNV+P GYY+V M IG P + +  D+DTGSDLTW+QCD
Sbjct: 19  SKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCD 78

Query: 81  APCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLG 139
           APC  C   P+  Y+P  +++PC +PIC +LH P   +C +P  QCDYE++YAD GSS+G
Sbjct: 79  APCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMG 138

Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQ 197
            LV D F     NG  + P +A GCGY+Q   +++ P    G+LGLG+GK  +++QL S 
Sbjct: 139 ALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSA 198

Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
            L RNVVGHCLS  GGGFLFFGD+L  S  V WT + S    +Y+ G A+L F G+ TGL
Sbjct: 199 GLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQ-DNHYTTGPADLLFNGKPTGL 257

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
           K L ++FD+GSSYTY N   YQT+ +++  +L    LK A ED+TLP+CWKG +PFK+V 
Sbjct: 258 KGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVL 317

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           +VK  F+T+ ++FT+G+  T   L PE YLI+S  GNVCLG+LNG+EVGLQ+ NVIG I
Sbjct: 318 EVKNFFKTITINFTNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDI 376


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 195/360 (54%), Positives = 245/360 (68%), Gaps = 9/360 (2%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           +SSS +      SS +F ++G+VYP G Y V M IG P +PYFLD+DTGSDLTWLQCDAP
Sbjct: 38  ASSSVAGVETEASSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAP 97

Query: 83  CVRCVEAPHPLYRPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSL 138
           C  C + PHPLYRP+ N LVPC D +CASLH      H C+ P  QCDY ++YAD GSS 
Sbjct: 98  CRSCNKVPHPLYRPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSST 157

Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGYN-QVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
           GVLV D+FA    NG  + P LA GCGY+ QV      P DG+LGLG G  S++SQ    
Sbjct: 158 GVLVNDSFALRLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQH 217

Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTG 256
            + +NVVGHCLS  GGGFLFFGDDL    RV WT M  S    YYSPG A L+FG ++  
Sbjct: 218 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLR 277

Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
           +K   VVFDSGSS+TY     YQ L + +K +LS ++LKE   D +LPLCWKG++PFK+V
Sbjct: 278 VKLTEVVFDSGSSFTYFAAQPYQALVTALKGDLS-RTLKEV-SDPSLPLCWKGKKPFKSV 335

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            DVKK F++L L+F +G  +   E+ P+ YLI++  GN CLGILNG+EVGL+DL+++G I
Sbjct: 336 LDVKKEFKSLVLNFGNG-NKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDI 394


>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
 gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
          Length = 424

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/345 (53%), Positives = 242/345 (70%), Gaps = 5/345 (1%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           GSSL+  V GNVYP GYY+V++YIG P + + LD+DTGSDLTW+QCDAPC  C +  H L
Sbjct: 50  GSSLVLPVFGNVYPLGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHL 109

Query: 94  YRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           Y+P N+L+ C DP+C+++   G + C+    QCDYE++YAD GSSLGVLV D F     N
Sbjct: 110 YKPRNNLLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMN 169

Query: 153 GQRLNPRLALGCGYNQ-VPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G  L P++  GCGY+Q  PG  +  P  G+LGLG GK+SI+SQL +  ++ NV+GHCLS 
Sbjct: 170 GSFLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSR 229

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
            GGGFLFFG D   S  + W  MS     KYY+ G AEL +GG+ TG K    +FDSGSS
Sbjct: 230 KGGGFLFFGQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSS 289

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YTY N   YQ+  ++++KELS K L++APE++ L +CWKG + FK+V++VK  F+  ALS
Sbjct: 290 YTYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALS 349

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           FT  K+  L ++ PE YLI++N GNVCLGILNG+EVGL + NVIG
Sbjct: 350 FTKAKSVQL-QIPPEDYLIVTNDGNVCLGILNGSEVGLGNFNVIG 393


>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 410

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/345 (51%), Positives = 237/345 (68%), Gaps = 4/345 (1%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++  + GNV+P GYY+V + IG P + +  D+DTGSD+TW+QCDAPC  C   P   Y
Sbjct: 38  SSVVLLLSGNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQY 97

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P  + VPC DPIC +LH P +  C +P  QCDYE+ YAD GSS+G LV D F F   NG
Sbjct: 98  KPKGNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNG 157

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
             + PRLA GCGY+Q   +++ P    G+LGLG+GK  +++QL S  L RNVVGHCLS  
Sbjct: 158 SAMQPRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK 217

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
           GGG+LFFGD L  S  V WT +      +Y+ G AEL F G+ TGLK L ++FD+GSSYT
Sbjct: 218 GGGYLFFGDTLIPSLGVAWTPLLPP-DNHYTTGPAELLFNGKPTGLKGLKLIFDTGSSYT 276

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
           Y N  TYQT+ +++  +L    LK A ED+TLP+CWKG +PFK+V +VK  F+T+ ++FT
Sbjct: 277 YFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFT 336

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + +  T  ++ PE+YLIIS  GN CLG+LNG+EVGLQ+ NVIG I
Sbjct: 337 NARRNTQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDI 381


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 188/347 (54%), Positives = 242/347 (69%), Gaps = 8/347 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++F + GNVYP GYY+V++ IG+    +  D+D+GSDLTW+QCDAPC  C +    LY
Sbjct: 39  SSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLY 98

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P+N+ + C +P+C SLH   +H+C+    QC YE+EYAD GSSLGVLV D      TNG
Sbjct: 99  KPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG 158

Query: 154 QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
               PR+A GCGY+    VP +S  P  G+LGLG G+ S +SQL S  ++RNVVGHCLS 
Sbjct: 159 SLAAPRIAFGCGYDHKYSVPDSS-PPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSD 217

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
            GG FLFFGD+   SS V WTSMS +    YYS G AE++FGG+ TG+K+L +VFDSGSS
Sbjct: 218 EGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSS 276

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YTY N   Y ++ +++K  L  K L++APED++LP+CWKG RPFK++ DVKK F  LAL 
Sbjct: 277 YTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALR 336

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           FT  K   + +L PE YLII+  GNVC GILNG EVGL DLN+IG I
Sbjct: 337 FTKTKNAQI-QLPPENYLIITKYGNVCFGILNGTEVGLGDLNIIGDI 382


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 185/345 (53%), Positives = 238/345 (68%), Gaps = 11/345 (3%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
           FQ+ GNVYP GYY V++ IG P + Y LD+DTGSDLTW+QCDAPC  C    + LY+P+ 
Sbjct: 52  FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNG 111

Query: 99  DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           +LV C DP+C ++ +  +H+C  P  QCDYE+EYAD GSSLGVL++D     +TNG    
Sbjct: 112 NLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLAR 171

Query: 158 PRLALGCGYNQV-----PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           P LA GCGY+Q      P AS     G+LGLG GK+SI+SQLHS  LIRNVVGHCLS  G
Sbjct: 172 PILAFGCGYDQKHVGHNPSASTA---GVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERG 228

Query: 213 GGFLFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
           GGFLFFGD L   S VVWT  + S  T++Y  G A+LFF  + T +K L ++FDSGSSYT
Sbjct: 229 GGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYT 288

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
           Y N   ++ L +++  +L  K L  A ED +LP+CW+G +PFK++HDV   F+ L LSFT
Sbjct: 289 YFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSFT 348

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             K  +L +L PEAYLI++  GNVCLGIL+G E+GL + N+IG I
Sbjct: 349 KSKN-SLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 392


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 187/370 (50%), Positives = 251/370 (67%), Gaps = 7/370 (1%)

Query: 11  CFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
           CF     +     SS+ +  + VGSS+ F+V GNVYPTGYY+V + IG P + +  D+DT
Sbjct: 15  CFSAASQTPIKGESSTPA-NDRVGSSVFFRVTGNVYPTGYYSVILNIGNPPKAFDFDIDT 73

Query: 71  GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYEL 129
           GSDLTW+QCDAPC  C +    LY+P N+LVPC + +C ++    +++C+ P  QCDYE+
Sbjct: 74  GSDLTWVQCDAPCKGCTKPRDKLYKPKNNLVPCSNSLCQAVSTGENYHCDAPDDQCDYEI 133

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGK 187
           EYAD GSS+GVL+ D+F    +NG  L P++A GCGY+Q     + P D  GILGLG+GK
Sbjct: 134 EYADLGSSIGVLLSDSFPLRLSNGTLLQPKMAFGCGYDQKHLGPHPPPDTAGILGLGRGK 193

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVA 246
            SI+SQL +  + +NVVGHC S   GGFLFFGD L+ SSR+ WT M  S     YS G A
Sbjct: 194 VSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPA 253

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
           EL FGG+ TG+K L ++FDSGSSYTY N   YQ++ ++++K+L+ K LK+APE E L +C
Sbjct: 254 ELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKE-LAVC 312

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
           WK  +P K++ D+K  F+ L +SF + K   L +L PE YLII+  GNVCLGILNG+E  
Sbjct: 313 WKTAKPIKSILDIKSYFKPLTISFMNAKNVQL-QLAPEDYLIITKDGNVCLGILNGSEQQ 371

Query: 367 LQDLNVIGGI 376
           L + NVIG I
Sbjct: 372 LGNFNVIGDI 381


>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 454

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/365 (52%), Positives = 251/365 (68%), Gaps = 5/365 (1%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R +    +  S +  + + SS +F++ GNVYP G+Y V++ IG P + Y LD+D+GSDLT
Sbjct: 29  RNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLT 88

Query: 76  WLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADG 134
           W+QCDAPC  C +    LY+P+++LV C D +C+ +H    +NC  P   CDYE+EYAD 
Sbjct: 89  WVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYADH 148

Query: 135 GSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVS 192
           GSSLGVLV+D   F +TNG  + PR+A GCGY+Q    S  P    G+LGLG G++SI+S
Sbjct: 149 GSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILS 208

Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFG 251
           QLHS  LIRNVVGHCLS  GGGFLFFGDD   SS +VWTSM SS   K+YS G AEL F 
Sbjct: 209 QLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFN 268

Query: 252 GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
           G+ T +K L ++FDSGSSYTY N   YQ +  ++ K+L  K LK A +D +LP+CWKG +
Sbjct: 269 GKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAK 328

Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLN 371
            F+++ DVKK F+ LALSF       +  L PE+YLII+  GNVCLGIL+G EVGL++LN
Sbjct: 329 SFESLSDVKKYFKPLALSFKKSXNLQM-HLPPESYLIITKHGNVCLGILDGTEVGLENLN 387

Query: 372 VIGGI 376
           +IG I
Sbjct: 388 IIGDI 392


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score =  368 bits (944), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 187/347 (53%), Positives = 241/347 (69%), Gaps = 8/347 (2%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++F + GNVYP GYY+V++ IG+    +  D+D+GSDLTW+QCDAPC  C +    LY
Sbjct: 39  SSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLY 98

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P+N+ + C +P+C SLH   +H+C+    QC YE+EYAD GSSLGVLV D      TNG
Sbjct: 99  KPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG 158

Query: 154 QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
               PR+A GCGY+    VP +S  P  G+LGLG G+ S +SQL S  ++RNVVGHCLS 
Sbjct: 159 SLAAPRIAFGCGYDHKYSVPDSS-PPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSD 217

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
            GG FLFFGD+   SS V WTSMS +    YYS G AE++F G+ TG+K+L +VFDSGSS
Sbjct: 218 EGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSS 276

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YTY N   Y ++ +++K  L  K L++APED++LP+CWKG RPFK++ DVKK F  LAL 
Sbjct: 277 YTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALR 336

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           FT  K   + +L PE YLII+  GNVC GILNG EVGL DLN+IG I
Sbjct: 337 FTKTKNAQI-QLPPENYLIITKYGNVCFGILNGTEVGLGDLNIIGDI 382


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 186/370 (50%), Positives = 246/370 (66%), Gaps = 9/370 (2%)

Query: 11  CFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
           CF     +     S++ +  + VGSS+ F+V GNVYPTG+Y+V + IG P + + LD+DT
Sbjct: 29  CFSAASQTPIKGKSTTPA-NDRVGSSVFFRVTGNVYPTGHYSVILNIGNPPKAFDLDIDT 87

Query: 71  GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYEL 129
           GSDLTW+QCDAPC  C +    LY+P N+ VPC   +C ++    ++NC+ P  QCDYE+
Sbjct: 88  GSDLTWVQCDAPCKGCTKPLDKLYKPKNNRVPCASSLCQAIQ---NNNCDIPTEQCDYEV 144

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGK 187
           EYAD GSSLGVL+ D F     NG  L PR+A GCGY+Q     + P D  GILGLG+GK
Sbjct: 145 EYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAFGCGYDQKYLGPHSPPDTAGILGLGRGK 204

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVA 246
           +SI+SQL +  + +NVVGHC S   GGFLFFGD L   S + WT M  S     YS G A
Sbjct: 205 ASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPA 264

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
           EL FGG+ TG+K L ++FDSGSSYTY N   YQ++ ++++K+LS   LK+APE++ L +C
Sbjct: 265 ELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVC 324

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
           WK  +P K++ D+K  F+ L ++F   K   L +L PE YLII+  GNVCLGILNG E G
Sbjct: 325 WKTAKPIKSILDIKSFFKPLTINFIKAKNVQL-QLAPEDYLIITKDGNVCLGILNGGEQG 383

Query: 367 LQDLNVIGGI 376
           L +LNVIG I
Sbjct: 384 LGNLNVIGDI 393


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 197/359 (54%), Positives = 256/359 (71%), Gaps = 7/359 (1%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S+S+  + N +G +++F + GNVYP G+Y+V++ IG P +PY LD+D+GSDLTWLQCDAP
Sbjct: 40  SASNQPISNRMGHTVVFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAP 99

Query: 83  CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVL 141
           CV C +APHP Y+P+   + C DP+C++LH P    C+    QCDYE+ YAD GSSLGVL
Sbjct: 100 CVSCTKAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVL 159

Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ-VPGASYHP-LDGILGLGKGKSSIVSQLHSQKL 199
           V D F+   TNG    PRLA GCGY+Q  PG +  P +DG+LGLG GKSSIV+QL S  L
Sbjct: 160 VHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 219

Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLK 258
           IR++VGHCLSG GGGFLF GD L  +  ++WT MS    +  Y+ G A+L F G+ +G+K
Sbjct: 220 IRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVK 279

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
            L +VFDSGSSYTY N   Y+T  S+++K L+ K LKE   DE+LP+CW+G +PFK++ +
Sbjct: 280 GLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK-LKET-ADESLPVCWRGAKPFKSIFE 337

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
           VK  F+  ALSFT  K+  L +L PE+YLIIS  GN CLGILNG+EVGL D NVIG I 
Sbjct: 338 VKNYFKPFALSFTKAKSAQL-QLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIA 395


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 197/359 (54%), Positives = 256/359 (71%), Gaps = 7/359 (1%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S+S+  + N +G +++F + GNVYP G+Y+V++ IG P +PY LD+D+GSDLTWLQCDAP
Sbjct: 7   SASNQPISNRMGHTVVFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAP 66

Query: 83  CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVL 141
           CV C +APHP Y+P+   + C DP+C++LH P    C+    QCDYE+ YAD GSSLGVL
Sbjct: 67  CVSCTKAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVL 126

Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ-VPGASYHP-LDGILGLGKGKSSIVSQLHSQKL 199
           V D F+   TNG    PRLA GCGY+Q  PG +  P +DG+LGLG GKSSIV+QL S  L
Sbjct: 127 VHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 186

Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLK 258
           IR++VGHCLSG GGGFLF GD L  +  ++WT MS    +  Y+ G A+L F G+ +G+K
Sbjct: 187 IRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVK 246

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
            L +VFDSGSSYTY N   Y+T  S+++K L+ K LKE   DE+LP+CW+G +PFK++ +
Sbjct: 247 GLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK-LKET-ADESLPVCWRGAKPFKSIFE 304

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
           VK  F+  ALSFT  K+  L +L PE+YLIIS  GN CLGILNG+EVGL D NVIG I 
Sbjct: 305 VKNYFKPFALSFTKAKSAQL-QLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIA 362


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score =  360 bits (923), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 186/349 (53%), Positives = 238/349 (68%), Gaps = 12/349 (3%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+ +FQ+ G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLY
Sbjct: 37  STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96

Query: 95  RPS-NDLVPCEDPICASLHA-PGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           RP+ N LVPC + +C +LH+  G +N C  P QCDY+++Y D  SS GVL+ D+F+    
Sbjct: 97  RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156

Query: 152 NGQRLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           +   + P L  GCGY+Q     GA    +DG+LGLG+G  S+VSQL  Q + +NVVGHCL
Sbjct: 157 S-SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+  SSRV W  M+   +  YYSPG   L+F   + G+K + VVFDSG
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           S+YTY     YQ + S +K  LS KSLK+   D TLPLCWKG++ FK+V DVK  F+++ 
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLS-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMF 333

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF+  K   + E+ PE YLI++  GNVCLGIL+G    L   NVIG I
Sbjct: 334 LSFSSAKNAAM-EIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDI 380


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 186/349 (53%), Positives = 237/349 (67%), Gaps = 12/349 (3%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+ +FQ+ G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLY
Sbjct: 37  STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96

Query: 95  RPS-NDLVPCEDPICASLHA-PGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           RP+ N LVPC + +C +LH+  G +N C  P QCDY+++Y D  SS GVL+ D+F+    
Sbjct: 97  RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156

Query: 152 NGQRLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           +   + P L  GCGY+Q     GA    +DG+LGLG+G  S+VSQL  Q + +NVVGHCL
Sbjct: 157 S-SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           S  GGGFLFFGDD+  SSRV W  M+   +  YYSPG   L+F   + G+K + VVFDSG
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           S+YTY     YQ + S +K  LS KSLK+   D TLPLCWKG++ FK+V DVK  F+++ 
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLS-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMF 333

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LSF   K   + E+ PE YLI++  GNVCLGIL+G    L   NVIG I
Sbjct: 334 LSFASAKNAAM-EIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDI 380


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 183/346 (52%), Positives = 233/346 (67%), Gaps = 13/346 (3%)

Query: 38  LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
           +F + G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLYRP+
Sbjct: 44  VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103

Query: 98  -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
            N LVPC + IC +LH+    N  C    QCDY+++Y D  SSLGVLV D+F+    N  
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKS 163

Query: 155 RLNPRLALGCGYNQVP---GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
            + P L+ GCGY+Q     GA+    DG+LGLG+G  S++SQL  Q + +NV+GHCLS  
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223

Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGGFLFFGDD+  +SRV W SM  S    YYSPG A L+F   +   K + VVFDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY +   YQ   S +K  LS KSLK+   D +LPLCWKG++ FK+V DVKK F++L   F
Sbjct: 284 TYFSAQPYQATISAIKGSLS-KSLKQV-SDPSLPLCWKGQKAFKSVSDVKKDFKSLQFIF 341

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             GK   + ++ PE YLII+  GNVCLGIL+G+   L   ++IG I
Sbjct: 342 --GK-NAVMDIPPENYLIITKNGNVCLGILDGSAAKL-SFSIIGDI 383


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 185/342 (54%), Positives = 235/342 (68%), Gaps = 5/342 (1%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
           FQ+ GNVYP GYY V++ IG P + Y LD+DTGSDLTW+QCDAPC  C    + LY+P  
Sbjct: 52  FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHG 111

Query: 99  DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           DLV C DP+CA++ +  +H+C  P  QCDYE+EYAD GSSLGVL++D     +TNG    
Sbjct: 112 DLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLAR 171

Query: 158 PRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
           P LA GCGY+Q       P    G+LGLG G++SI+SQLHS  LIRNVVGHCLSG GGGF
Sbjct: 172 PMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGF 231

Query: 216 LFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
           LFFGD L   S VVWT  + S   ++Y  G A+LFF  +TT +K L ++FDSGSSYTY N
Sbjct: 232 LFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFN 291

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
              ++ L +++  +L  K L  A  D +LP+CWKG +PFK++HDV   F+ L LSFT  K
Sbjct: 292 SQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLLSFTKSK 351

Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
              L +L PEAYLI++  GNVCLGIL+G E+GL + N+IG I
Sbjct: 352 NSPL-QLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 392


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score =  348 bits (893), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 182/346 (52%), Positives = 232/346 (67%), Gaps = 13/346 (3%)

Query: 38  LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
           +F + G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLYRP+
Sbjct: 44  VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103

Query: 98  -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
            N LVPC + IC +LH+    N  C    QCDY+++Y D  SSLGVLV D+F+    N  
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKS 163

Query: 155 RLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
            + P L+ GCGY+Q     GA+    DG+LGLG+G  S++SQL  Q + +NV+GHCLS  
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223

Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGGFLFFGDD+  +SRV W  M  S    YYSPG A L+F   +   K + VVFDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY +   YQ   S +K  LS KSLK+   D +LPLCWKG++ FK+V DVKK F++L   F
Sbjct: 284 TYFSAQPYQATISAIKGSLS-KSLKQV-SDPSLPLCWKGQKAFKSVSDVKKDFKSLQFIF 341

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             GK   + E+ PE YLI++  GNVCLGIL+G+   L   ++IG I
Sbjct: 342 --GK-NAVMEIPPENYLIVTKNGNVCLGILDGSAAKL-SFSIIGDI 383


>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 407

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 187/358 (52%), Positives = 244/358 (68%), Gaps = 8/358 (2%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
           S+ S+ +H  SS+ FQ+ GNVYP GYY+V + IG P + Y LD+DTGSDLTW+QCDAPC 
Sbjct: 23  SAISVLSH-ASSIAFQIKGNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCK 81

Query: 85  RCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVK 143
            C       Y+P  +LV C DP+CA++ +  +  C +P  QCDYE+EYAD GSSLGVLV+
Sbjct: 82  GCTLPRDRQYKPHGNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVLVR 141

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR 201
           D      TNG   +  LA GCGY+Q       P    G+LGLG G++SI+SQL+S+ LIR
Sbjct: 142 DIIPLKLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIR 201

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLK 258
           NVVGHCLSG GGGFLFFGD L   S VVWT +   SS   K+Y  G A++FF G+ T +K
Sbjct: 202 NVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVK 261

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
            L + FDSGSSYTY N + ++ L  ++  ++  K L  A ED +LP+CWKG +PFK++HD
Sbjct: 262 GLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHD 321

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           V   F+ L LSFT  K  +LF++ PEAYLI++  GNVCLGIL+G E+GL + N+IG I
Sbjct: 322 VTSNFKPLVLSFTKSK-NSLFQVPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 378


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score =  344 bits (883), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 176/370 (47%), Positives = 240/370 (64%), Gaps = 5/370 (1%)

Query: 12  FPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
           F     ++  SS+    L N  +GSS++F V GNVYP GYY V + IG P + + LD+DT
Sbjct: 28  FQPSDATTKDSSAQQVKLQNRRLGSSVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDT 87

Query: 71  GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYEL 129
           GSDLTW+QCDAPC  C +     Y+P+++ +PC   +C+ L    +  C+DP  QCDYE+
Sbjct: 88  GSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHLLCSGLDLTQNRPCDDPEDQCDYEI 147

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGK 187
            Y+D  SS+G LV D F     NG  +NP L  GCGY+Q         P  GILGLG+GK
Sbjct: 148 GYSDHASSIGALVTDEFPLKLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGK 207

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVA 246
             I +QL S  + +NV+ HCLS  G GFL  GD+L  SS V WTS++++  +K Y  G A
Sbjct: 208 VGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPA 267

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
           EL F  +TTG+K + VVFDSGSSYTY N   YQ +  +++K+L+ K L +  +D++LP+C
Sbjct: 268 ELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVC 327

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
           WKG++P K++ +VKK F+T+ L F   K   LF++ PE+YLII+ KGNVCLGILNG EVG
Sbjct: 328 WKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVPPESYLIITEKGNVCLGILNGTEVG 387

Query: 367 LQDLNVIGGI 376
           L   N++G I
Sbjct: 388 LDSYNIVGDI 397


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score =  343 bits (880), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 226/346 (65%), Gaps = 13/346 (3%)

Query: 38  LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
           +FQ++G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLY+P+
Sbjct: 39  VFQLNGDVYPTGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPT 98

Query: 98  -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
            N LVPC   IC +LH+    N  C  P QCDY+++Y D  SSLGVLV D F     N  
Sbjct: 99  KNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158

Query: 155 RLNPRLALGCGYNQVPGAS---YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
            + P    GCGY+Q  G +       DG+LGLGKG  S+VSQL    + +NV+GHCLS  
Sbjct: 159 SVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTN 218

Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGGFLFFGD++  +SR  W  M  S    YYSPG   L+F   + G+K + VVFDSGS+Y
Sbjct: 219 GGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTY 278

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY     YQ   S +K  LS KSL++   D +LPLCWKG++ FK+V DVK  F++L LSF
Sbjct: 279 TYFAAQPYQATVSALKAGLS-KSLQQV-SDPSLPLCWKGQKVFKSVSDVKNDFKSLFLSF 336

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
                 ++ E+ PE YLI++  GN CLGIL+G+   L   N+IG I
Sbjct: 337 VK---NSVLEIPPENYLIVTKNGNACLGILDGSAAKLT-FNIIGDI 378


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score =  340 bits (872), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 182/372 (48%), Positives = 237/372 (63%), Gaps = 15/372 (4%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
           L  P    S + +++   SL +   S+ +FQ+ G VYP G+Y VTM IG PA+PYFLD+D
Sbjct: 34  LLLPPFAPSPARAATPGKSLSS--ASTAVFQLQGAVYPIGHYYVTMNIGDPAKPYFLDVD 91

Query: 70  TGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAPGHHNCEDPAQCDYE 128
           TGSDLTWLQCDAPC  C + PHP Y+P+ N +VPC   +C SL    +  C  P QCDY+
Sbjct: 92  TGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPCAASLCTSLTP--NKKCAVPQQCDYQ 149

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVP---GASYHPLDGILGLGK 185
           ++Y D  SSLGVL+ D F  +  N   +   L  GCGY+Q     GA     DG+LGLGK
Sbjct: 150 IKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGK 209

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPG 244
           G  S++SQL  Q + +NV+GHC S  GGGFLFFGDD+  +SRV W  M+   +  YYSPG
Sbjct: 210 GAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGDDIVPTSRVTWVPMARTTSGNYYSPG 269

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
              L+F   + G+K + VVFDSGS+Y Y     YQ   S +K  LS KSLKE   D +LP
Sbjct: 270 SGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEPYQATVSALKAGLS-KSLKEV-SDVSLP 327

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           LCWKG++ FK+V +VK  F++L LSF  GK  ++ E+ PE YLI++  GNVCLGIL+G  
Sbjct: 328 LCWKGQKVFKSVSEVKNDFKSLFLSF--GK-NSVMEIPPENYLIVTKYGNVCLGILDGTT 384

Query: 365 VGLQDLNVIGGI 376
             L+  N+IG I
Sbjct: 385 AKLK-FNIIGDI 395


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 179/336 (53%), Positives = 228/336 (67%), Gaps = 10/336 (2%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
           +SSS ++      SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDA
Sbjct: 37  ASSSIAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA 96

Query: 82  PCVRCVEAPHPLYRPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSS 137
           PC  C E PHPLYRP+ + LVPC   +CASLH    G H C+ P  QCDY ++YAD GSS
Sbjct: 97  PCRSCNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSS 156

Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLH 195
            GVL+ D+FA   TNG    P +A GCGY+Q    G    P DG+LGLG G  S++SQL 
Sbjct: 157 TGVLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216

Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGET 254
            + + +NVVGHCLS  GGGFLFFGDDL    R  WT M+ S +  YYSPG A L+FG  +
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRS 276

Query: 255 TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
            G++   VVFDSGSS+TY     YQ L + +K  LS ++L+E P D +LPLCWKG+ PFK
Sbjct: 277 LGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFK 334

Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
           +V DV+K F++L L+F  GK +TL E+ PE YLI++
Sbjct: 335 SVLDVRKEFKSLVLNFASGK-KTLMEIPPENYLIVT 369


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 5/372 (1%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
           LC       ++  SS+   L N  + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25  LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84

Query: 69  DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
           DTGSDLTW+QCDAPC  C +     Y+P+++ +PC   +C+ L  P    C DP  QCDY
Sbjct: 85  DTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 144

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
           E+ Y+D  SS+G LV D       NG  +N RL  GCGY+Q         P  GILGLG+
Sbjct: 145 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 204

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
           GK  + +QL S  + +NV+ HCLS  G GFL  GD+L  SS V WTS++++  +K Y  G
Sbjct: 205 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 264

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
            AEL F  +TTG+K + VVFDSGSSYTY N   YQ +  +++K+L+ K L +  +D++LP
Sbjct: 265 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 324

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           +CWKG++P K++ +VKK F+T+ L F + K   LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 325 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 384

Query: 365 VGLQDLNVIGGI 376
           +GL+  N+IG I
Sbjct: 385 IGLEGYNIIGDI 396


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 5/372 (1%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
           LC       ++  SS+   L N  + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25  LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84

Query: 69  DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
           DTGSDLTW+QCDAPC  C +     Y+P+++ +PC   +C+ L  P    C DP  QCDY
Sbjct: 85  DTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 144

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
           E+ Y+D  SS+G LV D       NG  +N RL  GCGY+Q         P  GILGLG+
Sbjct: 145 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 204

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
           GK  + +QL S  + +NV+ HCLS  G GFL  GD+L  SS V WTS++++  +K Y  G
Sbjct: 205 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 264

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
            AEL F  +TTG+K + VVFDSGSSYTY N   YQ +  +++K+L+ K L +  +D++LP
Sbjct: 265 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 324

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           +CWKG++P K++ +VKK F+T+ L F + K   LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 325 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 384

Query: 365 VGLQDLNVIGGI 376
           +GL+  N+IG I
Sbjct: 385 IGLEGYNIIGDI 396


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 10/372 (2%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
           LC       ++  SS+   L N  + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25  LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84

Query: 69  DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
           DTGSDLTW+QCDAPC  C +     Y+P+++ +PC   +C+ L  P    C DP  QCDY
Sbjct: 85  DTGSDLTWVQCDAPCNGCTK-----YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 139

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
           E+ Y+D  SS+G LV D       NG  +N RL  GCGY+Q         P  GILGLG+
Sbjct: 140 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 199

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
           GK  + +QL S  + +NV+ HCLS  G GFL  GD+L  SS V WTS++++  +K Y  G
Sbjct: 200 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 259

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
            AEL F  +TTG+K + VVFDSGSSYTY N   YQ +  +++K+L+ K L +  +D++LP
Sbjct: 260 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 319

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           +CWKG++P K++ +VKK F+T+ L F + K   LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 320 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 379

Query: 365 VGLQDLNVIGGI 376
           +GL+  N+IG I
Sbjct: 380 IGLEGYNIIGDI 391


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score =  335 bits (858), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 170/320 (53%), Positives = 215/320 (67%), Gaps = 12/320 (3%)

Query: 38  LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
           +FQ+ GNVYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLYRP+
Sbjct: 41  IFQLQGNVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPT 100

Query: 98  -NDLVPCEDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
            N LVPC + +C +LH+ GH   + C  P QCDY+++Y D  SS GVL+ D F+      
Sbjct: 101 ANSLVPCANALCTALHS-GHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLP-MRS 158

Query: 154 QRLNPRLALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
             + P L  GCGY+Q  G   A     DG+LGLG+G  S+VSQL  Q + +NV+GHCLS 
Sbjct: 159 SNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLST 218

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            GGGFLFFGDD+  +SRV W  M+     YYSPG   L+F   + G+K + VVFDSGS+Y
Sbjct: 219 NGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTY 278

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY     YQ + S +K  LS KSLK+   D +LPLCWKG + FK+V DVKK F++L LSF
Sbjct: 279 TYFTAQPYQAVVSALKSGLS-KSLKQV-SDPSLPLCWKGPKAFKSVFDVKKEFKSLFLSF 336

Query: 331 TDGKTRTLFELTPEAYLIIS 350
              K   + E+ PE YLI++
Sbjct: 337 ASAK-NAVMEIPPENYLIVT 355


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 179/353 (50%), Positives = 239/353 (67%), Gaps = 5/353 (1%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           SS  N   SS+L  V GNVYP G++ V++ IG P + + LD+DTGSDLTW+QCDAPC  C
Sbjct: 31  SSAVNPFDSSILLPVKGNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGC 90

Query: 87  VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDA 145
                 LY+P N++V C +P+C++L +     C++P  QCDYE+EYAD GSS+GVLVKD 
Sbjct: 91  TLPHDRLYKPHNNVVRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDP 150

Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNV 203
                TNG  L P L  GCGY+Q  G S  P    G+LGLG  K+++ +QL +   +RNV
Sbjct: 151 VPLRLTNGTILAPNLGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNV 210

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
           +GHC SG GGGFLFFG DL  SS + W  +       YS G AE++FGG   G++ L + 
Sbjct: 211 LGHCFSGQGGGFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILT 270

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTY N   Y  + ++++  L  + L++APED+TLP+CWKG + FK+V DV+  F
Sbjct: 271 FDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFF 330

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + LALSF  G ++  F++ PEAYLIISN GNVCLGILNG++VGL ++N+IG I
Sbjct: 331 KPLALSF--GNSKVQFQIPPEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDI 381


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 173/327 (52%), Positives = 219/327 (66%), Gaps = 12/327 (3%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHA-P 114
           IG PA+PYFLD+DTGSDLTWLQCDAPC  C + PHPLYRP+ N LVPC + +C +LH+  
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60

Query: 115 GHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV---P 170
           G +N C  P QCDY+++Y D  SS GVL+ D+F+    +   + P L  GCGY+Q     
Sbjct: 61  GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRS-SNIRPGLTFGCGYDQQVGKN 119

Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVW 230
           GA    +DG+LGLG+G  S+VSQL  Q + +NVVGHCLS  GGGFLFFGDD+  SSRV W
Sbjct: 120 GAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPSSRVTW 179

Query: 231 TSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
             M+   +  YYSPG   L+F   + G+K + VVFDSGS+YTY     YQ + S +K  L
Sbjct: 180 VPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKGGL 239

Query: 290 SAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
           S KSLK+   D TLPLCWKG++ FK+V DVK  F+++ LSF   K   + E+ PE YLI+
Sbjct: 240 S-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAAM-EIPPENYLIV 296

Query: 350 SNKGNVCLGILNGAEVGLQDLNVIGGI 376
           +  GNVCLGIL+G    L   NVIG I
Sbjct: 297 TKNGNVCLGILDGTAAKL-SFNVIGDI 322


>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 413

 Score =  320 bits (820), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 173/346 (50%), Positives = 232/346 (67%), Gaps = 4/346 (1%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           GSS+LF V GNVYP G++ V + IG P++ + LD+DTGSDLTW+QCD  C+ C      L
Sbjct: 36  GSSVLFPVRGNVYPLGHFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDML 95

Query: 94  YRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           YRP N+ V  EDP+CA+L + G    ++P  QC YE+EYAD GSS+GVLVKD      TN
Sbjct: 96  YRPHNNAVSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTN 155

Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G+R++P L  GCGY+Q  G    P  + G+LGL   K++IVSQL     + NVVGHCL+G
Sbjct: 156 GKRISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTG 215

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            GGGFLFFG D+  SS + WT +  +    YS G AE++F G   G+  L + FDSGSSY
Sbjct: 216 RGGGFLFFGGDVVPSSGMSWTPILRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSY 275

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TY N   Y+ +  ++K +L    LK A +D+TL LCWKG +PF++V DV+  F+ LA+SF
Sbjct: 276 TYFNSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFFKPLAMSF 335

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            + K    F++ PEAYLIIS  GNVCLGIL+G++ G+ ++N+IG I
Sbjct: 336 KNSKN-VQFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIGDI 380


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 169/376 (44%), Positives = 228/376 (60%), Gaps = 13/376 (3%)

Query: 12  FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
           FP    +++ ++S   +  + + SSL++ + GNVYP G Y V++ IG P +PY LD+DTG
Sbjct: 23  FPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGLYTVSINIGNPPKPYELDIDTG 82

Query: 72  SDLTWLQC---DAPCVRCVEAPHPLYRPS-NDLVPCEDPICA---SLHAPGHHNCEDPAQ 124
           SDLTW+QC   DAPC  C      LY+P+   +V C DPIC    S H  G    +    
Sbjct: 83  SDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDPICVATQSTHVLGQICSKQSPP 142

Query: 125 CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV---PGASYHPLDGIL 181
           C Y ++YAD  S+LGVLV+D       +    +P +A GCGY Q    P   +    GIL
Sbjct: 143 CVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGIL 202

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTS-MSSDYTKY 240
           GLG GK+SI+SQL S   I NV+GHCLS  GGG+LF GD    SS +VWT  + S   K+
Sbjct: 203 GLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKH 262

Query: 241 YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           Y+ G  +LFF G+ T  K L ++FDSGSSYTY +   Y  + +++  +L  K L    +D
Sbjct: 263 YNTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRV-KD 321

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
            +LP+CWKG +PFK++++V   F+ L LSFT  K    F+L P AYLII+  GNVCLGIL
Sbjct: 322 PSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKNLQ-FQLPPVAYLIITKYGNVCLGIL 380

Query: 361 NGAEVGLQDLNVIGGI 376
           NG E GL + NV+G I
Sbjct: 381 NGNEAGLGNRNVVGDI 396


>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
 gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
 gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
          Length = 410

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 169/360 (46%), Positives = 229/360 (63%), Gaps = 26/360 (7%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+++ ++HGNVYP G++ +TM IG PA+ YFLD+DTGS LTWLQCDAPC  C   PH LY
Sbjct: 22  SAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLY 81

Query: 95  RPS-NDLVPCEDPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +P+   LV C D +C  L+        C    QCDY ++Y D  SS+GVLV D F+ + +
Sbjct: 82  KPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSAS 140

Query: 152 NGQRLNP-RLALGCGYNQ------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNV 203
           NG   NP  +A GCGY+Q      VP     P+D ILGL +GK +++SQL SQ +I ++V
Sbjct: 141 NGT--NPTTIAFGCGYDQGKKNRNVP----IPVDSILGLSRGKVTLLSQLKSQGVITKHV 194

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-- 261
           +GHC+S  GGGFLFFGD    +S V WT M+ ++ KYYSPG   L F   +  +   P  
Sbjct: 195 LGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMA 253

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHD 318
           V+FDSG++YTY     YQ   S++K  L++  K L E  E D  L +CWKG+     + +
Sbjct: 254 VIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDE 313

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE--VGLQDLNVIGGI 376
           VKKCFR+L+L F DG  +   E+ PE YLIIS +G+VCLGIL+G++  + L   N+IGGI
Sbjct: 314 VKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGI 373


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 166/376 (44%), Positives = 227/376 (60%), Gaps = 21/376 (5%)

Query: 21  SSSSSSSSLFNHVGSSL------LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           S +S  S   N +G  L      +F + GNV P G Y VTM +G P++PYFLD+D+GS+L
Sbjct: 43  SKASFVSRDTNRIGRRLQAHQTAIFSLKGNVVPYGLYYVTMLVGNPSKPYFLDVDSGSEL 102

Query: 75  TWLQCDAPCVRCVEAPHPLYR-PSNDLVPCEDPICASLHA-PGH-HNCEDPAQ-CDYELE 130
           TW+QCDAPC+ C + PHPLY+     LVP +DP+CA++ A  GH HN ++ +Q CDY++ 
Sbjct: 103 TWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVA 162

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKS 188
           YAD G S G LV+D+     TN   L      GCGYNQ      S    DGILGLG G +
Sbjct: 163 YADHGYSEGFLVRDSVRALLTNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMA 222

Query: 189 SIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGV 245
           S+ SQ   Q LI+NV+GHC+ G G  GG++FFGDDL  +S + W  M      K+Y  G 
Sbjct: 223 SLPSQWAKQGLIKNVIGHCIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGA 282

Query: 246 AELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           A++ FG      +  G K   ++FDSGS+YTY     Y    S++K+ LS K L++   D
Sbjct: 283 AQMNFGNKPLDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSD 342

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
             L LCW+ +  F++V +    F+ L L F   KT+ + E+ PE YL+++ KGNVCLGIL
Sbjct: 343 SFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKTKQM-EIFPEGYLVVNKKGNVCLGIL 401

Query: 361 NGAEVGLQDLNVIGGI 376
           NG  +G+ D NV+G I
Sbjct: 402 NGTAIGIVDTNVLGDI 417


>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
          Length = 410

 Score =  300 bits (768), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 167/358 (46%), Positives = 229/358 (63%), Gaps = 22/358 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+++ ++HGNVYP G++ VTM IG PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22  SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81

Query: 95  RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +P     V C +  CA L+A       C    QC Y ++Y  GGSS+GVL+ D+F+   +
Sbjct: 82  KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140

Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
           NG   NP  +A GCGYNQ  G + H    P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--VV 263
           HC+S  G GFLFFGD    +S V W+ M+ ++ K+YSP    L F   +  +   P  V+
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVI 255

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDVK 320
           FDSG++YTY     Y    S++K  LS   K L E  E D  L +CWKG+   + + +VK
Sbjct: 256 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 315

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
           KCFR+L+L F DG  +   E+ PE YLIIS +G+VCLGIL+G++    L   N+IGGI
Sbjct: 316 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 373


>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
          Length = 410

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 166/358 (46%), Positives = 228/358 (63%), Gaps = 22/358 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+++ ++HGNVYP G++ VTM I  PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22  SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81

Query: 95  RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +P     V C +  CA L+A       C    QC Y ++Y  GGSS+GVL+ D+F+   +
Sbjct: 82  KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140

Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
           NG   NP  +A GCGYNQ  G + H    P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--VV 263
           HC+S  G GFLFFGD    +S V W+ M+ ++ K+YSP    L F   +  +   P  V+
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLHFNSNSKPISAAPMEVI 255

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDVK 320
           FDSG++YTY     Y    S++K  LS   K L E  E D  L +CWKG+   + + +VK
Sbjct: 256 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 315

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
           KCFR+L+L F DG  +   E+ PE YLIIS +G+VCLGIL+G++    L   N+IGGI
Sbjct: 316 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 373


>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
          Length = 411

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 165/359 (45%), Positives = 226/359 (62%), Gaps = 23/359 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+++ ++HGNVYP G++ VTM I  PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22  SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81

Query: 95  RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +P     V C +  CA L+A       C    QC Y ++Y  GGSS+GVL+ D+F+   +
Sbjct: 82  KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140

Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
           NG   NP  +A GCGYNQ  G + H    P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET---TGLKNLPV 262
           HC+S  G GFLFFGD    +S V W+ M+ ++ K+YSP    L F            + V
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLHFNSNKQSPISAAPMEV 255

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDV 319
           +FDSG++YTY     Y    S++K  LS   K L E  E D  L +CWKG+   + + +V
Sbjct: 256 IFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV 315

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
           KKCFR+L+L F DG  +   E+ PE YLIIS +G+VCLGIL+G++    L   N+IGGI
Sbjct: 316 KKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 374


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 164/376 (43%), Positives = 220/376 (58%), Gaps = 19/376 (5%)

Query: 12  FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
           FP    +++ ++S   +  + + SSL++ + GNVYP G Y V++ IG P  PY LD+DTG
Sbjct: 23  FPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGIYTVSINIGNPPNPYELDIDTG 82

Query: 72  SDLTWLQCD---APCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAPGH---HNCEDP-A 123
           SDLTW+QCD   APC  C      LY+P+ N LV C DPICA++  P       C  P  
Sbjct: 83  SDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDPICAAVQPPFSTFGQKCAKPIP 142

Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGIL 181
            C Y++EYAD   S G L +D       +G  + P +  GCGY Q            G+L
Sbjct: 143 PCVYKVEYADNAESTGALARDYMHIGSPSGSNV-PLVVFGCGYEQKFSGPTPPPSTPGVL 201

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKY 240
           GLG GK SI+SQLHS   I NV+GHCLS  GGG+LF GD    SS + WT +  S   K+
Sbjct: 202 GLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKH 261

Query: 241 YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           YS G  +LFF G+ T  K L ++FDSGSSYTY +   Y  + +++  +L  K L+   +D
Sbjct: 262 YSTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKD 321

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
            +LP+CWKG +PFK++++V   F+ L LSFT  K    F+L P  +      GNVCLGIL
Sbjct: 322 PSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKNLQ-FQLPPVKF------GNVCLGIL 374

Query: 361 NGAEVGLQDLNVIGGI 376
           NG E GL + NV+G I
Sbjct: 375 NGNEAGLGNRNVVGDI 390


>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
          Length = 423

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 168/371 (45%), Positives = 230/371 (61%), Gaps = 35/371 (9%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA----- 89
           S+++ ++HGNVYP G++ VTM IG PA+PYFLD+DTGS LTWLQCD PC+ C +A     
Sbjct: 22  SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFY 81

Query: 90  --------PHPLYRPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSL 138
                   PH LY+P     V C +  CA L+A       C    QC Y ++Y  GGSS+
Sbjct: 82  PRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSI 140

Query: 139 GVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQ 193
           GVL+ D+F+   +NG   NP  +A GCGYNQ  G + H    P++GILGLG+GK +++SQ
Sbjct: 141 GVLIVDSFSLPASNGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQ 196

Query: 194 LHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
           L SQ +I ++V+GHC+S  G GFLFFGD    +S V W+ M+ ++ K+YSP    L F  
Sbjct: 197 LKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNS 255

Query: 253 ETTGLKNLP--VVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCW 307
            +  +   P  V+FDSG++YTY     Y    S++K  LS   K L E  E D  L +CW
Sbjct: 256 NSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 315

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-- 365
           KG+   + + +VKKCFR+L+L F DG  +   E+ PE YLIIS +G+VCLGIL+G++   
Sbjct: 316 KGKDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHP 375

Query: 366 GLQDLNVIGGI 376
            L   N+IGGI
Sbjct: 376 SLAGTNLIGGI 386


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 167/356 (46%), Positives = 222/356 (62%), Gaps = 27/356 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA---PCVRCVEAPHPL 93
           ++F++ G+V+PTG++ VTM IG+PA+PYFLD+DTGS+LTW++C A   PC  C + PHPL
Sbjct: 26  MVFKLGGDVHPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPL 85

Query: 94  YRPSNDLVPCEDPICASLHAP--GHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
           YRP   LVPC DP+C +LH       +C E+P QC Y++ YADG +SLGVL+ D F+   
Sbjct: 86  YRPKK-LVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPT 144

Query: 151 TNGQRLNPRLALGCGYNQVPGASYH-----PLDGILGLGKGKSSIVSQL-HSQKLIRNVV 204
            + +     +A GCGY+Q+ G         P+DGILGLG+G   +VSQL HS  + +NV+
Sbjct: 145 GSAR----NIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVI 200

Query: 205 GHCLSGGGGGFLFFGDDLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV 262
           GHCLS  GGG+LF G++   SS   +++    S    +YSPG A L  G    G K    
Sbjct: 201 GHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKA 260

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFKNVHDVKK 321
           +FDSGS+YTYL    +  L S +K  L   SLK   + +T L LCWKG +PFK VHD+ K
Sbjct: 261 IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPK 320

Query: 322 CFRTLA-LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F++L  L F  G T T   + PE YLII+  GN C GIL   E+   DL VIGGI
Sbjct: 321 EFKSLVTLKFDHGVTMT---IPPENYLIITGHGNACFGIL---ELPGYDLFVIGGI 370


>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
          Length = 775

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 171/390 (43%), Positives = 231/390 (59%), Gaps = 38/390 (9%)

Query: 5   HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
           H   N    T R  SS + + ++ L   VG+           P  ++ +TM IG PA+ Y
Sbjct: 369 HETPNRKVGTARQPSSPAPTGAAILCRGVGA-----------PRHFF-ITMNIGDPAKSY 416

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAP--GHHNCED 121
           FLD+DTGS LTWLQCDAPC  C   PH LY+P+   LV C D +C  L+        C  
Sbjct: 417 FLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLCTDLYTDLGKPKRCGS 476

Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQ------VPGASY 174
             QCDY ++Y D  SS+GVLV D F+ + +NG   NP  +A GCGY+Q      VP    
Sbjct: 477 QKQCDYVIQYVDS-SSMGVLVIDRFSLSASNGT--NPTTIAFGCGYDQGKKNRNVP---- 529

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM 233
            P+D ILGL +GK +++SQL SQ +I ++V+GHC+S  GGGFLFFGD    +S V WT M
Sbjct: 530 IPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPM 589

Query: 234 SSDYTKYYSPGVAELFFGGETTGLKNLP--VVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
           + ++ KYYSPG   L F   +  +   P  V+FDSG++YTY     YQ   S++K  L++
Sbjct: 590 NREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNS 648

Query: 292 --KSLKEAPE-DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
             K L E  E D  L +CWKG+     + +VKKCFR+L+L F DG  +   E+ PE YLI
Sbjct: 649 ECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLI 708

Query: 349 ISNKGNVCLGILNGAE--VGLQDLNVIGGI 376
           IS +G+VCLGIL+G++  + L   N+IGGI
Sbjct: 709 ISQEGHVCLGILDGSKEHLSLAGTNLIGGI 738



 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 125/284 (44%), Positives = 173/284 (60%), Gaps = 25/284 (8%)

Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           +V  +DP+  +LH  G     N   P QCDYE++YADG S++G L+ D F+      +  
Sbjct: 1   MVRADDPLYVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR-- 58

Query: 157 NPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGG 212
            P L  GCGYNQ  G ++    P++GILGL +GK S VSQL    +I ++VVGHCLS GG
Sbjct: 59  -PNLPFGCGYNQGIGENFQQTSPVNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGG 117

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           GG LF GD   D + V+       +  YYSPG A L+F   + G+  + VVFDSGS+YTY
Sbjct: 118 GGLLFVGDG--DGNLVLL------HANYYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTY 169

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
                YQ     +K  LS+ SL++   D +LPLCWKG++ F++V DVKK F++L L+F +
Sbjct: 170 FTAQPYQATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN 228

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
                + E+ PE YLI++  GNVCLGIL+G  +   + N+IG I
Sbjct: 229 ---NAVMEIPPENYLIVTEYGNVCLGILHGCRL---NFNIIGDI 266


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score =  287 bits (735), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 151/367 (41%), Positives = 211/367 (57%), Gaps = 15/367 (4%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
           S +S+      +  + + GN+YP G Y + M IG PA+ Y+LD+DTGSDLTWLQCDAPC 
Sbjct: 5   SKASVPETAQRTAAYPIGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCR 64

Query: 85  RCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLV 142
            C   PH LY P    +V C  P CA +   G   C  D  QCDYE++Y DG S++G+LV
Sbjct: 65  SCAVGPHGLYDPKRARVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILV 124

Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLI 200
           +D      TNG R   R  +GCGY+Q    +  P   DG++GL   K S+ SQL ++ + 
Sbjct: 125 EDTITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIA 184

Query: 201 RNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGL 257
            NV+GHCL+GG  GGG+LFFGD L  +  + WT M      + Y   +  + +GGE   L
Sbjct: 185 NNVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLEL 244

Query: 258 KNLP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
           +         +FDSG+S+TYL    Y  + S + ++     L+    D TLP CW+G  P
Sbjct: 245 EGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSP 304

Query: 313 FKNVHDVKKCFRTLALSF---TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD 369
           F++V DV   F+T+ L F   T   +  L EL+PE YLI+S +GNVCLG+L+ +   L+ 
Sbjct: 305 FESVADVSAYFKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEV 364

Query: 370 LNVIGGI 376
            N++G I
Sbjct: 365 TNILGDI 371


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score =  284 bits (727), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 209/353 (59%), Gaps = 16/353 (4%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           +++  Q+ GN+YP G Y + M IG PA+ Y+LD+DTGSDLTWLQCDAPC  C   PH LY
Sbjct: 7   ATVFSQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLY 66

Query: 95  RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
            P    LV C  P+CA +   G + C  P  QCDY++EYADG S++GVL++D      TN
Sbjct: 67  DPKKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTN 126

Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G R      +GCGY+Q    +  P   DG++GL   K S+ SQL  + ++RNV+GHCL+G
Sbjct: 127 GTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAG 186

Query: 211 G--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
           G  GGG+LFFGD L  +  + WT +     K  +  +       +        V+FDSG+
Sbjct: 187 GSNGGGYLFFGDSLVPALGMTWTPIMG---KSITGNIGGKSGDADDKTGDIGGVMFDSGT 243

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           S+TYL    Y  + S M+ ++    L     D TLP CW+G  PF++V DV++ F+T+ L
Sbjct: 244 SFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFKTVTL 303

Query: 329 SFTDGK-----TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F  GK        + EL+PE YLI+S +GNVCLGIL+ +   L+  N+IG +
Sbjct: 304 DF--GKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDV 354


>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 326

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 160/330 (48%), Positives = 203/330 (61%), Gaps = 28/330 (8%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
           +++ I   +  Y LD+DTGSDLTW Q DAPC  C      L +P   LV C D +CA++H
Sbjct: 1   MSITITSSSELYELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHCKLVKCGDRLCAAIH 60

Query: 113 APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
           +     C DP  QCDYE+EYAD GSSLGVLV D  A  +T+G    P LA        P 
Sbjct: 61  S---EPCADPDEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLARPILA-------APD 110

Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWT 231
                    +GL  GK+SI+SQLHS  LIRNVVGHCLS  GGGFLFFGD L   S VVWT
Sbjct: 111 ---------MGLATGKTSILSQLHSLGLIRNVVGHCLSRRGGGFLFFGDQLIPQSGVVWT 161

Query: 232 SM----SSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMK 286
            +    S  YT+ +Y  G A++FF G+ T +K L + FDSGSSYT  N   ++ L  ++ 
Sbjct: 162 PLLQNSSVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGLIT 221

Query: 287 KELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
            ++  KS   A ED +LP+CWK  + FK++HDV   F+ +ALSFT  K  +L +L PEAY
Sbjct: 222 NDIKGKSFSRATEDPSLPICWKNPKTFKSLHDVTNYFKPIALSFTKSK-NSLLQLPPEAY 280

Query: 347 LIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           LI    GNVCLGIL+G E+GL + N+IG I
Sbjct: 281 LI--KYGNVCLGILDGTEIGLGNTNIIGDI 308


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 153/368 (41%), Positives = 216/368 (58%), Gaps = 24/368 (6%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           SS+ NH   S+ F V GN+YP G Y + + +G P + YFLD+DTGSDLTW QCDAPC  C
Sbjct: 19  SSVGNH---SVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNC 75

Query: 87  VEAPHPLYRPSN-DLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKD 144
              PH LY P    +V C  P+CA +   G + C  D  QCDYE+EYADG S++GVLV+D
Sbjct: 76  AIGPHGLYNPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVED 135

Query: 145 AFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRN 202
                 TNG  +  +  +GCGY+Q    +  P   DG++GL   K ++ +QL  + +I+N
Sbjct: 136 TLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKN 195

Query: 203 VVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKN 259
           V+GHCL+ G  GGG+LFFGD+L  S  + WT M        Y   +  + +GG++  L N
Sbjct: 196 VLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNN 255

Query: 260 --------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
                     V+FDSG+S+TYL    Y ++ S + K+     L     D TLP CW+G  
Sbjct: 256 DEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQ---SGLLRVKSDTTLPYCWRGPS 312

Query: 312 PFKNVHDVKKCFRTLALSFTDGK---TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
           PF+++ DV + F+TL L F       T +  +L+P+ YLI+S +GNVCLGIL+ +   L+
Sbjct: 313 PFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNVCLGILDASGASLE 372

Query: 369 DLNVIGGI 376
             N+IG +
Sbjct: 373 VTNIIGDV 380


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 21/357 (5%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+ L  + GNV+P G Y  ++++G P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 171 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 230

Query: 95  RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P+ + +VP  D +C  L   G+ N CE   QCDYE+EYAD  SS+GVL +D      TN
Sbjct: 231 KPTKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHLIATN 288

Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS- 209
           G R       GC Y+Q       P   DGILGL     S+ SQL S  +I N+ GHC++ 
Sbjct: 289 GGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITR 348

Query: 210 -GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
             GGGG++F GDD      + WTS+ S     Y      + +G +   ++      + V+
Sbjct: 349 EQGGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVI 408

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYL    Y+ L + +K   ++    +   D TLPLCWK   P + + DVK+ F
Sbjct: 409 FDSGSSYTYLPDEIYENLVAAIK--YASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQFF 466

Query: 324 RTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + L L F  GK        F ++PE YLIIS+KGNVCLG+LNG E+      ++G +
Sbjct: 467 KPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 159/380 (41%), Positives = 218/380 (57%), Gaps = 23/380 (6%)

Query: 1   MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
           + +S N +++  P      +SS++++      V SS +F V GNVYP G Y   + +G P
Sbjct: 164 LVASVNDDDVIVPNRNYKLASSNAAA------VDSSSVFPVRGNVYPDGLYFTYILVGNP 217

Query: 61  ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN- 118
            RPY+LD+DT SDLTW+QCDAPC  C +  + LY+P  D +V  +D +C  LH       
Sbjct: 218 PRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGY 277

Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL- 177
           CE   QCDYE+EYAD  SS+GVL +D       NG   N +   GC Y+Q  G   + L 
Sbjct: 278 CETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQ-QGLLLNTLV 336

Query: 178 --DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM 233
             DGILGL K K S+ SQL ++ +I NVVGHCL+    GGG++F GDD      + W  M
Sbjct: 337 KTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPM 396

Query: 234 -SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKK 287
             S     Y   + +L +G     L     +   +VFDSGSSYTY  +  Y  L + + K
Sbjct: 397 LDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASL-K 455

Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEA 345
           ++S ++L +   D TLP CW+ + P ++V DVK+ F+TL L F        T F + PE 
Sbjct: 456 QVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEG 515

Query: 346 YLIISNKGNVCLGILNGAEV 365
           YLIISNKGNVCLGIL+G++V
Sbjct: 516 YLIISNKGNVCLGILDGSDV 535


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 211/376 (56%), Gaps = 27/376 (7%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM  + ++++ ++      S+ L  + GNV+P G Y  +++IG P RPYFLD+DTGSDLT
Sbjct: 158 RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 211

Query: 76  WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
           W+QCDAPC  C + PHPLY+P+ + +VP  D +C  L   G+ N CE   QCDYE+EYAD
Sbjct: 212 WIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 269

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIV 191
             SS+GVL +D      TNG R       GC Y+Q       P   DGILGL     S  
Sbjct: 270 QSSSMGVLARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFP 329

Query: 192 SQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
           SQL S  +I NV GHC++   GGGG++F GDD      V WTS+ S     Y      + 
Sbjct: 330 SQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVK 389

Query: 250 FGGET-----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           +G +           + V+FDSGSSYTYL    Y+ L + +K   ++    +   D TLP
Sbjct: 390 YGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIK--YASPGFVQDTSDRTLP 447

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGIL 360
           LCWK   P + + DVK+ F  L L F  GK        F ++PE YLIIS+KGNVCLG+L
Sbjct: 448 LCWKADFPVRYLEDVKQFFEPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLL 505

Query: 361 NGAEVGLQDLNVIGGI 376
           NG E+      ++G +
Sbjct: 506 NGTEINHGSTIIVGDV 521


>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
          Length = 245

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 134/206 (65%), Positives = 169/206 (82%), Gaps = 2/206 (0%)

Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWT 231
           +SYHPLDG+LGLG+GKSS+VSQL+SQ L+RNVVGHCLS  GGG++FFGD +YDSSR+ WT
Sbjct: 7   SSYHPLDGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGD-VYDSSRLTWT 65

Query: 232 SMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
            MSS   K+Y  G AEL FGG+ TG+  L  VFD+GSSYTY N   YQ + S +KKEL+ 
Sbjct: 66  PMSSRDLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAG 125

Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT-DGKTRTLFELTPEAYLIIS 350
           K LKEAP+D+TLPLCW G+RPF++V++V+K F+++ALSFT  G+T T FE+ PEAYLI+S
Sbjct: 126 KPLKEAPDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVS 185

Query: 351 NKGNVCLGILNGAEVGLQDLNVIGGI 376
           N GNVCLGIL+G+EVG+ DLN+IG I
Sbjct: 186 NMGNVCLGILDGSEVGMGDLNLIGDI 211


>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
           partial [Brachypodium distachyon]
          Length = 354

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 154/347 (44%), Positives = 204/347 (58%), Gaps = 43/347 (12%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++F++HG+VYPTG+  VTM IG+  +PYFLD+DTGS LTWL+     VR         
Sbjct: 20  SSMVFELHGDVYPTGHIYVTMSIGEQEKPYFLDIDTGSTLTWLED----VRF-------- 67

Query: 95  RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
                                 H+C E+P QCDY++ YA G SSLGVL+ D F+     G
Sbjct: 68  ---------------------KHDCKENPNQCDYDVRYAGGESSLGVLIADKFSLP---G 103

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGG 212
           +   P L  GCGY+Q  G +  P+DG+LG+G+G   + SQL  Q  I  NV+GHCL   G
Sbjct: 104 RDARPTLTFGCGYDQEGGKAEMPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQG 163

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET---TGLKNLPVVFDSGSS 269
           GG+LFFG +   SS V W  M  +   YYSPG+A L F G       +  + VV DSGS+
Sbjct: 164 GGYLFFGHEKVPSSVVTWVPMVPN-NHYYSPGLAALHFNGNLGNPISVAPMEVVIDSGST 222

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YTY+   TY+ L  ++   LS  SL     D  LP+CW G+ PFK + DVK  F+ L L+
Sbjct: 223 YTYMPTETYRRLVFVVIASLSKSSLTLV-RDPALPVCWAGKEPFKXIGDVKDKFKPLELA 281

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           F  G ++ + E+ PE YLIIS +GNVC+GIL+G + GL+ LNVIG I
Sbjct: 282 FIQGTSQAIMEIPPENYLIISGEGNVCMGILDGTQAGLRKLNVIGDI 328


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 155/366 (42%), Positives = 212/366 (57%), Gaps = 17/366 (4%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           ++S S F+   SS +F V G+VYP G Y   +++G P R YFLD+DTGSDLTW+QCDAPC
Sbjct: 290 ATSVSAFD---SSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC 346

Query: 84  VRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVL 141
             C + P+PLY+P   +LVP +D +C  +        CE   QCDYE+EYAD  SS+GVL
Sbjct: 347 TSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVL 406

Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             D       NG      +  GC Y+Q  +   S    DGILGL K K S+ SQL SQ++
Sbjct: 407 ASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRI 466

Query: 200 IRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
           I NV+GHCL+    GGG++F GDD      + W  M + ++  Y   + ++  G     L
Sbjct: 467 INNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSL 526

Query: 258 -----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
                +   VVFD+GSSYTY  +  Y  L + + K++S + L +   D TLP+CW+ + P
Sbjct: 527 GRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVCWRAKFP 585

Query: 313 FKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
            ++V DVK+ F+ L L F        T F + PE YLIISNKGNVCLGIL+G+ V     
Sbjct: 586 IRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGST 645

Query: 371 NVIGGI 376
            ++G I
Sbjct: 646 IILGDI 651


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 155/366 (42%), Positives = 212/366 (57%), Gaps = 17/366 (4%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           ++S S F+   SS +F V G+VYP G Y   +++G P R YFLD+DTGSDLTW+QCDAPC
Sbjct: 77  ATSVSAFD---SSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC 133

Query: 84  VRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVL 141
             C + P+PLY+P   +LVP +D +C  +        CE   QCDYE+EYAD  SS+GVL
Sbjct: 134 TSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVL 193

Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             D       NG      +  GC Y+Q  +   S    DGILGL K K S+ SQL SQ++
Sbjct: 194 ASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRI 253

Query: 200 IRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
           I NV+GHCL+    GGG++F GDD      + W  M + ++  Y   + ++  G     L
Sbjct: 254 INNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSL 313

Query: 258 -----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
                +   VVFD+GSSYTY  +  Y  L + + K++S + L +   D TLP+CW+ + P
Sbjct: 314 GRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVCWRAKFP 372

Query: 313 FKNVHDVKKCFRTLALSFTDGK--TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
            ++V DVK+ F+ L L F        T F + PE YLIISNKGNVCLGIL+G+ V     
Sbjct: 373 IRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGST 432

Query: 371 NVIGGI 376
            ++G I
Sbjct: 433 IILGDI 438


>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1388

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 162/363 (44%), Positives = 216/363 (59%), Gaps = 25/363 (6%)

Query: 33  VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
           V SS +F V GNVYP G Y   + +G P + YFLD+DTGSDLTW+QCDAPC+ C +  H 
Sbjct: 174 VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV 233

Query: 93  LYRPS-NDLVPCEDPICASL---HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           LY+P+ +++V   D +C  +      GHH+ E   QCDYE++YAD  SSLGVLV+D    
Sbjct: 234 LYKPTRSNVVSSVDALCLDVQKNQKNGHHD-ESLLQCDYEIQYADHSSSLGVLVRDELHL 292

Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVG 205
             TNG +    +  GCGY+Q  G   + L   DGI+GL + K S+  QL S+ LI+NVVG
Sbjct: 293 VTTNGSKTKLNVVFGCGYDQA-GLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 351

Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSP------GVAELFFGGETT 255
           HCLS  G GGG++F GDD      + W  M+   T   Y +       G  +L F G++ 
Sbjct: 352 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSK 411

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
             K   +VFDSGSSYTY  +  Y  L + +  E+S   L +   D TLP+CW+   P K+
Sbjct: 412 VGK---MVFDSGSSYTYFPKEAYLDLVASL-NEVSGLGLVQDDSDTTLPICWQANFPIKS 467

Query: 316 VHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
           V DVK  F+TL L F        TLF+++PE YLIISNKG+VCLGIL+G+ V      ++
Sbjct: 468 VKDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSIIL 527

Query: 374 GGI 376
           G I
Sbjct: 528 GDI 530


>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
 gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
          Length = 408

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 163/343 (47%), Positives = 208/343 (60%), Gaps = 26/343 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC---DAPCVRCVEAPHPL 93
           ++F++ G+VYP G++ VTM IG+PA PYFLD+DTGS  TWL+C   D PC  C + PHPL
Sbjct: 25  MVFKLDGSVYPVGHFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPL 84

Query: 94  YRPS-NDLVPCEDPICASLHAP--GHHNCED--PAQCDYELEYADGGSSLGVLVKDAFAF 148
           YR +   LVPC DP+C +LH        C D    QCDY+++Y DG SSLGVL+ D F+ 
Sbjct: 85  YRLTRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSL 144

Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYH-----PLDGILGLGKGKSSIVSQL-HSQKLIRN 202
             T G R    +A GCGY+Q+ G+        P+DGILGLG+G   + SQL HS  + +N
Sbjct: 145 P-TGGAR---NIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKN 200

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY---TKYYSPGVAELFFGGETTGLKN 259
           V+GHCLS  GGG+LF G++   SS V W  M+        +YSPG A L       G K 
Sbjct: 201 VIGHCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKP 260

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
           L  +FDSGS+YTYL    +  L S +K  LS  SLK+   D  LPLCWKG +PFK VHD 
Sbjct: 261 LKAIFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQV-SDPALPLCWKGPKPFKTVHDT 319

Query: 320 KKCFRTLA-LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
            K F++L  L F  G T     + PE YLII+  GN C GIL+
Sbjct: 320 PKEFKSLVTLKFDLGVTMI---IPPENYLIITGHGNACFGILD 359


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 156/376 (41%), Positives = 210/376 (55%), Gaps = 27/376 (7%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM  + ++++ ++      S+ L  + GNV+P G Y  +++IG P RPYFLD+DTGSDLT
Sbjct: 158 RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 211

Query: 76  WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
           W+QCDAPC    + PHPLY+P+ + +VP  D +C  L   G+ N CE   QCDYE+EYAD
Sbjct: 212 WIQCDAPCTNFAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 269

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIV 191
             SS+GVL +D      TNG R       GC Y+Q       P   DGILGL     S  
Sbjct: 270 QSSSMGVLARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFP 329

Query: 192 SQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
           SQL S  +I NV GHC++   GGGG++F GDD      V WTS+ S     Y      + 
Sbjct: 330 SQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVK 389

Query: 250 FGGET-----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           +G +           + V+FDSGSSYTYL    Y+ L + +K   ++    +   D TLP
Sbjct: 390 YGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIK--YASPGFVQDTSDRTLP 447

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGIL 360
           LCWK   P + + DVK+ F  L L F  GK        F ++PE YLIIS+KGNVCLG+L
Sbjct: 448 LCWKADFPVRYLEDVKQFFEPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLL 505

Query: 361 NGAEVGLQDLNVIGGI 376
           NG E+      ++G +
Sbjct: 506 NGTEINHGSTIIVGDV 521


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 148/355 (41%), Positives = 203/355 (57%), Gaps = 17/355 (4%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S++L  + GNV+P G Y  ++++G P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 178 STVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 237

Query: 95  RPSND-LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P+ + +VP  D +C  L    ++ C    QCDYE+EYAD  SS+GVL KD      TNG
Sbjct: 238 KPAKEKIVPPRDLLCQELQGDQNY-CATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNG 296

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
            R       GC Y+Q       P   DGILGL     S+ SQL SQ +I NV GHC++  
Sbjct: 297 GREKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKE 356

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVF 264
             GGG++F GDD      + W  +       Y     ++ +G +   +      ++ V+F
Sbjct: 357 PNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIF 416

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSGSSYTYL    Y+ L + +K +    S  +   D TLPLCWK     + + DVK+ F+
Sbjct: 417 DSGSSYTYLPDEIYKKLVTAIKYDYP--SFVQDTSDTTLPLCWKADFDVRYLEDVKQFFK 474

Query: 325 TLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            L L F +      RT F + P+ YLIIS+KGNVCLG+LNGAE+      ++G +
Sbjct: 475 PLNLHFGNRWFVIPRT-FTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDV 528


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 23/358 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS L  + GNV+P G Y  +MYIG P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 143 SSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 202

Query: 95  RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P   ++VP  D  C  L   G+ N  D + QCDYE+ YAD  SS+G+L +D       +
Sbjct: 203 KPEKPNVVPPRDSYCQELQ--GNQNYGDTSKQCDYEITYADRSSSMGILARDNMQLITAD 260

Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G+R N     GCGY+Q       P   DGILGL     S+ +QL SQ +I NV GHC++ 
Sbjct: 261 GERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320

Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
               GG++F GDD      + W  + +     YS  V ++ +G +   ++        V+
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYL    Y  L + +K    +    E+  D TLP C K   P +++ DVK  F
Sbjct: 381 FDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLF 438

Query: 324 RTLALSFTDGKTRTL-----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + L+L F   K R       F + PE YLIIS+K N+CLG+L+G E+G     VIG +
Sbjct: 439 KPLSLVF---KKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDV 493


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 23/358 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS L  + GNV+P G Y  +MYIG P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 143 SSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 202

Query: 95  RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P   ++VP  D  C  L   G+ N  D + QCDYE+ YAD  SS+G+L +D       +
Sbjct: 203 KPEKPNVVPPRDSYCQELQ--GNQNYGDTSKQCDYEITYADRSSSMGILARDNMQLITAD 260

Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G+R N     GCGY+Q       P   DGILGL     S+ +QL SQ +I NV GHC++ 
Sbjct: 261 GERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320

Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
               GG++F GDD      + W  + +     YS  V ++ +G +   ++        V+
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYL    Y  L + +K    +    E+  D TLP C K   P +++ DVK  F
Sbjct: 381 FDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLF 438

Query: 324 RTLALSFTDGKTRTL-----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + L+L F   K R       F + PE YLIIS+K N+CLG+L+G E+G     VIG +
Sbjct: 439 KPLSLVF---KKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDV 493


>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1336

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 158/363 (43%), Positives = 212/363 (58%), Gaps = 25/363 (6%)

Query: 33  VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
           V SS +F V GNVYP G Y   + +G P + YFLD+DTGSDLTW+QCDAPC  C +  H 
Sbjct: 176 VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHV 235

Query: 93  LYRPS-NDLVPCEDPICASL---HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            Y+P+ +++V   D +C  +      GHH+ E   QCDYE++YAD  SSLGVLV+D    
Sbjct: 236 QYKPTRSNVVSSVDSLCLDVQKNQKNGHHD-ESLLQCDYEIQYADHSSSLGVLVRDELHL 294

Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVG 205
             TNG +    +  GCGY+Q  G   + L   DGI+GL + K S+  QL S+ LI+NVVG
Sbjct: 295 VTTNGSKTKLNVVFGCGYDQ-EGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 353

Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSP------GVAELFFGGETT 255
           HCLS  G GGG++F GDD      + W  M+   T   Y +       G  +L F G++ 
Sbjct: 354 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSK 413

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
             K   V FDSGSSYTY  +  Y  L + +  E+S   L +   D TLP+CW+     ++
Sbjct: 414 VGK---VFFDSGSSYTYFPKEAYLDLVASL-NEVSGLGLVQDDSDTTLPICWQANFQIRS 469

Query: 316 VHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
           + DVK  F+TL L F        TLF++ PE YLIISNKG+VCLGIL+G++V      ++
Sbjct: 470 IKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSIIL 529

Query: 374 GGI 376
           G I
Sbjct: 530 GDI 532


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 145/352 (41%), Positives = 198/352 (56%), Gaps = 21/352 (5%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S++L  + GNV+P G Y  ++++G P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 175 STVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 234

Query: 95  RPSND-LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P+ + +VP  D +C  L    ++ CE   QCDYE+EYAD  SS+GVL KD      TNG
Sbjct: 235 KPAKEKIVPPRDSLCQELQGDQNY-CETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNG 293

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-- 209
            R       GC Y+Q       P   DGILGL     S+ SQL S+ +I NV GHC++  
Sbjct: 294 GREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRE 353

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGS 268
             GGG++F GDD      + W  +       Y     ++ +G +     N + V+FDSGS
Sbjct: 354 TNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGS 413

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           SYTYL    Y+ L   +K++  + S  +   D TLPLCWK          V+  F+ L L
Sbjct: 414 SYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKAD------FSVRSFFKPLNL 465

Query: 329 SFTDGK----TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F  G+        F + P+ YLIIS+KGNVCLG+LNG E+      ++G +
Sbjct: 466 HF--GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 515


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 148/356 (41%), Positives = 206/356 (57%), Gaps = 23/356 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+ L  + GNV+P G Y  ++++G P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 187 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 246

Query: 95  RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P+ + +VP +D +C  L   G+ N CE   QCDYE+EYAD  SS+GVL +D      TN
Sbjct: 247 KPAKEKIVPPKDLLCQELQ--GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTN 304

Query: 153 GQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G R       GC Y+Q     AS    DGILGL     S+ SQL +Q +I NV GHC++ 
Sbjct: 305 GGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITR 364

Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-----NLPVV 263
              GGG++F GDD      +  T + S     +     ++++G +   ++     ++ V+
Sbjct: 365 DPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVI 424

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYL    Y+ L + +K   +  +  +   D TLPLC     P + + DVK+ F
Sbjct: 425 FDSGSSYTYLPDEIYKNLIAAIK--YAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLF 482

Query: 324 RTLALSFTDGKT-----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           + L L F  GK      RT F + P+ YLIIS+KGNVCLG LNG ++      ++G
Sbjct: 483 KPLNLHF--GKRWFVMPRT-FTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVG 535


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 148/356 (41%), Positives = 206/356 (57%), Gaps = 23/356 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+ L  + GNV+P G Y  ++++G P RPYFLD+DTGSDLTW+QCDAPC  C + PHPLY
Sbjct: 188 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 247

Query: 95  RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P+ + +VP +D +C  L   G+ N CE   QCDYE+EYAD  SS+GVL +D      TN
Sbjct: 248 KPAKEKIVPPKDLLCQELQ--GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTN 305

Query: 153 GQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G R       GC Y+Q     AS    DGILGL     S+ SQL +Q +I NV GHC++ 
Sbjct: 306 GGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITR 365

Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-----NLPVV 263
              GGG++F GDD      +  T + S     +     ++++G +   ++     ++ V+
Sbjct: 366 DPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVI 425

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSGSSYTYL    Y+ L + +K   +  +  +   D TLPLC     P + + DVK+ F
Sbjct: 426 FDSGSSYTYLPDEIYKNLIAAIK--YAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLF 483

Query: 324 RTLALSFTDGKT-----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           + L L F  GK      RT F + P+ YLIIS+KGNVCLG LNG ++      ++G
Sbjct: 484 KPLNLHF--GKRWFVMPRT-FTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVG 536


>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 188/346 (54%), Gaps = 62/346 (17%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS++  + GNV+P GYY+V + IG P + +  D+DTGSDLTW+QCDAPC  C   P   Y
Sbjct: 38  SSVVLPLSGNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQY 97

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P  + VPC DPIC +LH P    C +P  QCDYE+ YAD GSS+G LV D F     NG
Sbjct: 98  KPKGNTVPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNG 157

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
             + PRLA GCGY+Q+   ++ P    G+LGLG+GK  ++ QL +  L RNVVGHCLS  
Sbjct: 158 SAMQPRLAFGCGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSK 217

Query: 212 GGGFLFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           GGG+LFFGD L  +  V WT  +S +YT ++                             
Sbjct: 218 GGGYLFFGDTLIPTLGVAWTPLLSPEYTFFF----------------------------- 248

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
            ++ R   Q   +  K  L  K+                   FK +           ++F
Sbjct: 249 -HICRDRLQRDYTFFKSVLEFKNF------------------FKTI----------TINF 279

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T+ +  T  ++ PE+YLIIS  GN CLG+LNG+EVGLQ+ NVIG I
Sbjct: 280 TNARRITQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDI 325


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  253 bits (647), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 17/345 (4%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           SS +F V G++YP G Y   + +G+P RPYFLD+DTGSDLTW+QCDAPC  C +   PLY
Sbjct: 183 SSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLY 242

Query: 95  RPSND-LVPCEDPICASLHAP-GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
           +P  + +V  +D +C  +        C    QC+YE++YAD  SSLGVLVKD F   ++N
Sbjct: 243 KPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSN 302

Query: 153 GQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G         GC Y+Q  +   +    DGILGL + K S+ SQL S+ +I NVVGHCL+G
Sbjct: 303 GSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTG 362

Query: 211 --GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGG-----ETTGLKNLPV 262
              GGG+LF GDD      + W +M  S    +Y   V  + +G      +T G     V
Sbjct: 363 DPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQV 422

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           VFDSGSSYTY  +  Y  L + + +E+SA  L    +D +  +CWK  +  ++V DVK  
Sbjct: 423 VFDSGSSYTYFTKEAYYQLVANL-EEVSAFGL--ILQDSSDTICWKTEQSIRSVKDVKHF 479

Query: 323 FRTLALSFTD--GKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
           F+ L L F        T   + PE YL+I+ +GNVCLGIL+G++V
Sbjct: 480 FKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQV 524


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 146/370 (39%), Positives = 204/370 (55%), Gaps = 38/370 (10%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA-PC 83
            +S+LF H        + GN++P G Y   + +G P RPYFLD+DTGS  TW+QCDA PC
Sbjct: 141 QNSTLFPH-------SLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPC 193

Query: 84  VRCVEAPHPLYRPSN--DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVL 141
             C +  HPLYRP+   D +P  DP+C           E+P QCDYE+ YADG SS+GV 
Sbjct: 194 ASCAKGAHPLYRPARTADALPASDPLCEGAQH------ENPNQCDYEISYADGSSSMGVY 247

Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           V+D+  F   +G+R N  +  GCGY+Q  V   +    DG+LGL     S+ +QL S+ +
Sbjct: 248 VRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGI 307

Query: 200 IRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSS--------DYTKYYSPGVAEL 248
           I N  GHC+S    G GG+LF GDD      + W  +             K  + G  +L
Sbjct: 308 ISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQL 367

Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
              G+ T      VVFD+GS+YTY        L S +K+  S + +++   D+TLP C K
Sbjct: 368 NAQGKLTQ-----VVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDD-SDKTLPFCMK 421

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
              P ++V DVK  F+ L+L F      +RT F + PE YL+IS+KGNVCLG+LNG  +G
Sbjct: 422 SDFPVRSVEDVKHFFKPLSLQFEKRFFFSRT-FNIRPEHYLVISDKGNVCLGVLNGTTIG 480

Query: 367 LQDLNVIGGI 376
              + ++G +
Sbjct: 481 YDSVVIVGDV 490


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 144/353 (40%), Positives = 201/353 (56%), Gaps = 22/353 (6%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
           ++ F + GNVYP G++  T+ IG+PA+PYFLD+DTGS+LTWL+C  P   C       PH
Sbjct: 23  AIKFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPH 82

Query: 92  PLYRPS--NDLVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
           P Y P+  N  V C  P+C ++    PG   C   DP +C YE++Y  G S  G L  D 
Sbjct: 83  PYYTPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141

Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
            + N     R   R+A GCGY Q   A     P+DGILGLG GK+ + +QL   K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKEN 197

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
           V+GHCLS  G G L+ GD    +  V W  M      YYSPG+AE+F   +   G     
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            VFDSGS+YT++    Y  + S ++  LS  SL+E  +   LPLCWKG++PF +V+DVK 
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
            F+ L+L  T  +  +  ++ P+ YL +   G  CL IL+ + +  L++LN I
Sbjct: 316 QFKALSLKITHARGTSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368


>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/353 (40%), Positives = 199/353 (56%), Gaps = 22/353 (6%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
           ++ F + GNVYP G++  T+ IG+PA+PYFLD+DTGS+LTWL+C  P   C       PH
Sbjct: 23  AIKFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPH 82

Query: 92  PLYRPS--NDLVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
           P Y P+  N  V C  P+C ++    PG   C   DP +C YE++Y  G S  G L  D 
Sbjct: 83  PYYTPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141

Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
            + N     R   R+A GCGY Q   A     P+DGILGLG GK+   +QL   K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKEN 197

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
           V+GHCLS  G G L+ GD    +  V W  M      YYSPG+AE+F   +   G     
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            VFDSGS+YT++    Y  + S ++  LS  SL+E  +   LPLCWKG++PF +V+DVK 
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
            F+ L+L  T  +     ++ P+ YL +   G  CL IL+ + +  L++LN I
Sbjct: 316 QFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 194/351 (55%), Gaps = 29/351 (8%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPC 103
           V P   Y  ++ IG PARPYFLD+DTGS LTW+QCDAPC  C + PHPLY+P+ + +VP 
Sbjct: 123 VLPERQYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENIVPP 182

Query: 104 EDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
            D  C  L   G+ N C+   QCDYE+ YAD  SS GVL +D       +G+R N  L  
Sbjct: 183 RDSHCQELQ--GNQNYCDTCKQCDYEIAYADRSSSAGVLARDNMELITADGERENMDLVF 240

Query: 163 GCGYNQ------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGG 214
           GC ++Q       P +S    DGILGL  G  S+ +QL  Q +I NV GHC++    G  
Sbjct: 241 GCAHDQQGKLLGSPASS----DGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSA 296

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVVFDSGSS 269
           ++F GDD      + W  + +     YS  V ++ +G +   ++        V+FDSGSS
Sbjct: 297 YMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSS 356

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YTY     Y +L  I   E  +        D+TLP C K   P ++V DVK+  + L L 
Sbjct: 357 YTYFPHEIYTSL--ITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLH 414

Query: 330 FTDGKTRTL----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           F+  KT  +    FE++PE YLIIS KGNVCLG+L+G E+G     VIG +
Sbjct: 415 FS--KTWLVIPRTFEISPENYLIISGKGNVCLGVLDGTEIGHSSTIVIGDV 463


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 149/362 (41%), Positives = 199/362 (54%), Gaps = 23/362 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
           S+ +F V GNVYP G Y   + +G+P   + Y LD+DTGSDLTW+QCDAPC  C +  + 
Sbjct: 182 STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQ 241

Query: 93  LYRPSND-LVPCEDPICASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
           LY+P  D LV   +P C  +       +CE   QCDYE+EYAD   S+GVL KD F    
Sbjct: 242 LYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKL 301

Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            NG      +  GCGY+Q  G   + L   DGILGL + K S+ SQL S+ +I NVVGHC
Sbjct: 302 HNGSLAESDIVFGCGYDQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 360

Query: 208 LSG--GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNL---- 260
           L+    G G++F G DL  S  + W  M    + + Y   V ++ +G     L       
Sbjct: 361 LASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRV 420

Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR--RPFKNVH 317
             V+FD+GSSYTY     Y  L + + +E+S   L     DE LP+CW+ +   P  ++ 
Sbjct: 421 GKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSDLELTRDDSDEALPICWRAKTNSPISSLS 479

Query: 318 DVKKCFRTLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           DVKK FR + L         ++ L  + PE YLIISNKGNVCLGIL+G+ V      +IG
Sbjct: 480 DVKKFFRPITLQIGSKWLIISKKLL-IQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIG 538

Query: 375 GI 376
            I
Sbjct: 539 DI 540


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 150/362 (41%), Positives = 200/362 (55%), Gaps = 23/362 (6%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
           S+ +F V GNVYP G Y   + +G+P   + Y LD+DTGS+LTW+QCDAPC  C +  + 
Sbjct: 187 STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQ 246

Query: 93  LYRPSND-LVPCEDPICASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
           LY+P  D LV   +  C  +       +CE+  QCDYE+EYAD   S+GVL KD F    
Sbjct: 247 LYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKL 306

Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            NG      +  GCGY+Q  G   + L   DGILGL + K S+ SQL S+ +I NVVGHC
Sbjct: 307 HNGSLAESDIVFGCGYDQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 365

Query: 208 LSG--GGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNL---- 260
           L+    G G++F G DL  S  + W  M  D     Y   V ++ +G     L       
Sbjct: 366 LASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRV 425

Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR--PFKNVH 317
             V+FD+GSSYTY     Y  L + + +E+S   L     DETLP+CW+ +   PF ++ 
Sbjct: 426 GKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSGLELTRDDSDETLPICWRAKTNFPFSSLS 484

Query: 318 DVKKCFRTLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           DVKK FR + L         +R L  + PE YLIISNKGNVCLGIL+G+ V      ++G
Sbjct: 485 DVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSVHDGSTIILG 543

Query: 375 GI 376
            I
Sbjct: 544 DI 545


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 186/343 (54%), Gaps = 17/343 (4%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-DLVPC 103
           V P   Y  ++ IG P RPYFLD+DTGSD TW+ CDAPC  C + PHP+Y+P+   +V  
Sbjct: 10  VVPERQYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHP 69

Query: 104 EDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
            DP+C  L   G+ N CE   QCDYE+ YAD  SS GVL +D       +G+  N     
Sbjct: 70  RDPLCEELQ--GNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVF 127

Query: 163 GCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFF 218
           GC +NQ       P   DGILGL  G  S+ +QL +  +I NV GHC++     GG++F 
Sbjct: 128 GCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFL 187

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVVFDSGSSYTYL 273
           GDD      + W  + +     YS  V ++ +G +   L+        V+FDSGSSYTY 
Sbjct: 188 GDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTYF 247

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L +++  E ++        D+TLP C K   P ++V DV++ F  L L     
Sbjct: 248 PHEIYTNLIALL--EDASPGFVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKR 305

Query: 334 --KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
                T F ++PE YLIIS+KGNVCLG+L+G E+G     +IG
Sbjct: 306 WFVIPTTFAISPENYLIISDKGNVCLGVLDGTEIGHSSTIIIG 348


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 141/353 (39%), Positives = 199/353 (56%), Gaps = 22/353 (6%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
           ++ F + GNVYP G++  T+ IG+PA+PYFLD+DTGS+LTWL+C  P   C       PH
Sbjct: 23  AINFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPH 82

Query: 92  PLYRPSND--LVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
           P Y P++    V C  P+C ++    PG   C   DP +C YE++Y  G S  G L  D 
Sbjct: 83  PYYTPADGKLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141

Query: 146 FAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
            + N     R   R+A GCGY Q   P +   P++GILGLG GK+   +QL   K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKEN 197

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
           V+GHCLS  G G L+ GD    +  V W  M      YYSPG+AE+F   +   G     
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            VFDSGS+YT++    Y  + S ++   S  SL+E  +   LPLCWKG++PF +V+DVK 
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
            F+ L+L  T  +     ++ P+ YL +   G  CL IL+ + +  L++LN I
Sbjct: 316 QFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368


>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
 gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
          Length = 603

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 147/407 (36%), Positives = 206/407 (50%), Gaps = 61/407 (14%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           ++S  +SS++++++     SS +F V GN+YP G          P +PY+LD DTGSDLT
Sbjct: 169 KISKLASSNAAAAM----DSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLT 214

Query: 76  WLQCDAPCVRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
           W+QCDAPC  C +  +  Y+P   ++VP +D +C  +        CE   QCDYE+EYAD
Sbjct: 215 WIQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYAD 274

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIV 191
             SS+GVL  D       NG         GC Y+Q  +   +    DGILGL + K S+ 
Sbjct: 275 HSSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLP 334

Query: 192 SQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAEL 248
           SQL SQ +I NV+GHCL+   GGGG++F GDD      + W  M  S   ++Y   V +L
Sbjct: 335 SQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKL 394

Query: 249 FFGGETTGLKNLP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
            +G     L  +      ++FDSGSSYTY  +  Y  L + +  E+S   L ++  D TL
Sbjct: 395 NYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASL-NEVSGAGLVQSTSDTTL 453

Query: 304 PLCWKGRRPFKNV--------------------------------HDVKKCFRTLALSFT 331
           PLCW+   P +                                   DVKK F+TL   F 
Sbjct: 454 PLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFG 513

Query: 332 DG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
                  T F + PE YL++S+KGNVCLGIL G++V      ++G I
Sbjct: 514 TKWLVISTKFRIPPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDI 560


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 143/347 (41%), Positives = 193/347 (55%), Gaps = 25/347 (7%)

Query: 51  YNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPI 107
           Y   + +G+P   + Y LD+DTGS+LTW+QCDAPC  C +  + LY+P  D LV   +  
Sbjct: 30  YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 89

Query: 108 CASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C  +       +CE+  QCDYE+EYAD   S+GVL KD F     NG      +  GCGY
Sbjct: 90  CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 149

Query: 167 NQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDD 221
           +Q  G   + L   DGILGL + K S+ SQL S+ +I NVVGHCL+    G G++F G D
Sbjct: 150 DQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSD 208

Query: 222 LYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYL-N 274
           L  S  + W  M  D     Y   V ++ +G     L     +   V+FD+GSSYTY  N
Sbjct: 209 LVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR--PFKNVHDVKKCFRTLALSFTD 332
           +   Q +TS+  +E+S   L     DETLP+CW+ +   PF ++ DVKK FR + L    
Sbjct: 269 QAYSQLVTSL--QEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGS 326

Query: 333 G---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
                +R L  + PE YLIISNKGNVCLGIL+G+ V      ++G I
Sbjct: 327 KWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDI 372


>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
           Group]
          Length = 307

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 116/296 (39%), Positives = 162/296 (54%), Gaps = 44/296 (14%)

Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           +V  +DP+  +LH  G     N   P QCDYE++YADG S++G L+ D F+      +  
Sbjct: 1   MVRADDPLYVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR-- 58

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
            P L  GCGYNQ  G ++     +  LG              + ++VVGHCLS GGGG L
Sbjct: 59  -PNLPFGCGYNQGIGENFQQTSPLKMLGI-------------ITKHVVGHCLSSGGGGLL 104

Query: 217 FFGDDLYDSSRV-----------VWTSMSSDYTK-----YYSPGVAELFFGGETTGLKNL 260
           F GD   D + V           +  S  S Y +     YYSPG A L+F   + G+  +
Sbjct: 105 FVGDG--DGNLVLLHASLGSLCPIAISTPSSYNEPMLMNYYSPGSATLYFDRHSLGMNPM 162

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            VVFDSGS+YTY     YQ     +K  LS+ SL++   D +LPLCWKG++ F++V DVK
Sbjct: 163 DVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVK 221

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           K F++L L+F +     + E+ PE YLI++  GNVCLGIL+G  +   + N+IG I
Sbjct: 222 KEFKSLQLNFGN---NAVMEIPPENYLIVTEYGNVCLGILHGCRL---NFNIIGDI 271


>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 406

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 107/278 (38%), Positives = 145/278 (52%), Gaps = 34/278 (12%)

Query: 12  FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
           +P            +S+LF H        + GN++P G Y   + +G P RPYFLD+DTG
Sbjct: 128 YPKPPRRGGDDWPQNSTLFPH-------SLAGNLFPEGLYYTAISLGSPPRPYFLDVDTG 180

Query: 72  SDLTWLQCDA-PCVRCVEAPHPLYRPSN--DLVPCEDPICASLHAPGHHNCEDPAQCDYE 128
           S  TW+QCDA PC  C +  HPLYRP+   D +P  DP+C           E+P QCDYE
Sbjct: 181 SHTTWVQCDAPPCASCAKGAHPLYRPARTADALPASDPLCEGAQH------ENPNQCDYE 234

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKG 186
           + YADG SS+GV V+D+  F   +G+R N  +  GCGY+Q  V   +    DG+LGL   
Sbjct: 235 ISYADGSSSMGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNK 294

Query: 187 KSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSSD------- 236
             S+ +QL S+ +I N  GHC+S    G GG+LF GDD      + W  +          
Sbjct: 295 ALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRR 354

Query: 237 -YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
              K  + G  +L   G+ T      VVFD+GS+YTY 
Sbjct: 355 AQVKQINHGDQQLNAQGKLTQ-----VVFDTGSTYTYF 387


>gi|224097210|ref|XP_002334633.1| predicted protein [Populus trichocarpa]
 gi|222873871|gb|EEF11002.1| predicted protein [Populus trichocarpa]
          Length = 143

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 74/109 (67%), Positives = 89/109 (81%), Gaps = 1/109 (0%)

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           SYTYLN   YQ L S++K+ELS K L+EA +D+TLP+CWKGR+PFK+VHDVKK F+T AL
Sbjct: 1   SYTYLNSQAYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVHDVKKYFKTFAL 60

Query: 329 SFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           SF  DGK++T  E  PEAYLI+S+KGN CLG+LNG EVGL DLNVIG I
Sbjct: 61  SFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDI 109


>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
 gi|238008766|gb|ACR35418.1| unknown [Zea mays]
          Length = 205

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 70/152 (46%), Positives = 95/152 (62%), Gaps = 10/152 (6%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM  + ++++ ++      S+ L  + GNV+P G Y  +++IG P RPYFLD+DTGSDLT
Sbjct: 61  RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 114

Query: 76  WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
           W+QCDAPC  C + PHPLY+P+ + +VP  D +C  L   G+ N CE   QCDYE+EYAD
Sbjct: 115 WIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 172

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
             SS+GVL +D      TNG R       GC 
Sbjct: 173 QSSSMGVLARDDMHMIATNGGREKLDFVFGCA 204


>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
 gi|219887685|gb|ACL54217.1| unknown [Zea mays]
          Length = 292

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 92/254 (36%), Positives = 135/254 (53%), Gaps = 22/254 (8%)

Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLH 195
           +GV V+D+  F   +G+R N  +  GCGY+Q  V   +    DG+LGL     S+ +QL 
Sbjct: 1   MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60

Query: 196 SQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWT--------SMSSDYTKYYSPG 244
           S+ +I N  GHC+S    G GG+LF GDD      + W          +     K  + G
Sbjct: 61  SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
             +L   G+ T      VVFD+GS+YTY        L S +K+  S + +++   D+TLP
Sbjct: 121 DQQLNAQGKLT-----QVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQD-DSDKTLP 174

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGK--TRTLFELTPEAYLIISNKGNVCLGILNG 362
            C K   P ++V DVK  F+ L+L F      +RT F + PE YL+IS+KGNVCLG+LNG
Sbjct: 175 FCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRT-FNIRPEHYLVISDKGNVCLGVLNG 233

Query: 363 AEVGLQDLNVIGGI 376
             +G   + ++G +
Sbjct: 234 TTIGYDSVVIVGDV 247


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 102/367 (27%), Positives = 160/367 (43%), Gaps = 43/367 (11%)

Query: 9   NLCFPTVRMSSSSS--SSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPARPY 64
           NL FP  R  +S +   +  SS    + S++ F + GN  PT  G Y   + +G P++ Y
Sbjct: 23  NLVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPTVTGLYFTKIGLGSPSKDY 82

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLHAPG 115
           ++ +DTGSD+ W+ C   C RC           LY P    +++ V CE   C+S +   
Sbjct: 83  YVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGR 141

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ----RLNPRLALGCGYNQ--- 168
              C+    C Y + Y DG ++ G  V+D   FN  NG       N  +  GCG  Q   
Sbjct: 142 ILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGT 201

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRV 228
              +S   LDGI+G G+  SS++SQL +   ++ +  HCL    GG +F   ++ +    
Sbjct: 202 FASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK-- 259

Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQT 280
           V T+       +Y+  +  +   G+   L             V DSG++  YL R+ Y  
Sbjct: 260 VKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ 319

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
           L S        K L + P  +   L  +    F+   +V   F  + L F D  + T++ 
Sbjct: 320 LMS--------KVLAKQPRLKVY-LVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVY- 369

Query: 341 LTPEAYL 347
             P  YL
Sbjct: 370 --PHDYL 374


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 161/353 (45%), Gaps = 36/353 (10%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            ++H ++   GYY   ++IG P + + L +DTGS +T++ C   C +C     P ++P  
Sbjct: 69  MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQP-- 125

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           DL     P+  +L      NC+ D  QC YE +YA+  +S GVL +D  +F   N   L 
Sbjct: 126 DLSSTYQPVKCTLDC----NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFG--NQSELA 179

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  SI+ QL  + ++ +    C  G   GGG
Sbjct: 180 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGG 239

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
             +  G  +   S +V+       + YY+  + E+   G+   L   P VF        D
Sbjct: 240 AMVLGG--ISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLN--PSVFDGKHGSVLD 295

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++Y YL    +      + KEL + S    P+     LC+ G     +V  + K F  
Sbjct: 296 SGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAG--IDVSQLSKTFPV 353

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + + F +G     + L+PE Y+   +K  G  CLGI      G     ++GGI
Sbjct: 354 VDMIFGNGHK---YSLSPENYMFRHSKVRGAYCLGIFQN---GKDPTTLLGGI 400


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 166/361 (45%), Gaps = 36/361 (9%)

Query: 31  NHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           +H  ++    ++ ++ P GYY   ++IG P + + L +DTGS LT++ C   C +C +  
Sbjct: 72  SHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQ 130

Query: 91  HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
            P ++P  D      P+  S+       C+ +   C Y+ +YA+  SS GVL +D  +F 
Sbjct: 131 DPNFQP--DWSSTYQPLKCSMEC----TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG 184

Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                 L P R   GC   +         DGI+GLG+G  SIV QL  + +I N    C 
Sbjct: 185 --KQSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242

Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF- 264
            G   GGG  +  G  +   + +V+T      + YY+  + E+   G+   +   P+VF 
Sbjct: 243 GGMDVGGGAMVLGG--ISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPIN--PMVFD 298

Query: 265 -------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                  DSG++Y YL    ++     + KEL++  L + P+     +C+ G     +V 
Sbjct: 299 GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVS 356

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGG 375
            + K F  + L F++G       L+PE YL   +K  G  CLGI            ++GG
Sbjct: 357 QLSKTFPAVDLVFSNGNR---LSLSPENYLFQHSKAHGAYCLGIFQNEN---DQTTLLGG 410

Query: 376 I 376
           I
Sbjct: 411 I 411


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 166/361 (45%), Gaps = 36/361 (9%)

Query: 31  NHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           +H  ++    ++ ++ P GYY   ++IG P + + L +DTGS LT++ C   C +C +  
Sbjct: 72  SHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQ 130

Query: 91  HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
            P ++P  D      P+  S+       C+ +   C Y+ +YA+  SS GVL +D  +F 
Sbjct: 131 DPNFQP--DWSSTYQPLKCSMEC----TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG 184

Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                 L P R   GC   +         DGI+GLG+G  SIV QL  + +I N    C 
Sbjct: 185 --KQSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242

Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF- 264
            G   GGG  +  G  +   + +V+T      + YY+  + E+   G+   +   P+VF 
Sbjct: 243 GGMDVGGGAMVLGG--ISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPIN--PMVFD 298

Query: 265 -------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                  DSG++Y YL    ++     + KEL++  L + P+     +C+ G     +V 
Sbjct: 299 GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVS 356

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGG 375
            + K F  + L F++G       L+PE YL   +K  G  CLGI            ++GG
Sbjct: 357 QLSKTFPAVDLVFSNGNR---LSLSPENYLFQHSKAHGAYCLGIFQNEN---DQTTLLGG 410

Query: 376 I 376
           I
Sbjct: 411 I 411


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 169/391 (43%), Gaps = 54/391 (13%)

Query: 1   MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNH--VGSSLLFQVHGNVYPT--GYYNVTMY 56
           + S  NG NL FP  R   S S+  +  +     + S++   + GN  PT  G Y   + 
Sbjct: 17  IGSVANG-NLVFPVERRKRSLSAVRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLG 75

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPI 107
           +G P R Y++ +DTGSD+ W+ C   C RC           LY P    ++D+V C+   
Sbjct: 76  LGSPPRDYYVQVDTGSDILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDF 134

Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNPR---LALG 163
           C++        C+    C Y + Y DG ++ G  V+D   +N  NG  R +P+   +  G
Sbjct: 135 CSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFG 194

Query: 164 CGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
           CG  Q   +  +S   LDGI+G G+  SS++SQL +   ++ +  HCL    GG +F   
Sbjct: 195 CGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGGIFAIG 254

Query: 221 DLYDSS-------------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           ++ +                VV  S+  D      P  +++F      G      V DSG
Sbjct: 255 EVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLP--SDIFDSVNGKG-----TVIDSG 307

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++  YL  + Y         EL  K L   P  + L L  +  R F    +V + F  + 
Sbjct: 308 TTLAYLPDIVYD--------ELIQKVLARQPGLK-LYLVEQQFRCFLYTGNVDRGFPVVK 358

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLG 358
           L F D  + T++   P  YL     G  C+G
Sbjct: 359 LHFKDSLSLTVY---PHDYLFQFKDGIWCIG 386


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/353 (28%), Positives = 161/353 (45%), Gaps = 36/353 (10%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C  C     P ++P  
Sbjct: 77  MRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CEHCGRHQDPKFQP-- 133

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           DL     P+  +       NC+ D  QC Y+ +YA+  SS GVL +D  +F   N   L 
Sbjct: 134 DLSETYQPVKCTPDC----NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFG--NLSELA 187

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC  ++         DGI+GLG+G  SI+ QL  +K+I +    C  G   GGG
Sbjct: 188 PQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGG 247

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
             +  G  +     +V+T    D + YY+  + E+   G+   L   P VF        D
Sbjct: 248 AMILGG--ISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLN--PKVFDGKHGTVLD 303

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++Y YL    +      + KE ++      P+     +C+ G     +V  + K F  
Sbjct: 304 SGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAG--IDVSQLAKSFPV 361

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + + F +G       L+PE YL   +K  G  CLG+ +    G     ++GGI
Sbjct: 362 VDMVFENGHK---LSLSPENYLFRHSKVRGAYCLGVFSN---GRDPTTLLGGI 408


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 160/356 (44%), Gaps = 42/356 (11%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            ++H ++   GYY   ++IG P + + L +DTGS +T++ C   C +C     P ++P +
Sbjct: 100 MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPES 158

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                  P+  ++      NC+ D  QC YE +YA+  +S GVL +D  +F   N   L 
Sbjct: 159 S--STYQPVKCTIDC----NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFG--NQSELA 210

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  SI+ QL  +K+I +    C  G   GGG
Sbjct: 211 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGG 270

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------- 262
             +  G  +   S + +     D + YY+  + E+   G     K LP+           
Sbjct: 271 AMVLGG--ISPPSDMTFAYSDPDRSPYYNIDLKEMHVAG-----KRLPLNANVFDGKHGT 323

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           V DSG++Y YL    +      + KEL +      P+     +C+ G     +V  + K 
Sbjct: 324 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAG--NDVSQLSKS 381

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           F  + + F +G     + L+PE Y+   +K  G  CLGI      G     ++GGI
Sbjct: 382 FPVVDMVFGNGHK---YSLSPENYMFRHSKVRGAYCLGIFQN---GNDQTTLLGGI 431


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 160/346 (46%), Gaps = 42/346 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCE-D 105
           GYY   ++IG P + + L +DTGS +T++ C + C +C +   P ++P  S+   P + +
Sbjct: 75  GYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQPDLSSTYRPVKCN 133

Query: 106 PICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
           P C         NC+D   QC YE  YA+  SS GV+ +D  +F   N   L P R   G
Sbjct: 134 PSC---------NCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFG--NESELKPQRAVFG 182

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDD 221
           C   +         DGI+GLG+G+ S+V QL  + +I +    C  G   GGG +  G  
Sbjct: 183 CENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLG-Q 241

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYL 273
           +     +V++  +   + YY+  + EL   G+   LK  P VF        DSG++Y Y 
Sbjct: 242 ISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLK--PKVFDEKHGTVLDSGTTYAYF 299

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               +  L   + KE+        P+     +C+ G    + V  + K F  + + F  G
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAG--REVSHLSKVFPEVNMVFGSG 357

Query: 334 KTRTLFELTPEAYLIISNK--GNVCLGIL-NGAEVGLQDLNVIGGI 376
           +      L+PE YL    K  G  CLGI  NG ++      ++GGI
Sbjct: 358 QK---LSLSPENYLFRHTKVSGAYCLGIFQNGNDL----TTLLGGI 396


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/338 (26%), Positives = 157/338 (46%), Gaps = 27/338 (7%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            ++H ++   GYY   +YIG P + + L +D+GS +T++ C A C +C     P ++P  
Sbjct: 77  MRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP-- 133

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           DL     P+  ++       C+ D  QC YE +YA+  SS GVL +D  +F   +  +  
Sbjct: 134 DLSSSYSPVKCNVDC----TCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQ 189

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGG 214
            R   GC  ++         DGI+GLG+G+ SI+ QL  + +I +    C  G   GGG 
Sbjct: 190 -RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGA 248

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL------PVVFDSGS 268
            +  G  +   S +V++      + YY+  + E+   G+   + +         V DSG+
Sbjct: 249 MVLGG--VPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGT 306

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           +Y YL    +      +  ++ +      P+     +C+ G R  +NV  + + F  + +
Sbjct: 307 TYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGAR--RNVSKLHEVFPDVDM 364

Query: 329 SFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
            F +G+      LTPE YL   +K  G  CLG+    +
Sbjct: 365 VFGNGQK---LSLTPENYLFRHSKVDGAYCLGVFQNGK 399


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 159/354 (44%), Gaps = 30/354 (8%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+   ++H ++   GYY   ++IG P + + L +DTGS +T++ C + CV+C     P +
Sbjct: 73  SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRF 131

Query: 95  RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P  +L     P+  +       NC E+  QC YE  YA+  +S GVL +D  +F     
Sbjct: 132 QP--ELSSTYQPVKCNADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KES 184

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--- 210
           + +  R   GC   +         DGI+GLG+G  S++ QL  + ++ N    C  G   
Sbjct: 185 ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLPVVF 264
           GGG  +  G  +     +V++      + YY+  + E+   G+   L           + 
Sbjct: 245 GGGAMVLGG--ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAIL 302

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG++Y Y     Y      + K++S       P+     +C+ G    ++V ++ K F 
Sbjct: 303 DSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAG--RDVTELPKVFP 360

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + + F +G+      L+PE YL    K  G  CLGI      G     ++GGI
Sbjct: 361 EVDMVFANGQK---ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGI 408


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 97/340 (28%), Positives = 162/340 (47%), Gaps = 39/340 (11%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C +C +   P ++P  
Sbjct: 64  MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 122

Query: 97  --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
             S   + C +P C         NC+D  + C YE  YA+  SS GVL +D  +F   N 
Sbjct: 123 STSYQALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 170

Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
            +L+P R   GC   +         DGI+GLG+GK S+V QL  + +I +V   C  G  
Sbjct: 171 SQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230

Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
            GGG +  G  +     +V++      + YY+  + ++   G++  LK  P VF      
Sbjct: 231 VGGGAMVLG-KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 287

Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
             DSG++Y Y  +  +  +   + KE+ +      P+     +C+ G    ++V ++   
Sbjct: 288 VLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 345

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
           F  +A+ F +G+      L+PE YL    K  G  CLGI 
Sbjct: 346 FPEIAMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 382


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 166/351 (47%), Gaps = 31/351 (8%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +D+GS +T++ C + C +C +   P ++P  
Sbjct: 81  MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 137

Query: 99  DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           ++     P+  ++      NC+D   QC YE EYA+  SS GVL +D  +F   N  +L 
Sbjct: 138 EMSSTYQPVKCNMDC----NCDDDREQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 191

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  S+V QL  + LI N  G C  G   GGG
Sbjct: 192 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 251

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSG 267
             +  G D    S +V+T    D + YY+  +  +   G+   L +         V DSG
Sbjct: 252 SMILGGFDY--PSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSG 309

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++Y YL    +      + +E+S     + P+      C++       V ++ K F ++ 
Sbjct: 310 TTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAAS-NYVSELSKIFPSVE 368

Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + F  G++   + L+PE Y+   +K  G  CLG+      G     ++GGI
Sbjct: 369 MVFKSGQS---WLLSPENYMFRHSKVHGAYCLGVFPN---GKDHTTLLGGI 413


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 97/340 (28%), Positives = 162/340 (47%), Gaps = 39/340 (11%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C +C +   P ++P  
Sbjct: 64  MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 122

Query: 97  --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
             S   + C +P C         NC+D  + C YE  YA+  SS GVL +D  +F   N 
Sbjct: 123 STSYQALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 170

Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
            +L+P R   GC   +         DGI+GLG+GK S+V QL  + +I +V   C  G  
Sbjct: 171 SQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230

Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
            GGG +  G  +     +V++      + YY+  + ++   G++  LK  P VF      
Sbjct: 231 VGGGAMVLG-KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 287

Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
             DSG++Y Y  +  +  +   + KE+ +      P+     +C+ G    ++V ++   
Sbjct: 288 VLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 345

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
           F  +A+ F +G+      L+PE YL    K  G  CLGI 
Sbjct: 346 FPEIAMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 382


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 159/354 (44%), Gaps = 30/354 (8%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+   ++H ++   GYY   ++IG P + + L +DTGS +T++ C + CV+C     P +
Sbjct: 73  SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRF 131

Query: 95  RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +P  +L     P+  +       NC E+  QC YE  YA+  +S GVL +D  +F     
Sbjct: 132 QP--ELSSTYQPVKCNADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KES 184

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--- 210
           + +  R   GC   +         DGI+GLG+G  S++ QL  + ++ N    C  G   
Sbjct: 185 ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLPVVF 264
           GGG  +  G  +     +V++      + YY+  + E+   G+   L           + 
Sbjct: 245 GGGAMVLGG--ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAIL 302

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG++Y Y     Y      + K++S       P+     +C+ G    ++V ++ K F 
Sbjct: 303 DSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAG--RDVTELPKVFP 360

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + + F +G+      L+PE YL    K  G  CLGI      G     ++GGI
Sbjct: 361 EVDMVFANGQK---ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGI 408


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 166/351 (47%), Gaps = 31/351 (8%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +D+GS +T++ C + C +C +   P ++P  
Sbjct: 82  MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 138

Query: 99  DLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           +L     P+  ++      NC+D   QC YE EYA+  SS GVL +D  +F   N  +L 
Sbjct: 139 ELSSTYQPVKCNMDC----NCDDDKEQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 192

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  S+V QL  + LI N  G C  G   GGG
Sbjct: 193 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 252

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSG 267
             +  G D    S +++T    D + YY+  +  +   G+   L +         V DSG
Sbjct: 253 SMILGGFDY--PSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSG 310

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++Y YL    +      + +E+S     + P+      C+       +V ++ K F ++ 
Sbjct: 311 TTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAAS-NDVSELSKIFPSVE 369

Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + F  G++   + L+PE Y+   +K  G  CLG+      G     ++GGI
Sbjct: 370 MIFKSGQS---WLLSPENYMFRHSKVHGAYCLGVFPN---GKDHTTLLGGI 414


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/386 (26%), Positives = 175/386 (45%), Gaps = 35/386 (9%)

Query: 7   GENLCFPTVRM---SSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARP 63
           G  L  P  R    +S  ++SS   L +    +   ++H ++   GYY   +YIG P + 
Sbjct: 42  GPPLFLPLTRSYPNASRLAASSRRGLGDGAHPNARMRLHDDLLTNGYYTTRLYIGTPPQE 101

Query: 64  YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DP 122
           + L +D+GS +T++ C A C +C     P ++P  DL     P+  ++       C+ D 
Sbjct: 102 FALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSSYSPVKCNVDC----TCDSDK 154

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGIL 181
            QC YE +YA+  SS GVL +D  +F   +   L P R   GC  ++         DGI+
Sbjct: 155 KQCTYERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRAVFGCENSETGDLFSQHADGIM 212

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYT 238
           GLG+G+ SI+ QL  + +I +    C  G   GGG  +  G  +   S +V++      +
Sbjct: 213 GLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGG--VPAPSDMVFSHSDPLRS 270

Query: 239 KYYSPGVAELFFGGETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
            YY+  + E+   G+   + +         V DSG++Y YL    +      +  ++ + 
Sbjct: 271 PYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSL 330

Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
                P+     +C+ G    +NV  + + F  + + F +G+      LTPE YL   +K
Sbjct: 331 KKIRGPDPNYKDICFAGAG--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSK 385

Query: 353 --GNVCLGILNGAEVGLQDLNVIGGI 376
             G  CLG+      G     ++GGI
Sbjct: 386 VDGAYCLGVFQN---GKDPTTLLGGI 408


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 169/372 (45%), Gaps = 32/372 (8%)

Query: 18  SSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWL 77
           ++S  +SS   L +    S   ++H ++   GYY   +YIG P + + L +D+GS +T++
Sbjct: 52  NASRLASSRRVLGDGGRPSARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYV 111

Query: 78  QCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGS 136
            C A C +C     P ++P  DL     P+  S        C+ D +QC YE +YA+  S
Sbjct: 112 PC-ASCEQCGNHQDPRFQP--DLSSTYSPVKCSADC----TCDSDKSQCTYERQYAEMSS 164

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           S GVL +D  +F  T  +    R   GC  ++         DGI+GLG+G+ SI+ QL  
Sbjct: 165 SSGVLGEDIVSFG-TESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVD 223

Query: 197 QKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
           + +I +    C  G   GGG +  G  +     +V++      + YY+  + E+   G+ 
Sbjct: 224 KGVIGDSFSMCYGGMDIGGGAMVLG-AMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKA 282

Query: 255 TGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
             L   P +F        DSG++Y YL    +      +  ++        P+     +C
Sbjct: 283 LRLD--PRIFDSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDIC 340

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
           + G    +NV  + + F  + + F DG+      L+PE YL   +K  G  CLG+     
Sbjct: 341 FAGAG--RNVSQLSQAFPDVDMVFGDGQK---LSLSPENYLFRHSKVEGAYCLGVFQN-- 393

Query: 365 VGLQDLNVIGGI 376
            G     ++GGI
Sbjct: 394 -GKDPTTLLGGI 404


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/340 (29%), Positives = 150/340 (44%), Gaps = 50/340 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y ++M IG P R Y   LDTGSDL W QC APC+ CV+ P P + P+       +PC 
Sbjct: 87  GEYLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPCN 145

Query: 105 DPICASLHAP-GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            P+C +L+ P  + N      C Y+  Y D  ++ GVL  + F F   + +   PR+A G
Sbjct: 146 SPMCNALYYPLCYRNV-----CVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFG 200

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
           CG   +   S     G++G G+G  S+VSQL S +       +CL+         L+FG 
Sbjct: 201 CG--NLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRF-----SYCLTSFMSPVPSRLYFGA 253

Query: 221 DLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP--------------- 261
               +S    T      T +  +PG+  +++    G + G + LP               
Sbjct: 254 YATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTG 313

Query: 262 -VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            V+ DSGS+ TYL R  Y  +      ++           + L  C+    P + +  + 
Sbjct: 314 GVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMP 373

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGI 359
           +    LA  F         EL  E Y++I  + GN+CL I
Sbjct: 374 E----LAFHFEGAN----MELPLENYMLIDGDTGNLCLAI 405


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/386 (26%), Positives = 179/386 (46%), Gaps = 35/386 (9%)

Query: 7   GENLCFPTVRMSSSSSSSSSS---SLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARP 63
           G  L  P  R   ++S  ++S    L + V  +   ++H ++   GYY   +YIG P + 
Sbjct: 41  GPPLFLPLTRSYPNASRLAASLRRGLGDGVHPNARMRLHDDLLTNGYYTTRLYIGTPPQE 100

Query: 64  YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DP 122
           + L +D+GS +T++ C + C +C     P ++P  DL     P+  ++       C+ D 
Sbjct: 101 FALIVDSGSTVTYVPCSS-CEQCGNHQDPRFQP--DLSSSYSPVKCNVDC----TCDSDK 153

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGYNQVPGASYHPLDGIL 181
            QC YE +YA+  SS GVL +D  +F   +   L P+ A+ GC  ++         DGI+
Sbjct: 154 KQCTYERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETGDLFSQHADGIM 211

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYT 238
           GLG+G+ SI+ QL  + +I +    C  G   GGG  +  G  +     +++++     +
Sbjct: 212 GLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGG--MLAPPDMIFSNSDPLRS 269

Query: 239 KYYSPGVAELFFGGETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
            YY+  + E+   G+   +++         V DSG++Y YL    +      +  ++ + 
Sbjct: 270 PYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSL 329

Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
                P+     +C+ G    +NV  + + F  + + F +G+      LTPE YL   +K
Sbjct: 330 KKIRGPDPSYKDICFAGAG--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSK 384

Query: 353 --GNVCLGILNGAEVGLQDLNVIGGI 376
             G  CLG+      G     ++GGI
Sbjct: 385 VDGAYCLGVFQN---GKDPTTLLGGI 407


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 177/398 (44%), Gaps = 48/398 (12%)

Query: 4   SHNGENLCFP----TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVY----PTGYYNVTM 55
           +HN   +  P    T  +SS     +S+     + +S L   H  +Y      GYY   +
Sbjct: 33  NHNHRPMIIPLHLSTSNISSHRKPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRL 92

Query: 56  YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCE-DPICASLH 112
           +IG P + + L +DTGS +T++ C   C +C +   P ++P  S+   P + +P C    
Sbjct: 93  FIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSSTYKPMQCNPSC---- 147

Query: 113 APGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGYNQVP 170
                NC+D   QC YE  YA+  SS G+L +D  +F   N   L P+ A+ GC   +  
Sbjct: 148 -----NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFG--NESELTPQRAIFGCETVETG 200

Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRV 228
                  DGI+GLG+G  S+V QL  ++++ N    C  G    GG +  G ++     +
Sbjct: 201 ELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLG-NIPPPPDM 259

Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQT 280
           V+       + YY+  + EL   G+   LK  P VF        DSG++Y YL    +  
Sbjct: 260 VFAHSDPYRSAYYNIELKELHVAGKR--LKLNPRVFDGKHGTVLDSGTTYAYLPEEAFVA 317

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
               + KE+        P+     +C+ G    ++V  + K F  + + F +G+      
Sbjct: 318 FKDAIIKEIKFLKQIHGPDPSYNDICFSGAG--RDVSQLSKIFPEVNMVFGNGQK---LS 372

Query: 341 LTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           L+PE YL    K  G  CLGI      G     ++GGI
Sbjct: 373 LSPENYLFRHTKVSGAYCLGIFQN---GKDPTTLLGGI 407


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 164/383 (42%), Gaps = 53/383 (13%)

Query: 9   NLCFPTVRMSSS--SSSSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPARPY 64
           N  FP  R   S  +  +  +     + S++   + GN  PT  G Y   + +G P + Y
Sbjct: 24  NFVFPVERRKRSLNAVKAHDARRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPKDY 83

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLHAPG 115
           ++ +DTGSD+ W+ C   C RC           LY P    +++L+ C+   C++ +   
Sbjct: 84  YVQVDTGSDILWVNC-VKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGP 142

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNPR---LALGCGYNQ--- 168
              C+    C Y + Y DG ++ G  V+D   +N+ N   R  P+   +  GCG  Q   
Sbjct: 143 IPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGT 202

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS-- 226
           +  +S   LDGI+G G+  SS++SQL +   ++ +  HCL    GG +F   ++ +    
Sbjct: 203 LSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGIFAIGEVVEPKVS 262

Query: 227 -----------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
                       VV  S+  D      P  +++F  G   G      + DSG++  YL  
Sbjct: 263 TTPLVPRMAHYNVVLKSIEVDTDILQLP--SDIFDSGNGKG-----TIIDSGTTLAYLPA 315

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
           + Y         EL  K +   P  + L L  +    F+   +V + F  + L F D  +
Sbjct: 316 IVYD--------ELIPKVMARQPRLK-LYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLS 366

Query: 336 RTLFELTPEAYLIISNKGNVCLG 358
            T++   P  YL     G  C+G
Sbjct: 367 LTVY---PHDYLFQFKDGIWCIG 386


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 156/355 (43%), Gaps = 54/355 (15%)

Query: 33  VGSSLLFQVHGNVYPT----GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           VG  + F+V G+  P+    G Y   + +G P R + + +DTGSD+ W+ C+  C  C +
Sbjct: 62  VGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPK 120

Query: 89  AP---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSL 138
           +            +   +  LVPC DP+CAS        C     QC Y  +Y DG  + 
Sbjct: 121 SSGLGIELNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTS 180

Query: 139 GVLVKDAFAFNYTNGQRLNPRLA------LGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
           GV V DA  F+   GQ     +A       GC   Q    +     +DGILG G G+ S+
Sbjct: 181 GVYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSV 240

Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           VSQL S+ +   V  HCL   G GGG L  G+ L  S  +V++ +      +Y+  +  +
Sbjct: 241 VSQLSSRGITPKVFSHCLKGDGNGGGILVLGEILEPS--IVYSPLVPS-QPHYNLNLQSI 297

Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
              G+   +   P VF          DSG++ +YL +  Y  L + +   +S  +     
Sbjct: 298 AVNGQVLSIN--PAVFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATS--- 352

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG 353
                    KG + +  +  +   F T++ +F  G +    +L P  YL+  N+G
Sbjct: 353 ------FISKGSQCYLVLTSIDDSFPTVSFNFEGGAS---MDLKPSQYLL--NRG 396


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 96/338 (28%), Positives = 140/338 (41%), Gaps = 48/338 (14%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEA 89
            Q   + Y  G Y   + +G P RP+++ +DTGSD+ W+ C  PC  C         +  
Sbjct: 29  LQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNC-KPCNACPLTSGLGVALNF 87

Query: 90  PHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
             P    +   + C D  C S +      C     C Y  EY DG  +LG  V D F +N
Sbjct: 88  FDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYN 147

Query: 150 -YTNGQRLN---PRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLHSQKLIRN 202
            Y N    N    ++  GC YNQ  G    P   +DGI G G+   S+VSQL+SQ L   
Sbjct: 148 QYVNQYVTNNASAKITFGCSYNQ-SGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPK 206

Query: 203 VVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
           +  HCL G   GGG L  G+       +V+T +      +Y+  +  +   G+   +   
Sbjct: 207 IFSHCLEGADPGGGILVLGE--ITEPGMVYTPIVPS-QPHYNLNLQGIAVNGQQLSID-- 261

Query: 261 PVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           P VF          D G++  YL    Y+   + +   +S           T P   KG 
Sbjct: 262 PQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVS---------QSTQPFMLKGN 312

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
             F  VH + + F ++ L F         +L P+ YLI
Sbjct: 313 PCFLTVHSIDEIFPSVTLYFEGAP----MDLKPKDYLI 346


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 162/357 (45%), Gaps = 39/357 (10%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S    +H ++   GYY   + IG P   + L +DTGS +T++ C + C  C     P + 
Sbjct: 20  SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSS-CTHCGNHQDPRFS 78

Query: 96  P--SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
           P  S+   P E   C S  + G   C+   +  Y+ +YA+  +S GVL KD   F+ ++ 
Sbjct: 79  PALSSSYKPLE---CGSECSTGF--CDGSRK--YQRQYAEKSTSSGVLGKDVIGFSNSSD 131

Query: 153 --GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
             GQRL      GC   +         DGI+GLG+G  SI+ QL  +  + +V   C  G
Sbjct: 132 LGGQRL----VFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGG 187

Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP 261
              GGG  +  G        +V+T+     + YY+  +  +  GG    LK         
Sbjct: 188 MDEGGGAMILGG--FQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 245

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            V DSG++Y Y     +Q   S +K+++ +      P+++   +C+ G     NV ++ +
Sbjct: 246 TVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAG--TNVSNLSQ 303

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            F ++   F DG++ T   L+PE YL    K  G  CLG+    +       ++GGI
Sbjct: 304 FFPSVDFVFGDGQSVT---LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 353


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 96/340 (28%), Positives = 161/340 (47%), Gaps = 39/340 (11%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C +C +   P ++P  
Sbjct: 68  MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 126

Query: 97  --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
             S   + C +P C         NC+D  + C YE  YA+  SS GVL +D  +F   N 
Sbjct: 127 SSSYKALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 174

Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
            +L P R   GC   +         DGI+GLG+GK S+V QL  + +I +V   C  G  
Sbjct: 175 SQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 234

Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
            GGG +  G  +   + +V++      + YY+  + ++   G++  LK  P VF      
Sbjct: 235 VGGGAMVLG-KISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 291

Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
             DSG++Y Y  +  +  +   + KE+ +      P+     +C+ G    ++V ++   
Sbjct: 292 VLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 349

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
           F  + + F +G+      L+PE YL    K  G  CLGI 
Sbjct: 350 FPEIDMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 386


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 164/354 (46%), Gaps = 32/354 (9%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S   ++H ++   GYY   ++IG P + + L +D+GS +T++ C A C +C     P ++
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131

Query: 96  PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
           P  DL     P+  ++       C+ D  QC YE +YA+  SS GVL +D  +F  T  +
Sbjct: 132 P--DLSSTYSPVKCNVDC----TCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESE 184

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
               R   GC  ++         DGI+GLG+G+ SI+ QL  + +I +    C  G   G
Sbjct: 185 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 244

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------- 264
           GG +  G  +     +++T  ++  + YY+  + E+   G+   L+  P +F        
Sbjct: 245 GGAMVLG-AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKA--LRVDPRIFDGKHGTVL 301

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG++Y YL    +      +  ++        P+     +C+ G    +NV  + + F 
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAG--RNVSQLSEVFP 359

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + + F +G+      L+PE YL   +K  G  CLG+      G     ++GGI
Sbjct: 360 KVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLGGI 407


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 111/347 (31%), Positives = 145/347 (41%), Gaps = 49/347 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RP 96
           Y  G Y   + +G P R Y L +DTGSDL W+ C  PC+ C     ++ P   Y      
Sbjct: 31  YIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASA 89

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           S+  VPC DP C  +       C D  QC Y  +Y DG  +LG LV+D   +        
Sbjct: 90  SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNA---- 145

Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
              +  GCG+ Q      S   LDGI+G G    S  SQL  Q    NV  HCL GG  G
Sbjct: 146 TATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205

Query: 213 GGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSG 267
           GG L  G+    D+  +  V + S  +   +  S   A L    +      +   +FDSG
Sbjct: 206 GGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSG 265

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++  YL    YQ  T        A SL  AP      LC      F     + K F  + 
Sbjct: 266 TTLAYLPDEAYQAFT-------QAVSLVVAP----FLLCDTRLSRF-----IYKLFPNVV 309

Query: 328 LSFTDGKTRTLFELTPEAYLI----ISNKGNVCLGI--LNGAEVGLQ 368
           L F +G + T   LTP  YLI     +N    C+G   +  AE  LQ
Sbjct: 310 LYF-EGASMT---LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQ 352


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 95/330 (28%), Positives = 142/330 (43%), Gaps = 33/330 (10%)

Query: 43  GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYR 95
           GN +P   G Y   + +G P + Y++ +DTGSD+ W+ C A C +C     +     LY 
Sbjct: 72  GNGHPAEAGLYFAKIGLGNPPKDYYVQVDTGSDILWVNC-ANCDKCPTKSDLGVKLTLYD 130

Query: 96  P----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           P    S   + C+D  CA+ +      C     C Y + Y DG S+ G  VKD   F+  
Sbjct: 131 PQSSTSATRIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRV 190

Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
            G       N  +  GCG  Q    G S   LDGILG G+  SS++SQL +   ++ V  
Sbjct: 191 TGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFA 250

Query: 206 HCLSG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------- 257
           HCL    GGG    G+ +  S +V  T M  +   +Y+  + E+  GG    L       
Sbjct: 251 HCLDNVKGGGIFAIGEVV--SPKVNTTPMVPN-QPHYNVVMKEIEVGGNVLELPTDIFDT 307

Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
                 + DSG++  YL  V Y+++ + +  E     L    E  T   C++        
Sbjct: 308 GDRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQFT---CFQYTGNVNEG 364

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
             V K     +LS T      LF++  E +
Sbjct: 365 FPVVKFHFNGSLSLTVNPHDYLFQIHEEVW 394


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 153/360 (42%), Gaps = 48/360 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + +G P + Y++ +DTGSD+ W+ C    + C + PH         LY P   
Sbjct: 83  TGLYYTEIKLGTPPKHYYVQVDTGSDILWVNC----ITCEQCPHKSGLGLDLTLYDPKAS 138

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT---- 151
            +  +V C+   CA+        C     C+Y + Y DG S++G  V DA  F+      
Sbjct: 139 STGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDG 198

Query: 152 NGQRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
             Q  N  +  GCG  Q    G+S   LDGILG G+  +S++SQL +   ++ +  HCL 
Sbjct: 199 QTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLD 258

Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
              GGG    GD +    +V  T + +D   +Y+  +  +  GG T  L        +  
Sbjct: 259 TIKGGGIFSIGDVV--QPKVKTTPLVAD-KPHYNVNLKTIDVGGTTLQLPAHIFEPGEKK 315

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
             + DSG++ TYL  + ++ +   +  +    +  +           +G   F+    V 
Sbjct: 316 GTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDV----------QGFLCFQYPGSVD 365

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
             F T+   F D     ++   P  Y   +     C+G  NGA    +D   I  +GD V
Sbjct: 366 DGFPTITFHFEDDLALHVY---PHEYFFANGNDVYCVGFQNGASQS-KDGKDIVLMGDLV 421


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 164/354 (46%), Gaps = 32/354 (9%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S   ++H ++   GYY   ++IG P + + L +D+GS +T++ C A C +C     P ++
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131

Query: 96  PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
           P  DL     P+  ++       C+ D  QC YE +YA+  SS GVL +D  +F  T  +
Sbjct: 132 P--DLSSTYSPVKCNVDC----TCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESE 184

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
               R   GC  ++         DGI+GLG+G+ SI+ QL  + +I +    C  G   G
Sbjct: 185 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 244

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------- 264
           GG +  G  +     +++T  ++  + YY+  + E+   G+   L+  P +F        
Sbjct: 245 GGAMVLG-AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKA--LRVDPRIFDGKHGTVL 301

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG++Y YL    +      +  ++        P+     +C+ G    +NV  + + F 
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAG--RNVSQLSEVFP 359

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + + F +G+      L+PE YL   +K  G  CLG+      G     ++GGI
Sbjct: 360 KVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLGGI 407


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 172/369 (46%), Gaps = 34/369 (9%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
           +SS+   L +    +   ++H ++   GYY   +YIG P++ + L +D+GS +T++ C A
Sbjct: 62  ASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-A 120

Query: 82  PCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGV 140
            C +C     P ++P  DL     P+  ++       C++  +QC YE +YA+  SS GV
Sbjct: 121 TCEQCGNHQDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQYAEMSSSSGV 174

Query: 141 LVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           L +D  +F   +   L P R   GC   +         DGI+GLG+G+ SI+ QL  + +
Sbjct: 175 LGEDIMSFGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGV 232

Query: 200 IRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
           I +    C  G   GGG +  G  +     +V++  +   + YY+  + E+   G+   L
Sbjct: 233 ISDSFSLCYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRL 291

Query: 258 KNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
              P +F        DSG++Y YL    +      +  ++++      P+     +C+ G
Sbjct: 292 D--PKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAG 349

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGL 367
               +NV  + + F  + + F +G+      L+PE YL   +K  G  CLG+      G 
Sbjct: 350 AG--RNVSQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GK 401

Query: 368 QDLNVIGGI 376
               ++GGI
Sbjct: 402 DPTTLLGGI 410


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/347 (31%), Positives = 144/347 (41%), Gaps = 49/347 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RP 96
           Y  G Y   + +G P R Y L +DTGSDL W+ C  PC+ C     ++ P   Y      
Sbjct: 31  YIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASA 89

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           S+  VPC DP C  +       C D  QC Y  +Y DG  +LG LV+D   +        
Sbjct: 90  SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNA---- 145

Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
              +  GCG+ Q      S   LDGI+G G    S  SQL  Q    NV  HCL GG  G
Sbjct: 146 TATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205

Query: 213 GGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSG 267
           GG L  G+    D+  +  V +    +   +  S   A L    +      +   +FDSG
Sbjct: 206 GGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSG 265

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++  YL    YQ  T        A SL  AP      LC      F     + K F  + 
Sbjct: 266 TTLAYLPDEAYQAFT-------QAVSLVVAP----FLLCDTRLSRF-----IYKLFPNVV 309

Query: 328 LSFTDGKTRTLFELTPEAYLI----ISNKGNVCLGI--LNGAEVGLQ 368
           L F +G + T   LTP  YLI     +N    C+G   +  AE  LQ
Sbjct: 310 LYF-EGASMT---LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQ 352


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 175/379 (46%), Gaps = 44/379 (11%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
           +SS+   L +    +   ++H ++   GYY   +YIG P++ + L +D+GS +T++ C A
Sbjct: 62  ASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-A 120

Query: 82  PCVRC----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELE 130
            C +C          +EA  P ++P  DL     P+  ++       C++  +QC YE +
Sbjct: 121 TCEQCGNHQSESPNIIEAHDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQ 174

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSS 189
           YA+  SS GVL +D  +F   +   L P R   GC   +         DGI+GLG+G+ S
Sbjct: 175 YAEMSSSSGVLGEDIMSFGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLS 232

Query: 190 IVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE 247
           I+ QL  + +I +    C  G   GGG +  G  +     +V++  +   + YY+  + E
Sbjct: 233 IMDQLVEKGVISDSFSLCYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKE 291

Query: 248 LFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
           +   G+   L   P +F        DSG++Y YL    +      +  ++++      P+
Sbjct: 292 IHVAGKALRLD--PKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD 349

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCL 357
                +C+ G    +NV  + + F  + + F +G+      L+PE YL   +K  G  CL
Sbjct: 350 PNYKDICFAGAG--RNVSQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCL 404

Query: 358 GILNGAEVGLQDLNVIGGI 376
           G+      G     ++GGI
Sbjct: 405 GVFQN---GKDPTTLLGGI 420


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 146/343 (42%), Gaps = 47/343 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + +G P + +++ +DTGSD+ W+ C    + C + PH         LY P   
Sbjct: 85  TGLYYTEVRLGTPPKRFYVQVDTGSDILWVNC----ITCDQCPHKSGLGLDLTLYDPKAS 140

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-- 153
            +   V C+   CA         C     C+Y + Y DG S++G  V DA  F+   G  
Sbjct: 141 STGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDG 200

Query: 154 --QRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
             Q  N  +  GCG  Q    G+S   LDGILG G+  +S++SQL +   ++ +  HCL 
Sbjct: 201 QTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLD 260

Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
              GGG    GD +    +V  T + +D   +Y+  +  +  GG T  L        +  
Sbjct: 261 TIKGGGIFAIGDVV--QPKVKTTPLVAD-KPHYNVNLKTIDVGGTTLELPADIFKPGEKR 317

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
             + DSG++ TYL  + ++ +   +  +    +  +  +     LC      F+    V 
Sbjct: 318 GTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQD----FLC------FEYSGSVD 367

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
             F TL   F D     ++   P  Y   +     C+G  NGA
Sbjct: 368 DGFPTLTFHFEDDLALHVY---PHEYFFPNGNDVYCVGFQNGA 407


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 168/362 (46%), Gaps = 44/362 (12%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VE 88
            ++H ++   GYY   +YIG P++ + L +D+GS +T++ C A C +C          +E
Sbjct: 80  MRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQSESPNIIE 138

Query: 89  APHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFA 147
           A  P ++P  DL     P+  ++       C++  +QC YE +YA+  SS GVL +D  +
Sbjct: 139 AHDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQYAEMSSSSGVLGEDIMS 192

Query: 148 FNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
           F   +   L P R   GC   +         DGI+GLG+G+ SI+ QL  + +I +    
Sbjct: 193 FGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSL 250

Query: 207 CLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
           C  G   GGG +  G  +     +V++  +   + YY+  + E+   G+   L   P +F
Sbjct: 251 CYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLD--PKIF 307

Query: 265 --------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
                   DSG++Y YL    +      +  ++++      P+     +C+ G    +NV
Sbjct: 308 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAG--RNV 365

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIG 374
             + + F  + + F +G+      L+PE YL   +K  G  CLG+      G     ++G
Sbjct: 366 SQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLG 419

Query: 375 GI 376
           GI
Sbjct: 420 GI 421


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 159/356 (44%), Gaps = 42/356 (11%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            ++H ++   GYY   ++IG P + + L +DTGS +T++ C   C +C     P ++P +
Sbjct: 72  MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPES 130

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                  P+  ++      NC+ D  QC YE +YA+  +S GVL +D  +F   N   L 
Sbjct: 131 S--STYQPVKCTIDC----NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFG--NQSELA 182

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  SI+ QL  + +I +    C  G   GGG
Sbjct: 183 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGG 242

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------- 262
             +  G  +   S + +       + YY+  + E+   G     K LP+           
Sbjct: 243 AMVLGG--ISPPSDMAFAYSDPVRSPYYNIDLKEIHVAG-----KRLPLNANVFDGKHGT 295

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           V DSG++Y YL    +      + KEL +      P+     +C+ G     +V  + K 
Sbjct: 296 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAG--IDVSQLSKS 353

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           F  + + F +G+  T   L+PE Y+   +K  G  CLG+      G     ++GGI
Sbjct: 354 FPVVDMVFENGQKYT---LSPENYMFRHSKVRGAYCLGVFQN---GNDQTTLLGGI 403


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 160/367 (43%), Gaps = 49/367 (13%)

Query: 39  FQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------- 90
           F V G+  P  G Y   + +G PAR + + +DTGSD+ W+ C +PC  C ++        
Sbjct: 71  FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129

Query: 91  --HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
                   S  ++PC DPICA++             C Y   Y D   + G  V D+  F
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHF 189

Query: 149 NYTNGQRL----NPRLALGCG---YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
           +   G+      +  +  GC    Y  +  A+   LDGI G G+G+ S++SQL S+ +  
Sbjct: 190 DILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGITP 248

Query: 202 NVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM---SSDYT-KYYSPGVA-ELFFGGET 254
            V  HCL GG  GGG L  G+ L  S  +V++ +      YT K  S  ++ +LF     
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306

Query: 255 TGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
             + N    + DSG++  YL    Y  + S++   +S  +          P   +G + F
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSA---------TPTISRGSQCF 357

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNKGNVCLGILNGAEVGLQD 369
           +    V   F  L  +F    +     +TPE YL    I+      C+G    AE G   
Sbjct: 358 RVSMSVADIFPVLRFNFEGIASMV---VTPEEYLQFDSIVREPALWCIG-FQKAEDG--- 410

Query: 370 LNVIGGI 376
           LN++G +
Sbjct: 411 LNILGDL 417


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 31/323 (9%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C + C +C     P + P  
Sbjct: 78  MRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPC-STCEQCGRHQDPKFEP-- 134

Query: 99  DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           +L     P+  ++       C++   QC YE +YA+  SS GVL +D  +F   N   L 
Sbjct: 135 ELSSTYQPVSCNIDC----TCDNERKQCVYERQYAEMSSSSGVLGEDIISFG--NQSELV 188

Query: 158 PRLALGCGYNQVPGASY-HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P+ A+    NQ  G  Y    DGI+GLG+G  SIV QL  + +I +    C  G   GGG
Sbjct: 189 PQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGG 248

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
             +  G  +   S +V+       ++YY+  +  +   G+   L   P +F        D
Sbjct: 249 AMILGG--ISPPSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLD--PSIFDGKHGTVLD 304

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++Y YL    +      M KEL++      P+     +C+ G     +V  +   F  
Sbjct: 305 SGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAE--SDVSQLSNTFPA 362

Query: 326 LALSFTDGKTRTLFELTPEAYLI 348
           + + F++G+      L+PE YL 
Sbjct: 363 VEMVFSNGQK---LSLSPENYLF 382


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 158/368 (42%), Gaps = 58/368 (15%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           V G    +G Y V   +G P + + L +DTGSDL ++QC APC  C E   PLY+PSN  
Sbjct: 24  VSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSS 82

Query: 100 ---LVPCEDPICASLHAPGHHNC-----EDPAQ--CDYELEYADGGSSLGVLVKDAFAFN 149
               VPC+   C  + AP    C     E P Q  C YE  Y D  S++GV    A+   
Sbjct: 83  TFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVF---AYETA 139

Query: 150 YTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
              G R+N  +A GCG  NQ    S+    G+LGLG+G  S  SQ  +     N   +CL
Sbjct: 140 TVGGIRVN-HVAFGCGNRNQ---GSFVSAGGVLGLGQGALSFTSQ--AGYAFENKFAYCL 193

Query: 209 SG-----GGGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGET----- 254
           +           L FGDD+    +D       S   + + YY   +  + FGGET     
Sbjct: 194 TSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYV-QIVRICFGGETLLIPD 252

Query: 255 -----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                  + N   +FDSG++ TY +   Y  + +  +K  S    +  P  + LPLC   
Sbjct: 253 SAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEK--SVPYPRAPPSPQGLPLC--- 307

Query: 310 RRPFKNVHDVK-KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
                NV  +    + +  + F  G T   +      Y I  +    CL +L  +  G  
Sbjct: 308 ----VNVSGIDHPIYPSFTIEFDQGAT---YRPNQGNYFIEVSPNIDCLAMLESSSDG-- 358

Query: 369 DLNVIGGI 376
             NVIG I
Sbjct: 359 -FNVIGNI 365


>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
          Length = 310

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 80/239 (33%), Positives = 117/239 (48%), Gaps = 22/239 (9%)

Query: 148 FNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           FN  NG R      LG  ++Q       P    GILGL     S+ SQL S+ +I NV G
Sbjct: 3   FNRYNGGR-KASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFG 61

Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET--TGLKNLP 261
           HC++    GGG++F GDD      + W  +       Y     ++ +G +    G+  + 
Sbjct: 62  HCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGIP-VQ 120

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           V+   G+SYTYL    Y+ L   +K++  + S  +   D TLPLCWK          V+ 
Sbjct: 121 VISRCGTSYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKAD------FSVRS 172

Query: 322 CFRTLALSFTDGKTRTL----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F+ L L F  G+   +    F + P+ YLIIS+KGNVCLG+LNG E+      ++G +
Sbjct: 173 FFKPLNLHF--GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 229


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 101/345 (29%), Positives = 150/345 (43%), Gaps = 49/345 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------VE-APHPLYRPSN-DL 100
           G Y   + +G P+R + + +DTGSD+ W+ C A C+RC      VE  P+ +   S    
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDVDASSTAKS 141

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR----L 156
           V C D  C+ ++      C   + C Y + Y DG S+ G LVKD    +   G R     
Sbjct: 142 VSCSDNFCSYVNQ--RSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGST 199

Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           N  +  GCG  Q    G S   +DGI+G G+  SS +SQL SQ  ++    HCL    GG
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFDS 266
            +F   ++  S +V  T M S  + +YS  +  +  G     L         +  V+ DS
Sbjct: 260 GIFAIGEVV-SPKVKTTPMLSK-SAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDS 317

Query: 267 GSSYTYLNRVTYQ-TLTSIMKK--ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           G++  YL    Y   L  I+    EL+  +++E+               F   H   K  
Sbjct: 318 GTTLVYLPDAVYNPLLNEILASHPELTLHTVQES---------------FTCFHYTDKLD 362

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
           R   ++F   K+ +L  + P  YL    +   C G  NG   GLQ
Sbjct: 363 RFPTVTFQFDKSVSL-AVYPREYLFQVREDTWCFGWQNG---GLQ 403


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 145/335 (43%), Gaps = 38/335 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    T  Y  ++ +G PA    ++LDTGSD +W+QC  PC  C E    L+ PS     
Sbjct: 126 GKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALFDPSKSSTY 184

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C    C  L +   HNC    +C YE+ YAD   ++G L +D    + T+     P
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAV---P 241

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GCG+N     S+  +DG+LGLG+GK+S+ SQ+ ++        +CL  S    G+L
Sbjct: 242 GFVFGCGHNNA--GSFGEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSATGYL 297

Query: 217 -FFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DS 266
            F G      +   +T M +  +  +Y   +  +   G    +K  P VF        DS
Sbjct: 298 SFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGR--AIKVPPSVFATAAGTIIDS 355

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G++++ L    Y  L S ++  +     K AP       C+         H+  +   ++
Sbjct: 356 GTAFSCLPPSAYAALRSSVRSAMG--RYKRAPSSTIFDTCYD-----LTGHETVR-IPSV 407

Query: 327 ALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGIL 360
           AL F DG T     L P   L   SN    CL  L
Sbjct: 408 ALVFADGAT---VHLHPSGVLYTWSNVSQTCLAFL 439


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 161/351 (45%), Gaps = 33/351 (9%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C + C  C +   P ++P  
Sbjct: 76  MRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQP-- 132

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           D      P+  ++      NC+ D   C YE  YA+  SS GVL +D  +F     + + 
Sbjct: 133 DESSTYHPVKCNMDC----NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFG-NQSEVVP 187

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGG 214
            R   GC   +         DGI+GLG+G+ SIV QL  + +I +    C  G   GGG 
Sbjct: 188 QRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGA 247

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------KNLPVVFDSGS 268
            +  G  +     +V++      + YY+  + E+   G+   L      +    V DSG+
Sbjct: 248 MVLGG--IPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGT 305

Query: 269 SYTYLNRVTYQTLT-SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +Y YL    +     +I+KK  + K +   P+     +C+ G    ++V  + K F  + 
Sbjct: 306 TYAYLPEEAFVAFRDAIIKKSHNLKQI-HGPDPNYNDICFSGAG--RDVSQLSKAFPEVD 362

Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + F++G+      LTPE YL    K  G  CLGI    +       ++GGI
Sbjct: 363 MVFSNGQK---LSLTPENYLFQHTKVHGAYCLGIFRNGD----STTLLGGI 406


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 94/334 (28%), Positives = 146/334 (43%), Gaps = 41/334 (12%)

Query: 39  FQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------- 90
           F V G+  P  G Y   + +G PAR + + +DTGSD+ W+ C +PC  C ++        
Sbjct: 71  FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129

Query: 91  --HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
                   S  ++PC DPICA++             C Y   Y D   + G  V D+  F
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHF 189

Query: 149 NYTNGQRL----NPRLALGCG---YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
           +   G+      +  +  GC    Y  +  A+   LDGI G G+G+ S++SQL S+ +  
Sbjct: 190 DILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGITP 248

Query: 202 NVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM---SSDYT-KYYSPGVA-ELFFGGET 254
            V  HCL GG  GGG L  G+ L  S  +V++ +      YT K  S  ++ +LF     
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306

Query: 255 TGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
             + N    + DSG++  YL    Y  + S++   +S  +          P   +G + F
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSA---------TPTISRGSQCF 357

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
           +    V   F  L  +F    +     +TPE YL
Sbjct: 358 RVSMSVADIFPVLRFNFEGIASMV---VTPEEYL 388


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 40/359 (11%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+   +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C +C     P +
Sbjct: 67  SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKF 125

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCE-----DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
                     DP  +S + P   N +     D  QC YE +YA+  +S GVL +D  +F 
Sbjct: 126 ----------DPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFG 175

Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
             N   L P R   GC   +         DGI+GLG G  S+V QL  +  I +    C 
Sbjct: 176 --NQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY 233

Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------ 259
            G   GGG  +  G  +   S +++T      + YY+  + E+   G+   L +      
Sbjct: 234 GGMDIGGGAMVLGG--ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR 291

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              V DSG++Y YL    +      +  E+ +    + P+     +C+ G     +  ++
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG--SDAAEL 349

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
              F T+ + F +G+      LTPE Y    +K  G  CLGI    E G     ++GGI
Sbjct: 350 SNKFPTVDMVFENGQK---LSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGI 402


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 40/359 (11%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
           S+   +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C +C     P +
Sbjct: 67  SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKF 125

Query: 95  RPSNDLVPCEDPICASLHAPGHHNCE-----DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
                     DP  +S + P   N +     D  QC YE +YA+  +S GVL +D  +F 
Sbjct: 126 ----------DPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFG 175

Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
             N   L P R   GC   +         DGI+GLG G  S+V QL  +  I +    C 
Sbjct: 176 --NQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY 233

Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------ 259
            G   GGG  +  G  +   S +++T      + YY+  + E+   G+   L +      
Sbjct: 234 GGMDIGGGAMVLGG--ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR 291

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              V DSG++Y YL    +      +  E+ +    + P+     +C+ G     +  ++
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG--SDAAEL 349

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
              F T+ + F +G+      LTPE Y    +K  G  CLGI    E G     ++GGI
Sbjct: 350 SNKFPTVDMVFENGQK---LSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGI 402


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 146/348 (41%), Gaps = 51/348 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           VG  + F V G   P   G Y   + +G PA+ +++ +DTGSD+ W+ C    + C   P
Sbjct: 63  VGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKEFYVQIDTGSDILWINC----ITCSNCP 118

Query: 91  HP------------LYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
           H                 +  LV C DPIC+         C   A QC Y  +Y DG  +
Sbjct: 119 HSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGT 178

Query: 138 LGVLVKDAFAFNYTN-GQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
            G  V D   F+    GQ +    +  +  GC   Q    +     +DGI G G G  S+
Sbjct: 179 TGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSV 238

Query: 191 VSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL S+ +   V  HCL GG  GGG L  G+ L  S  +V++ +      +Y+  +  +
Sbjct: 239 ISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPS--IVYSPLVPS-QPHYNLNLQSI 295

Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
              G+   +         N   + DSG++  YL +  Y             K++  A   
Sbjct: 296 AVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFV---------KAITAAVSQ 346

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
            + P+  KG + +   + V   F  ++L+F  G +     L PE YL+
Sbjct: 347 FSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMV---LNPEHYLM 391


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 97/345 (28%), Positives = 152/345 (44%), Gaps = 49/345 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------VE-APHPLYRPSN-DL 100
           G Y   + +G P+R + + +DTGSD+ W+ C A C+RC      VE  P+     S    
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDADASSTAKS 141

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR----L 156
           V C D  C+ ++      C   + C Y + Y DG S+ G LV+D    +   G R     
Sbjct: 142 VSCSDNFCSYVNQ--RSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGST 199

Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           N  +  GCG  Q    G S   +DGI+G G+  SS +SQL SQ  ++    HCL    GG
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFDS 266
            +F   ++  S +V  T M S  + +YS  +  +  G     L         +  V+ DS
Sbjct: 260 GIFAIGEVV-SPKVKTTPMLSK-SAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDS 317

Query: 267 GSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           G++  YL    Y  L + +    +EL+  +++++               F   H + +  
Sbjct: 318 GTTLVYLPDAVYNPLMNQILASHQELNLHTVQDS---------------FTCFHYIDRLD 362

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
           R   ++F   K+ +L  + P+ YL    +   C G  NG   GLQ
Sbjct: 363 RFPTVTFQFDKSVSL-AVYPQEYLFQVREDTWCFGWQNG---GLQ 403


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 140/350 (40%), Gaps = 66/350 (18%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LY----R 95
            G Y   + IG PAR Y++ +DTGSD+ W+ C    ++C E P          LY     
Sbjct: 95  VGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKES 150

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            +  LV C+   C +++      C     C Y   YADG SS G  V+D   ++  +G  
Sbjct: 151 LTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDL 210

Query: 155 ---RLNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
                N  +  GC   Q    +S   LDGILG GK  +S++SQL S   +R +  HCL G
Sbjct: 211 ETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDG 270

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
             GGG    G  +    +V  T +  + T +Y+  +  +  GG      NLP        
Sbjct: 271 LNGGGIFAIGHIV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGD 324

Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
               + DSG++  YL  V Y  L S +                     W+       +HD
Sbjct: 325 KKGTIIDSGTTLAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHD 365

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYL-------IISNKGNVCLGILN 361
              CF+  + S  DG     F      YL       + S  G  C+G  N
Sbjct: 366 QFTCFQ-YSESLDDGFPAVTFHFENSLYLKVHPHEYLFSYDGLWCIGWQN 414


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 90/347 (25%), Positives = 148/347 (42%), Gaps = 42/347 (12%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED 105
           Y+  T+ +G P R + + +DTGS +T++ C   C  C +     + P        + C D
Sbjct: 12  YFYTTLKLGTPERTFSVIIDTGSTITYIPCKD-CSHCGKHTAEWFDPDKSTTAKKLACGD 70

Query: 106 PICASLHAPGHHNCEDPA------QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
           P+C         NC  P+      +C Y   YA+  SS G +++D F F  ++      R
Sbjct: 71  PLC---------NCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---R 118

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           L  GC   +         DGI+G+G   ++  SQL  +K+I +V   C      G L  G
Sbjct: 119 LVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLG 178

Query: 220 D-DLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYT 271
           D  L + +  V+T + +  +  YY+  +  +   G+T         +    V DSG+++T
Sbjct: 179 DVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFT 238

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAP--EDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YL    ++ +   +   +  K L+  P  + +   +CWKG        D+ K F      
Sbjct: 239 YLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAP--DQFKDLDKYFPPAEFV 296

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           F  G   T   L P  YL +S     CLGI +    G     ++GG+
Sbjct: 297 FGGGAKLT---LPPLRYLFLSKPAEYCLGIFDNGNSGA----LVGGV 336


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 151/363 (41%), Gaps = 55/363 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
           G Y   + IG PA+ Y++ +DTGSD+ W+ C    ++C + P          LY      
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
           S  LV C+D  C  +       C+    C Y   Y DG S+ G  VKD   ++   G   
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193

Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            Q  N  +  GCG  Q   +  ++   LDGILG GK  SS++SQL S   ++ +  HCL 
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
           G  GG +F    +    +V  T +  +   Y            +    A+LF  G+  G 
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG- 311

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                + DSG++  YL  + Y+ L   +  +  A  +    +D          + F+   
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKD---------YKCFQYSG 358

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
            V + F  +   F   +      + P  YL   ++G  C+G  N A +  +D   +  +G
Sbjct: 359 RVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPHEGMWCIGWQNSA-MQSRDRRNMTLLG 413

Query: 378 DFV 380
           D V
Sbjct: 414 DLV 416


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 104/369 (28%), Positives = 166/369 (44%), Gaps = 43/369 (11%)

Query: 35  SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
           S++   + GN +P+  G Y   + IG P++ Y++ +DTGSD+ W+ C A C RC     +
Sbjct: 56  SAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDL 114

Query: 88  EAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
                LY      ++D V C+D  C+    P    C+   QC Y + Y DG S+ G  V+
Sbjct: 115 GVDLTLYDMKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQ 173

Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D   +N  +G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL S 
Sbjct: 174 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233

Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
             ++ V  HCL    GG +F   ++ +  +V  T +  +   +Y+  + E+  GG+   +
Sbjct: 234 GKVKKVFSHCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDV 291

Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                        + DSG++  Y  +  Y  L          K L + P D  L    + 
Sbjct: 292 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQA 342

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL- 367
              F    +V   F T+ L F    + T++   P  YL    +   C+G  N GA+    
Sbjct: 343 FTCFDYTGNVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQNSGAQTKDG 399

Query: 368 QDLNVIGGI 376
           +DL ++G +
Sbjct: 400 KDLTLLGDL 408


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 162/361 (44%), Gaps = 43/361 (11%)

Query: 43  GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY- 94
           GN +P+  G Y   + IG P++ Y++ +DTGSD+ W+ C A C RC     +     LY 
Sbjct: 145 GNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYD 203

Query: 95  ---RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
                ++D V C+D  C+    P    C+   QC Y + Y DG S+ G  V+D   +N  
Sbjct: 204 MKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRI 262

Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           +G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL S   ++ V  
Sbjct: 263 SGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFS 322

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
           HCL    GG +F   ++ +  +V  T +  +   +Y+  + E+  GG+   +        
Sbjct: 323 HCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDVPSDAFESG 380

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                + DSG++  Y  +  Y  L          K L + P D  L    +    F    
Sbjct: 381 DRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQAFTCFDYTG 431

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL-QDLNVIGG 375
           +V   F T+ L F    + T++   P  YL    +   C+G  N GA+    +DL ++G 
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGD 488

Query: 376 I 376
           +
Sbjct: 489 L 489


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 142/349 (40%), Gaps = 59/349 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP---------HPLYRPSN 98
           TG Y   + +G P + Y++ +DTGSD+ W+ C + C +C              P    S 
Sbjct: 81  TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCIS-CEKCPRKSGLGLDLTFYDPKASSSG 139

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
             V C+   CA+ +      C     C+Y + Y DG S+ G  V DA  F+   G    Q
Sbjct: 140 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQ 199

Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
             N  +  GCG  Q    G+S   LDGILG G+  +S++SQL +   ++ +  HCL    
Sbjct: 200 PGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIK 259

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVV 263
           GGG    G+ +    +V  T + +D   +Y+  +  +  GG T  L        +    +
Sbjct: 260 GGGIFAIGNVV--QPKVKTTPLVAD-MPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTI 316

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD----- 318
            DSG++ TYL  + +        KE+ A    +  +             F NV D     
Sbjct: 317 IDSGTTLTYLPELVF--------KEVMAAIFNKHQD-----------IVFHNVQDFMCFQ 357

Query: 319 ----VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
               V   F T+   F D        + P  Y   +     C+G  NGA
Sbjct: 358 YPGSVDDGFPTITFHFED---DLALHVYPHEYFFPNGNDMYCVGFQNGA 403


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 145/348 (41%), Gaps = 51/348 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           VG  + F V G   P   G Y   + +G PA+ +++ +DTGSD+ W+ C    + C   P
Sbjct: 63  VGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKDFYVQIDTGSDILWINC----ITCSNCP 118

Query: 91  HP------------LYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
           H                 +  LV C DPIC+         C   A QC Y  +Y DG  +
Sbjct: 119 HSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGT 178

Query: 138 LGVLVKDAFAFNYTN-GQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
            G  V D   F+    GQ +    +  +  GC   Q    +     +DGI G G G  S+
Sbjct: 179 TGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSV 238

Query: 191 VSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL S+ +   V  HCL GG  GGG L  G+ L  S  +V++ +      +Y+  +  +
Sbjct: 239 ISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPS--IVYSPLVPSL-PHYNLNLQSI 295

Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
              G+   +         N   + DSG++  YL +  Y      +   +S  S       
Sbjct: 296 AVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFS------- 348

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
              P+  KG + +   + V   F  ++L+F  G +     L PE YL+
Sbjct: 349 --KPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMV---LNPEHYLM 391


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 140/350 (40%), Gaps = 66/350 (18%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LY----R 95
            G Y   + IG PAR Y++ +DTGSD+ W+ C    ++C E P          LY     
Sbjct: 95  VGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKES 150

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            +  LV C+   C +++      C     C Y   YADG SS G  V+D   ++  +G  
Sbjct: 151 LTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDL 210

Query: 155 ---RLNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
                N  +  GC   Q    +S   LDGILG GK  +S++SQL S   +R +  HCL G
Sbjct: 211 ETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDG 270

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
             GGG    G  +    +V  T +  + T +Y+  +  +  GG      NLP        
Sbjct: 271 LNGGGIFAIGHIV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGD 324

Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
               + DSG++  YL  V Y  L S +                     W+       +HD
Sbjct: 325 KKGTIIDSGTTLAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHD 365

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYL-------IISNKGNVCLGILN 361
              CF+  + S  DG     F      YL       + S  G  C+G  N
Sbjct: 366 QFTCFQ-YSESLDDGFPAVTFHFENSLYLKVHPHEYLFSYDGLWCIGWQN 414


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 98/341 (28%), Positives = 153/341 (44%), Gaps = 32/341 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC 108
           GYY   ++IG P + + L +DTGS +T++ C   C  C     P +RP  +      P+ 
Sbjct: 91  GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CKHCGSHQDPKFRP--EASETYQPVK 147

Query: 109 ASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGY 166
            +       NC+D   QC YE  YA+  +S GVL +D  +F   N   L+P+ A+ GC  
Sbjct: 148 CTWQC----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFG--NQSELSPQRAIFGCEN 201

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDS 225
           ++         DGI+GLG+G  SI+ QL  +K+I +    C  G G G        +   
Sbjct: 202 DETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPP 261

Query: 226 SRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVT 277
           + +V+T      + YY+  + E+   G+   L   P VF        DSG++Y YL    
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLN--PKVFDGKHGTVLDSGTTYAYLPESA 319

Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
           +      + KE  +      P+     +C+ G     NV  + K F  + + F +G    
Sbjct: 320 FLAFKHAIMKETHSLKRISGPDPHYNDICFSGAE--INVSQLSKSFPVVEMVFGNGHK-- 375

Query: 338 LFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
              L+PE YL   +K  G  CLG+ +    G     ++GGI
Sbjct: 376 -LSLSPENYLFRHSKVRGAYCLGVFSN---GNDPTTLLGGI 412


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 163/363 (44%), Gaps = 54/363 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + IG PA+ Y++ +DTGSD+ W+ C    V C   P          +Y P   
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
            S +LV C+   C + +     +C   + C+Y + Y DG S+ G  V D   +N  +G  
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
                N  ++ GCG       G+S   LDGILG G+  SS++SQL +   +R +  HCL 
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
           +  GGG    G+ +    +V  T + SD   +Y+  +  +  GG   GL         + 
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVSD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
             + DSG++  Y+    Y+ L +++    +++S ++L++         C      F+   
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
            V   F  +   F +G    +  ++P  YL  + K   C+G  NG  V  +D   +  +G
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG-VQTKDGKDMVLLG 422

Query: 378 DFV 380
           D V
Sbjct: 423 DLV 425


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 147/356 (41%), Gaps = 48/356 (13%)

Query: 35  SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------ 86
           S++  Q+ GN +P+  G Y   + +G P + Y++ +DTGSD+ W+ C A C  C      
Sbjct: 56  SAIDLQLGGNGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDL 114

Query: 87  ---VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
              +    P    +++ V C    C S +      C     C+Y + Y DG S+ G  V+
Sbjct: 115 GIELSLYSPSSSSTSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVR 174

Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D    +   G       N  +  GCG  Q    GA+   LDGILG G+  SS++SQL S 
Sbjct: 175 DHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASS 234

Query: 198 KLIRNVVGHCLSG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
             ++ V  HCL    GGG    G+ +    R   T+       +Y+  +  +    E   
Sbjct: 235 GKVKRVFAHCLDNINGGGIFAIGEVVQPKVR---TTPLVPQQAHYNVFMKAIEVDNE--- 288

Query: 257 LKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           + NLP            + DSG++  Y   V Y+ L S +    S   L    E  T   
Sbjct: 289 VLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTC-- 346

Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
                  F+   +V   F T+   F D  + T++   P  YL   +    C+G  N
Sbjct: 347 -------FEYDGNVDDGFPTVTFHFEDSLSLTVY---PHEYLFDIDSNKWCVGWQN 392


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 151/357 (42%), Gaps = 42/357 (11%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH------PLY 94
           +H ++   GYY   ++IG PA+ + L +DTGS +T++    PC  C    H      P +
Sbjct: 89  LHDDLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYV----PCSSCTHCGHHQACFDPRF 144

Query: 95  RPSN----DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
           +P N      V C  P C +       +     QC YE  YA+  SS GVL KD   F  
Sbjct: 145 KPDNSSSYQTVSCNSPDCITKMCDARVH-----QCKYERVYAEMSSSKGVLGKDLLGFG- 198

Query: 151 TNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            NG RL P  L  GC   +         DGI+GLG+G  SIV QL     + +    C  
Sbjct: 199 -NGSRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYG 257

Query: 210 G--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------LP 261
           G   GGG +  G  +     +V+     + + YY+  ++E+   G +  + +      L 
Sbjct: 258 GMDEGGGSMVLG-AIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLG 316

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            V DSG++Y YL    +      + ++L +      P+     +C+ G     +   + K
Sbjct: 317 TVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAG--SDSKALGK 374

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            F  +   F+ G  +    L PE YL    K  G  CLG     +       ++GGI
Sbjct: 375 HFPPVDFVFS-GNQKVF--LAPENYLFKHTKVPGAYCLGFFKNQDA----TTLLGGI 424


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 152/372 (40%), Gaps = 61/372 (16%)

Query: 31  NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
           + VG  + F V G+  P   G Y   + +G P   + + +DTGSD+ W+ C + C  C  
Sbjct: 78  SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136

Query: 87  ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
                      +AP  L   S   V C DPIC+S+       C +  QC Y   Y DG  
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193

Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  + D F F+   G+ L    +  +  GC   Q      S   +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253

Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
           VSQL S+ +   V  HCL   G GGG    G+ L     +V++ +      Y     S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLNLLSIG 311

Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           V        A +F    T G      + D+G++ TYL +  Y         +L   ++  
Sbjct: 312 VNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFLNAISN 357

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNK 352
           +      P+   G + +     +   F +++L+F  G +     L P+ YL    I    
Sbjct: 358 SVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYGIYDGA 414

Query: 353 GNVCLGILNGAE 364
              C+G     E
Sbjct: 415 SMWCIGFQKAPE 426


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/328 (28%), Positives = 137/328 (41%), Gaps = 46/328 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHP-----LYRPS-- 97
           + TG Y   +Y+G P   Y++ +DTGSD+TWL C APC  CV E   P      Y PS  
Sbjct: 32  FVTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVTETQLPSIKLTTYDPSRS 90

Query: 98  --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQ 154
             +  + C D  C +       +C     C Y   Y DG S+ G  ++D   F    N  
Sbjct: 91  STDGALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNT 150

Query: 155 RLN--PRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           ++N    +  GCG  Q      S   LDG++G G+   SI SQL S   + N   HCL G
Sbjct: 151 QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQG 210

Query: 211 G--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG---------ETTGLKN 259
              GGG +  G        + +T + S    +Y+ G+  +   G         +TT    
Sbjct: 211 DNQGGGTIVIGS--VSEPNISYTPIVS--RNHYAVGMQNIAVNGRNVTTPASFDTTSTSA 266

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
             V+ DSG++  YL    Y   T  +    + +S   +   + L L W           +
Sbjct: 267 GGVIMDSGTTLAYLVDPAY---TQFVNAVSTFESSMFSSHSQCLQLAW---------CSL 314

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYL 347
           +  F T+ L F  G    +  LTP  YL
Sbjct: 315 QADFPTVKLFFDAGA---VMNLTPRNYL 339


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 145/358 (40%), Gaps = 60/358 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V + IG P  P    LDTGSDL W QCDAPC RC   P PLY P+       V C
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C +L +P          C Y   Y DG S+ GVL  + F        R    +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG   +   S     G++G+G+G  S+VSQL   +       +C +         LF G 
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRF-----SYCFTPFNATAASPLFLG- 257

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAE------LFFGGETTGLKNLP------------- 261
               S+R+   + ++ +    S G         L   G T G   LP             
Sbjct: 258 ---SSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGD 314

Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
             V+ DSG+++T L    +  L   +   +       A     L LC+    P     +V
Sbjct: 315 GGVIIDSGTTFTALEESAFVALARALASRVRLPLASGA--HLGLSLCFAAASP--EAVEV 370

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
            +    L L F DG      EL  E+Y++      V CLG+++      + ++V+G +
Sbjct: 371 PR----LVLHF-DGAD---MELRRESYVVEDRSAGVACLGMVSA-----RGMSVLGSM 415


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 165/376 (43%), Gaps = 53/376 (14%)

Query: 35  SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
           S++   + GN +P+  G Y   + IG P++ Y++ +DTGSD+ W+ C A C RC     +
Sbjct: 60  SAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDL 118

Query: 88  EAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
                LY      ++D V C+D  C+    P    C+   QC Y + Y DG S+ G  V+
Sbjct: 119 GVDLTLYDMKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQ 177

Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D   +N  +G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL S 
Sbjct: 178 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237

Query: 198 KLIRNVVGHCLSG-GGGGFLFFGDD--------LYDSSRVVWTSMSSDYTKYYSPGVAEL 248
             ++ V  HCL    GGG    G+         L +S  +V   +S     +Y+  + E+
Sbjct: 238 GKVKKVFSHCLDNVDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSR---AHYNVVMKEI 294

Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
             GG+   +             + DSG++  Y  +  Y  L          K L + P D
Sbjct: 295 EVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-D 345

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
             L    +    F    +V   F T+ L F    + T++   P  YL    +   C+G  
Sbjct: 346 LRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQ 402

Query: 361 N-GAEVGL-QDLNVIG 374
           N GA+    +DL ++G
Sbjct: 403 NSGAQTKDGKDLTLLG 418


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 150/363 (41%), Gaps = 55/363 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
           G Y   + IG PA+ Y++ +DTGSD+ W+ C    ++C + P          LY      
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
           S  LV C+D  C  +       C+    C Y   Y DG S+ G  VKD   ++   G   
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193

Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            Q  N  +  GCG  Q   +  ++   LDGILG GK  SS++SQL S   ++ +  HCL 
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
           G  GG +F    +    +V  T +  +   Y            +    A+LF  G+  G 
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKG- 311

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                + DSG++  YL  + Y+ L   +  +  A  +    +D          + F+   
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKD---------YKCFQYSG 358

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
            V + F  +   F   +      + P  YL    +G  C+G  N A +  +D   +  +G
Sbjct: 359 RVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPYEGMWCIGWQNSA-MQSRDRRNMTLLG 413

Query: 378 DFV 380
           D V
Sbjct: 414 DLV 416


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 49/343 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
           +  G Y + + IG P R +   +DTGSDL W QC APC+ CVE P P + P+       +
Sbjct: 80  FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 138

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           PC   +C +L++P    C   A C Y+  Y D  SS GVL  + F F   + +   PR++
Sbjct: 139 PCSSAMCNALYSP---LCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVS 194

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFF 218
            GCG N   G  ++   G++G G+G  S+VSQL S +       +CL+         L+F
Sbjct: 195 FGCG-NMNAGTLFNG-SGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPATSRLYF 247

Query: 219 GDDLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP------------- 261
           G     +S    +S     T +  +P +  ++F    G +     LP             
Sbjct: 248 GAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDG 307

Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
              V+ DSG++ T+L +  Y  +       +        P D T   C+K   P + +  
Sbjct: 308 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-TFDTCFKWPPPPRRMVT 366

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGIL 360
           + +    + L F DG      EL  E Y+++    GN+CL +L
Sbjct: 367 LPE----MVLHF-DGAD---MELPLENYMVMDGGTGNLCLAML 401


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 49/343 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
           +  G Y + + IG P R +   +DTGSDL W QC APC+ CVE P P + P+       +
Sbjct: 83  FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 141

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           PC   +C +L++P    C   A C Y+  Y D  SS GVL  + F F   + +   PR++
Sbjct: 142 PCSSAMCNALYSP---LCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVS 197

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFF 218
            GCG N   G  ++   G++G G+G  S+VSQL S +       +CL+         L+F
Sbjct: 198 FGCG-NMNAGTLFNG-SGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPATSRLYF 250

Query: 219 GDDLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP------------- 261
           G     +S    +S     T +  +P +  ++F    G +     LP             
Sbjct: 251 GAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDG 310

Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
              V+ DSG++ T+L +  Y  +       +        P D T   C+K   P + +  
Sbjct: 311 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-TFDTCFKWPPPPRRMVT 369

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGIL 360
           + +    + L F DG      EL  E Y+++    GN+CL +L
Sbjct: 370 LPE----MVLHF-DGAD---MELPLENYMVMDGGTGNLCLAML 404


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 152/372 (40%), Gaps = 61/372 (16%)

Query: 31  NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
           + VG  + F V G+  P   G Y   + +G P   + + +DTGSD+ W+ C + C  C  
Sbjct: 78  SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136

Query: 87  ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
                      +AP  L   S   V C DPIC+S+       C +  QC Y   Y DG  
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193

Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  + D F F+   G+ L    +  +  GC   Q      S   +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253

Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
           VSQL S+ +   V  HCL   G GGG    G+ L     +V++ +      Y     S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLNLLSIG 311

Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           V        A +F    T G      + D+G++ TYL +  Y         +L   ++  
Sbjct: 312 VNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFLNAISN 357

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNK 352
           +      P+   G + +     +   F +++L+F  G +     L P+ YL    I    
Sbjct: 358 SVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYGIYDGA 414

Query: 353 GNVCLGILNGAE 364
              C+G     E
Sbjct: 415 SMWCIGFQKAPE 426


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 82/275 (29%), Positives = 122/275 (44%), Gaps = 44/275 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
           G Y   + IG PA+ Y++ +DTGSD+ W+ C    ++C + P          LY      
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
           S  LV C+D  C  +       C+    C Y   Y DG S+ G  VKD   ++   G   
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193

Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            Q  N  +  GCG  Q   +  ++   LDGILG GK  SS++SQL S   ++ +  HCL 
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
           G  GG +F    +    +V  T +  +   Y            +    A+LF  G+  G 
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG- 311

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
                + DSG++  YL  + Y+ L   +KKE + K
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPL---VKKEPALK 339


>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
          Length = 154

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/148 (43%), Positives = 82/148 (55%), Gaps = 5/148 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G     VVFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEVVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
           YT++    Y  + S ++  LS  SL+E 
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEV 149


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 145/358 (40%), Gaps = 60/358 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V + IG P  P    LDTGSDL W QCDAPC RC   P PLY P+       V C
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C +L +P          C Y   Y DG S+ GVL  + F        R    +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG   +   S     G++G+G+G  S+VSQL   +       +C +         LF G 
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRF-----SYCFTPFNATAASPLFLG- 257

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAE------LFFGGETTGLKNLP------------- 261
               S+R+   + ++ +    S G         L   G T G   LP             
Sbjct: 258 ---SSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGD 314

Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
             V+ DSG+++T L    +  L   +   +       A     L LC+    P     +V
Sbjct: 315 GGVIIDSGTTFTALEERAFVALARALASRVRLPLASGA--HLGLSLCFAAASP--EAVEV 370

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
            +    L L F DG      EL  E+Y++      V CLG+++      + ++V+G +
Sbjct: 371 PR----LVLHF-DGAD---MELRRESYVVEDRSAGVACLGMVSA-----RGMSVLGSM 415


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 158/359 (44%), Gaps = 56/359 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + M IG PAR Y   LDTGSDL W QC APC+ CV+ P P + P+N      + C 
Sbjct: 90  GEYLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPANSSTYRSLGCS 148

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
            P C +L+ P  +       C Y+  Y D  S+ GVL  + F F   + +   PR++ GC
Sbjct: 149 APACNALYYPLCYQ----KTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGC 204

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGDD 221
           G   +   S     G++G G+G  S+VSQL S +       +CL+         L+FG  
Sbjct: 205 G--NLNAGSLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVRSRLYFGAY 257

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV---------------- 262
              +S    T  S+ +    +P +  ++F    G + G   LP+                
Sbjct: 258 ATLNSTNASTVQSTPFI--INPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGT 315

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKEL-SAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           + DSG++ TYL    Y  +       L S   L +  E   L  C++   P +    + +
Sbjct: 316 IIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQ 375

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL------QDLNVI 373
               L L F DG     +EL  + Y+++  + G +CL +   ++  +      Q+ NV+
Sbjct: 376 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVL 426


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 94/336 (27%), Positives = 138/336 (41%), Gaps = 38/336 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
           TG Y V+M +G PAR   +  DTGSDL+W+QC  PC  C E   PL+ P+       VPC
Sbjct: 143 TGNYVVSMGLGTPARDMTVVFDTGSDLSWVQC-TPCSDCYEQKDPLFDPARSSTYSAVPC 201

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P C  L +    +C    +C YE+ Y D   + G L +D      ++   + P    G
Sbjct: 202 ASPECQGLDS---RSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD---VLPGFVFG 255

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
           CG        +   DG++GLG+ K S+ SQ  S+        +CL  S    G+L  G  
Sbjct: 256 CGEQDT--GLFGRADGLVGLGREKVSLSSQAASK--YGAGFSYCLPSSPSAAGYLSLGGP 311

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYLN 274
              ++R        D   +Y   +  +   G T  ++  P+VF       DSG+  T L 
Sbjct: 312 APANARFTAMETRHDSPSFYYVRLVGVKVAGRT--VRVSPIVFSAAGTVIDSGTVITRLP 369

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
              Y  L S   + +     K AP    L  C+     F     V+    ++AL F  G 
Sbjct: 370 PRVYAALRSAFARSMGRYGYKRAPALSILDTCYD----FTGHTTVR--IPSVALVFAGGA 423

Query: 335 TRTLFELTPEAYLIISNKGNVCLGIL---NGAEVGL 367
                 L     L ++     CL      +GA+ G+
Sbjct: 424 A---VGLDFSGVLYVAKVSQACLAFAPNGDGADAGI 456


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/269 (28%), Positives = 122/269 (45%), Gaps = 35/269 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + IG P + Y + +DTGSD+ W+ C    + C + P          LY P   
Sbjct: 80  TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC----ISCNKCPRKSDLGIDLRLYDPKGS 135

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-- 153
            S   V C+   CA+ +      C     C+Y + Y DG S+ G  V D+  +N  +G  
Sbjct: 136 SSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDG 195

Query: 154 --QRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
             +  N  +  GCG  Q    G++   LDGI+G G+  +S++SQL +   ++ +  HCL 
Sbjct: 196 QTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLD 255

Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
              GGG    GD +    +V  T +  D   +Y+  +  +  GG T  L        +  
Sbjct: 256 TIKGGGIFAIGDVV--QPKVKSTPLVPD-MPHYNVNLESINVGGTTLQLPSHMFETGEKK 312

Query: 261 PVVFDSGSSYTYLNRVTYQ-TLTSIMKKE 288
             + DSG++ TYL  + Y+  L ++  K 
Sbjct: 313 GTIIDSGTTLTYLPELVYKDVLAAVFAKH 341


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 157/351 (44%), Gaps = 32/351 (9%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +DTGS +T++ C   C  C     P +RP +
Sbjct: 81  MRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCST-CRHCGSHQDPKFRPED 139

Query: 99  DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                  P+  +       NC+ D  QC YE  YA+  +S G L +D  +F   N   L+
Sbjct: 140 S--ETYQPVKCTWQC----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFG--NQTELS 191

Query: 158 PRLAL-GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
           P+ A+ GC  ++         DGI+GLG+G  SI+ QL  +K+I +    C  G G G  
Sbjct: 192 PQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGG 251

Query: 217 FFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSG 267
                 +   + +V+T      + YY+  + E+   G+   L   P VF        DSG
Sbjct: 252 AMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLN--PKVFDGKHGTVLDSG 309

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++Y YL    +      + KE  +      P+     +C+ G     +V  + K F  + 
Sbjct: 310 TTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAE--IDVSQISKSFPVVE 367

Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
           + F +G       L+PE YL   +K  G  CLG+ +    G     ++GGI
Sbjct: 368 MVFGNGHK---LSLSPENYLFRHSKVRGAYCLGVFSN---GNDPTTLLGGI 412


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 162/363 (44%), Gaps = 54/363 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + IG PA+ Y++ +DTGSD+ W+ C    V C   P          +Y P   
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
            S +LV C+   C + +     +C   + C+Y + Y DG S+ G  V D   +N  +G  
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
                N  ++ GCG       G+S   LDGILG G+  SS++SQL +   +R +  HCL 
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
           +  GGG    G+ +    +V  T +  D   +Y+  +  +  GG   GL         + 
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVPD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
             + DSG++  Y+    Y+ L +++    +++S ++L++         C      F+   
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
            V   F  +   F +G    +  ++P  YL  + K   C+G  NG  V  +D   +  +G
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG-VQTKDGKDMVLLG 422

Query: 378 DFV 380
           D V
Sbjct: 423 DLV 425


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 84/282 (29%), Positives = 126/282 (44%), Gaps = 56/282 (19%)

Query: 45  VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL 100
           V P+G   Y + + IG P +P    LDTGSDL W QC APC  C+  P PL+ P  S+  
Sbjct: 95  VRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSY 153

Query: 101 VP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
           VP  C   +C  +    HH+C+ P  C Y   Y DG ++LGV   + F F  ++G++L+ 
Sbjct: 154 VPMRCSGQLCNDIL---HHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSV 210

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--------- 209
            L  GCG   V   S +   GI+G G+   S+VSQL  ++       +CL+         
Sbjct: 211 PLGFGCGTMNV--GSLNNGSGIVGFGRDPLSLVSQLSIRRF-----SYCLTPYTSTRKST 263

Query: 210 ---GGGGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP- 261
              G     +F GDD       ++R++ +  +  +  YY P      F G T G + L  
Sbjct: 264 LMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTF--YYVP------FTGVTVGTRRLRI 315

Query: 262 --------------VVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
                         V+ DSG++ T         +    + +L
Sbjct: 316 PLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL 357


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 163/361 (45%), Gaps = 44/361 (12%)

Query: 43  GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY- 94
           GN +P+  G Y   + IG P++ Y++ +DTGSD+ W+ C A C RC     +     LY 
Sbjct: 145 GNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYD 203

Query: 95  ---RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
                ++D V C+D  C+    P    C+   QC Y + Y DG S+ G  V+D   +N  
Sbjct: 204 MKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRI 262

Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
           +G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL S   ++ V  
Sbjct: 263 SGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFS 322

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
           HCL    GG +F   ++ +  +V  T +  +   +Y+  + E+  GG+   +        
Sbjct: 323 HCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDVPSDAFESG 380

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                + DSG++  Y  +  Y  L          K L + P D  L    +    F    
Sbjct: 381 DRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQAFTCFDYTG 431

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL-QDLNVIGG 375
           +V   F T+ L F    + T++   P  YL   ++   C+G  N GA+    +DL ++G 
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVY---PHEYL-FQHEFEWCIGWQNSGAQTKDGKDLTLLGD 487

Query: 376 I 376
           +
Sbjct: 488 L 488


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 157/380 (41%), Gaps = 55/380 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           VG  + F V G+  P   G Y   + +G P R + + +DTGSD+ W+ C + C  C +  
Sbjct: 61  VGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTS 119

Query: 91  ---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
                          +  LVPC  PIC S        C   + QC Y  +Y DG  + G 
Sbjct: 120 GLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGY 179

Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
            V D F F+   G+ L    +  +  GC   Q    +     +DGI G G+G+ S++SQL
Sbjct: 180 YVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQL 239

Query: 195 HSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
            S  +   V  HCL G   GGG L  G+ L     +V++ +      +Y+  +  +   G
Sbjct: 240 SSHGITPRVFSHCLKGEDSGGGILVLGEIL--EPGIVYSPLVPS-QPHYNLDLQSIAVSG 296

Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           +   +         N   + D+G++  YL    Y    S +   +S  +          P
Sbjct: 297 QLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLA---------TP 347

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGA 363
              KG + +   + V + F  ++ +F  G T     L PE YL+ ++N     L      
Sbjct: 348 TINKGNQCYLVSNSVSEVFPPVSFNFAGGATML---LKPEEYLMYLTNYAGAALWC---- 400

Query: 364 EVGLQDLNVIGGI---GDFV 380
            +G Q +   GGI   GD V
Sbjct: 401 -IGFQKIQ--GGITILGDLV 417


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 93/346 (26%), Positives = 154/346 (44%), Gaps = 53/346 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + IG PA+ Y++ +DTGSD+ W+ C    V C   P          +Y P   
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
            S +LV C+   C + +     +C   + C+Y + Y DG S+ G  V D   +N  +G  
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
                N  ++ GCG       G+S   LDGILG G+  SS++SQL +   +R +  HCL 
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
           +  GGG    G+ +    +V  T +  D   +Y+  +  +  GG   GL         + 
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVPD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
             + DSG++  Y+    Y+ L +++    +++S ++L++         C      F+   
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
            V   F  +   F +G    +  ++P  YL  + K   C+G  NG 
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG 409


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 87/289 (30%), Positives = 133/289 (46%), Gaps = 26/289 (8%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R+ S  +  SS  +F    ++L  Q  G    +G Y VT+ +G P + + L  DTGSDLT
Sbjct: 99  RVDSIHARLSSHGVFQEKQATLPVQ-SGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLT 157

Query: 76  WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELE 130
           W QC+ PC + C +   P   P+       + C    C  L   G  +C  P  C Y+++
Sbjct: 158 WTQCE-PCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSSPT-CLYQVQ 215

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
           Y DG  S+G    +    + +N   +      GCG  Q     +    G+LGLG+ K S+
Sbjct: 216 YGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNSGLFRGAAGLLGLGRTKLSL 270

Query: 191 VSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDY--TKYYSPGVA 246
            SQ  +QK  + +  +CL  S    G+L FG  +  S  V +T +S D+  T +Y   + 
Sbjct: 271 PSQT-AQKY-KKLFSYCLPASSSSKGYLSFGGQV--SKTVKFTPLSEDFKSTPFYGLDIT 326

Query: 247 ELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
           EL  GG     + +       V DSG+  T L    Y  L+S  +K ++
Sbjct: 327 ELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLMT 375


>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
 gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
 gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
 gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
 gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
          Length = 154

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
           YT++    Y  + S ++  LS  SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 148/360 (41%), Gaps = 72/360 (20%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + + IG P   Y   +DTGSDL W QC APCV C + P P +RP+      LVPC 
Sbjct: 90  GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
            P+CA+L  P    C   + C Y+  Y D  S+ GVL  + F F   N  + +   +A G
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------------- 210
           CG   +         G++GLG+G  S+VSQL   +       +CL+              
Sbjct: 206 CG--NINSGQLANSSGMVGLGRGPLSLVSQLGPSRF-----SYCLTSFLSPEPSRLNFGV 258

Query: 211 ----GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
                G      G  +  +  VV  ++ S Y          +   G + G K LP     
Sbjct: 259 FATLNGTNASSSGSPVQSTPLVVNAALPSLYF---------MSLKGISLGQKRLPIDPLV 309

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET---LPLCWK 308
                     V  DSG+S T+L +  Y  +    ++EL +      P ++T   L  C+ 
Sbjct: 310 FAINDDGTGGVFIDSGTSLTWLQQDAYDAV----RRELVSVLRPLPPTNDTEIGLETCF- 364

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
              P+     V      + L F  G   T   + PE Y++I    G +CL ++   +  +
Sbjct: 365 ---PWPPPPSVAVTVPDMELHFDGGANMT---VPPENYMLIDGATGFLCLAMIRSGDATI 418


>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
          Length = 140

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 80/141 (56%), Gaps = 5/141 (3%)

Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFL 216
           +A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L
Sbjct: 1   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
           +FGD    S  V W  M  +   YYSPG+AEL    +   G      VFDSGS+YT++  
Sbjct: 61  YFGDFNPPSRGVTWVPM-KESXXYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119

Query: 276 VTYQTLTSIMKKELSAKSLKE 296
             Y  + S ++  LS  SL+E
Sbjct: 120 QIYNEIVSKVRGTLSESSLEE 140


>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
          Length = 152

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 1   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSS 60

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+
Sbjct: 61  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 119

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
           YT++    Y  + S ++  LS  SL+E
Sbjct: 120 YTHVPAQIYNEIVSKVRGTLSESSLEE 146


>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
          Length = 149

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 64/148 (43%), Positives = 81/148 (54%), Gaps = 5/148 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
           YT++    Y  + S ++  LS  SL+E 
Sbjct: 122 YTHVPAQIYNEILSKVRGTLSESSLEEV 149


>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
          Length = 154

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
           YT++    Y  + S ++  LS  SL+E
Sbjct: 122 YTHVPAQIYNEILSKVRGTLSESSLEE 148


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 93/310 (30%), Positives = 128/310 (41%), Gaps = 54/310 (17%)

Query: 43  GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------- 92
           GN  PT  G Y   + IG PA+ Y++ +DTGSD+ W+ C    V C   P          
Sbjct: 71  GNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNC----VFCDTCPRKSGLGIELT 126

Query: 93  LYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           LY PS       V C    C + H     +C   A C Y + Y DG S+ G  V D   +
Sbjct: 127 LYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQY 186

Query: 149 NYTNGQR----LNPRLALGCG--YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           N  +G       N  +  GCG       G+S   LDGILG G+  SS++SQL +   +R 
Sbjct: 187 NQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRK 246

Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL----- 257
           V  HCL    GG +F   D+      V T+       +Y+  +  +  GG    L     
Sbjct: 247 VFAHCLDTINGGGIFAIGDVVQPK--VSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIF 304

Query: 258 ---KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
              ++   + DSG++  YL  V Y    +IM K  +                  G  P K
Sbjct: 305 DIGESKGTIIDSGTTLAYLPGVVYN---AIMSKVFAQ----------------YGDMPLK 345

Query: 315 NVHDVKKCFR 324
           N  D  +CFR
Sbjct: 346 NDQDF-QCFR 354


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 116/268 (43%), Gaps = 43/268 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR---- 95
            G Y   + IG P++ Y++ +DTGSD+ W+ C    ++C E P          LY     
Sbjct: 83  VGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNC----IQCRECPRTSSLGMELTLYNIKDS 138

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            S  LVPC++  C  ++      C     C Y   Y DG S+ G  VKD   ++  +G  
Sbjct: 139 VSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDL 198

Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                N  +  GCG  Q   +   S   LDGILG GK  SS++SQL + + ++ +  HCL
Sbjct: 199 QTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL 258

Query: 209 SG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------------ELFFGGETT 255
            G  GGG    G  +    +V  T +  +   Y     A            E F  G+  
Sbjct: 259 DGINGGGIFAIGHVV--QPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRK 316

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTS 283
           G      + DSG++  YL  + Y+ L S
Sbjct: 317 G-----AIIDSGTTLAYLPEIVYEPLVS 339


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 42/355 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
           G Y   + +G P + Y++ +DTGSD+ W+ C APC +C     +  P  LY      ++ 
Sbjct: 75  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKASSTSK 133

Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR---- 155
            V CED  C+ +       C     C Y + Y DG +S G  VKD    +   G      
Sbjct: 134 NVGCEDAFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAP 191

Query: 156 LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
           L   +  GCG NQ    G +   +DGI+G G+  +S++SQL +   ++ +  HCL    G
Sbjct: 192 LAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNG 251

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
           G +F   ++   S VV T+       +Y+  +  +   GE   L         +   + D
Sbjct: 252 GGIFAIGEV--ESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIID 309

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++  YL +  Y +L   ++K  + + +K     ET          F    +  K F  
Sbjct: 310 SGTTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 359

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
           + L F D    +++   P  YL    +   C G  +G        +VI  +GD V
Sbjct: 360 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 410


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/334 (28%), Positives = 151/334 (45%), Gaps = 32/334 (9%)

Query: 56  YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPG 115
           +IG P + + L +DTGS +T++ C++ C +C     P ++P  DL     P+  +   P 
Sbjct: 1   WIGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQP--DLSDTYHPVKCN---PD 54

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASY 174
                +  QC YE +YA+  SS G+L +D  +F   N   L P R   GC   +      
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFG--NMSELKPQRAVFGCENAETGDLFS 112

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
              DGI+GLG+G  SIV QL  + +I +    C  G   GGG +  G  +   S +V++ 
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG-QISPPSDMVFSH 171

Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSI 284
              D + YY+  +  L   G+   +   P VF        DSG++Y YL    +      
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN--PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQA 229

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
           +  EL        P+     +C+ G      + ++ K F ++ + F +G+    + L+PE
Sbjct: 230 ITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDNGEK---YSLSPE 284

Query: 345 AYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            YL   +K  G  CLG+      G     ++GGI
Sbjct: 285 NYLFKHSKVHGAYCLGVFQN---GKDPTTLLGGI 315


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/334 (28%), Positives = 151/334 (45%), Gaps = 32/334 (9%)

Query: 56  YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPG 115
           +IG P + + L +DTGS +T++ C++ C +C     P ++P  DL     P+  +   P 
Sbjct: 1   WIGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQP--DLSDTYHPVKCN---PD 54

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASY 174
                +  QC YE +YA+  SS G+L +D  +F   N   L P R   GC   +      
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFG--NMSELKPQRAVFGCENAETGDLFS 112

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
              DGI+GLG+G  SIV QL  + +I +    C  G   GGG +  G  +   S +V++ 
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG-QISPPSDMVFSH 171

Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSI 284
              D + YY+  +  L   G+   +   P VF        DSG++Y YL    +      
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN--PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQA 229

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
           +  EL        P+     +C+ G      + ++ K F ++ + F +G+    + L+PE
Sbjct: 230 ITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDNGEK---YSLSPE 284

Query: 345 AYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            YL   +K  G  CLG+      G     ++GGI
Sbjct: 285 NYLFKHSKVHGAYCLGVFQN---GKDPTTLLGGI 315


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 152/377 (40%), Gaps = 66/377 (17%)

Query: 31  NHVGSSLLFQVHGNVYP-------TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           + VG  + F V G+  P       T  Y   + +G P   + + +DTGSD+ W+ C + C
Sbjct: 78  SSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-C 136

Query: 84  VRC------------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEY 131
             C             +AP  L   S   V C DPIC+S+       C +  QC Y   Y
Sbjct: 137 SNCPHSSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRY 193

Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGK 185
            DG  + G  + D F F+   G+ L    +  +  GC   Q      S   +DGI G GK
Sbjct: 194 GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGK 253

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--- 240
           GK S+VSQL S+ +   V  HCL   G GGG    G+ L     +V++ +      Y   
Sbjct: 254 GKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLN 311

Query: 241 -YSPGV--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
             S GV        A +F    T G      + D+G++ TYL +  Y         +L  
Sbjct: 312 LLSIGVNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFL 357

Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL---- 347
            ++  +      P+   G + +     +   F +++L+F  G +     L P+ YL    
Sbjct: 358 NAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYG 414

Query: 348 IISNKGNVCLGILNGAE 364
           I       C+G     E
Sbjct: 415 IYDGASMWCIGFQKAPE 431


>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
          Length = 154

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 78/141 (55%), Gaps = 5/141 (3%)

Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFL 216
           +A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L
Sbjct: 9   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 68

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
           + GD    +  V W  M      YYSPG+A LF   +   G      VFDSGS+YTY+  
Sbjct: 69  YVGDFNPPTRGVTWVPMRESLF-YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYMPA 127

Query: 276 VTYQTLTSIMKKELSAKSLKE 296
             Y  L S ++  LS  SL+E
Sbjct: 128 QIYNELVSKIRGTLSESSLEE 148


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 147/360 (40%), Gaps = 72/360 (20%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + + IG P   Y   +DTGSDL W QC APCV C + P P +RP+      LVPC 
Sbjct: 90  GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
            P+CA+L  P    C   + C Y+  Y D  S+ GVL  + F F   N  + +   +A G
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------------- 210
           CG   +         G++GLG+G  S+VSQL   +       +CL+              
Sbjct: 206 CG--NINSGQLANSSGMVGLGRGPLSLVSQLGPSRF-----SYCLTSFLSPEPSRLNFGV 258

Query: 211 ----GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
                G      G  +  +  VV  ++ S Y          +   G + G K LP     
Sbjct: 259 FATLNGTNASSSGSPVQSTPLVVNAALPSLYF---------MSLKGISLGQKRLPIDPLV 309

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET---LPLCWK 308
                     V  DSG+S T+L +  Y  +    + EL +      P ++T   L  C+ 
Sbjct: 310 FAINDDGTGGVFIDSGTSLTWLQQDAYDAV----RHELVSVLRPLPPTNDTEIGLETCF- 364

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
              P+     V      + L F  G   T   + PE Y++I    G +CL ++   +  +
Sbjct: 365 ---PWPPPPSVAVTVPDMELHFDGGANMT---VPPENYMLIDGATGFLCLAMIRSGDATI 418


>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
          Length = 154

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/147 (42%), Positives = 81/147 (55%), Gaps = 5/147 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           +R   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   ERDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-NLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +  G       VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIGGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
           YT++    Y  + S ++  LS  SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 157/387 (40%), Gaps = 49/387 (12%)

Query: 5   HNGENLCFPTVRMSSSSSSSSSSSLFNHVG------SSLLFQVHGNVYPTGYYNVTMYIG 58
           H   N+ FP VR     + + ++   +  G      S +   + GN  PT        IG
Sbjct: 23  HANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLALGGNGRPTSTGLYYTKIG 82

Query: 59  QPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDP 106
                Y++ +DTGSD  W+ C    V C   P          LY P    ++ +VPC+D 
Sbjct: 83  LGPNDYYVQVDTGSDTLWVNC----VGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDE 138

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLAL 162
            C S +      C+    C Y + Y DG ++ G  +KD   F+   G       N  +  
Sbjct: 139 FCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIF 198

Query: 163 GCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           GCG  Q   +   +   LDGI+G G+  SS++SQL +   ++ V  HCL    GG +F  
Sbjct: 199 GCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAI 258

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYT 271
            ++      V T+       +Y+  + ++   G+   L             + DSG++  
Sbjct: 259 GEVVQPK--VKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLA 316

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
           YL    Y  L   ++K L+ +S  E    E    C+     + +   +   F T+  +F 
Sbjct: 317 YLPVSIYDQL---LEKTLAQRSGMELYLVEDQFTCFH----YSDEKSLDDAFPTVKFTFE 369

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLG 358
           +G T T +   P  YL    +   C+G
Sbjct: 370 EGLTLTAY---PHDYLFPFKEDMWCIG 393


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 165/372 (44%), Gaps = 58/372 (15%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-------PL 93
           +HG V   GY+  T+++G PAR + + +DTGS +T++ C A C R    PH       P 
Sbjct: 52  LHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYVPC-ASCGRNC-GPHHKDAAFDPA 109

Query: 94  YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
              S+ ++ C+   C     P    C +  +C Y+  YA+  SS G+LV D       +G
Sbjct: 110 SSSSSAVIGCDSDKCICGRPP--CGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLR--DG 165

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGG 212
                 +  GC   +         DGILGLG  + S+V+QL    +I +V   C  S  G
Sbjct: 166 A---VEVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEG 222

Query: 213 GGFLFFGD---DLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLK------NLPV 262
            G L  GD     YD +      +SS  +  YYS  +  L+ GG+   +K          
Sbjct: 223 DGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGT 282

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA--------PEDETLP----LCWKGR 310
           V DSG+++TYL    +Q    + K+ +SA +L+          P++++      +C+ G 
Sbjct: 283 VLDSGTTFTYLPSEAFQ----LFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGG- 337

Query: 311 RPFKNVHD---VKKCFRTLALSFTDG-KTRTLFELTPEAYLII--SNKGNVCLGILNGAE 364
            P     D   ++K F    L F DG + RT     P  YL +     G  CLG+ +   
Sbjct: 338 APHAGHADQSKLEKVFPVFELQFADGVRLRT----GPLNYLFMHTGEMGAYCLGVFDNGA 393

Query: 365 VGLQDLNVIGGI 376
            G     ++GGI
Sbjct: 394 SG----TLLGGI 401


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 150/344 (43%), Gaps = 34/344 (9%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRP----SN 98
           Y NVT  +G P+  + + LDTGSDL WL CD   CVR ++AP        +Y P    ++
Sbjct: 56  YANVT--VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 113

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQR 155
             VPC   +C      G       + C Y++ Y ++G SS GVLV+D      N  + + 
Sbjct: 114 TKVPCNSTLCTR----GDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKA 169

Query: 156 LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +  R+  GCG  QV    +H     +G+ GLG    S+ S L  + +  N    C    G
Sbjct: 170 IPARVTFGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG 227

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
            G + FGD      R    ++   +   Y+  V ++  GG T  L+    VFDSG+S+TY
Sbjct: 228 AGRISFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVGGNTGDLE-FDAVFDSGTSFTY 285

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF--KNVHDVKKCFRTLALSF 330
           L    Y  ++         K  +    +     C+  R P    + H  K  F+  A++ 
Sbjct: 286 LTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNL 345

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           T     +     P   + + +    CL I+      ++D+++IG
Sbjct: 346 TMKGGSSYPVYHPLVVIPMKDTDVYCLAIMK-----IEDISIIG 384


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 159/352 (45%), Gaps = 34/352 (9%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            ++H ++   GYY   ++IG P + + L +DTGS +T++ C + C +C     P ++P  
Sbjct: 1   MRLHDDLLINGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQP-- 57

Query: 99  DLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           DL      +  ++      NC+D   QC YE +YA+  +S GVL +D  +F   N   L 
Sbjct: 58  DLSSTYQSVKCNIDC----NCDDEKQQCVYERQYAEMSTSSGVLGEDIISFG--NLSALA 111

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF- 215
           P R   GC   +         DGI+G+G+G  SIV  L  + +I +    C  G G G  
Sbjct: 112 PQRAVFGCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGG 171

Query: 216 -LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DS 266
            +  G  +   S +V++      + YY+  + E+   G+   L   P VF        DS
Sbjct: 172 AMVLG-GISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLN--PTVFDGKHGTILDS 228

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G++Y YL    + +    + KEL +      P+     +C+ G     ++  +   F  +
Sbjct: 229 GTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAG--SDISQLSSSFPAV 286

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + F +G+      L+PE YL   +K  G  CLGI      G     ++GGI
Sbjct: 287 EMVFGNGQK---LLLSPENYLFRHSKVHGAYCLGIFQN---GKDPTTLLGGI 332


>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
          Length = 150

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 78/141 (55%), Gaps = 5/141 (3%)

Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFL 216
           +A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L
Sbjct: 7   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 66

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
           + GD    +  V W  M      YYSPG+A LF   +   G      VFDSGS+YTY+  
Sbjct: 67  YVGDFNPPTRGVTWVPMRESLF-YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYVPA 125

Query: 276 VTYQTLTSIMKKELSAKSLKE 296
             Y  L S ++  LS  SL+E
Sbjct: 126 QIYNELVSKIRGTLSESSLEE 146


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 146/348 (41%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P+     
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C  P C+ L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 232 ANISCAAPACSDLDTRG---CSG-GNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 340

Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            F  G      +R+    ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 341 DFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFTTAGTIVDS 397

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L S     ++A+  K+AP    L  C+     F  +  V     T+
Sbjct: 398 GTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G      ++     +  ++   VCLG     + G  D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASVSQVCLGFAANEDGG--DVGIVG 494


>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
          Length = 141

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 79/142 (55%), Gaps = 5/142 (3%)

Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFL 216
           +A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I+ NV+GHCLS  G G L
Sbjct: 1   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
           + GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+YT++  
Sbjct: 61  YVGDFNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119

Query: 276 VTYQTLTSIMKKELSAKSLKEA 297
             Y  + S ++  LS  SL+E 
Sbjct: 120 QIYNEIVSKVRGTLSEPSLEEV 141


>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
          Length = 154

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/151 (42%), Positives = 81/151 (53%), Gaps = 5/151 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYS G+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSAGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           YT++    Y  + S ++  LS  SL+E   D
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEVKGD 152


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 156/362 (43%), Gaps = 52/362 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSND--- 99
           TG Y   + IG P + Y++ +DTGSD+ W+ C   C RC     +     LY P +    
Sbjct: 86  TGLYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTG 144

Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
             V C+   CA+ +      C     C+Y + Y DG S+ G  V D   F+  +G    +
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 204

Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
             N  +  GCG  Q    G+S   LDGI+G G+  +S++SQL +   ++ +  HCL    
Sbjct: 205 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN 264

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVV 263
           GGG    G+ +    +V  T +  +   +Y+  +  +  GG    L        +    +
Sbjct: 265 GGGIFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTI 321

Query: 264 FDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            DSG++ TYL  + Y+ +   +    K+++  +++E        LC      F+ V  V 
Sbjct: 322 IDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF-------LC------FQYVGRVD 368

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI--GD 378
             F  +   F +     ++   P  Y   +     C+G  NG   GLQ  +  G +  GD
Sbjct: 369 DDFPKITFHFENDLPLNVY---PHDYFFENGDNLYCVGFQNG---GLQSKDGKGMVLLGD 422

Query: 379 FV 380
            V
Sbjct: 423 LV 424


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 77/264 (29%), Positives = 116/264 (43%), Gaps = 35/264 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA--------PHPLYRPSN- 98
            G Y   + IG P++ Y++ +DTGSD+ W+ C   C  C           P+ L   +  
Sbjct: 84  VGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTG 142

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ---- 154
            LV C++  C  ++      C     C Y   Y DG S+ G  VKD   +N  +G     
Sbjct: 143 KLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETT 202

Query: 155 RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
             N  +  GCG  Q   +  +    LDGILG GK  SSI+SQL S + ++ +  HCL G 
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGT 262

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGV----------AELFFGGETTGLKN 259
            GG +F    +    +V  T +  +   Y     GV          A++F  G+  G   
Sbjct: 263 NGGGIFAMGHVVQ-PKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG--- 318

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTS 283
              + DSG++  YL  + Y+ L +
Sbjct: 319 --TIIDSGTTLAYLPELIYEPLVA 340


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
           TG Y   + IG P + Y++ +DTGSD+ W+ C    +RC   P           Y P  S
Sbjct: 81  TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136

Query: 98  NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
              V CE   C +  A G    C   +  C + + Y DG ++ G  V D   +N    NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196

Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
           Q    N  +  GCG       G+S   LDGILG G+  SS++SQL + + +R +  HCL 
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
           +  GGG    G+ +    +V  T +  + T +Y+  +  +  GG T  L         + 
Sbjct: 257 TVRGGGIFAIGNVV--QPKVKTTPLVPNVT-HYNVNLQGISVGGATLQLPTSTFDSGDSK 313

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL-CWKGRRPFKNVHDV 319
             + DSG++  YL R  Y+TL + +  +            + LPL  ++    F+    +
Sbjct: 314 GTIIDSGTTLAYLPREVYRTLLAAVFDKY-----------QDLPLHNYQDFVCFQFSGSI 362

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
              F  +  SF    T  ++   P+ YL  +     C+G L+G  V  +D   +  +GD 
Sbjct: 363 DDGFPVITFSFKGDLTLNVY---PDDYLFQNRNDLYCMGFLDGG-VQTKDGKDMLLLGDL 418

Query: 380 V 380
           V
Sbjct: 419 V 419


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
           TG Y   + IG P + Y++ +DTGSD+ W+ C    +RC   P           Y P  S
Sbjct: 81  TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136

Query: 98  NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
              V CE   C +  A G    C   +  C + + Y DG ++ G  V D   +N    NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196

Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
           Q    N  +  GCG       G+S   LDGILG G+  SS++SQL + + +R +  HCL 
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
           +  GGG    G+ +    +V  T +  + T +Y+  +  +  GG T  L         + 
Sbjct: 257 TVRGGGIFAIGNVV--QPKVKTTPLVPNVT-HYNVNLQGISVGGATLQLPTSTFDSGDSK 313

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL-CWKGRRPFKNVHDV 319
             + DSG++  YL R  Y+TL + +  +            + LPL  ++    F+    +
Sbjct: 314 GTIIDSGTTLAYLPREVYRTLLAAVFDKY-----------QDLPLHNYQDFVCFQFSGSI 362

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
              F  +  SF    T  ++   P+ YL  +     C+G L+G  V  +D   +  +GD 
Sbjct: 363 DDGFPVITFSFEGDLTLNVY---PDDYLFQNRNDLYCMGFLDGG-VQTKDGKDMLLLGDL 418

Query: 380 V 380
           V
Sbjct: 419 V 419


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 146/354 (41%), Gaps = 48/354 (13%)

Query: 29  LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           L   +G  + F V G   P   G Y   + +G P R +++ +DTGSD+ W+ C A C  C
Sbjct: 57  LLQSLGGVIDFPVDGTFDPFVVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSC-ASCNGC 115

Query: 87  -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
                ++     + P + +    V C D  C+         C      C Y  +Y DG  
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175

Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  V D   F+   G  L P     +  GC  +Q      S   +DGI G G+   S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235

Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL SQ L   V  HCL G  GGGG L  G+ +     +V+T +      +Y+  +  +
Sbjct: 236 ISQLASQGLAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292

Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
              G+   +   P VF          D+G++  YL+   Y             +++  A 
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
                P+  KG + +     V   F  ++L+F  G   ++F L P+ YLI  N 
Sbjct: 342 SQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 76/267 (28%), Positives = 116/267 (43%), Gaps = 41/267 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP------------LYR 95
            G Y   + IG P++ Y++ +DTGSD+ W+ C    ++C E P                 
Sbjct: 84  VGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC----IQCRECPRTSSLGMELTPYDLEES 139

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            +  LV C++  C  ++      C     C Y   Y DG S+ G  VKD   +N  +G  
Sbjct: 140 TTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDL 199

Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                N  +  GCG  Q   +  +    LDGILG GK  SSI+SQL S + ++ +  HCL
Sbjct: 200 ETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL 259

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGV----------AELFFGGETTG 256
            G  GG +F    +    +V  T +  +   Y     GV          A++F  G+  G
Sbjct: 260 DGTNGGGIFAMGHVV-QPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG 318

Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTS 283
                 + DSG++  YL  + Y+ L +
Sbjct: 319 -----TIIDSGTTLAYLPELIYEPLVA 340


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 76/159 (47%), Gaps = 9/159 (5%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V + IG P  P    LDTGSDL W QCDAPC RC   P PLY P+       V C
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C +L +P          C Y   Y DG S+ GVL  + F        R    +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           CG   +   S     G++G+G+G  S+VSQL   +  R+
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRPRRS 242


>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 154

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/146 (41%), Positives = 81/146 (55%), Gaps = 5/146 (3%)

Query: 155 RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGG 211
           R   ++A GCGY Q   A     P+DGILGLG GK+   +QL   K+I+ NV+GHCLS  
Sbjct: 4   RDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSK 63

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSY 270
           G G L+ GD    +  V W  M      YYSPG+AE+F   +   G      VFDSGS+Y
Sbjct: 64  GKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 122

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
           T++    Y  + S ++  LS  SL+E
Sbjct: 123 THVPAQIYNEIVSKVRVTLSESSLEE 148


>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
          Length = 154

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 63/148 (42%), Positives = 80/148 (54%), Gaps = 5/148 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A   P  +DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDS S+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSDST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
           YT++    Y  + S ++  LS  SL+E 
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEV 149


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--S 97
           TG Y   + IG P++ Y++ +DTGSD+ W+ C    +RC   P           Y P  S
Sbjct: 82  TGLYYTQIEIGSPSKGYYVQVDTGSDILWVNC----IRCDGCPTTSGLGIELTQYDPAGS 137

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYT--N 152
              V C+   C + ++P       P+    C + + Y DG S+ G  V D+  +N    N
Sbjct: 138 GTTVGCDQEFCVA-NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGN 196

Query: 153 GQRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           GQ    N  +  GCG       G+S   LDGILG G+  SS++SQL + + +R +  HCL
Sbjct: 197 GQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256

Query: 209 -SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KN 259
            +  GGG    G+ +    +V  T +  + T +Y+  +  +  GG T  L         +
Sbjct: 257 DTVHGGGIFAIGNVV--QPKVKTTPLVQNVT-HYNVNLQGISVGGATLQLPSSTFDSGDS 313

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSG++  YL R  Y+TL + +  +    +L            ++    F+    +
Sbjct: 314 KGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHN----------YQDFVCFQFSGSI 363

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
              F  +  SF    T  ++   P  YL  +     C+G L+G  V  +D   +  +GD 
Sbjct: 364 DDGFPVVTFSFEGEITLNVY---PHDYLFQNENDLYCMGFLDGG-VQTKDGKDMVLLGDL 419

Query: 380 V 380
           V
Sbjct: 420 V 420


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 85/272 (31%), Positives = 128/272 (47%), Gaps = 31/272 (11%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
           V G+   +G Y V  ++G P + + L +D+GSDL W+QC APC++C     PLY PSN  
Sbjct: 55  VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQDTPLYAPSNSS 113

Query: 99  --DLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
             + VPC  P C  + A     C+   P  C YE  YAD   S GV    A+     +  
Sbjct: 114 TFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVF---AYESATVDDV 170

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL---HSQKLIRNVVGHCLSGG 211
           R++ ++A GCG +     S+    G+LGLG+G  S  SQ+   +  K    +V +     
Sbjct: 171 RID-KVAFGCGRDN--QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 227

Query: 212 GGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG----------L 257
              +L FGD+L    +D       S S + T YY   + ++  GGE+            L
Sbjct: 228 VSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYV-QIEKVMVGGESLPISHSAWSLDFL 286

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
            N   +FDSG++ TY     Y+ + +   K +
Sbjct: 287 GNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV 318


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 146/348 (41%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P+     
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTY 230

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C  L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 231 ANVSCAAPACFDLDTRG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 284 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 339

Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            F  G      +R+    ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 340 DFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 396

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L S     ++A+  K+AP    L  C+     F  +  V     T+
Sbjct: 397 GTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 450

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G    + ++     +  ++   VCLG     + G  D+ ++G
Sbjct: 451 SLLFQGG---AILDVDASGIMYAASVSQVCLGFAANEDGG--DVGIVG 493


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 43/372 (11%)

Query: 35  SSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
           S +  ++ GN +P  TG Y   + IG P   + + +DTGSD+ W+ C   C  C     +
Sbjct: 55  SVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDI 113

Query: 88  EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
                LY P    ++ L+ C+ P C++ +      C+    C Y++ Y DG ++ G  V 
Sbjct: 114 GVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVN 173

Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D        G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL + 
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233

Query: 198 KLIRNVVGHCL-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------ELFF 250
             ++ +  HCL S  GGG    G+ +    +      +  +      GV       +L  
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPL 293

Query: 251 GGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWKG 309
           G   T  K   ++ DSG++  YL    Y  L   M+K L A+  LK    D+    C+  
Sbjct: 294 GLFETSYKRGAII-DSGTTLAYLPDSIYLPL---MEKILGAQPDLKLRTVDDQFT-CFVF 348

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGLQ 368
               KNV D    F T+   F +    T++   P  YL        C+G  N GA+   +
Sbjct: 349 D---KNVDD---GFPTVTFKFEESLILTIY---PHEYLFQIRDDVWCVGWQNSGAQS--K 397

Query: 369 DLNVIGGIGDFV 380
           D N +  +GD V
Sbjct: 398 DGNEVTLLGDLV 409


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 154/371 (41%), Gaps = 63/371 (16%)

Query: 44  NVYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----S 97
           +V P+G   Y V + IG P +P    LDTGSDL W QC APC  C+  P PL+ P    S
Sbjct: 93  SVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPGESAS 151

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL- 156
            + + C   +C+ +    HH CE P  C Y   Y DG  ++GV   + F F  + G RL 
Sbjct: 152 YEPMRCAGQLCSDIL---HHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLM 208

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-- 214
              L  GCG   V   S +   GI+G G+   S+VSQL  ++       +CL+  G G  
Sbjct: 209 TVPLGFGCGSMNV--GSLNNGSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYGSGRK 261

Query: 215 -FLFFGD---DLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
             L FG     +Y D++  V T  +       +P    +   G T G + L         
Sbjct: 262 STLLFGSLSGGVYGDATGPVQT--TPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFAL 319

Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA-PEDET---LPLCWKGR 310
                  V+ DSG++ T L       +    +++L         PED     +P  W+  
Sbjct: 320 RPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRS 379

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN-KGNVCLGILNGAEVG--- 366
                V   +  F      F D       +L    Y++  + KG +CL + +  + G   
Sbjct: 380 SSTSQVPVPRMVFH-----FQDAD----LDLPRRNYVLDDHRKGRLCLLLADSGDDGSTI 430

Query: 367 ----LQDLNVI 373
                QD+ V+
Sbjct: 431 GNLVQQDMRVL 441


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 132/298 (44%), Gaps = 35/298 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G PAR Y + +DTGS L+WLQC    V C     PL+ PS       + C
Sbjct: 10  SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 69

Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
               C+SL     +N  CE  +  C Y   Y D   S+G L +D         Q L P  
Sbjct: 70  TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 126

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
             GCG  Q     +    GILGLG+ K S++ Q+ S+        +CL + GGGGFL  G
Sbjct: 127 VYGCG--QDSEGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 182

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF--------GGETTGLK----NLPVVFDSG 267
                 S   +T M++D      PG   L+F        GG   G+      +P + DSG
Sbjct: 183 KASLAGSAYKFTPMTTD------PGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSG 236

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFR 324
           +  T L    Y        K +S+K    AP    L  C+KG  +  ++V +V+  F+
Sbjct: 237 TVITRLPMSVYTPFQQAFVKIMSSK-YARAPGFSILDTCFKGNLKDMQSVPEVRLIFQ 293


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 148/363 (40%), Gaps = 67/363 (18%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPHPLYRPSN 98
           V G+   +G Y V + +G PA+ + L +DTGSDLTW+QC+ P         P P Y  S+
Sbjct: 17  VSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 76

Query: 99  D----LVPCEDPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFN--Y 150
                 +PC D  C  L AP   +C  + P+ CDY   Y+D   + G+L  +  +     
Sbjct: 77  SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136

Query: 151 TNGQRLN---------PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
            +G+R             +ALGC    V GAS+    G+LGLG+G  S+ +Q     L  
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESV-GASFLGASGVLGLGQGPISLATQTRHTAL-G 194

Query: 202 NVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
            +  +CL           FL  G       R  W  ++  +T       A+ F+    TG
Sbjct: 195 GIFSYCLVDYLRGSNASSFLVMG-------RTRWRKLA--HTPIVRNPAAQSFYYVNVTG 245

Query: 257 LK--------------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           +                     N   +FDSG++ +YL    Y  +   +   +     +E
Sbjct: 246 VAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQE 305

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
            PE     LC+       NV  ++K    L + F  G    + EL    Y+++  +   C
Sbjct: 306 IPEG--FELCY-------NVTRMEKGMPKLGVEFQGG---AVMELPWNNYMVLVAENVQC 353

Query: 357 LGI 359
           + +
Sbjct: 354 VAL 356


>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
          Length = 148

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/147 (42%), Positives = 80/147 (54%), Gaps = 5/147 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYS G+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSAGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
           YT++    Y  + S ++  LS  SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 119/273 (43%), Gaps = 32/273 (11%)

Query: 34  GSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP- 90
           G  L F V G  + Y  G Y   + +G PA+ +++ +DTGSD+ WL C+  C  C ++  
Sbjct: 52  GGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSG 110

Query: 91  --------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVL 141
                         +  LV C DP+C+         C   A QC Y  +Y DG  + G  
Sbjct: 111 LGIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYY 170

Query: 142 VKDAFAFNYTNGQRL----NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLH 195
           V DA  F+   GQ +    +  +  GC   Q      +   +DGI G G G  S+VSQ+ 
Sbjct: 171 VYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVS 230

Query: 196 SQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
           SQ +   V  HCL   G GGG L  G+ L     +V+T +      +Y+  +  +   G+
Sbjct: 231 SQGMAPKVFSHCLKGQGSGGGILVLGEIL--EPNIVYTPLVP-LQPHYNLNLQSIAVNGQ 287

Query: 254 TTGL--------KNLPVVFDSGSSYTYLNRVTY 278
              +         N   + DSG++  YL +  Y
Sbjct: 288 ILPIDQDVFATGNNRGTIVDSGTTLAYLVQEAY 320


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 81/276 (29%), Positives = 129/276 (46%), Gaps = 43/276 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y + + +G P + +   +DTGSDL W+QC APC RC E P PL+ P    S     C
Sbjct: 5   SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASC 63

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            D +C +L  P    C     C Y   Y DG ++ G     AF     NG  L  R+  G
Sbjct: 64  TDSLCDALPRP---TCSMRNTCTYSYSYGDGSNTRGDF---AFETVTLNGSTL-ARIGFG 116

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
           CG+NQ    ++   DG++GLG+G  S+ SQL+S     ++  +CL    + G    + FG
Sbjct: 117 CGHNQ--EGTFAGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPITFG 172

Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP---------------V 262
            +  ++SR  +T +  + D   YY  GV  +     + G + +P               V
Sbjct: 173 -NAAENSRASFTPLLQNEDNPSYYYVGVESI-----SVGNRRVPTPPSAFRIDANGVGGV 226

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
           + DSG++ TY     +  + + +++++S       P
Sbjct: 227 ILDSGTTITYWRLAAFIPILAELRRQISYPEADPTP 262


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 89/189 (47%), Gaps = 23/189 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLY----R 95
            G Y   + IG P + Y+L +DTGSD+ W+ C    ++C E P          LY     
Sbjct: 80  VGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC----IQCKECPTRSSLGMDLTLYDIKES 135

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            S  LVPC+   C  ++      C     C Y   Y DG S+ G  VKD   ++  +G  
Sbjct: 136 SSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDL 195

Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                N  +  GCG  Q   +  ++   LDGILG GK  SS++SQL S   ++ +  HCL
Sbjct: 196 KTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL 255

Query: 209 SGGGGGFLF 217
           +G  GG +F
Sbjct: 256 NGVNGGGIF 264


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 142/344 (41%), Gaps = 35/344 (10%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P+     
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 231 ANVSCAAPACSDLDTRG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 284 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 339

Query: 217 FFGDDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSY 270
            FG     ++R+  T M  D    +Y  G+  +  GG       +       + DSG+  
Sbjct: 340 DFGAG-SPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVI 398

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y +L S     +SA+  K+AP    L  C+     F  +  V     T++L F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYD----FAGMSQVA--IPTVSLLF 452

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
             G      ++     +  ++   VCL      + G  D+ ++G
Sbjct: 453 QGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 491


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 90/357 (25%), Positives = 144/357 (40%), Gaps = 59/357 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP---------HPLYRPSN 98
           TG Y   + +G P + Y++ +DTGSD+ W+ C   C +C              P    S 
Sbjct: 84  TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNC-ISCSKCPRKSGLGLDLTFYDPKASSSG 142

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
             V C+   CA+ +      C     C+Y + Y DG S+ G  + DA  F+   G    Q
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQ 202

Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--G 210
             N  +  GCG  Q    G S   LDGILG G+  +S++SQL +    + +  HCL    
Sbjct: 203 PGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIK 262

Query: 211 GGGGF-------------LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
           GGG F              FF   L +    +   M      +Y+  +  +  GG T  L
Sbjct: 263 GGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLV-MILLSRPHYNVNLKSIDVGGTTLQL 321

Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLC 306
                   +    + DSG++ TYL  + ++ +  ++    ++++  +L++        LC
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF-------LC 374

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
                 F+    V   F T+   F D     ++   P  Y   +     C+G  NGA
Sbjct: 375 ------FQYSGSVDDGFPTITFHFEDDLALHVY---PHEYFFPNGNDIYCVGFQNGA 422


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 142/357 (39%), Gaps = 57/357 (15%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
           F V G+  P   G Y   + +G P + YF+ +DTGSD+ W+ C +PC  C         +
Sbjct: 77  FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135

Query: 88  EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
           E  +P    ++  +PC D  C +        C+  D + C Y   Y DG  + G  V D 
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195

Query: 146 FAFNYTNGQRLNPR----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
             F+   G          +  GC  +Q    +     +DGI G G+ + S+VSQL+S  +
Sbjct: 196 MYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255

Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
              V  HCL G   GGG L  G+ +     +V+T +      Y              P  
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313

Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           + LF    T G      + DSG++  YL    Y    + +   +S              L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSL 359

Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
             KG + F     V   F T++L F  G   T   + PE YL+    I N    C+G
Sbjct: 360 VSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 413


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 46/365 (12%)

Query: 13  PTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGS 72
           P +    S+SSS+ S          L    G    TG Y VT+ +G PA  Y +  DTGS
Sbjct: 135 PGIHPGHSASSSTPS----------LPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGS 184

Query: 73  DLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYE 128
           D TW+QC    V+C +   PL+ P+       V C D  CA L   G   C     C Y 
Sbjct: 185 DTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNG---CTG-GHCLYA 240

Query: 129 LEYADGGSSLGVLVKDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           ++Y DG  ++G   +D    A +   G R       GCG        +    G++GLG+G
Sbjct: 241 VQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN--NGLFGKTAGLMGLGRG 292

Query: 187 KSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
           K+S+  Q +++        +CL     G G+L FG     ++  +   ++     +Y  G
Sbjct: 293 KTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVG 350

Query: 245 VAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
           +  +  GG+   +          + DSG+  T L    Y  L+S   K + A+  K+AP 
Sbjct: 351 MTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPG 410

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
              L  C+     F  + DV+    T++L F  G      ++     +   ++  VCL  
Sbjct: 411 YSILDTCYD----FTGLSDVE--LPTVSLVFQGG---ACLDVDVSGIVYAISEAQVCLAF 461

Query: 360 LNGAE 364
            +  +
Sbjct: 462 ASNGD 466


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 144/350 (41%), Gaps = 41/350 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAP----HPLYRPSNDLVP 102
           Y   + +G P R +++ +DTGSD+ W+ C +    P    +  P     P   P+  L+ 
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149

Query: 103 CEDPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----N 157
           C D  C+  L +          QC Y  +Y DG  + G  V D   F+   G  +    +
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209

Query: 158 PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGG 213
             +  GC   Q    +     +DGI G G+   S++SQL SQ +   V  HCL G   GG
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGG 269

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFD 265
           G L  G+ +     +V+T +      +Y+  +  ++  G+T  +         N   + D
Sbjct: 270 GILVLGEIV--EPNIVYTPLVPS-QPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIID 326

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++  YL    Y    S +   +S             P   KG + +     +   F  
Sbjct: 327 SGTTLAYLTEAAYDPFISAITSTVSP---------SVSPYLSKGNQCYLTSSSINDVFPQ 377

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGILNGAEVGLQDLNVIG 374
           ++L+F  G +  L    P+ YLI  +  N   L  +   ++  Q++ ++G
Sbjct: 378 VSLNFAGGTSMILI---PQDYLIQQSSINGAALWCVGFQKIQGQEITILG 424


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 146/363 (40%), Gaps = 67/363 (18%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPHPLYRPSN 98
           V G+   +G Y V + +G PA+ + L +DTGSDLTW+QC+ P         P P Y  S+
Sbjct: 49  VSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 108

Query: 99  D----LVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAF---- 148
                 +PC D  C  L AP   +C    P+ CDY   Y+D   + G+L  +  +     
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168

Query: 149 -------NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
                  N+   +     +ALGC    V GAS+    G+LGLG+G  S+ +Q     L  
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESV-GASFLGASGVLGLGQGPISLATQTRHTAL-G 226

Query: 202 NVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
            +  +CL           FL  G       R  W  ++  +T       A+ F+    TG
Sbjct: 227 GIFSYCLVDYLRGSNASSFLVMG-------RTHWRKLA--HTPIVRNPAAQSFYYVNVTG 277

Query: 257 LK--------------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           +                     N   +FDSG++ +YL    Y  +   +   +     +E
Sbjct: 278 VAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQE 337

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
            PE     LC+       NV  ++K    L + F  G    + EL    Y+++  +   C
Sbjct: 338 IPEG--FELCY-------NVTRMEKGMPKLGVEFQGG---AVMELPWNNYMVLVAENVQC 385

Query: 357 LGI 359
           + +
Sbjct: 386 VAL 388


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 142/357 (39%), Gaps = 57/357 (15%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
           F V G+  P   G Y   + +G P + YF+ +DTGSD+ W+ C +PC  C         +
Sbjct: 77  FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135

Query: 88  EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
           E  +P    ++  +PC D  C +        C+  D + C Y   Y DG  + G  V D 
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195

Query: 146 FAFNYTNGQRLNPR----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
             F+   G          +  GC  +Q    +     +DGI G G+ + S+VSQL+S  +
Sbjct: 196 MYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255

Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
              V  HCL G   GGG L  G+ +     +V+T +      Y              P  
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313

Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           + LF    T G      + DSG++  YL    Y    + +   +S              L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSL 359

Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
             KG + F     V   F T++L F  G   T   + PE YL+    I N    C+G
Sbjct: 360 VSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 413


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 145/347 (41%), Gaps = 37/347 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y +  YIG P       +DTGS L WLQC +PC  C     PL+ P    +     C+
Sbjct: 87  GEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHNCFPQETPLFEPLKSSTYKYATCD 145

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN--PRLAL 162
              C  L  P   +C    QC Y + Y D   S+G+L  +  +F  T G +    P    
Sbjct: 146 SQPCTLLQ-PSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIF 204

Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLF 217
           GCG  N     + + + GI GLG G  S+VSQL +Q  I +   +CL    S       F
Sbjct: 205 GCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKLKF 262

Query: 218 FGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE--TTGLKNLPVVFDSGSSYTYL 273
             + +  ++ VV T +        YY   +  +  G +  +TG  +  +V DSG+  TYL
Sbjct: 263 GSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYL 322

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y    + +++ L  K L++ P    L  C+  R               +A  FT  
Sbjct: 323 ENTFYNNFVASLQETLGVKLLQDLPSP--LKTCFPNR--------ANLAIPDIAFQFTGA 372

Query: 334 KTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGIGDF 379
                  L P+  LI     N+ CL ++  + +G   +++ G I  +
Sbjct: 373 SV----ALRPKNVLIPLTDSNILCLAVVPSSGIG---ISLFGSIAQY 412


>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
          Length = 154

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 80/146 (54%), Gaps = 5/146 (3%)

Query: 155 RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGG 211
           R   ++A GCGY Q   A     P+DGILGLG GK+   +QL   K+I+ NV+GHCLS  
Sbjct: 4   RDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSK 63

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSY 270
           G G L+ GD    +  V W  M      YYSPG+AE+F   +   G      VFDSGS+Y
Sbjct: 64  GKGVLYVGDFNPPTRGVTWVPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 122

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
           T++    Y  + S ++  LS  S +E
Sbjct: 123 THVPAQIYSEIVSKVRGTLSESSFEE 148


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 42/355 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
           G Y   + +G P + Y++ +DTGSD+ W+ C APC +C     +  P  LY      ++ 
Sbjct: 72  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSK 130

Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNP 158
            V CED  C+ +       C     C Y + Y DG +S G  +KD        G  R  P
Sbjct: 131 NVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAP 188

Query: 159 ---RLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
               +  GCG NQ    G +   +DGI+G G+  +SI+SQL +    + +  HCL    G
Sbjct: 189 LAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 248

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
           G +F   ++   S VV T+       +Y+  +  +   G+   L         +   + D
Sbjct: 249 GGIFAVGEV--ESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 306

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++  YL +  Y +L   ++K  + + +K     ET          F    +  K F  
Sbjct: 307 SGTTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 356

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
           + L F D    +++   P  YL    +   C G  +G        +VI  +GD V
Sbjct: 357 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 407


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/354 (25%), Positives = 146/354 (41%), Gaps = 48/354 (13%)

Query: 29  LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           L   +G  + F V G   P   G Y   + +G P R +++ +DTGSD+ W+ C A C  C
Sbjct: 57  LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115

Query: 87  -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
                ++     + P + +    + C D  C+         C      C Y  +Y DG  
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175

Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  V D   F+   G  L P     +  GC  +Q      S   +DGI G G+   S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235

Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL SQ +   V  HCL G  GGGG L  G+ +     +V+T +      +Y+  +  +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292

Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
              G+   +   P VF          D+G++  YL+   Y             +++  A 
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
                P+  KG + +     V   F  ++L+F  G   ++F L P+ YLI  N 
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 42/355 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
           G Y   + +G P + Y++ +DTGSD+ W+ C APC +C     +  P  LY      ++ 
Sbjct: 76  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSK 134

Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNP 158
            V CED  C+ +       C     C Y + Y DG +S G  +KD        G  R  P
Sbjct: 135 NVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAP 192

Query: 159 ---RLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
               +  GCG NQ    G +   +DGI+G G+  +SI+SQL +    + +  HCL    G
Sbjct: 193 LAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 252

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
           G +F   ++   S VV T+       +Y+  +  +   G+   L         +   + D
Sbjct: 253 GGIFAVGEV--ESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 310

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++  YL +  Y    S+++K  + + +K     ET          F    +  K F  
Sbjct: 311 SGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 360

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
           + L F D    +++   P  YL    +   C G  +G        +VI  +GD V
Sbjct: 361 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 411


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 89/190 (46%), Gaps = 25/190 (13%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
           G Y   + IG P++ Y+L +DTG+D+ W+ C    ++C E P          LY      
Sbjct: 71  GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNC----IQCKECPTRSNLGMDLTLYNIKESS 126

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
           S  LVPC+  +C  ++      C       C Y   Y DG S+ G  VKD   F+  +G 
Sbjct: 127 SGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGD 186

Query: 155 ----RLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
                 N  +  GCG  Q    SY     LDGILG GK   S++SQL S   ++ +  HC
Sbjct: 187 LKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHC 246

Query: 208 LSGGGGGFLF 217
           L+G  GG +F
Sbjct: 247 LNGVNGGGIF 256


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 161/373 (43%), Gaps = 45/373 (12%)

Query: 35  SSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
           S +  ++ GN +P  TG Y   + IG P   + + +DTGSD+ W+ C   C  C     +
Sbjct: 55  SVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDI 113

Query: 88  EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
                LY P    ++ L+ C+ P C++ +      C+    C Y++ Y DG ++ G  V 
Sbjct: 114 GVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVN 173

Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D        G       N  +  GCG  Q    G+S   LDGILG G+  SS++SQL + 
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233

Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELF 249
             ++ +  HCL    GG +F   ++ +  ++  T +  +   Y              +L 
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVE-PKLXNTPVVPNQAHYNVVLNGVKVGDTALDLP 292

Query: 250 FGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWK 308
            G   T  K   ++ DSG++  YL    Y  L   M+K L A+  LK    D+    C+ 
Sbjct: 293 LGLFETSYKRGAII-DSGTTLAYLPESIYLPL---MEKILGAQPDLKLRTVDDQFT-CFV 347

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL 367
                KNV D    F T+   F +    T++   P  YL        C+G  N GA+   
Sbjct: 348 FD---KNVDD---GFPTVTFKFEESLILTIY---PHEYLFQIRDDVWCVGWQNSGAQS-- 396

Query: 368 QDLNVIGGIGDFV 380
           +D N +  +GD V
Sbjct: 397 KDGNEVTLLGDLV 409


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 159/361 (44%), Gaps = 48/361 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTW---LQCDA-PCVRCVEAPHPLYRP--SNDLV 101
           TG Y   + IG P + Y++ +DTGSD+ W   + CD  P    +      Y P  S   V
Sbjct: 82  TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGTTV 141

Query: 102 PCEDPICASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQ 154
            CE   C +  A     P   +   P  C + + Y DG S+ G  V D   +N    NGQ
Sbjct: 142 GCEQEFCVANSAASGVPPACPSAASP--CQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQ 199

Query: 155 RL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
               N  +  GCG       G+S   LDGILG G+  +S++SQL + + +R +  HCL  
Sbjct: 200 TTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDT 259

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPV 262
             GG +F   ++     V  T +  + T +Y+  +  +  GG T  L         +   
Sbjct: 260 VRGGGIFAIGNVVQPPIVKTTPLVPNAT-HYNVNLQGISVGGATLQLPTSTFDSGDSKGT 318

Query: 263 VFDSGSSYTYLNRVTYQT-LTSIMKK--ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
           + DSG++  YL R  Y+T LT++  K  +L+ ++ ++        +C      F+    +
Sbjct: 319 IIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDF-------IC------FQFSGSL 365

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
            + F  +  SF    T  ++   P  YL  +     C+G L+G  V  +D   +  +GD 
Sbjct: 366 DEEFPVITFSFEGDLTLNVY---PHDYLFQNGNDLYCMGFLDGG-VQTKDGKDMVLLGDL 421

Query: 380 V 380
           V
Sbjct: 422 V 422


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 46/365 (12%)

Query: 13  PTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGS 72
           P +    S+SSS+ S          L    G    TG Y VT+ +G PA  Y +  DTGS
Sbjct: 135 PGIHPGHSASSSTPS----------LPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGS 184

Query: 73  DLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYE 128
           D TW+QC    V+C +   PL+ P+       V C D  CA L   G   C     C Y 
Sbjct: 185 DTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNG---CTG-GHCLYA 240

Query: 129 LEYADGGSSLGVLVKDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           ++Y DG  ++G   +D    A +   G R       GCG        +    G++GLG+G
Sbjct: 241 VQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN--NGLFGKTAGLMGLGRG 292

Query: 187 KSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
           K+S+  Q +++        +CL     G G+L FG     ++  +   ++     +Y  G
Sbjct: 293 KTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVG 350

Query: 245 VAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
           +  +  GG+   +          + DSG+  T L    Y  L+S   K + A+  K+AP 
Sbjct: 351 MTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPG 410

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
              L  C+     F  + DV+    T++L F  G      ++     +   ++  VCL  
Sbjct: 411 YSILDTCYD----FTGLSDVE--LPTVSLVFQGG---ACLDVDVSGIVYAISEAQVCLAF 461

Query: 360 LNGAE 364
            +  +
Sbjct: 462 ASNGD 466


>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
          Length = 147

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 78/142 (54%), Gaps = 5/142 (3%)

Query: 159 RLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGF 215
           ++A GCGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G 
Sbjct: 1   KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 60

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLN 274
           L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+YT++ 
Sbjct: 61  LYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVP 119

Query: 275 RVTYQTLTSIMKKELSAKSLKE 296
              Y  + S +   LS  SL+E
Sbjct: 120 AQIYNEIVSKVIGTLSESSLEE 141


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 146/351 (41%), Gaps = 57/351 (16%)

Query: 31  NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
           + VG  + F V G+  P   G Y   + +G P   + + +DTGSD+ W+ C + C  C  
Sbjct: 78  SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136

Query: 87  ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
                      +AP      S   V C DPIC+S+       C +  QC Y   Y DG  
Sbjct: 137 SSGLGIDLHFFDAPGSFTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193

Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  + D F F+   G+ L    +  +  GC   Q      S   +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253

Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
           VSQL S+ +   V  HCL   G GGG    G+ L     +V++ +      Y     S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLLPSQPHYNLNLLSIG 311

Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           V        A +F    T G      + D+G++ TYL +  Y    + +   +S      
Sbjct: 312 VNGQILPIDAAVFEASNTRG-----TIVDTGTTLTYLVKEAYDPFLNAISNSVS------ 360

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
             +  TL +   G + +     +   F  ++L+F  G +     L P+ YL
Sbjct: 361 --QLVTL-IISNGEQCYLVSTSISDMFPPVSLNFAGGASMM---LRPQDYL 405


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/331 (29%), Positives = 142/331 (42%), Gaps = 42/331 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSND----LVPC 103
           G Y   + +G P   Y + +D+GS LTWLQC APC V C     PLY P        VPC
Sbjct: 106 GNYITRLGLGTPTTTYVMVVDSGSSLTWLQC-APCAVSCHPQAGPLYDPRASSTYAAVPC 164

Query: 104 EDPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
             P CA L A      +C     C Y+  Y DG  S G L KD  + + +      P   
Sbjct: 165 SAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS---FPGFY 221

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
            GCG + V    +    G++GL + K S++SQL     + N   +CL   +    G+L F
Sbjct: 222 YGCGQDNV--GLFGRAAGLIGLARNKLSLLSQLAPS--VGNSFAYCLPTSAAASAGYLSF 277

Query: 219 G--DDLYDSSRVVWTSMSS---DYTKYYSPGVAELFFGGETTGLK-----NLPVVFDSGS 268
           G   D  +  +  +TSM S   D + Y+   +A +   G    +      +LP + DSG 
Sbjct: 278 GSNSDNKNPGKYSYTSMVSSSLDASLYFV-SLAGMSVAGSPLAVPSSEYGSLPTIIDSG- 335

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T + R+     T++ K   +A +   AP    L  C+KG+     V  V        +
Sbjct: 336 --TVITRLPTPVYTALSKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVN-------M 386

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
           +F  G T     LTP   L+  N+   CL  
Sbjct: 387 AFAGGAT---LRLTPGNVLVDVNETTTCLAF 414


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 95/193 (49%), Gaps = 19/193 (9%)

Query: 43  GNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYR 95
           GN  P  TG Y   + +G PA+ +++ +DTGSD+ W+ C A C  C +         LY 
Sbjct: 62  GNGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNC-AGCTACPKKSGLGMDLTLYD 120

Query: 96  P----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           P    +++ VPC D  C   ++     C+    C Y + Y DG ++ G  V D+  F+  
Sbjct: 121 PNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEV 180

Query: 152 NG----QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 204
           +G    +  N  +  GCG  Q   +   S   LDGI+G G+  SS++SQL +   ++ + 
Sbjct: 181 SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIF 240

Query: 205 GHCLSGGGGGFLF 217
            HCL    GG +F
Sbjct: 241 SHCLDSHHGGGIF 253


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/354 (25%), Positives = 146/354 (41%), Gaps = 48/354 (13%)

Query: 29  LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           L   +G  + F V G   P   G Y   + +G P R +++ +DTGSD+ W+ C A C  C
Sbjct: 57  LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115

Query: 87  -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
                ++     + P + +    + C D  C+         C      C Y  +Y DG  
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175

Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  V D   F+   G  L P     +  GC  +Q      S   +DGI G G+   S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235

Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL SQ +   V  HCL G  GGGG L  G+ +     +V+T +      +Y+  +  +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292

Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
              G+   +   P VF          D+G++  YL+   Y             +++  A 
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
                P+  KG + +     V   F  ++L+F  G   ++F L P+ YLI  N 
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 155/377 (41%), Gaps = 49/377 (12%)

Query: 29  LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           L   +G  + F V G   P   G Y   + +G P R +++ +DTGSD+ W+ C A C  C
Sbjct: 57  LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115

Query: 87  -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
                ++     + P + +    + C D  C+         C      C Y  +Y DG  
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175

Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
           + G  V D   F+   G  L P     +  GC  +Q      S   +DGI G G+   S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235

Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           +SQL SQ +   V  HCL G  GGGG L  G+ +     +V+T +      +Y+  +  +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292

Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
              G+   +   P VF          D+G++  YL+   Y             +++  A 
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCL 357
                P+  KG + +     V   F  ++L+F  G   ++F L P+ YLI  N  G   +
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNNVGGTAV 398

Query: 358 GILNGAEVGLQDLNVIG 374
             +    +  Q + ++G
Sbjct: 399 WCIGFQRIQNQGITILG 415


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 156/384 (40%), Gaps = 51/384 (13%)

Query: 9   NLCFPTVRMSSSSSSSSSSSLFNHVG------SSLLFQVHGNVYPTGYYNVTMYIGQPAR 62
           NL FP VR       + ++   +  G      S +   + GN  PT        IG   +
Sbjct: 26  NLVFPVVRKFKGPVENLAAIKAHDAGRRGRFLSVVDVALGGNGRPTSNGLYYTKIGLGPK 85

Query: 63  PYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDPICAS 110
            Y++ +DTGSD  W+ C    V C   P          LY P    ++  VPC+D  C S
Sbjct: 86  DYYVQVDTGSDTLWVNC----VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTS 141

Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGY 166
            +      C     C Y + Y DG ++ G  +KD   F+   G       N  +  GCG 
Sbjct: 142 TYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGS 201

Query: 167 NQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDDL 222
            Q   +   +   LDGI+G G+  SS++SQL +   ++ +  HCL S  GGG    G+ +
Sbjct: 202 KQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGIFAIGEVV 261

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYTYLN 274
                 V T+       +Y+  + ++   G+   L             + DSG++  YL 
Sbjct: 262 QPK---VKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSGTTLAYLP 318

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
              Y  L   ++K L+ +S  +    E    C+     + +   V   F T+  +F +G 
Sbjct: 319 VSIYDQL---LEKILAQRSGMKLYLVEDQFTCFH----YSDEESVDDLFPTVKFTFEEGL 371

Query: 335 TRTLFELTPEAYLIISNKGNVCLG 358
           T T +   P  YL +  +   C+G
Sbjct: 372 TLTTY---PRDYLFLFKEDMWCVG 392


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 150/368 (40%), Gaps = 70/368 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G + + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   V C
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGC 162

Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              +C +L      NC ED   C+Y   Y D  S+ G+L  + F F   N       +  
Sbjct: 163 SSGLCNALP---RSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGF 216

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFF 218
           GCG     G  +    G++GLG+G  S++SQL   K       +CL+          LF 
Sbjct: 217 GCGVEN-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFI 270

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV--------- 262
           G            S+  + TK  S       P    L   G T G K L V         
Sbjct: 271 GSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAE 330

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRP 312
                 + DSG++ TYL    ++    ++K+E +++     P D++    L LC+K    
Sbjct: 331 DGTGGMIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPDA 384

Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEV 365
            KN+   K  F                EL  E Y++  S+ G +CL  G  NG      V
Sbjct: 385 AKNIAVPKMIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 435

Query: 366 GLQDLNVI 373
             Q+ NV+
Sbjct: 436 QQQNFNVL 443


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/359 (25%), Positives = 154/359 (42%), Gaps = 52/359 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSND----LV 101
           Y   + IG P + Y++ +DTGSD+ W+ C   C RC     +     LY P +      V
Sbjct: 4   YYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QRLN 157
            C+   CA+ +      C     C+Y + Y DG S+ G  V D   F+  +G    +  N
Sbjct: 63  SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122

Query: 158 PRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGG 214
             +  GCG  Q    G+S   LDGI+G G+  +S++SQL +   ++ +  HCL    GGG
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGG 182

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDS 266
               G+ +    +V  T +  +   +Y+  +  +  GG    L        +    + DS
Sbjct: 183 IFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239

Query: 267 GSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           G++ TYL  + Y+ +   +    K+++  +++E        LC      F+ V  V   F
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF-------LC------FQYVGRVDDDF 286

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI--GDFV 380
             +   F +     ++   P  Y   +     C+G  NG   GLQ  +  G +  GD V
Sbjct: 287 PKITFHFENDLPLNVY---PHDYFFENGDNLYCVGFQNG---GLQSKDGKGMVLLGDLV 339


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 147/348 (42%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P+     
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L+    H C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 232 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340

Query: 217 FFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            FG     ++R   T+  ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 341 DFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 397

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L       ++A+  K+AP    L  C+     F  +  V     T+
Sbjct: 398 GTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G      ++     +  ++   VCL      + G  D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 494


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 153/375 (40%), Gaps = 62/375 (16%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH------- 91
           F V G   P    +V MY G     + + +DTGSD+ W+ C+  C  C ++         
Sbjct: 60  FSVQGTSDPN---SVGMY-GXXXXXFNVQIDTGSDILWVNCNT-CSNCPQSSQLGIELNF 114

Query: 92  --PLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAF 148
              +   +  L+PC D IC S        C     QC Y  +Y DG  + G  V DA  F
Sbjct: 115 FDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYF 174

Query: 149 NYTNGQ----RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           N   GQ         +  GC  +Q    +     +DGI G G G  S+VSQL SQ +   
Sbjct: 175 NLIMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPK 234

Query: 203 VVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
           V  HCL   G GGG L  G+ L  S  +V++ +      +Y+  +  +   G+   +   
Sbjct: 235 VFSHCLKGDGNGGGILVLGEILEPS--IVYSPLVPS-QPHYNLNLQSIAVNGQPLPIN-- 289

Query: 261 PVVF-----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
           P VF           D G++  YL +  Y  L + +   +S  + +            KG
Sbjct: 290 PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNS---------KG 340

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE---VG 366
            + +     +   F  ++L+F  G +     L PE YL+ +       G L+GAE   VG
Sbjct: 341 NQCYLVSTSIGDIFPLVSLNFEGGASMV---LKPEQYLMHN-------GYLDGAEMWCVG 390

Query: 367 LQDLNVIGGI-GDFV 380
            Q L     I GD V
Sbjct: 391 FQKLQEGASILGDLV 405


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 75/266 (28%), Positives = 113/266 (42%), Gaps = 41/266 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR----P 96
           G Y   + IG P + Y++ +DTGSD+ W+ C    ++C E P          LY      
Sbjct: 76  GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNC----IQCRECPKTSSLGIDLTLYNINESD 131

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-- 154
           +  LVPC+   C  ++      C     C Y   Y DG S+ G  VKD   +   +G   
Sbjct: 132 TGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLK 191

Query: 155 --RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
               N  +  GCG  Q   +  ++   LDGILG GK  SS++SQL     ++ +  HCL 
Sbjct: 192 TTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLD 251

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------------ELFFGGETTGL 257
           G  GG +F    +    +V  T +  +   Y     A            ++F  G+  G 
Sbjct: 252 GTNGGGIFVIGHVV-QPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG- 309

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTS 283
                + DSG++  YL  + Y+ L S
Sbjct: 310 ----AIIDSGTTLAYLPEMVYKPLVS 331


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C +    L+ P+     
Sbjct: 174 GRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTY 233

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L+  G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 234 ANVSCAAPACSDLYTRG---CSG-GHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 286

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 287 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 342

Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            F  G      +R     ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 343 DFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFSTAGTIVDS 399

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L S     ++A+  K+AP    L  C+     F  + +V      +
Sbjct: 400 GTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYD----FTGMSEVA--IPKV 453

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G      ++     +  ++   VCLG    A     D+ ++G
Sbjct: 454 SLLFQGG---AYLDVNASGIMYAASLSQVCLGF--AANEDDDDVGIVG 496


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 96/339 (28%), Positives = 142/339 (41%), Gaps = 41/339 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--CE 104
           G Y +  YIG P        DTGSDL W+QC +PC  C     PL++P  S+  +P  C 
Sbjct: 88  GEYLMRFYIGTPPVERLATADTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCR 146

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLN--PRLA 161
              C +L  P    C    +C Y  +Y D  S S G+L  +   F+   G +    P   
Sbjct: 147 SQPC-TLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205

Query: 162 LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
            GCG YN +     + L GI+GLG G  S+VSQ+  Q  I +   +CL    S       
Sbjct: 206 FGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLK 263

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------VVFDSGSS 269
           F  + +     VV T M     K + P    L     T   K +P       V+ DSG+ 
Sbjct: 264 FGNESIITGEGVVSTPM---IIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTL 320

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
            TYL    Y    + +++ L+ + +++      LP C+  R  F         F  +A  
Sbjct: 321 LTYLGESFYYNFAASLQESLAVELVQDV--LSPLPFCFPYRDNF--------VFPEIAFQ 370

Query: 330 FTDGKTRTLFELTP-EAYLIISNKGNVCLGILNGAEVGL 367
           FT  +      L P   +++  ++  VCL I   +  G+
Sbjct: 371 FTGARV----SLKPANLFVMTEDRNTVCLMIAPSSVSGI 405


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 120/261 (45%), Gaps = 34/261 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRPSND--- 99
           T  Y   + IG P + Y++ +DTGSD+ W+ C + C RC           LY P +    
Sbjct: 30  TRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCIS-CDRCPRKSGLGLELTLYDPKDSSTG 88

Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
             V C+   CA+ +      C     C+Y + Y DG S+ G  V D   F+  +G    +
Sbjct: 89  SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 148

Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
             N  +  GCG  Q    G+S   LDGI+G G+  +S++SQL +   ++ +  HCL    
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN 208

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------- 261
           GGG    G+ +    +V  T +  +   +Y+  +  +  GG  T LK LP          
Sbjct: 209 GGGIFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGG--TALK-LPSHMFDTGEKK 262

Query: 262 -VVFDSGSSYTYLNRVTYQTL 281
             + DSG++ TYL  + Y+ +
Sbjct: 263 GTIIDSGTTLTYLPEIVYKEI 283


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/341 (26%), Positives = 140/341 (41%), Gaps = 49/341 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRP----SNDLV 101
           Y   + IG P +P+ + +DTGSD+ W+ C   C +C     +     LY P    S   V
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNC-VSCDKCPTKSGLGIDLALYDPKGSSSGSAV 145

Query: 102 PCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QR 155
            C++  CA+ +  G     C     C+Y  EY DG S+ G  V D+  +N  +G    + 
Sbjct: 146 SCDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205

Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
               +  GCG  Q     ++   LDGI+G G+  +S +SQL S   ++ +  HCL    G
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKG 265

Query: 214 GFLFFGDDLYD---SSRVVWTSMSSDYTKYYSPGVA--------ELFFGGETTGLKNLPV 262
           G +F   ++      S  +  +MS       S  VA         +F   E  G      
Sbjct: 266 GGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRG-----T 320

Query: 263 VFDSGSSYTYLNRVTYQ-TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           + DSG++ TYL  + Y+  L ++ +K           +D T     +G   F+    V  
Sbjct: 321 IIDSGTTLTYLPELVYKDILAAVFQKH----------QDITFRTI-QGFLCFEYSESVDD 369

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
            F  +   F D     ++   P  Y   +     CLG  NG
Sbjct: 370 GFPKITFHFEDDLGLNVY---PHDYFFQNGDNLYCLGFQNG 407


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 149/360 (41%), Gaps = 47/360 (13%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------- 97
           V   G Y   + +G P + Y + +DTGSD+ W+ C  PC +C    +  +R S       
Sbjct: 68  VDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNAS 126

Query: 98  --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
             +  V C+D  C+ +      +C+    C Y + YAD  +S G  ++D        G  
Sbjct: 127 STSKKVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDL 184

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
               L   +  GCG +Q    G     +DG++G G+  +S++SQL +    + V  HCL 
Sbjct: 185 KTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLD 244

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKNL 260
              GG + F   + DS +V  T M  +   Y       +  G +  G         ++N 
Sbjct: 245 NVKGGGI-FAVGVVDSPKVKTTPMVPNQMHYNV-----MLMGMDVDGTSLDLPRSIVRNG 298

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
             + DSG++  Y  +V Y +L   +   L+ + +K    +ET        + F    +V 
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETF-------QCFSFSTNVD 348

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
           + F  ++  F D    T++   P  YL    +   C G   G     +   VI  +GD V
Sbjct: 349 EAFPPVSFEFEDSVKLTVY---PHDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDLV 404


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 149/365 (40%), Gaps = 40/365 (10%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S    +H ++   GYY   ++IG P   + L +DTGS +T++    PC  C    H    
Sbjct: 25  SARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYV----PCSSCTHCGHHQAS 80

Query: 96  PSNDLVPCEDPICASLHAPGHHN------------CE-DPAQCDYELEYADGGSSLGVLV 142
            S   + C DP     ++  +              C+ +  QC YE  YA+  +S GVL 
Sbjct: 81  FSTHRLFCRDPRFKPENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLG 140

Query: 143 KDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
           KD   F      RL  + L+ GC   +         DGI+GLG+G  SIV QL     I 
Sbjct: 141 KDLLDFG--PASRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIE 198

Query: 202 NVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
           +    C  G   GGG +  G  +   S +V+       + YY+  + E+   G +  L +
Sbjct: 199 DSFSLCYGGMDEGGGSMVLG-AIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDS 257

Query: 260 ------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
                    + DSG++Y YL    ++  T  +  +L +    + P+     +C+ G    
Sbjct: 258 NVFNGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAG-- 315

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLN 371
            +  ++ K F  +   F + +      L PE YL    K  G  CLG     +       
Sbjct: 316 TDTKELGKHFPLVDFVFAENQK---VSLAPENYLFKHTKVPGAYCLGFFKNQDA----TT 368

Query: 372 VIGGI 376
           ++GGI
Sbjct: 369 LLGGI 373


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 149/358 (41%), Gaps = 61/358 (17%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G  L   VH      G + + + IG PA  Y   +DTGSDL W QC  PCV C +   P+
Sbjct: 86  GGDLQVPVHAG---NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCK-PCVDCFKQSTPV 141

Query: 94  YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           + PS+      VPC   +C+ L       C   ++C Y   Y D  S+ GVL  + F   
Sbjct: 142 FDPSSSSTYATVPCSSALCSDLPT---STCTSASKCGYTYTYGDASSTQGVLASETFTLG 198

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
               ++  P +A GCG +   G  +    G++GLG+G  S+VSQL   K       +CL+
Sbjct: 199 --KEKKKLPGVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 250

Query: 210 ----GGGGGFLFFGDDLYDSSR-----------------------VVWTSMSSDYTKYYS 242
               G G   L  G      S                        V  T ++   T+   
Sbjct: 251 SLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITL 310

Query: 243 PGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET 302
           P  A       T G     V+ DSG+S TYL    Y+ L      +++  ++  +  +  
Sbjct: 311 PASAFAIQDDGTGG-----VIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGS--EIG 363

Query: 303 LPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
           L LC++G  P K V +V+     L L F  G      +L  E Y+++ S  G +CL +
Sbjct: 364 LDLCFQG--PAKGVDEVQ--VPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTV 414


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 142/368 (38%), Gaps = 55/368 (14%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           R  ++    S   L   V   + F V G  N Y  G Y   + +G PA+ +F+ +DTGSD
Sbjct: 52  RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSD 111

Query: 74  LTWLQCDAPCVRC---------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE---- 120
           + W+ C +PC  C         +E+ +P    +   + C D  C +    G   C+    
Sbjct: 112 ILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNS 170

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY-- 174
             + C Y   Y DG  + G  V D   F    G       +  +  GC  +Q    +   
Sbjct: 171 QSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 230

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
             +DGI G G+ + S++SQL+S  +   V  HCL G   GGG L  G+ +     +V+T 
Sbjct: 231 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPG--LVYTP 288

Query: 233 MSSDYTKY------------YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
           +      Y              P  + LF    T G      + DSG++  YL    Y  
Sbjct: 289 LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQG-----TIVDSGTTLAYLADGAYDP 343

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
             S +   +S              L  KG + F     V   F T+ L F  G       
Sbjct: 344 FVSAIAAAVSP---------SVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG---VAMS 391

Query: 341 LTPEAYLI 348
           + PE YL+
Sbjct: 392 VKPENYLL 399


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 84/296 (28%), Positives = 129/296 (43%), Gaps = 44/296 (14%)

Query: 41  VHGNVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND 99
           +   + P+ G Y + +YIG P  P    +DTGSDLTW QC  PC  C +   PL+ P N 
Sbjct: 81  IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPKNS 139

Query: 100 LV----PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
                  C    C +L      +C    +C +   YADG  + G L  +    + T G+ 
Sbjct: 140 STYRDSSCGTSFCLALGK--DRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKP 197

Query: 156 LN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           ++ P  A GCG++   G       GI+GLG G+ S++SQL S   I  +  +CL      
Sbjct: 198 VSFPGFAFGCGHSS-GGIFDKSSSGIVGLGGGELSLISQLKST--INGLFSYCL------ 248

Query: 215 FLFFGDDLYDSSRVVW--TSMSSDYTKYYSPGVAE-------LFFGGETTGLKNLP---- 261
            L    D   SSR+ +  +   S Y    +P V +       L   G + G K LP    
Sbjct: 249 -LPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGY 307

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                     ++ DSG++YT+L +  Y  L   +   +  K +++   +    LC+
Sbjct: 308 SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDP--NGIFSLCY 361


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 142/368 (38%), Gaps = 55/368 (14%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           R  ++    S   L   V   + F V G  N Y  G Y   + +G PA+ +F+ +DTGSD
Sbjct: 54  RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSD 113

Query: 74  LTWLQCDAPCVRC---------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE---- 120
           + W+ C +PC  C         +E+ +P    +   + C D  C +    G   C+    
Sbjct: 114 ILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNS 172

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY-- 174
             + C Y   Y DG  + G  V D   F    G       +  +  GC  +Q    +   
Sbjct: 173 QSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 232

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
             +DGI G G+ + S++SQL+S  +   V  HCL G   GGG L  G+ +     +V+T 
Sbjct: 233 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPG--LVYTP 290

Query: 233 MSSDYTKY------------YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
           +      Y              P  + LF    T G      + DSG++  YL    Y  
Sbjct: 291 LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQG-----TIVDSGTTLAYLADGAYDP 345

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
             S +   +S              L  KG + F     V   F T+ L F  G       
Sbjct: 346 FVSAIAAAVSP---------SVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG---VAMS 393

Query: 341 LTPEAYLI 348
           + PE YL+
Sbjct: 394 VKPENYLL 401


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 150/368 (40%), Gaps = 70/368 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G + + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   V C
Sbjct: 105 SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGC 163

Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              +C +L      NC ED   C+Y   Y D  S+ G+L  + F F   N       +  
Sbjct: 164 SSGLCNALP---RSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGF 217

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFF 218
           GCG     G  +    G++GLG+G  S++SQL   K       +CL+          LF 
Sbjct: 218 GCGVEN-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFI 271

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV--------- 262
           G            ++  + TK  S       P    L   G T G K L V         
Sbjct: 272 GSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSE 331

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRP 312
                 + DSG++ TYL    ++    ++K+E +++     P D++    L LC+K    
Sbjct: 332 DGTGGMIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPNA 385

Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEV 365
            KN+   K  F                EL  E Y++  S+ G +CL  G  NG      V
Sbjct: 386 AKNIAVPKLIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 436

Query: 366 GLQDLNVI 373
             Q+ NV+
Sbjct: 437 QQQNFNVL 444


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 149/361 (41%), Gaps = 47/361 (13%)

Query: 44  NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------ 97
            V   G Y   + +G P + Y + +DTGSD+ W+ C  PC +C    +  +R S      
Sbjct: 67  RVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNA 125

Query: 98  ---NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
              +  V C+D  C+ +      +C+    C Y + YAD  +S G  ++D        G 
Sbjct: 126 SSTSKKVGCDDDFCSFISQSD--SCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGD 183

Query: 155 R----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                L   +  GCG +Q    G     +DG++G G+  +S++SQL +    + V  HCL
Sbjct: 184 LKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL 243

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKN 259
               GG + F   + DS +V  T M  +   Y       +  G +  G         ++N
Sbjct: 244 DNVKGGGI-FAVGVVDSPKVKTTPMVPNQMHY-----NVMLMGMDVDGTSLDLPRSIVRN 297

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSG++  Y  +V Y +L   +   L+ + +K    +ET        + F    +V
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETF-------QCFSFSTNV 347

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
            + F  ++  F D    T++   P  YL    +   C G   G     +   VI  +GD 
Sbjct: 348 DEAFPPVSFEFEDSVKLTVY---PHDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDL 403

Query: 380 V 380
           V
Sbjct: 404 V 404


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 145/345 (42%), Gaps = 45/345 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           TG Y VT+ +G PA  Y +  DTGSD TW+QC+   V C E    L+ P+       + C
Sbjct: 183 TGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISC 242

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P C+ L+  G         C Y ++Y DG  S+G    D    +  +  +       G
Sbjct: 243 AAPACSDLYTKGCSG----GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIK---GFRFG 295

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDD 221
           CG        +    G+LGLG+GK+S+  Q + +     V  HC      G G+L FG  
Sbjct: 296 CGERNE--GLFGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTGYLDFGP- 350

Query: 222 LYDSSRVVWTSMSS-----DYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
              SS  V T +++     +   +Y  G+  +  GG+   L   P VF       DSG+ 
Sbjct: 351 --GSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKL--LSIPPSVFTTAGTIVDSGTV 406

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
            T L    Y +L S     ++A+  K+AP    L  C+     F  +  V     T++L 
Sbjct: 407 ITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYD----FTGMSQVA--IPTVSLL 460

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           F  G +    ++     +  ++    CLG     E    D+ ++G
Sbjct: 461 FQGGAS---LDVDASGIIYAASVSQACLGFAANEED--DDVGIVG 500


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------- 97
           V   G Y   + +G P + Y + +DTGSD+ W+ C  PC  C    +  +  S       
Sbjct: 68  VDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNC-KPCPECPSKTNLNFHLSLFDVNAS 126

Query: 98  --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
             +  V C+D  C+ +      +C+    C Y + YAD  +S G  ++D        G  
Sbjct: 127 STSKKVGCDDDFCSFISQS--DSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDL 184

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
               L   +  GCG +Q    G S   +DG++G G+  +S++SQL +    + V  HCL 
Sbjct: 185 QTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLD 244

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKNL 260
              GG + F   + DS +V  T M  +   Y       +  G +  G         ++N 
Sbjct: 245 NVKGGGI-FAVGVVDSPKVKTTPMVPNQMHYNV-----MLMGMDVDGTALDLPPSIMRNG 298

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
             + DSG++  Y  +V Y +L   +   L+ + +K    ++T        + F    +V 
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEDTF-------QCFSFSENVD 348

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
             F  ++  F D    T++   P  YL    K   C G   G     +   VI  +GD V
Sbjct: 349 VAFPPVSFEFEDSVKLTVY---PHDYLFTLEKELYCFGWQAGGLTTGERTEVI-LLGDLV 404


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 81/253 (32%), Positives = 119/253 (47%), Gaps = 29/253 (11%)

Query: 45  VYPTGY-YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR 95
           + P G+ Y   + +G P  PY + LDTGSDL WL CD  CV C+   +         +Y 
Sbjct: 100 ISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYS 157

Query: 96  PSND----LVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF- 148
           P+N      V C   +C+ L       C  P+  C Y++ Y +D  SS G LV+D     
Sbjct: 158 PNNSSTSKEVQCSSSLCSHL-----DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT 212

Query: 149 -NYTNGQRLNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
            N    + +N R+ LGCG +Q  GA  S    +G+ GLG    S+ S L +  LI N   
Sbjct: 213 TNDVQSKPVNARITLGCGKDQ-SGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFS 271

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
            C      G + FGD           ++   +   Y+  + ++  GG  + L ++ V+FD
Sbjct: 272 LCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPT-YNVSITQIGVGGHISDL-DVAVIFD 329

Query: 266 SGSSYTYLNRVTY 278
           SG+S+TYLN   Y
Sbjct: 330 SGTSFTYLNDPAY 342


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 109/408 (26%), Positives = 164/408 (40%), Gaps = 64/408 (15%)

Query: 12  FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVH----GNVYPT--GYYNVTMYIGQPARPYF 65
           FP  + +        ++L  H G  LL  V     GN  PT  G Y   + IG P++ Y+
Sbjct: 44  FPRHQGNGPGGEEHLAALRKHDGRRLLTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYY 103

Query: 66  LDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDPICASLHA 113
           + +DTGSD+ W+ C    + C   P          LY P    S+  V C    CA+   
Sbjct: 104 VQVDTGSDILWVNC----ISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCATATN 159

Query: 114 PG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QRLNPRLALGCG--Y 166
            G   +C   + C Y + Y DG S+ G  V D   ++  +G       N  +  GCG   
Sbjct: 160 GGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASVTFGCGAKI 219

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS 226
               G+S   LDGILG G+  SS++SQL S   +  +  HCL    GG +F   ++    
Sbjct: 220 GGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGIFAIGNVVQPK 279

Query: 227 RVVWTSMSSDYTKYYSPGVAELFFGGET---------TGLKNLPVVFDSGSSYTYLNRVT 277
             V T+       +Y+  +  +  GG T          G  +   + DSG++  YL  V 
Sbjct: 280 --VKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTLAYLPEVV 337

Query: 278 YQTLTSIMKKELSAKSLKEA----------------PE-----DETLPL-CWKGRRPFKN 315
           Y+ + S +       +LK                  PE     D  LPL  +     F+N
Sbjct: 338 YKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQN 397

Query: 316 VHDVKKC-FRTLALSFTDGKTRTLF-ELTPEAYLIISNKGNVCLGILN 361
             DV    F++  +   DGK   L  +L     L++ +  N  +G  N
Sbjct: 398 TEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTN 445


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 81/253 (32%), Positives = 119/253 (47%), Gaps = 29/253 (11%)

Query: 45  VYPTGY-YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR 95
           + P G+ Y   + +G P  PY + LDTGSDL WL CD  CV C+   +         +Y 
Sbjct: 123 ISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYS 180

Query: 96  PSND----LVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF- 148
           P+N      V C   +C+ L       C  P+  C Y++ Y +D  SS G LV+D     
Sbjct: 181 PNNSSTSKEVQCSSSLCSHL-----DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT 235

Query: 149 -NYTNGQRLNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
            N    + +N R+ LGCG +Q  GA  S    +G+ GLG    S+ S L +  LI N   
Sbjct: 236 TNDVQSKPVNARITLGCGKDQ-SGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFS 294

Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
            C      G + FGD           ++   +   Y+  + ++  GG  + L ++ V+FD
Sbjct: 295 LCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPT-YNVSITQIGVGGHISDL-DVAVIFD 352

Query: 266 SGSSYTYLNRVTY 278
           SG+S+TYLN   Y
Sbjct: 353 SGTSFTYLNDPAY 365


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 163/391 (41%), Gaps = 70/391 (17%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTW 76
           MS   + +++ S+       L   VH      G + + M IG PA  Y   +DTGSDL W
Sbjct: 87  MSRLVARTATGSVKAAAAPDLQVPVHAG---NGEFLMDMSIGTPALAYAAIVDTGSDLVW 143

Query: 77  LQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY 131
            QC  PCV C     P++ PS+      +PC   +C+ L       C   A+ C Y   Y
Sbjct: 144 TQCK-PCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPT---STCTSAAKDCGYTYTY 199

Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
            D  S+ GVL  + F    T      P +A GCG +   G  +    G++GLG+G  S+V
Sbjct: 200 GDASSTQGVLAAETFTLAKTK----LPGVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLV 254

Query: 192 SQLHSQKLIRNVVGHCLSGGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTKYYS 242
           SQL   K       +CL+            G       D   ++ +  T +  + ++   
Sbjct: 255 SQLGLGKF-----SYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQ--- 306

Query: 243 PGVAELFFGGETTGLKNLP---------------VVFDSGSSYTYLNRVTYQTLTSIMKK 287
           P    +     T G   +P               V+ DSG+S TYL    Y+ L    KK
Sbjct: 307 PSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPL----KK 362

Query: 288 ELSAKSLKEAPEDET---LPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
             +A+ +K    D +   L LC+K   P   V DV+     L L F  G      +L  E
Sbjct: 363 AFAAQ-MKLPVADGSAVGLDLCFKA--PASGVDDVE--VPKLVLHFDGGAD---LDLPAE 414

Query: 345 AYLII-SNKGNVCLGILNGAEVGLQDLNVIG 374
            Y+++ S  G +CL ++     G + L++IG
Sbjct: 415 NYMVLDSASGALCLTVM-----GSRGLSIIG 440


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 133/339 (39%), Gaps = 48/339 (14%)

Query: 39  FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPH 91
           F + G   P   G Y   + +G P + Y + +DTGSD+ W+ C  PC  C     +  P 
Sbjct: 15  FSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPL 73

Query: 92  PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
            +Y P    +  LV C DP+C          C      C+Y   Y DG +S G  V+DA 
Sbjct: 74  TMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAM 133

Query: 147 AFNYTNGQRL---NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
            +N  +   L     ++  GC   Q      S   +DGI+G G+ + S+ +QL +Q+ I 
Sbjct: 134 QYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIP 193

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYS------------PGVAELF 249
            V  HCL G   G             + +T +  D   Y              P  AE F
Sbjct: 194 RVFSHCLEGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDF 253

Query: 250 FGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                TG     V+ DSG++  Y     Y      +++  SA  ++    D    L   G
Sbjct: 254 SSTNDTG-----VIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLV-SG 307

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
           R        +   F  + L+F  G      EL P+ YL+
Sbjct: 308 R--------LSDLFPNVTLNFEGGA----MELQPDNYLM 334


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 147/348 (42%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G+   TG Y VT+ +G PA  Y +  DTGSD TW+QC+   V C +    L+ P+     
Sbjct: 153 GSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTY 212

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C  P C+ L+  G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 213 ANISCAAPACSDLYIKG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIK--- 265

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        Y    G+LGLG+GK+S+  Q + +     V  HC      G G+L
Sbjct: 266 GFRFGCGERNE--GLYGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTGYL 321

Query: 217 FFG-DDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            FG   L   S  + T M  D    +Y  G+  +  GG+   L ++P         + DS
Sbjct: 322 DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGK---LLSIPQSVFTTSGTIVDS 378

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L S     ++ +  K+AP    L  C+     F  + +V     T+
Sbjct: 379 GTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYD----FTGMSEVA--IPTV 432

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G +    ++     +  ++    CLG     E    D+ ++G
Sbjct: 433 SLLFQGGAS---LDVHASGIIYAASVSQACLGFAGNKED--DDVGIVG 475


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 147/363 (40%), Gaps = 70/363 (19%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   V C   +C
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59

Query: 109 ASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
            +L      NC ED   C+Y   Y D  S+ G+L  + F F   N       +  GCG  
Sbjct: 60  NALP---RSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGFGCGVE 113

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGDDLY 223
              G  +    G++GLG+G  S++SQL   K       +CL+          LF G    
Sbjct: 114 N-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFIGSLAS 167

Query: 224 DSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV-------------- 262
                   S+  + TK  S       P    L   G T G K L V              
Sbjct: 168 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 227

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVH 317
            + DSG++ TYL    ++    ++K+E +++     P D++    L LC+K     KN+ 
Sbjct: 228 MIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPDAAKNIA 281

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEVGLQDL 370
             K  F                EL  E Y++  S+ G +CL  G  NG      V  Q+ 
Sbjct: 282 VPKMIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNF 332

Query: 371 NVI 373
           NV+
Sbjct: 333 NVL 335


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 134/298 (44%), Gaps = 25/298 (8%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S S S++L N   +S    +  ++ P +G Y +++ IG P   Y    DTGSDLTW QC 
Sbjct: 62  SLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQC- 120

Query: 81  APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
            PC++C +   P++ P    S   VPC    C   HA    +C     CDY   Y D   
Sbjct: 121 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---HAVDDGHCGVQGVCDYSYTYGDRTY 177

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           S G L      F        + +  +GCG+    G  +    G++GLG G+ S+VSQ+  
Sbjct: 178 SKGDL-----GFEKITIGSSSVKSVIGCGHASSGGFGFA--SGVIGLGGGQLSLVSQMSQ 230

Query: 197 QKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFG 251
              I     +CL        G + FG++   S   V ++  +S +   YY   +  +  G
Sbjct: 231 TSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIG 290

Query: 252 GE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            E      K   V+ DSG++ T L +  Y  + S + K + AK +K+     +L LC+
Sbjct: 291 NERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKD--PHGSLDLCF 346


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 149/377 (39%), Gaps = 59/377 (15%)

Query: 39  FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
           F V G  N Y  G Y   + +G PA+ YF+ +DTGSD+ W+ C +PC  C         +
Sbjct: 75  FPVEGSANPYMVGLYFTRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGCPTSSGLNIQL 133

Query: 88  EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCED----PAQCDYELEYADGGSSLGVLVK 143
           E  +P    ++  +PC D  C +    G   C+      + C Y   Y DG  + G  V 
Sbjct: 134 EFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVS 193

Query: 144 DAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQ 197
           D   F+   G          +  GC  +Q      +   +DGI G G+ + S+VSQL+S 
Sbjct: 194 DTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSL 253

Query: 198 KLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSP 243
            +      HCL G   GGG L  G+ +     +V+T +      Y              P
Sbjct: 254 GVSPKTFSHCLKGSDNGGGILVLGEIV--EPGLVFTPLVPSQPHYNLNLESIAVSGQKLP 311

Query: 244 GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
             + LF    T G      + DSG++  YL    Y              ++  A      
Sbjct: 312 IDSSLFATSNTQG-----TIVDSGTTLVYLVDGAYDPFI---------NAIAAAVSPSVR 357

Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
            +  KG + F     V   F T  L F  G + T   + PE YL+   +G+V   +L   
Sbjct: 358 SVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMT---VKPENYLL--QQGSVDNNVL--W 410

Query: 364 EVGLQDLNVIGGIGDFV 380
            +G Q    I  +GD V
Sbjct: 411 CIGWQRSQGITILGDLV 427


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 48/350 (13%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPH------PLYRP----S 97
           Y NVT  +G P+  + + LDTGSDL WL CD    CVR ++AP        +Y P    +
Sbjct: 105 YANVT--VGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASST 162

Query: 98  NDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNG 153
           +  VPC   +C  +       C  P + C Y++ Y ++G SS GVLV+D         N 
Sbjct: 163 SSKVPCNSTLCTRV-----DRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS 217

Query: 154 QRLNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           + +  R+ LGCG  Q     +H     +G+ GLG    S+ S L  + +  N    C   
Sbjct: 218 KPIRARITLGCGLVQT--GVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGD 275

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            G G + FGD      R    ++   +   Y+  V ++  GG T  L+    VFD+G+S+
Sbjct: 276 DGAGRISFGDKGSVDQRETPLNIRQPHPT-YNVTVTQISVGGNTGDLE-FDAVFDTGTSF 333

Query: 271 TYLNRVTYQTLTSIMKKELSAKSL-KEAPEDETLPL--CWKGRRPFKNVHDVKKCFR--T 325
           TYL    Y    +++ +  ++ +L K    D  LP   C+        V   KK F    
Sbjct: 334 TYLTDAPY----TLISESFNSLALDKRYQTDSELPFEYCYA-------VSPNKKSFEYPD 382

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           + L+   G +  ++   P   + I +    CL I+   ++ +   N + G
Sbjct: 383 VNLTMKGGSSYPVYH--PLIVVPIEDTVVYCLAIMKSEDISIIGQNFMTG 430


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 91/188 (48%), Gaps = 22/188 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
           TG Y   + IG PA+ Y++ +DTGSD+ W+ C    V C   P          +Y P   
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142

Query: 97  -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
            S +LV C+   C + +     +C   + C+Y + Y DG S+ G  V D   +N  +G  
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
                N  ++ GCG       G+S   LDGILG G+  SS++SQL +   +R +  HCL 
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 210 GGGGGFLF 217
              GG +F
Sbjct: 263 TVNGGGIF 270


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 87/189 (46%), Gaps = 23/189 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLY----R 95
            G Y   + IG P + Y+L +DTGSD+ W+ C    ++C E P          LY     
Sbjct: 82  VGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC----IQCKECPTRSNLGMDLTLYDIKES 137

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            S   VPC+   C  ++      C     C Y   Y DG S+ G  VKD   ++  +G  
Sbjct: 138 SSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDL 197

Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                N  +  GCG  Q   +  ++   L GILG GK  SS++SQL S   ++ +  HCL
Sbjct: 198 KTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL 257

Query: 209 SGGGGGFLF 217
           +G  GG +F
Sbjct: 258 NGVNGGGIF 266


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 88/334 (26%), Positives = 153/334 (45%), Gaps = 41/334 (12%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPL 93
           F + GN    G Y   + +G P +   + +DTGSD+ W++C +PC  C+       P  +
Sbjct: 71  FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129

Query: 94  YR----PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           Y      ++ +  C DP+C    A    +  + A C Y + Y D  +S+G  VKD   + 
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNSA-CAYGISYQDKSTSIGAYVKDDMHYV 188

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
              G      +  GC  N + G+   P DGI+G G+   ++ +Q+ +Q+ +  V  HCL 
Sbjct: 189 LQGGNATTSHIFFGCAIN-ITGS--WPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLG 245

Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF------------FGGETT 255
           G   GGG L FG++  +++ +V+T +  + T +Y+  +  +             F   + 
Sbjct: 246 GEKHGGGILEFGEE-PNTTEMVFTPL-LNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSN 303

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                 V+ DSG+S+  L     + L S +K   +AK     P+ E L   +      K+
Sbjct: 304 STNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKL---GPKLEGLQCFY-----LKS 355

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
              V+  F  + L+F+ G T    +L P+ YL++
Sbjct: 356 GLTVETSFPNVTLTFSGGST---MKLKPDNYLVM 386


>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
          Length = 136

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 73/134 (54%), Gaps = 5/134 (3%)

Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
           QR   ++A GCGY Q   A   P  +DGILGLG GK+   +QL  QK+I  NV+GHCLS 
Sbjct: 3   QRDKKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
            G G L+ GD    S  V W  M      YYSPG+AEL    +   G      VFDSGS+
Sbjct: 63  KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 270 YTYLNRVTYQTLTS 283
           YT++    Y  + S
Sbjct: 122 YTHVPAQIYNEIVS 135


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 48/361 (13%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           V G+   +G Y V  ++G P + + L +D+GSDL W+QC +PC +C     PLY PSN  
Sbjct: 54  VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-SPCRQCYAQDSPLYVPSNSS 112

Query: 101 ----VPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
               VPC    C  + A     C+   P  C YE  YAD  SS GV    A+     +G 
Sbjct: 113 TFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVF---AYESATVDGV 169

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL---HSQKLIRNVVGHCLSGG 211
           R++ ++A GCG +     S+    G+LGLG+G  S  SQ+   +  K    +V +     
Sbjct: 170 RID-KVAFGCGSDN--QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 226

Query: 212 GGGFLFFGDDLYDSSR-VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV-------- 262
               L FGD+L  +   + +T + S+     SP +  +     T G K+LP+        
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPK---SPTLYYVQIEKVTVGGKSLPISDSAWEID 283

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  +FDSG++ TY     Y   + I+    S      A   + L LC +       
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAY---SHILAAFDSGVHYPRAESVQGLDLCVE----LTG 336

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           V   +  F +  + F DG    +F+   E Y +       CL  + G    L   N IG 
Sbjct: 337 VD--QPSFPSFTIEFDDG---AVFQPEAENYFVDVAPNVRCLA-MAGLASPLGGFNTIGN 390

Query: 376 I 376
           +
Sbjct: 391 L 391


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/351 (26%), Positives = 153/351 (43%), Gaps = 45/351 (12%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHPLYRP--S 97
           V G    +G Y V + IGQP +   L  DTGSDL W++C A C  C   +P  ++ P  S
Sbjct: 73  VSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 131

Query: 98  NDLVP--CEDPICASLHAPGH----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +   P  C DP+C  +  PG     ++    + C YE  YADG  + G+  ++  +   +
Sbjct: 132 STFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTS 191

Query: 152 NGQRLNPR-LALGCGY----NQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNV 203
           +G+    + +A GCG+      V G S++  +G++GLG+G  S  SQL  +   K    +
Sbjct: 192 SGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 251

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG--------- 252
           + + LS     +L  GD     S++ +T + ++     +Y   +  +F  G         
Sbjct: 252 MDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSI 311

Query: 253 -ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP---LCWK 308
            E     N   V DSG++  +L    Y+ + + +K+      +K    DE  P   LC  
Sbjct: 312 WEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQR-----IKLPNADELTPGFDLCVN 366

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
                  V   +K    L   F+ G    +F   P  Y I + +   CL I
Sbjct: 367 ----VSGVTKPEKILPRLKFEFSGG---AVFVPPPRNYFIETEEQIQCLAI 410


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/341 (27%), Positives = 151/341 (44%), Gaps = 37/341 (10%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRPSNDLVP 102
           Y NVT  +G P+  + + LDTGSDL WL CD   CVR ++AP        +Y P+     
Sbjct: 105 YANVT--VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 162

Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
            + P  ++L   G       + C Y++ Y ++G SS GVLV+D      N  + + +  R
Sbjct: 163 TKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPAR 222

Query: 160 LALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
           +  GCG  QV    +H     +G+ GLG    S+ S L  + +  N    C    G G +
Sbjct: 223 VTFGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRI 280

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
            FGD      R    ++   +   Y+  V ++  GG T  L+    VFDSG+S+TYL   
Sbjct: 281 SFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVGGNTGDLE-FDAVFDSGTSFTYLTDA 338

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLALSFTDG 333
            Y TL S     L+     +  + E          PF+  + +   K  F+  A++ T  
Sbjct: 339 AY-TLISESFNSLALDKRYQTTDSEL---------PFEYCYALSPNKDSFQYPAVNLTMK 388

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
              +     P   + + +    CL I+      ++D+++IG
Sbjct: 389 GGSSYPVYHPLVVIPMKDTDVYCLAIMK-----IEDISIIG 424


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/364 (28%), Positives = 153/364 (42%), Gaps = 50/364 (13%)

Query: 47  PTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVP 102
           P   Y +  YIG P    F   DTGSDL W+QC APC +CV    PL+ P        VP
Sbjct: 88  PITEYLMRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTVP 146

Query: 103 CEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C+   C +L  P    C   + QC Y+  Y D     G+L  ++  F   N     P+L 
Sbjct: 147 CDSQPC-TLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLT 205

Query: 162 LGCGY--NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFL 216
            GC +  N     S   + G++GLG G  S++SQL  Q  I     +C   LS      +
Sbjct: 206 FGCTFSNNDTVDESKRNM-GLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKM 262

Query: 217 FFGDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFD 265
            FG+D  +     VV T +     K   P    L   G + G K +          ++ D
Sbjct: 263 RFGNDAIVKQIKGVVSTPL---IIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILID 319

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG+S+T L +  Y    +++K+    +++K  P         KG+R         K F  
Sbjct: 320 SGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKR---------KRFPD 370

Query: 326 LALSFTDGKTRT----LFELTPEAYLII-----SNKGNVCLGILNGAEVGLQ-DLNVIGG 375
           +   FT  K R     LFE      L +     S++ +   G  N A++G Q + ++ GG
Sbjct: 371 VVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFG--NHAQIGYQVEYDLQGG 428

Query: 376 IGDF 379
           +  F
Sbjct: 429 MVSF 432


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 123/266 (46%), Gaps = 25/266 (9%)

Query: 41  VHGNVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND 99
           +   + P+ G Y + +YIG P  P    +DTGSDLTW QC  PC  C +   PL+ P N 
Sbjct: 81  IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPKNS 139

Query: 100 LV----PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
                  C    C +L      +C    +C +   YADG  + G L  +    + T G+ 
Sbjct: 140 STYRDSSCGTSFCLALGK--DRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKP 197

Query: 156 LN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           ++ P  A GCG++   G       GI+GLG G+ S++SQL S   I  +  +CL      
Sbjct: 198 VSFPGFAFGCGHSS-GGIFDKSSSGIVGLGGGELSLISQLKST--INGLFSYCL------ 248

Query: 215 FLFFGDDLYDSSRVVW--TSMSSDYTKYYSPGVAELFFGG--ETTGLKNLPVVFDSGSSY 270
            L    D   SSR+ +  +   S Y    +P    L + G  + T ++   ++ DSG++Y
Sbjct: 249 -LPVSTDSSISSRINFGASGRVSGYGTVSTP--LRLPYKGYSKKTEVEEGNIIVDSGTTY 305

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
           T+L +  Y  L   +   +  K +++
Sbjct: 306 TFLPQEFYSKLEKSVANSIKGKRVRD 331


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 136/343 (39%), Gaps = 55/343 (16%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDLV 101
           Y   + +G P + YF+ +DTGSD+ W+ C +PC  C         +E  +P    ++  +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 175

Query: 102 PCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
           PC D  C +        C+  D + C Y   Y DG  + G  V D   F+   G      
Sbjct: 176 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 235

Query: 160 ----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--G 211
               +  GC  +Q    +     +DGI G G+ + S+VSQL+S  +   V  HCL G   
Sbjct: 236 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 295

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGLKN 259
           GGG L  G+ +     +V+T +      Y              P  + LF    T G   
Sbjct: 296 GGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQG--- 350

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSG++  YL    Y    + +   +S              L  KG + F     V
Sbjct: 351 --TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSLVSKGNQCFVTSSSV 399

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
              F T++L F  G   T   + PE YL+    I N    C+G
Sbjct: 400 DSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 439


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 129/300 (43%), Gaps = 25/300 (8%)

Query: 8   ENLCFPTVRMSSSSSSSSSSSLFNHVGSSL---LFQVHGNVYPTGYYNVTMYIGQPARPY 64
           E L     R++S  S  S     NHV  S    L    G+   +G Y VT+ +G P    
Sbjct: 87  EILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 146

Query: 65  FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASL-HAPGHHN 118
            L  DTGSDLTW QC  PCVR C +   P++ PS       V C    C SL  A G+  
Sbjct: 147 SLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAG 205

Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD 178
               + C Y ++Y D   S+G L KD F    ++   +   +  GCG N      +  + 
Sbjct: 206 SCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSD---VFDGVYFGCGENN--QGLFTGVA 260

Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD-LYDSSRVVWTSMSS 235
           G+LGLG+ K S  SQ  +      +  +CL  S    G L FG   +  S +    S  +
Sbjct: 261 GLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTIT 318

Query: 236 DYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
           D T +Y   +  +  GG+     +T       + DSG+  T L    Y  L S  K ++S
Sbjct: 319 DGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS 378


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/276 (28%), Positives = 116/276 (42%), Gaps = 30/276 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
           +G Y V + IG P    +L +D+GSD+ W+QC  PC+ C     PL+ P+       VPC
Sbjct: 124 SGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPC 182

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C +L   G   C D   CDYE+ Y DG  + G L  +      T  +     +A+G
Sbjct: 183 GSAVCRTLRTSG---CGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVE----GVAIG 235

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLY 223
           CG+       +    G+LGLG G  S+V QL           +CL+  G G L  G    
Sbjct: 236 CGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAAG--GAFSYCLASRGAGSLVLGRSEA 291

Query: 224 DSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKN----------LPVVFDSGSSYT 271
                VW  +  +     +Y  G++ +  G E   L+             VV D+G++ T
Sbjct: 292 VPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVT 351

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            L +  Y  L       + A  L  AP    L  C+
Sbjct: 352 RLPQEAYAALRDAFVAAVGA--LPRAPGVSLLDTCY 385


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/337 (29%), Positives = 145/337 (43%), Gaps = 45/337 (13%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           S +S S+ S+  H+G S+          +  Y VT+ +G PA    L +DTGSDL+W+QC
Sbjct: 98  SRASKSNVSIPTHLGGSV---------DSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQC 148

Query: 80  DAPC--VRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGH-HNCED----PAQCDYE 128
            APC    C     PL+ PS       +PC    C  L   G+  +C       AQC Y 
Sbjct: 149 -APCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYA 207

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGK 187
           + Y DG  + GV   +        G  +      GCG++Q  P   Y   DG+LGLG   
Sbjct: 208 ITYGDGSQTTGVYSNETLTM--APGVTVK-DFHFGCGHDQDGPNDKY---DGLLGLGGAP 261

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
            S+V Q  S  +      +CL       GFL  G  + D+S  V+T M  +   +Y   +
Sbjct: 262 ESLVVQTSS--VYGGAFSYCLPAANDQAGFLALGAPVNDASGFVFTPMVREQQTFYVVNM 319

Query: 246 AELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
             +  GGE   +        ++ DSG+  T L    Y  L +  +K ++A  L    E +
Sbjct: 320 TGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELD 379

Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           T   C+     F    +V      +AL+F+ G T  L
Sbjct: 380 T---CYN----FTGHSNVT--VPRVALTFSGGATVDL 407


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 150/345 (43%), Gaps = 39/345 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-------HPLYRPSNDLVPC 103
           Y + + +G P        DTGSDL W+ C +      +A         P    +   + C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
           +   C +L      +C+  ++C Y+  Y DG  ++GVL  + F+F      GQ   PR+ 
Sbjct: 163 QSNACQALS---QASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVN 219

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLF 217
            GC       A     DG++GLG G  S+VSQL +   I   + +CL           L 
Sbjct: 220 FGC---STASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLN 276

Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
           FG     S     ++  + SD   YY+  +  +  GG+     +  ++ DSG++ T+L+ 
Sbjct: 277 FGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFLDP 336

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKN--VHDVKKCFRTLALSFT 331
                L + +++ +  + ++  P ++ L LC+  +G+    N  + DV        L F 
Sbjct: 337 ALLGPLVTELERRIKLQRVQ--PPEQLLQLCYDVQGKSETDNFGIPDVT-------LRFG 387

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            G   T   L PE    +  +G +CL ++  +E   Q ++++G I
Sbjct: 388 GGAAVT---LRPENTFSLLQEGTLCLVLVPVSES--QPVSILGNI 427


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/361 (26%), Positives = 157/361 (43%), Gaps = 44/361 (12%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYR 95
           +HG V   GY+  T+Y+G PA+ + + +DTGS +T++ C +    C       A  P   
Sbjct: 68  LHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEAS 127

Query: 96  PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
            +   + C  P C+     G   C     QC Y   YA+  SS G+L++D  A +  +G 
Sbjct: 128 STASRISCTSPKCSC----GSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALH--DGL 181

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGG 213
              P +  GC   +         DG+ GLG   +S+V+QL    +I +V   C     G 
Sbjct: 182 PGAP-IIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGD 240

Query: 214 GFLFFGD-DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPV-------- 262
           G L  GD ++  S  + +T +  S+ +  YY+  +  L   G+      LPV        
Sbjct: 241 GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQL-----LPVSQSLFDQG 295

Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE--APEDETLPLCWKGRRPFKNVH 317
              V DSG+++TY+    ++     ++K   +  LK    P+ +   +C+       ++ 
Sbjct: 296 YGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS--NKGNVCLGILNGAEVGLQDLNVIGG 375
            +   F ++ + F  G   T   L P  YL +   N G  CLG+ +    G     ++GG
Sbjct: 356 ALSSVFPSMEVQFDQG---TSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAG----TLLGG 408

Query: 376 I 376
           I
Sbjct: 409 I 409


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 142/344 (41%), Gaps = 44/344 (12%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---- 86
           VG  + F V G+  P   G Y   + +G P R + + +DTGSD+ W+ C++ C  C    
Sbjct: 46  VGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTS 104

Query: 87  -VEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
            +      +  S+      V C DPIC S        C     QC Y  +Y DG  + G 
Sbjct: 105 GLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGY 164

Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
            V D   F+   GQ L    +  +  GC   Q    +     +DGI G G+G+ S++SQL
Sbjct: 165 YVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQL 224

Query: 195 HSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
            ++ +   V  HCL   G GGG L  G+ L     +V++ +      +Y+  +  +   G
Sbjct: 225 STRGITPRVFSHCLKGDGSGGGILVLGEIL--EPGIVYSPLVPS-QPHYNLNLLSIAVNG 281

Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           +   +         +   + DSG++  YL    Y    S +   +S             P
Sbjct: 282 QLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSP---------SVTP 332

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
           +  KG + +     V + F   + +F  G +     L PE YLI
Sbjct: 333 ITSKGNQCYLVSTSVSQMFPLASFNFAGGASMV---LKPEDYLI 373


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/349 (27%), Positives = 151/349 (43%), Gaps = 36/349 (10%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G+   TG Y VT+ +G P R      DTGSDLTW QC+ PC R C     P++ PS    
Sbjct: 130 GSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCE-PCARYCYHQQEPIFNPSKSTS 188

Query: 101 ---VPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              + C  P C  L +  G+      + C Y ++Y D   S+G   +D  A   T+   +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD---V 245

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
                 GCG N      +  + G++GLG+   S+VSQ  +QK  + +  +CL  +    G
Sbjct: 246 FNNFLFGCGQNNR--GLFVGVAGLIGLGRNALSLVSQT-AQKYGK-LFSYCLPSTSSSTG 301

Query: 215 FLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSG 267
           +L FG     S  V +T   ++S    +Y   +  +  GG       +       + DSG
Sbjct: 302 YLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSG 361

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +  + L    Y  L +  ++++S K  K AP    L  C+   +   +  DV K    + 
Sbjct: 362 TVISRLPPTAYSDLRASFQQQMS-KYPKAAPA-SILDTCYDFSQ--YDTVDVPK----IN 413

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           L F+DG      +L P     I N   VCL     ++    D+ ++G +
Sbjct: 414 LYFSDGAE---MDLDPSGIFYILNISQVCLAFAGNSDA--TDIAILGNV 457


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 154/370 (41%), Gaps = 48/370 (12%)

Query: 33  VGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           VG  + F V+G  + Y  G Y   + +G P R + + +DTGSD+ W+ C++ C  C    
Sbjct: 66  VGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTS 124

Query: 91  ---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
                     P    +  LV C  PIC SL       C   + QC Y   Y DG  + G 
Sbjct: 125 GLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGY 184

Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
            V D   F+   G  L    +  +  GC   Q    +     +DGI G G+   S+VSQL
Sbjct: 185 YVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQL 244

Query: 195 HSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
            S  +   V  HCL   G GGG L  G+ L     ++++ +    + +Y+  +  +   G
Sbjct: 245 SSLGITPKVFSHCLKGEGDGGGKLVLGEIL--EPNIIYSPLVPSQS-HYNLNLQSISVNG 301

Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           +   +         N   + DSG++ TYL    Y    S +   +S+          T P
Sbjct: 302 QLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSS---------TTP 352

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI---ISNKGNV-CLGIL 360
           +  KG + +     V + F  ++L+F  G +     L P  YL+    S+   + C+G  
Sbjct: 353 VLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMV---LKPGEYLMHLGFSDGAAMWCIGFQ 409

Query: 361 NGAEVGLQDL 370
             AE G+  L
Sbjct: 410 KVAEPGITIL 419


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 151/341 (44%), Gaps = 37/341 (10%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRPSNDLVP 102
           Y NVT  +G P+  + + LDTGSDL WL CD   CVR ++AP        +Y P+     
Sbjct: 105 YANVT--VGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 162

Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
            + P  ++L   G       + C Y++ Y ++G SS GVLV+D      N  + + +  R
Sbjct: 163 TKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPAR 222

Query: 160 LALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
           + LGCG  QV    +H     +G+ GLG    S+ S L  + +  N    C    G G +
Sbjct: 223 VTLGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRI 280

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
            FGD      R    ++   +   Y+  V ++   G T  L+    VFDSG+S+TYL   
Sbjct: 281 SFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVEGNTGDLE-FDAVFDSGTSFTYLTDA 338

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLALSFTDG 333
            Y TL S     L+     +  + E          PF+  + +   K  F+  A++ T  
Sbjct: 339 AY-TLISESFNSLALDKRYQTTDSEL---------PFEYCYALSPNKDSFQYPAVNLTMK 388

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
              +     P   + + +    CL IL      ++D+++IG
Sbjct: 389 GGSSYPVYHPLVVIPMKDTDVYCLAILK-----IEDISIIG 424


>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
          Length = 142

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 75/138 (54%), Gaps = 5/138 (3%)

Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
           CGY Q   A     P+DGILGLG GK+    QL  QK+I+ N++GHCLS  G G L+ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
               S  V W  M      YYSPG+AEL    +   G      VFDSGS+YT++    Y 
Sbjct: 61  FNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHIYS 119

Query: 280 TLTSIMKKELSAKSLKEA 297
            + S ++  LS  SL+E 
Sbjct: 120 EIVSKVRGTLSESSLEEV 137


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 126/319 (39%), Gaps = 46/319 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRP----SNDLVPCEDPI 107
           +G P + Y + +DTGSD+ W+ C  PC  C     +  P  +Y P    +  LV C DP+
Sbjct: 8   LGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSLVSCSDPL 66

Query: 108 CASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL---NPRLALG 163
           C          C      C+Y   Y DG +S G  V+DA  +N  +   L     ++  G
Sbjct: 67  CVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFG 126

Query: 164 CGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
           C   Q      S   +DGI+G G+ + S+ +QL +Q+ I  V  HCL G   G       
Sbjct: 127 CSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIG 186

Query: 222 LYDSSRVVWTSMSSDYTKYYS------------PGVAELFFGGETTGLKNLPVVFDSGSS 269
                 + +T +  D   Y              P  AE F     TG     V+ DSG++
Sbjct: 187 GIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTG-----VIMDSGTT 241

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
             Y     Y      +++  SA  ++    D    L   GR        +   F  + L+
Sbjct: 242 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLV-SGR--------LSDLFPNVTLN 292

Query: 330 FTDGKTRTLFELTPEAYLI 348
           F  G      EL P+ YL+
Sbjct: 293 FEGGA----MELQPDNYLM 307


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 134/301 (44%), Gaps = 35/301 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--CEDP 106
           Y +T+ +G PA+   + +D+GSD++W+QC  PC++C     PL+ P  S+   P  C   
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + YADG S+ G    D  A     G         GC +
Sbjct: 190 ACAQLGQDG-NGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQFGCSH 244

Query: 167 NQVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLY 223
            +   + ++ L DG++GLG G  S+ SQ  +         +CL  +    GFL  G    
Sbjct: 245 VE---SGFNDLTDGLMGLGGGAPSLASQ--TAGTFGTAFSYCLPPTPSSSGFLTLG---A 296

Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSYTYLNRVT 277
            +S  V T M  SS    +Y   +  +  GG      T + +  +V DSG+  T L R  
Sbjct: 297 GTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTA 356

Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
           Y  L+S  K  +  K  + AP    +  C+     F     V+    ++AL F+ G    
Sbjct: 357 YSALSSAFKAGM--KQYRPAPPRSIMDTCFD----FSGQSSVR--LPSVALVFSGGAVVN 408

Query: 338 L 338
           L
Sbjct: 409 L 409


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 109/388 (28%), Positives = 169/388 (43%), Gaps = 53/388 (13%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           +S S+++   S+    +   ++  V   V   +G Y V +Y+G P R + + +DTGSDL 
Sbjct: 114 LSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYLGTPPRRFRMIMDTGSDLN 173

Query: 76  WLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQ--CD 126
           WLQC APC+ C E   P++ P+  +    V C D  C  +  P       C  P    C 
Sbjct: 174 WLQC-APCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCP 232

Query: 127 YELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGK 185
           Y   Y D  ++ G L  +AF  N T +G R    +A GCG+       +H   G+LGLG+
Sbjct: 233 YYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRN--RGLFHGAAGLLGLGR 290

Query: 186 GKSSIVSQLHSQKLIRNVVG-----HCL----SGGGGGFLFFGDD-LYDSSRVVWTSM-- 233
           G  S  SQL      R V G     +CL    S  G   +F  DD L    ++ +T+   
Sbjct: 291 GPLSFASQL------RGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAP 344

Query: 234 SSDYTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGSSYTYLNRVTYQTLTSIMKKE 288
           ++D   +Y   +  +  GGE   + +  +     + DSG++ +Y     YQ +       
Sbjct: 345 TTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDR 404

Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYL 347
           +S            L L +    P  NV   +K     L+L F DG     +E   E Y 
Sbjct: 405 MSPSY--------PLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAA---WEFPAENYF 453

Query: 348 I-ISNKGNVCLGILNGAEVGLQDLNVIG 374
           I +  +G +CL +L     G   +++IG
Sbjct: 454 IRLEPEGIMCLAVLGTPRSG---MSIIG 478


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 145/341 (42%), Gaps = 39/341 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--PLYRPSNDLVPCEDPICASLHAP 114
           +G PA  Y + LDTGSDL WL C+  C +CV         + + ++   ++   +   A 
Sbjct: 119 VGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDNKESSTSKNVAC 176

Query: 115 GHHNCEDPAQCD--------YELEY-ADGGSSLGVLVKDAFAF---NYTNGQRLNPRLAL 162
               CE   QC         Y++EY ++  S+ G LV+D       N    Q  NP +  
Sbjct: 177 NSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQHANPLITF 236

Query: 163 GCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
           GCG  Q    + GA+    +G+ GLG    S+ S L  Q L  N    C +  G G + F
Sbjct: 237 GCGQVQTGAFLDGAA---PNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLGRITF 293

Query: 219 GDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
           GD+    D  +  +    S  T  Y+  V ++  GG +  L+    +FD+G+S+TYLN  
Sbjct: 294 GDNNSSLDQGKTPFNIRPSHST--YNITVTQIIVGGNSADLE-FNAIFDTGTSFTYLNNP 350

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK--KCFRTLALSFTDGK 334
            Y+ +T     ++  +    +  D+          PF+  +D++  +      ++ T   
Sbjct: 351 AYKQITQSFDSKIKLQRHSFSNSDDL---------PFEYCYDLRTNQTIEVPNINLTMKG 401

Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
               F + P       N G +CL +L    V +   N + G
Sbjct: 402 GDNYFVMDPIITSGGGNNGVLCLAVLKSNNVNIIGQNFMTG 442


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 142/332 (42%), Gaps = 38/332 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +T  +G P    +   DTGSD+ WLQC+ PC +C     P++ PS       +PC 
Sbjct: 85  GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCS 143

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C   H+    +C D   C Y++ Y D   S G L  D  +   T+G  ++ P++ +G
Sbjct: 144 SKLC---HSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIG 200

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
           CG +   G       GI+GLG G  S+++QL S   I     +CL             L 
Sbjct: 201 CGTDNA-GTFGGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILS 257

Query: 218 FGDDLYDSSRVVWTS--MSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGS 268
           FGD    S   V ++  +  D   Y      +S G   + FGG + G  +   ++ DSG+
Sbjct: 258 FGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGT 317

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR-----PFKNVH----DV 319
           + T +    Y  L S +   +    + +   ++   LC+  +      P   VH    DV
Sbjct: 318 TLTLIPSDVYTNLESAVVDLVKLDRVDDP--NQQFSLCYSLKSNEYDFPIITVHFKGADV 375

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
           +    +  +  TDG     F+ +P+   I  N
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGN 407


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 152/364 (41%), Gaps = 40/364 (10%)

Query: 39  FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PH 91
           F V G  + Y  G Y   + +G P + +++ +DTGSD+ W+ C + C  C ++     P 
Sbjct: 54  FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 112

Query: 92  PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
             + P    +  L+ C D  C+         C     QC Y  +Y DG  + G  V D  
Sbjct: 113 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 172

Query: 147 AFNYTNGQRL---NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
            F+   G  +   +  +  GC  +Q      S   +DGI G G+   S++SQ+ SQ +  
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---- 257
            V  HCL G GGG             +V++ +      +Y+  +  +   G++  +    
Sbjct: 233 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS-QPHYNLNLQSISVNGKSLAIDPEV 291

Query: 258 ----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
                N   + DSG++  YL    Y    S         ++ EA      PL  KG + +
Sbjct: 292 FATSTNRGTIVDSGTTLAYLAEEAYDPFVS---------AITEAVSQSVRPLLSKGTQCY 342

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQDLNV 372
                VK  F T++L+F  G +     L PE YL+  N  G+  +  +   ++  Q + +
Sbjct: 343 LITSSVKGIFPTVSLNFAGGVS---MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI 399

Query: 373 IGGI 376
           +G +
Sbjct: 400 LGDL 403


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 29/278 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y ++  +G P+   F  LDTGSD+ WLQC  PC +C E   P++  S       +PC 
Sbjct: 87  GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKSQTYKTLPCP 145

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C S+       C     C Y + Y DG  SLG L  +      TNG  +  P   +G
Sbjct: 146 SNTCQSVQG---TFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIG 202

Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG---GGGFLFFG 219
           CG YN +     +   GI+GLG+G  S+++QL      +    +CL  G       L FG
Sbjct: 203 CGRYNAIGIEEKN--SGIVGLGRGPMSLITQLSPSTGGK--FSYCLVPGLSTASSKLNFG 258

Query: 220 DDLYDSSR-VVWTSMSSD--------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           +    S R  V T + S           + +S G   + FG   +G K   ++ DSG++ 
Sbjct: 259 NAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKG-NIIIDSGTTL 317

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           T L    Y  L + + K +  + +++   ++ L LC+K
Sbjct: 318 TALPNGVYSKLEAAVAKTVILQRVRD--PNQVLGLCYK 353


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 152/366 (41%), Gaps = 71/366 (19%)

Query: 42  HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---- 97
           + N  PT  Y V + IG P +P  L LDTGSDL W QC  PCV C + P P +  S    
Sbjct: 26  YDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSST 84

Query: 98  NDLVPCE------DP---ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           N L+PCE      DP   +C  L+       +    C Y   Y D   ++G+L  D F F
Sbjct: 85  NALLPCESTQCKLDPTVTVCVKLN-------QTVQTCAYYTSYGDNSVTIGLLAADKFTF 137

Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
               G  L P +  GCG N   G       GI G G+G  S+ SQL           HC 
Sbjct: 138 --VAGTSL-PGVTFGCGLNNT-GVFNSNETGIAGFGRGPLSLPSQLKVGNF-----SHCF 188

Query: 209 SGGGGG-----FLFFGDDLYDSSR-VVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLP 261
           +   G       L    DL+ + +  V T+    Y K  + P +  L   G T G   LP
Sbjct: 189 TTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLP 248

Query: 262 V--------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDET-LPL 305
           V              + DSG+S T L    YQ    +++ E +A+  L   P + T    
Sbjct: 249 VPESAFALTNGTGGTIIDSGTSITSLPPQVYQ----VVRDEFAAQIKLPVVPGNATGHYT 304

Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNKGN--VCLGILN 361
           C+    P +   DV K    L L F +G T    +L  E Y+  +  + GN  +CL I  
Sbjct: 305 CFSA--PSQAKPDVPK----LVLHF-EGAT---MDLPRENYVFEVPDDAGNSIICLAINK 354

Query: 362 GAEVGL 367
           G E  +
Sbjct: 355 GDETTI 360


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 152/364 (41%), Gaps = 40/364 (10%)

Query: 39  FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PH 91
           F V G  + Y  G Y   + +G P + +++ +DTGSD+ W+ C + C  C ++     P 
Sbjct: 69  FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 127

Query: 92  PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
             + P    +  L+ C D  C+         C     QC Y  +Y DG  + G  V D  
Sbjct: 128 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 187

Query: 147 AFNYTNGQRL---NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
            F+   G  +   +  +  GC  +Q      S   +DGI G G+   S++SQ+ SQ +  
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---- 257
            V  HCL G GGG             +V++ +      +Y+  +  +   G++  +    
Sbjct: 248 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS-QPHYNLNLQSISVNGKSLAIDPEV 306

Query: 258 ----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
                N   + DSG++  YL    Y    S         ++ EA      PL  KG + +
Sbjct: 307 FATSTNRGTIVDSGTTLAYLAEEAYDPFVS---------AITEAVSQSVRPLLSKGTQCY 357

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQDLNV 372
                VK  F T++L+F  G +     L PE YL+  N  G+  +  +   ++  Q + +
Sbjct: 358 LITSSVKGIFPTVSLNFAGGVS---MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI 414

Query: 373 IGGI 376
           +G +
Sbjct: 415 LGDL 418


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 88/330 (26%), Positives = 136/330 (41%), Gaps = 56/330 (16%)

Query: 44  NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPLYRP-- 96
           + + TG Y   +Y+G P + +++ +DTGSD+ W+ C  PC  C  A     P  ++ P  
Sbjct: 41  DTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVALPISIFDPEK 99

Query: 97  --SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNY--- 150
             S   + C D  C   +   +  C  +   C Y   Y DG S+ G L+ D  +FN    
Sbjct: 100 STSKTSISCTDEEC---YLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPS 156

Query: 151 --TNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
             +       RL  GCG NQ         DG++G G+ + S+ SQL  Q +  N+  HCL
Sbjct: 157 GNSTATSGTARLTFGCGSNQ---TGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCL 213

Query: 209 SG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
            G   G G L  G        +V+T +    + Y    V  L  G   T +         
Sbjct: 214 QGDNKGSGTLVIGH--IREPGLVYTPIVPKQSHY---NVELLNIGVSGTNVTTPTAFDLS 268

Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
               V+ DSG++ TYL +  Y         +  AK +++      LP+       F+   
Sbjct: 269 NSGGVIMDSGTTLTYLVQPAYD--------QFQAK-VRDCMRSGVLPVA------FQFFC 313

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
            ++  F  + L F  G       L+P +YL
Sbjct: 314 TIEGYFPNVTLYFAGGAA---MLLSPSSYL 340


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 85/271 (31%), Positives = 123/271 (45%), Gaps = 27/271 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVP 102
           G Y + +YIG P+       DTGSDLTW+QC +PC   +C     PLY P N     L+P
Sbjct: 94  GNYLMRIYIGTPSVERLAIADTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLLP 152

Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
           C+   C  L     + C D   C Y   Y D   S G L  D+           N ++  
Sbjct: 153 CDSQPCTQLPY-SQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQ-LHYNSKICF 210

Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
           GCG+ N+          GI+GLG G  S+VSQL  +  I +   +CL   S      L F
Sbjct: 211 GCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSKLKF 268

Query: 219 GD-DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGET--TGLKNLPVVFDSGSSYTYL 273
           G+  +   + VV T +    D   YY   +  +  G +T  TG  +  ++ DSGS+ TYL
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYL-NLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
               Y    S++K+ ++ +      ED+ +P
Sbjct: 328 EESFYNEFVSLVKETVAVE------EDQYIP 352


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
           Y TG Y   + IG PA  Y++ LDTGS   W+      + C + PH          Y P 
Sbjct: 54  YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 109

Query: 97  ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
              S+  V C+D IC S   P    C    +C Y   YADGG ++G+L  D   ++  Y 
Sbjct: 110 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 164

Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
           NGQ    +  +  GCG  Q      S   +DGI+G G    + +SQL +    + +  HC
Sbjct: 165 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 224

Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
           L S  GGG    G+ +    +V  T +  +   Y+   +  +   G T  L        K
Sbjct: 225 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 282

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
                 DSGS+  YL  + Y         EL      + P D T+   +   + F  +  
Sbjct: 283 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 332

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           V   F  +   F +  T    ++ P  YL+       C G  +    G +D+ ++G
Sbjct: 333 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 385


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 94/340 (27%), Positives = 148/340 (43%), Gaps = 51/340 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + M IG P R Y   LDTGSDL W QC APC+ CV+ P P + P+       + C 
Sbjct: 88  GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
            P C +L+ P  +       C Y+  Y D  S+ GVL  + F F  TN  R++ P ++ G
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISFG 201

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
           CG   +   S     G++G G+G  S+VSQL S +       +CL+         L+FG 
Sbjct: 202 CG--NLNAGSLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYFGV 254

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV--------------- 262
               +S    +          +P +  ++F    G + G   LP+               
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            + DSG++ TYL    Y  + +    +++   L    +   L  C++   P +    + +
Sbjct: 315 TIIDSGTTITYLAEPAYDAVRAAFASQITLP-LLNVTDASVLDTCFQWPPPPRQSVTLPQ 373

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGI 359
               L L F DG     +EL  + Y+++  S  G +CL +
Sbjct: 374 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGGLCLAM 405


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
           Y TG Y   + IG PA  Y++ LDTGS   W+      + C + PH          Y P 
Sbjct: 78  YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133

Query: 97  ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
              S+  V C+D IC S   P    C    +C Y   YADGG ++G+L  D   ++  Y 
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188

Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
           NGQ    +  +  GCG  Q      S   +DGI+G G    + +SQL +    + +  HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248

Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
           L S  GGG    G+ +    +V  T +  +   Y+   +  +   G T  L        K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
                 DSGS+  YL  + Y         EL      + P D T+   +   + F  +  
Sbjct: 307 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 356

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           V   F  +   F +  T    ++ P  YL+       C G  +    G +D+ ++G
Sbjct: 357 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 409


>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
          Length = 142

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 74/138 (53%), Gaps = 5/138 (3%)

Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
           CGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L+ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
               S  V W  M      YYSPG+AEL    +   G      VFDSGS+YT++    Y 
Sbjct: 61  FNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119

Query: 280 TLTSIMKKELSAKSLKEA 297
            + S +   LS  SL+E 
Sbjct: 120 EIVSKVIGTLSESSLEEV 137


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 144/332 (43%), Gaps = 41/332 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           TG Y V++ +G PA+ Y +  DTGSDL+W+QC  PC  C E   PL+ PS       V C
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVAC 204

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P C  L A G   C   ++C YE++Y D   + G LV+D    + ++     P    G
Sbjct: 205 GAPECQELDASG---CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFG 258

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
           CG +Q  G  +  +DG+ GLG+ K S+ SQ            +CL  S  G G+L  G  
Sbjct: 259 CG-DQNAGL-FGQVDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSSSGRGYLSLGG- 313

Query: 222 LYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLN 274
               +   +T+++   T  +Y   +  +  GG    +           V DSG+  T L 
Sbjct: 314 -APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLP 372

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTD 332
              Y  L +   + ++    K+AP    L  C+   G R  +          T+ L+F  
Sbjct: 373 PRAYAPLRAAFARSMA--QYKKAPALSILDTCYDFTGHRTAQ--------IPTVELAFAG 422

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           G T +L + T    L +S     CL     A+
Sbjct: 423 GATVSL-DFT--GVLYVSKVSQACLAFAPNAD 451


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 144/332 (43%), Gaps = 41/332 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           TG Y V++ +G PA+ Y +  DTGSDL+W+QC  PC  C E   PL+ PS       V C
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVAC 204

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P C  L A G   C   ++C YE++Y D   + G LV+D    + ++     P    G
Sbjct: 205 GAPECQELDASG---CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFG 258

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
           CG +Q  G  +  +DG+ GLG+ K S+ SQ            +CL  S  G G+L  G  
Sbjct: 259 CG-DQNAGL-FGQVDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSSSGRGYLSLGG- 313

Query: 222 LYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLN 274
               +   +T+++   T  +Y   +  +  GG    +           V DSG+  T L 
Sbjct: 314 -APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLP 372

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTD 332
              Y  L +   + ++    K+AP    L  C+   G R  +          T+ L+F  
Sbjct: 373 PRAYAPLRAAFARSMA--QYKKAPALSILDTCYDFTGHRTAQ--------IPTVELAFAG 422

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           G T +L + T    L +S     CL     A+
Sbjct: 423 GATVSL-DFT--GVLYVSKVSQACLAFAPNAD 451


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
           Y TG Y   + IG PA  Y++ LDTGS   W+      + C + PH          Y P 
Sbjct: 78  YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133

Query: 97  ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
              S+  V C+D IC S   P    C    +C Y   YADGG ++G+L  D   ++  Y 
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188

Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
           NGQ    +  +  GCG  Q      S   +DGI+G G    + +SQL +    + +  HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248

Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
           L S  GGG    G+ +    +V  T +  +   Y+   +  +   G T  L        K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
                 DSGS+  YL  + Y         EL      + P D T+   +   + F  +  
Sbjct: 307 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 356

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           V   F  +   F +  T    ++ P  YL+       C G  +    G +D+ ++G
Sbjct: 357 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 409


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
           Y TG Y   + IG PA  Y++ LDTGS   W+      + C + PH          Y P 
Sbjct: 54  YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 109

Query: 97  ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
              S+  V C+D IC S   P    C    +C Y   YADGG ++G+L  D   ++  Y 
Sbjct: 110 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 164

Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
           NGQ    +  +  GCG  Q      S   +DGI+G G    + +SQL +    + +  HC
Sbjct: 165 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 224

Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
           L S  GGG    G+ +    +V  T +  +   Y+   +  +   G T  L        K
Sbjct: 225 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 282

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
                 DSGS+  YL  + Y         EL      + P D T+   +   + F  +  
Sbjct: 283 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 332

Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           V   F  +   F +  T    ++ P  YL+       C G  +    G +D+ ++G
Sbjct: 333 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 385


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 131/333 (39%), Gaps = 53/333 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSND 99
           G Y   + +G PA+ +F+ +DTGSD+ W+ C +PC  C         +E+ +P    +  
Sbjct: 3   GLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTAS 61

Query: 100 LVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
            + C D  C +    G   C+      + C Y   Y DG  + G  V D   F     N 
Sbjct: 62  RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121

Query: 154 QRLN--PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
           Q  N    +  GC  +Q    +     +DGI G G+ + S++SQL+S  +   V  HCL 
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLK 181

Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETT 255
           G   GGG L  G+ +     +V+T +      Y              P  + LF    T 
Sbjct: 182 GSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 239

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
           G      + DSG++  YL    Y    S +   +S              L  KG + F  
Sbjct: 240 G-----TIVDSGTTLAYLADGAYDPFVSAIAAAVSP---------SVRSLVSKGSQCFIT 285

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
              V   F T+ L F  G       + PE YL+
Sbjct: 286 SSSVDSSFPTVTLYFMGG---VAMSVKPENYLL 315


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 78/283 (27%), Positives = 120/283 (42%), Gaps = 33/283 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G + V +Y+G P +   + +DTGSDLTW+Q + PC  C E   P++ PS     + + C 
Sbjct: 23  GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSE-PCRACFEQADPIFDPSKSSTYNKIACS 81

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              CA L   G   C   A C Y   Y DG  + G   K+      T G+ +      G 
Sbjct: 82  SSACADLL--GTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK----FGA 135

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
                        +GILGLG+G  S+ SQL S  ++ N   +CL     +G     ++FG
Sbjct: 136 SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETSTMYFG 193

Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----------VVFDSG 267
           D    S  V +T +  ++D+  YY   V  +  GG    +               + DSG
Sbjct: 194 DAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSG 253

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           ++ TYL +  +  L +    ++   +   A     L LC+  R
Sbjct: 254 TTITYLQQEVFNALVAAYTSQVRYPTTTSA---TGLDLCFNTR 293


>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
          Length = 142

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 76/138 (55%), Gaps = 5/138 (3%)

Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
           CGY Q   A     P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L+ G+
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
               S  V W  M  + + YYSPG+AEL    +   G      VFDSGS+YT +    Y 
Sbjct: 61  FNPPSRGVTWVPM-RESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYN 119

Query: 280 TLTSIMKKELSAKSLKEA 297
            + S ++  LS  SL+E 
Sbjct: 120 EIVSKVRGTLSESSLEEV 137


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 157/374 (41%), Gaps = 49/374 (13%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
           S +++   VGS +   V G    +G Y V + +G P    +L +D+GSD+ W+QC  PC 
Sbjct: 110 SPTTMTTEVGSEV---VSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR-PCA 165

Query: 85  RCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGV 140
            C +   PL+ P+       VPC+  +C +L   G   C D   C Y++ Y DG  + GV
Sbjct: 166 ECYQQADPLFDPAASASFTAVPCDSGVCRTLPG-GSSGCADSGACRYQVSYGDGSYTQGV 224

Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
           L  +   F  +   +    +A+GCG+       +    G+LGLG G  S+V QL      
Sbjct: 225 LAMETLTFGDSTPVQ---GVAIGCGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAAG- 278

Query: 201 RNVVGHCLSG----GGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE- 253
                +CL+      G G L FG D       VW  +  ++    +Y  G+  L  GGE 
Sbjct: 279 -GAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGER 337

Query: 254 ---TTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
                GL +L       VV D+G++ T L    Y  L       +    L  AP    L 
Sbjct: 338 LPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGD-LPRAPGVSLLD 396

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNV-CLGILNG 362
            C+           V+    T+AL F  DG   TL    P   L++   G V CL     
Sbjct: 397 TCYD----LSGYASVR--VPTVALYFGRDGAALTL----PARNLLVEMGGGVYCLAFAAS 446

Query: 363 AEVGLQDLNVIGGI 376
           A      L+++G I
Sbjct: 447 AS----GLSILGNI 456


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 146/366 (39%), Gaps = 47/366 (12%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAP-- 90
           F V G   P   G Y   + +G P R +++ +DTGSD+ W+ C +    P    +  P  
Sbjct: 38  FPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLN 97

Query: 91  --HPLYRPSNDLVPCEDPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFA 147
              P   P+  L+ C D  C+  L +           C Y  +Y DG  + G  V D   
Sbjct: 98  FFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLH 157

Query: 148 FNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
           F+   G  +    +  +  GC   Q      S   +DGI G G+   S+VSQL SQ +  
Sbjct: 158 FDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISP 217

Query: 202 NVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
               HCL G   GGG L  G+ +     +V+T +      +Y+  +  +   G+T  +  
Sbjct: 218 RAFSHCLKGDDSGGGILVLGEIV--EPNIVYTPLVPS-QPHYNLNMQSISVNGQTLAID- 273

Query: 260 LPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
            P VF          DSG++  YL    Y    S +   +S             P   KG
Sbjct: 274 -PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSP---------SVRPYLSKG 323

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQ 368
              +     +   F  ++L+F  G +  L    P+ YLI  S+ G   L  +   ++  Q
Sbjct: 324 NHCYLISSSINDIFPQVSLNFAGGASMILI---PQDYLIQQSSIGGAALWCIGFQKIQGQ 380

Query: 369 DLNVIG 374
            + ++G
Sbjct: 381 GITILG 386


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 100/345 (28%), Positives = 147/345 (42%), Gaps = 55/345 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + IG P   Y   LDTGSDL W QC  PC RC + P P++ P    S   V C 
Sbjct: 106 GEYLIELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTRCYKQPTPIFDPKKSSSFSKVSCG 164

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C++L +     C D   C+Y   Y D   + GVL  + F F  +  +     +  GC
Sbjct: 165 SSLCSALPS---STCSD--GCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD- 220
           G +   G  +    G++GLG+G  S+VSQL  Q+       +CL+         L  G  
Sbjct: 220 GEDN-EGDGFEQASGLVGLGRGPLSLVSQLKEQRF-----SYCLTPIDDTKESVLLLGSL 273

Query: 221 -DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLK-----NLPVVFDSG 267
             + D+  VV T +  +       Y    +  V +     E +  +     N  V+ DSG
Sbjct: 274 GKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSG 333

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVHDVKKCF 323
           ++ TY+ +  Y+ L    KKE  +++  +   D+T    L LC+        V   K  F
Sbjct: 334 TTITYVQQKAYEAL----KKEFISQT--KLALDKTSSTGLDLCFSLPSGSTQVEIPKLVF 387

Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
                 F  G      EL  E Y+I  SN G  CL +  GA  G+
Sbjct: 388 H-----FKGGD----LELPAENYMIGDSNLGVACLAM--GASSGM 421


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 154/367 (41%), Gaps = 44/367 (11%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA--PCVRCVEAPHPLYRPSN 98
           V G    +G Y V + +G P +   L  DTGSDL W++C A   C R       L R S 
Sbjct: 79  VSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHST 138

Query: 99  DLVP--CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
              P  C D  C  +  P HH C      + C YE  Y DG  + G   K+    N ++G
Sbjct: 139 TFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSG 198

Query: 154 QRLNPR-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVG 205
           +    + +A GC +      V GAS++   G++GLG+G  S+ SQL  +   K    ++ 
Sbjct: 199 REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMD 258

Query: 206 HCLSGGGGGFLFFGDDLYDSS----RVVWTSMSSD--YTKYYSPGVAELFFGG------- 252
           H +S     +L  G    D +    R+ +T +  +     +Y  G+  +   G       
Sbjct: 259 HDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINP 318

Query: 253 ---ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                  L N   + DSG++ T+L    Y  + +++K+ +   S  E        LC   
Sbjct: 319 SVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAE--PTPGFDLCV-- 374

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD 369
                NV +++   R   LSF  G   ++F   P  Y + +++   CL +   A +    
Sbjct: 375 -----NVSEIEHP-RLPKLSFKLGGD-SVFSPPPRNYFVDTDEDVKCLAL--QAVMTPSG 425

Query: 370 LNVIGGI 376
            +VIG +
Sbjct: 426 FSVIGNL 432


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/353 (25%), Positives = 141/353 (39%), Gaps = 46/353 (13%)

Query: 29  LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----P 82
           +    G  + F V G   P   G Y   + +G P + +++ +DTGSD+ W+ C++    P
Sbjct: 59  MLQSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCP 118

Query: 83  CVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
               ++ P   + P    +  LV C D ICA         C   + QC Y  +Y DG  +
Sbjct: 119 ATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGT 178

Query: 138 LGVLVKDAF----AFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIV 191
            G  V D        + +     +  +  GC  +Q      S   +DGI G G+   S++
Sbjct: 179 SGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVI 238

Query: 192 SQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
           SQL S+ +   V  HCL G   GGG L  G+ +     VV+T +      +Y+  +  + 
Sbjct: 239 SQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIV--EPNVVYTPLVPS-QPHYNLNLQSIS 295

Query: 250 FGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
             G+   +   P VF          DSG++  YL    Y      +   +S         
Sbjct: 296 VNGQVLPIS--PAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVS--------- 344

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
             T  +  KG R +     V   F  ++L+F  G +     L  + YLI  N 
Sbjct: 345 QSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGAS---LVLGAQDYLIQQNS 394


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 148/351 (42%), Gaps = 50/351 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR  F+ LDTGSD+ W+QC APC +C     P++ P+       +PC
Sbjct: 144 SGEYFTRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPC 202

Query: 104 EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
             P+C  L +PG   C      C Y++ Y DG  + G    +   F    G R+  R+AL
Sbjct: 203 GSPLCRRLDSPG---CSTKKHICLYQVSYGDGSFTYGEFSTETLTF---RGTRVG-RVAL 255

Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
           GCG+ N+        L G+        S + +  S+K    +V    S     ++ FGD 
Sbjct: 256 GCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSAS-SKPSYMVFGDS 314

Query: 222 LYDSSRVVWTSMSSDY---TKYY------------SPGVAELFFGGETTGLKNLPVVFDS 266
              S    +T + S+    T YY             PG+    F  ++TG  N  V+ DS
Sbjct: 315 AI-SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTG--NGGVIIDS 371

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+S T L R  Y  L    +  + A +LK APE      C+          +VK    T+
Sbjct: 372 GTSVTRLTRPAYVALRDAFR--VGASNLKRAPEFSLFDTCFD----LSGKTEVK--VPTV 423

Query: 327 ALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            L F          L    YLI + N G+ C          +  L+++G I
Sbjct: 424 VLHFRGADV----SLPASNYLIPVDNSGSFCFAFAG----TMSGLSIVGNI 466


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 164/373 (43%), Gaps = 39/373 (10%)

Query: 10  LCFPTVRMSSSSSSSS--SSSLFNHVGSSL--LFQVHGNVYPTGYYNVTMYIGQPARPYF 65
           L F  + +SS  + +   +  LF    +++  + Q   N Y  G + + +YIG P     
Sbjct: 24  LLFHVLHLSSIEAQNDGFTIKLFRKTSNNIQNIVQAPINAY-IGQHLMEIYIGTPPIKIT 82

Query: 66  LDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCED 121
             +DTGSDL W+QC APC+ C +   P++ P    + + + C+ P+C  L       C  
Sbjct: 83  GLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDT---GVCSP 138

Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDGI 180
             +C+Y   Y D   + GVL +D   F    G+ ++  R   GCG+N   G + H + G+
Sbjct: 139 EKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEM-GL 197

Query: 181 LGLGKGKSSIVSQL--------HSQKLIRNVVGHCLSGG---GGGFLFFGDDLYDSSRVV 229
           +GLG G +S++SQ+         SQ L+  +    +S     G G    G+ +  +  V 
Sbjct: 198 IGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVP 257

Query: 230 WTSMSSDYTKYYSPGVAELFFG-GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
               +S +       V + +F    T G  N+ V  DSG+    L +  Y  + + ++ +
Sbjct: 258 REKDTSYFVTLLGISVEDTYFPMNSTIGKANMLV--DSGTPPILLPQQLYDKVFAEVRNK 315

Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
           ++ K + + P   T  LC++ +    N+      F  +  +      +T    TP+    
Sbjct: 316 VALKPITDDPSLGT-QLCYRTQ---TNLKGPTLTFHFVGANVLLTPIQTFIPPTPQT--- 368

Query: 349 ISNKGNVCLGILN 361
              KG  CL I N
Sbjct: 369 ---KGIFCLAIYN 378


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/303 (30%), Positives = 129/303 (42%), Gaps = 56/303 (18%)

Query: 45  VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           V P+G   Y V + IG P +P    LDTGSDL W QC APC  C+  P PL+ P    S 
Sbjct: 88  VRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAPGQSASY 146

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
           + + C   +C+ +    HH+CE P  C Y   Y DG  ++GV   + F F  + G  L  
Sbjct: 147 EPMRCAGTLCSDIL---HHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTT 203

Query: 159 R---LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-- 213
               L  GCG   V   S +   GI+G G+   S+VSQL  ++       +CL+      
Sbjct: 204 TTVPLGFGCGSVNV--GSLNNGSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYASRR 256

Query: 214 -GFLFFG---DDLYDSS--RVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
              L FG   D +Y  +  RV  T +     + T YY      + F G T G + L    
Sbjct: 257 QSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYY------VHFTGLTVGARRLRIPE 310

Query: 262 ------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA-PEDET---LPL 305
                       V+ DSG++ T L       +    +++L         PED     +P 
Sbjct: 311 SAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPA 370

Query: 306 CWK 308
            W+
Sbjct: 371 AWR 373


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/380 (25%), Positives = 153/380 (40%), Gaps = 56/380 (14%)

Query: 34  GSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
           G  + F V G   P   G Y   + +G P + + + +DTGSD+ W+ C+  C  C ++  
Sbjct: 59  GGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNCPQSSQ 117

Query: 92  ---------PLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVL 141
                     +   +  L+PC DPIC S        C     QC Y  +Y DG  + G  
Sbjct: 118 LGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYY 177

Query: 142 VKDAFAFNYTNGQ----RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLH 195
           V DA  F+   GQ      +  +  GC  +Q    +     +DGI G G G  S+VSQL 
Sbjct: 178 VSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLS 237

Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
           S+ +   V  HCL G G G             +V++ +      +Y+  +  +   G+  
Sbjct: 238 SRGITPKVFSHCLKGDGDGGGVLVLGEILEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLL 296

Query: 256 GLKNLPVVF-----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
            +   P VF           D G++  YL +  Y  L + +   +S  + +         
Sbjct: 297 PIN--PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNS------ 348

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
              KG + +     +   F +++L+F  G +     L PE YL+ +       G L+GAE
Sbjct: 349 ---KGNQCYLVSTSIGDIFPSVSLNFEGGASMV---LKPEQYLMHN-------GYLDGAE 395

Query: 365 ---VGLQDLNVIGGI-GDFV 380
              +G Q       I GD V
Sbjct: 396 MWCIGFQKFQEGASILGDLV 415


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 143/347 (41%), Gaps = 68/347 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
           +G Y V + IG P   Y   +DTGSDL W QC APC+ C + P P +      +   +PC
Sbjct: 86  SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPC 144

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
               CASL +P          C Y+  Y D  S+ GVL  + F F   N  ++    +A 
Sbjct: 145 RSSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF 200

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
           GCG   +         G++G G+G  S+VSQL   +       +CL+         L+FG
Sbjct: 201 GCG--SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFG 253

Query: 220 DDLYDSSRVVWTSMSSDYTK----------YYSPGVAELFF---GGETTGLKNLP----- 261
                    V+ ++SS  T             +P +  ++F      + G K LP     
Sbjct: 254 ---------VYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
                     V+ DSG+S T+L +  Y+ +   +   +   ++ +   D  L  C++   
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMND--TDIGLDTCFQWPP 362

Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL 357
           P     +V      L   F D    TL    PE Y++I S  G +CL
Sbjct: 363 P----PNVTVTVPDLVFHF-DSANMTLL---PENYMLIASTTGYLCL 401


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/339 (25%), Positives = 155/339 (45%), Gaps = 49/339 (14%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPL 93
           F + GN    G Y   + +G P +   + +DTGSD+ W++C +PC  C+       P  +
Sbjct: 71  FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129

Query: 94  YR----PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           Y      ++ +  C DP+C         +  + A C Y   Y D  +S+G  V+D   + 
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNSA-CAYVSSYQDKSASVGAYVRDDMHYV 188

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
              G     R+  GC  N + G+   P+DGI+G G    ++ +Q+ +Q+ +  V  HCL 
Sbjct: 189 LHGGNATTSRIFFGCATN-ITGS--WPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLG 245

Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---------- 257
           G   GGG L FG+   +++ +V+T +  + T +Y+  +  +    +   +          
Sbjct: 246 GEKHGGGILEFGEAP-NTTEMVFTPL-LNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRN 303

Query: 258 --KNLPVVFDSGSSYTYL----NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
              N  V+ DSG+++  L    NR+ +Q + S+   +L        P+ E L   +    
Sbjct: 304 STNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKL-------GPKLEGLECFY---- 352

Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
             K+   ++  F  + L+F+ G T    +L P+ YL+++
Sbjct: 353 -LKSGLTMETSFPNVTLTFSGGST---MKLKPDNYLVMA 387


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 116/262 (44%), Gaps = 22/262 (8%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G+   +G Y VT+ +G P     L  DTGSDLTW QC  PCVR C +   P++ PS    
Sbjct: 96  GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTS 154

Query: 101 ---VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              V C    C SL  A G+      + C Y ++Y D   S+G L K+ F    TN    
Sbjct: 155 YYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL--TNSDVF 212

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
           +  +  GCG N      +  + G+LGLG+ K S  SQ  +      +  +CL  S    G
Sbjct: 213 D-GVYFGCGENN--QGLFTGVAGLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTG 267

Query: 215 FLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
            L FG   +  S +    S  +D T +Y   +  +  GG+     +T       + DSG+
Sbjct: 268 HLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGT 327

Query: 269 SYTYLNRVTYQTLTSIMKKELS 290
             T L    Y  L S  K ++S
Sbjct: 328 VITRLPPKAYAALRSSFKAKMS 349


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 122/283 (43%), Gaps = 27/283 (9%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
           Y   YY ++  IG P    +  +DTGSD  W QC  PC  C+    P++ PS       +
Sbjct: 85  YAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK-PCKPCLNQTSPIFNPSKSSTYKNI 143

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRL 160
            C  PIC         +     +C+YE+ Y D   S G + KD    N  +G  ++ P++
Sbjct: 144 RCSSPICKR-GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKI 202

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGF 215
            +GCG+      +     GI+G G+G  SIVSQL S   I     +CL+           
Sbjct: 203 VIGCGHKN-SLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISSK 259

Query: 216 LFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKN---LP-----VVFD 265
           L+FGD    S   V ++  + S Y   Y   +     G     LK+   +P      V D
Sbjct: 260 LYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVID 319

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           SGS+ T L    Y  L + +   +  K +K+  +   L LC+K
Sbjct: 320 SGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQ--LSLCYK 360


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/327 (28%), Positives = 142/327 (43%), Gaps = 42/327 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
           + VT+  G PA+ Y +  DTGSD++W+QC  PC   C +   P++ P+      +VPC  
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
           P CA+       N      C Y++EY DG SS GVL  +  +   T   R  P  A GCG
Sbjct: 194 PQCAAADGSKCSN----GTCLYKVEYGDGSSSAGVLSHETLSLTST---RALPGFAFGCG 246

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDDLY 223
              +    +  +DG++GLG+G+ S+ SQ  +         +CL       G+L  G    
Sbjct: 247 QTNL--GDFGDVDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIGPTTP 302

Query: 224 DSS-RVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYL 273
            S+  V +T+M    DY  +Y   +  +  GG    L   P +F       DSG+  TYL
Sbjct: 303 ASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYI--LPVPPTLFTDDGTFLDSGTILTYL 360

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L    K   +    K AP  +    C+     F     +      ++  F+DG
Sbjct: 361 PPEAYTALRDRFK--FTMTQYKPAPAYDPFDTCYD----FTGQSAI--FIPAVSFKFSDG 412

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGIL 360
              ++F+L+    LI  +     +G L
Sbjct: 413 ---SVFDLSFFGILIFPDDTAPAIGCL 436


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 123/285 (43%), Gaps = 38/285 (13%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +T  +G P    +  +DTGSD+ WLQC+ PC  C     P++ PS       +PC 
Sbjct: 85  GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE-PCQECYNQTTPMFNPSKSSSYKNIPCP 143

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C S+      +C D   C+Y   Y D   S G L  D      TNG  ++ P + +G
Sbjct: 144 SKLCQSME---DTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIG 200

Query: 164 CGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---------GG 211
           CG N +    GAS     GI+G G G +S ++QL S    +    +CL+           
Sbjct: 201 CGTNNILSYEGAS----SGIVGFGSGPASFITQLGSSTGGK--FSYCLTPLFSVTNIQSN 254

Query: 212 GGGFLFFGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGGETTGLKNLPV 262
               L FGD    S   V T+  +  D   +Y       S G   +  GG   G     +
Sbjct: 255 ATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNI 314

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           + DSG++ T L +  Y  L S +   +  + + +    +TL LC+
Sbjct: 315 IIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPT--QTLNLCY 357


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/336 (28%), Positives = 139/336 (41%), Gaps = 33/336 (9%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G+   +G Y VT+ +G P     L  DTGSDLTW QC  PCVR C +   P++ PS    
Sbjct: 124 GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTS 182

Query: 101 ---VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              V C    C SL  A G+      + C Y ++Y D   S+G L K+ F    TN    
Sbjct: 183 YYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL--TNSDVF 240

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
           +  +  GCG N      +  + G+LGLG+ K S  SQ  +      +  +CL  S    G
Sbjct: 241 D-GVYFGCGENNQ--GLFTGVAGLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTG 295

Query: 215 FLFFGD-DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
            L FG   +  S +    S  +D T +Y   +  +  GG+     +T       + DSG+
Sbjct: 296 HLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGT 355

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T L    Y  L S  K ++S            L  C+     FK V   K     +A 
Sbjct: 356 VITRLPPKAYAALRSSFKAKMSKYPTTSGV--SILDTCFD-LSGFKTVTIPK-----VAF 407

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           SF+ G    + EL  +    +     VCL     ++
Sbjct: 408 SFSGG---AVVELGSKGIFYVFKISQVCLAFAGNSD 440


>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
 gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
          Length = 490

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 158/359 (44%), Gaps = 45/359 (12%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S    +H ++   GYY   + IG P   + L +D  S   ++              P + 
Sbjct: 20  SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---FVSPKTMFCSFFFLQDPRFS 76

Query: 96  P--SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
           P  S+   P E   C +  + G   C+   +  Y+ +YA+  +S GVL KD  +F+ ++ 
Sbjct: 77  PALSSSYKPLE---CGNECSTGF--CDGSRK--YQRQYAEKSTSSGVLGKDVISFSNSSD 129

Query: 153 --GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
             GQRL      GC   +         DGI+GLG+G  SI+ QL  +  + +V   C  G
Sbjct: 130 LGGQRL----VFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGG 185

Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP 261
              GGG  +  G        +V+TS     + YY+  +  +  GG    LK         
Sbjct: 186 MDEGGGAMILGG--FQPPKDMVFTSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 243

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA--PEDETLPLCWKGRRPFKNVHDV 319
            V DSG++Y Y     +Q   S +K+++   SLKE   P+++   +C+ G     NV ++
Sbjct: 244 TVLDSGTTYAYFPGAAFQAFKSAVKEQVG--SLKEVPGPDEKFKDICYAGAG--TNVSNL 299

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
            + F ++   F DG++ T   L+PE YL    K  G  CLG+    +       ++GGI
Sbjct: 300 SQFFPSVDFVFGDGQSVT---LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 351


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 145/317 (45%), Gaps = 41/317 (12%)

Query: 43  GNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VE 88
           G+++P+G      Y   + +G P   + + LDTGSDL W+ CD  C++C         ++
Sbjct: 89  GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146

Query: 89  APHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLV 142
               +Y+PS       +PC   +C+         C +P Q C Y ++Y ++  +S G+L+
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLI 201

Query: 143 KDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQK 198
           +D    +   G   +N  + +GCG  Q    SY      DG+LGLG    S+ S L    
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQ--SGSYLEGIAPDGLLGLGMADISVPSFLARAG 259

Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
           L+RN    C      G +FFGD    + +       +   + Y+  V +   G + T   
Sbjct: 260 LVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGA 319

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
               + D+G+S+T L    Y+++T    K+++A   + + +D +   C+    P + + D
Sbjct: 320 GFQALVDTGTSFTSLPLDAYKSITMEFDKQINAS--RASSDDYSFEYCYS-TGPLE-MPD 375

Query: 319 VKKCFRTLALSFTDGKT 335
           V     T+ L+F + K+
Sbjct: 376 VP----TITLTFAENKS 388


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 77/153 (50%), Gaps = 10/153 (6%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y + + +G P +P    LDTGSDL W QCD  C  C+  P PL+ P    S + + C   
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           +C  +    HH+C  P  C Y   Y DG ++LG    + F F  ++G+  +  L  GCG 
Sbjct: 157 LCGDIL---HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGT 213

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             V   S +   GI+G G+   S+VSQL  ++ 
Sbjct: 214 MNV--GSLNNASGIVGFGRDPLSLVSQLSIRRF 244


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/334 (29%), Positives = 142/334 (42%), Gaps = 45/334 (13%)

Query: 52  NVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPI 107
           N    +G  A    + +DT S+LTW+QC  PC  C +   PL+ PS+      VPC    
Sbjct: 119 NYVATVGLGAAEATVVVDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSS 177

Query: 108 CASLH---APGHHNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
           C +L    A G   C D     PA C Y L Y DG  S GVL +D        GQ +   
Sbjct: 178 CDALRVAMAAGTSPCADDNEQQPA-CSYALSYRDGSYSRGVLARDKLRL---AGQDIE-G 232

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFL 216
              GCG +   GA +    G++GLG+   S+VSQ   Q     V  +CL     G  G L
Sbjct: 233 FVFGCGTSN-QGAPFGGTSGLMGLGRSHVSLVSQTMDQ--FGGVFSYCLPMRESGSSGSL 289

Query: 217 FFGDD---LYDSSRVVWTSMSSD----YTKYYSPGVAELFFGG---ETTGLKNLPVVFDS 266
             GDD     +S+ +V+T+M SD       +Y   +  +  GG   E+       V+ DS
Sbjct: 290 VLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSAGRVIIDS 349

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y  + +    +L+     +AP    L  C+       N+  +K+  +  
Sbjct: 350 GTIITTLVPSVYNAVRAEFLSQLA--EYPQAPAFSILDTCF-------NLTGLKE-VQVP 399

Query: 327 ALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGI 359
           +L F  +G      +     Y + S+   VCL +
Sbjct: 400 SLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLAL 433


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 77/153 (50%), Gaps = 10/153 (6%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y + + +G P +P    LDTGSDL W QCD  C  C+  P PL+ P    S + + C   
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           +C  +    HH+C  P  C Y   Y DG ++LG    + F F  ++G+  +  L  GCG 
Sbjct: 157 LCGDIL---HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGT 213

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             V   S +   GI+G G+   S+VSQL  ++ 
Sbjct: 214 MNV--GSLNNASGIVGFGRDPLSLVSQLSIRRF 244


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 145/317 (45%), Gaps = 41/317 (12%)

Query: 43  GNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VE 88
           G+++P+G      Y   + +G P   + + LDTGSDL W+ CD  C++C         ++
Sbjct: 89  GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146

Query: 89  APHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLV 142
               +Y+PS       +PC   +C+         C +P Q C Y ++Y ++  +S G+L+
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLI 201

Query: 143 KDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQK 198
           +D    +   G   +N  + +GCG  Q    SY      DG+LGLG    S+ S L    
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQ--SGSYLEGIAPDGLLGLGMADISVPSFLARAG 259

Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
           L+RN    C      G +FFGD    + +       +   + Y+  V +   G + T   
Sbjct: 260 LVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGA 319

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
               + D+G+S+T L    Y+++T    K+++A   + + +D +   C+    P + + D
Sbjct: 320 GFQALVDTGTSFTSLPLDAYKSITMEFDKQINAS--RASSDDYSFEYCYS-TGPLE-MPD 375

Query: 319 VKKCFRTLALSFTDGKT 335
           V     T+ L+F + K+
Sbjct: 376 VP----TITLTFAENKS 388


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 143/345 (41%), Gaps = 45/345 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y +T+  G P R   +  DTGSD+ WLQC    VRC     PL+ PS       V C
Sbjct: 13  SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSC 72

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            +P C  L   G  +    + C Y + Y DG S++G L  D F    T  Q+       G
Sbjct: 73  TEPACVGLSTRGCSS----STCLYGVFYGDGSSTIGFLAMDTFML--TPAQKFK-NFIFG 125

Query: 164 CGYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
           CG N      +    G++GLG+  + S+ SQ+     + NV  +CL  +    G+L  G+
Sbjct: 126 CGQNNT--GLFQGTAGLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGYLNIGN 181

Query: 221 DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTYL 273
                    +T+M +D      Y   +  +  GG      +T  +++  + DSG+  T L
Sbjct: 182 PQNTPG---YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRL 238

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L + ++  ++  +L  AP    L  C+   R    V+ V      + L F   
Sbjct: 239 PPTAYSALKTAVRAAMTQYTL--APAVTILDTCYDFSRTTSVVYPV------IVLHFAGL 290

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
             R    +       + N   VCL     A  G  D  +IG IG+
Sbjct: 291 DVR----IPATGVFFVFNSSQVCL-----AFAGNTDSTMIGIIGN 326


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 140/343 (40%), Gaps = 46/343 (13%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G   P   G Y   + +G P   + + +DTGSD+ W+ C++ C  C +        
Sbjct: 11  FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQIQL 69

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
               P    ++ ++ C D  C +        C     QC Y  +Y DG  + G  V D  
Sbjct: 70  NFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMM 129

Query: 147 AFN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             N  +      N    +  GC  NQ  G    S   +DGI G G+ + S++SQL SQ +
Sbjct: 130 HLNTIFEGSVTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGI 188

Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
              V  HCL G   GGG L  G+ +     +V+TS+      +Y+  +  +   G+T  +
Sbjct: 189 APRVFSHCLKGDSSGGGILVLGEIV--EPNIVYTSL-VPAQPHYNLNLQSIAVNGQTLQI 245

Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                    +   + DSG++  YL    Y    S +   +  +S+  A          +G
Sbjct: 246 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVS--------RG 296

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
            + +     V + F  ++L+F  G +     L P+ YLI  N 
Sbjct: 297 NQCYLITSSVTEVFPQVSLNFAGGASMI---LRPQDYLIQQNS 336


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 141/338 (41%), Gaps = 39/338 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y +T+ +G P +   + +DTGSD++W+QC  PC +C     PL+ P    +     C   
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + Y DG S+ G    D  A   +N  R   +   GC  
Sbjct: 192 ACAQLGQEG-NGCSS-SQCQYTVTYGDGSSTTGTYSSDTLALG-SNAVR---KFQFGC-- 243

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYD 224
           + V        DG++GLG G  S+VSQ  +         +CL  +    GFL  G     
Sbjct: 244 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTFGAAFSYCLPATSSSSGFLTLG---AG 298

Query: 225 SSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSYTYLNRVTY 278
           +S  V T M  SS    +Y   +  +  GG      T + +   + DSG+  T L    Y
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVLTRLPPTAY 358

Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
             L+S  K  +  K    AP    L  C+     F     V     T+AL F+ G    +
Sbjct: 359 SALSSAFKAGM--KQYPSAPPSGILDTCFD----FSGQSSVS--IPTVALVFSGGA---V 407

Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            ++  +  ++ ++   +CL     A      L +IG +
Sbjct: 408 VDIASDGIMLQTSNSILCLAF--AANSDDSSLGIIGNV 443


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 153/353 (43%), Gaps = 43/353 (12%)

Query: 48  TGYYNVTMYI--GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-----HPLYRPSNDL 100
           T  +   MY+  G P        DTGSDL W+ C +       +      HP    +  L
Sbjct: 95  TRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSL 154

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQRL 156
           + C+   C +L      +C+  ++C Y+  Y DG  ++GVL  + F+F        GQ  
Sbjct: 155 LSCQSAACQALS---QASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVR 211

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGG 211
            PR++ GC       A     DG++GLG G  S+VSQL +   I     +CL     +  
Sbjct: 212 VPRVSFGCSTGS---AGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAAN 268

Query: 212 GGGFLFFGDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSGS 268
               L FG    + D        + S+   YY+  +  +   G+     N   ++ DSG+
Sbjct: 269 SSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRIIVDSGT 328

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKN--VHDVKKCFR 324
           + T+L+    + L + +++ +  +  +  P ++ L LC+  +G+   ++  + DV     
Sbjct: 329 TLTFLDPALLRPLVAELERRI--RLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVT---- 382

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
              L F  G + T   L PE    +  +G +CL ++  +E   Q ++++G I 
Sbjct: 383 ---LRFGGGASVT---LRPENTFSLLEEGTLCLVLVPVSES--QPVSILGNIA 427


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 156/350 (44%), Gaps = 47/350 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
           Y   + +G PA  + + LDTGSDL W+ CD  C++C         ++    +YRP+    
Sbjct: 66  YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 123

Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
              +PC   +C S+  PG   C +P Q C Y ++Y ++  +S G+L++D    NY     
Sbjct: 124 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 178

Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
            +N  + +GCG  Q    + G +    DG+LGLG    S+ S L    L++N    C   
Sbjct: 179 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 235

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
              G +FFGD    S +           + Y+  V +   G +     +   + DSG+S+
Sbjct: 236 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 295

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y+  T    K+++A  +    ED T   C+    P + + DV     T+ L+F
Sbjct: 296 TSLPLDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 347

Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
              K  +L  + P   L  ++K       CL +L   E +G+   N + G
Sbjct: 348 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 393


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 156/350 (44%), Gaps = 47/350 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
           Y   + +G PA  + + LDTGSDL W+ CD  C++C         ++    +YRP+    
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153

Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
              +PC   +C S+  PG   C +P Q C Y ++Y ++  +S G+L++D    NY     
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208

Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
            +N  + +GCG  Q    + G +    DG+LGLG    S+ S L    L++N    C   
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 265

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
              G +FFGD    S +           + Y+  V +   G +     +   + DSG+S+
Sbjct: 266 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 325

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y+  T    K+++A  +    ED T   C+    P + + DV     T+ L+F
Sbjct: 326 TSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 377

Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
              K  +L  + P   L  ++K       CL +L   E +G+   N + G
Sbjct: 378 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 423


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 123/287 (42%), Gaps = 43/287 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
           Y VT+ IG PA    + +DTGSDL+W+QC  PC    C     PL+ PS       +PC 
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183

Query: 105 DPICASLHAPGHHN-CED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C  L   G+ N C +     P QC Y +EY +G  + GV   +  A   +   +   
Sbjct: 184 SDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSAVVK--- 240

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFL 216
               GCG +Q     Y   DG+LGLG    S+VSQ  S  +      +CL     G GFL
Sbjct: 241 SFRFGCGSDQ--HGPYDKFDGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGAGFL 296

Query: 217 FFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELF---FGGETTGLKNL---PVVF--- 264
             G        +S  V+T M +     +SP +A  +     G + G K L   P VF   
Sbjct: 297 TLGAPNSTNNSNSGFVFTPMHA-----FSPKIATFYVVTLTGISVGGKALDIPPAVFAKG 351

Query: 265 ---DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
              DSG+  T +    Y+ L +  +  ++   L   P D  L  C+ 
Sbjct: 352 NIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLP-PADSALDTCYN 397


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 48/321 (14%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH-------APGH 116
           +DT S+LTW+QC APC  C +   PL+ PS+      VPC+ P C +L          G 
Sbjct: 158 VDTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGA 216

Query: 117 HNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
             C+   PA C Y L Y DG  S GVL  D  +     G+ ++     GCG +   G  +
Sbjct: 217 PPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL---AGEVIDG-FVFGCGTSN-QGPPF 271

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD---LYDSSR 227
               G++GLG+ + S+VSQ   Q     V  +CL         G L  GDD     +S+ 
Sbjct: 272 GGTSGLMGLGRSQLSLVSQTVDQ--FGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTP 329

Query: 228 VVWTSMSSD-----YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
           VV+TSM S+        +Y   +  +  GG   E+TG     +V DSG+  T L    Y 
Sbjct: 330 VVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAIV-DSGTVITSLVPSVYN 388

Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTL 338
            + +    +L+     +AP    L  C+       N+  +K+    +L L F DG     
Sbjct: 389 AVRAEFMSQLA--EYPQAPGFSILDTCF-------NMTGLKEVQVPSLTLVF-DGGAEVE 438

Query: 339 FELTPEAYLIISNKGNVCLGI 359
            +     Y + S+   VCL +
Sbjct: 439 VDSGGVLYFVSSDSSQVCLAV 459


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 137/334 (41%), Gaps = 32/334 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
           +G Y + + +G P + Y + LDTGS L+WLQC    V C     PL+ P  SN   P  C
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYC 176

Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
               C+ L A   ++  C     C Y   Y D   S+G L +D      T  Q L P   
Sbjct: 177 SSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTL--TPSQTL-PSFT 233

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
            GCG  Q     +    GI+GL + K S+++QL  +        +CL   +  GGGFL  
Sbjct: 234 YGCG--QDNEGLFGKAAGIVGLARDKLSMLAQLSPK--YGYAFSYCLPTSTSSGGGFLSI 289

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLN 274
           G     S +      +S     Y   +A +   G   G+      +P + DSG+  T L 
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLP 349

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFRTLALSFTDG 333
              Y  L     K +S +  ++AP    L  C+KG  +      +++  F+  A      
Sbjct: 350 ISIYAALREAFVKIMS-RRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGA------ 402

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
                  L     LI ++KG  CL   +  ++ +
Sbjct: 403 ----DLSLRAPNILIEADKGIACLAFASSNQIAI 432


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 127/318 (39%), Gaps = 50/318 (15%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
           F V G+  P   G Y   + +G P + YF+ +DTGSD+ W+ C +PC  C         +
Sbjct: 77  FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135

Query: 88  EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
           E  +P    ++  +PC D  C +        C+  D + C Y   Y DG  + G  V D 
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195

Query: 146 FAFNYT--NGQRLN--PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
             F+    N Q  N    +  GC  +Q    +     +DGI G G+ + S+VSQL+S  +
Sbjct: 196 MYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255

Query: 200 IRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
              V  HCL G   GGG L  G+ +     +V+T +      Y              P  
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313

Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           + LF    T G      + DSG++  YL    Y    + +   +S              L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSPS---------VRSL 359

Query: 306 CWKGRRPFKNVHDVKKCF 323
             KG + F     +  CF
Sbjct: 360 VSKGNQCFVTSSRLASCF 377


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 147/340 (43%), Gaps = 51/340 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + M IG P R Y   LDTGSDL W QC APC+ CV+ P P + P+       + C 
Sbjct: 88  GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
            P C +L+ P  +       C Y+  Y D  S+ GVL  + F F  TN  R++ P ++ G
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISFG 201

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
           CG   +         G++G G+G  S+VSQL S +       +CL+         L+FG 
Sbjct: 202 CG--NLNAGLLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYFGV 254

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV--------------- 262
               +S    +          +P +  ++F    G + G   LP+               
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            + DSG++ TYL    Y  + +    +++   L    +   L  C++   P +    + +
Sbjct: 315 TIIDSGTTITYLAEPAYDAVRAAFASQITLP-LLNVTDASVLDTCFQWPPPPRQSVTLPQ 373

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGI 359
               L L F DG     +EL  + Y+++  S  G +CL +
Sbjct: 374 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGGLCLAM 405


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/281 (28%), Positives = 128/281 (45%), Gaps = 36/281 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
           Y   + +G PA  + + LDTGSDL W+ CD  C++C         ++    +YRP+    
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153

Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
              +PC   +C S+  PG   C +P Q C Y ++Y ++  +S G+L++D    NY     
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208

Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
            +N  + +GCG  Q    + G +    DG+LGLG    S+ S L    L++N    C   
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 265

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSGS 268
              G +FFGD    S +   T     Y K   Y+  V +   G +     +   + DSG+
Sbjct: 266 DSSGRIFFGDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
           S+T L    Y+  T    K+++A  +    ED T   C+  
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA 362


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 92/314 (29%), Positives = 131/314 (41%), Gaps = 37/314 (11%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           LF   GS  LF   GN     +Y   + IG P   + + LD GSDL W+ CD  C++C  
Sbjct: 88  LFPSQGSQALF--FGNELDWLHY-TWIDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAP 142

Query: 89  APHPLYRPSNDLVPCEDPICASLHAPGHH------------NCEDPAQ-CDYELEYAD-- 133
                Y  S D    E     SL +   H            NC++P   C Y   Y D  
Sbjct: 143 LSASYYNISLDRDLSE--YSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFE 200

Query: 134 GGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYH---PLDGILGLGKG 186
             +S G LV+D        ++T  + L   + LGCG  Q  G S+      DG++GLG G
Sbjct: 201 NTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQ--GGSFFDGAAPDGVMGLGPG 258

Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGV 245
             S+ S L    LI+N    C      G + FGD  + S +   +  +   Y  Y+  GV
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFV-GV 317

Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
                G           + DSGSS+TYL    Y  L S   K+++AK +  + +D     
Sbjct: 318 ESYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRI--SFQDGLWDY 375

Query: 306 CWKGRRPFKNVHDV 319
           C+      + +HD+
Sbjct: 376 CYNASS--QELHDI 387


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 148/365 (40%), Gaps = 45/365 (12%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAPHP 92
           F V G   P   G Y   + +G P   + + +DTGSD+ W+ C++    P    ++    
Sbjct: 64  FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLN 123

Query: 93  LYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFA 147
            + P    ++ ++ C D  C +        C     QC Y  +Y DG  + G  V D   
Sbjct: 124 FFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMH 183

Query: 148 FN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
            N  +      N    +  GC  NQ  G    S   +DGI G G+ + S++SQL SQ + 
Sbjct: 184 LNTIFEGSMTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 242

Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
             +  HCL G   GGG L  G+ +     +V+TS+      +Y+  +  +   G+T  + 
Sbjct: 243 PRIFSHCLKGDSSGGGILVLGEIV--EPNIVYTSLVP-AQPHYNLNLQSISVNGQTLQID 299

Query: 258 -------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                   +   + DSG++  YL    Y    S         ++  A       +  +G 
Sbjct: 300 SSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS---------AITAAIPQSVRTVVSRGN 350

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQD 369
           + +     V   F  ++L+F  G +     L P+ YLI  N  G   +  +   ++  Q 
Sbjct: 351 QCYLITSSVTDVFPQVSLNFAGGASMI---LRPQDYLIQQNSIGGAAVWCIGFQKIQGQG 407

Query: 370 LNVIG 374
           + ++G
Sbjct: 408 ITILG 412


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/298 (27%), Positives = 131/298 (43%), Gaps = 46/298 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           TG Y + M++G P +  +L LDTGSDL+W+QCD PC  C E     Y P +      + C
Sbjct: 168 TGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNISC 226

Query: 104 EDPIC--ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNPR 159
            DP C   S   P  H   +   C Y  +YADG ++ G    + F  N T  NG+    +
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286

Query: 160 LA---LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
           +     GCG+       ++   G+LGLG+G  S  SQ+  Q +  +   +CL+       
Sbjct: 287 VVDVMFGCGHWN--KGFFYGASGLLGLGRGPISFPSQI--QSIYGHSFSYCLTDLFSNTS 342

Query: 212 GGGFLFFGDD--LYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKN----- 259
               L FG+D  L ++  + +T++     + D T YY   +  +  GGE   +       
Sbjct: 343 VSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQ-IKSIMVGGEVLDISEQTWHW 401

Query: 260 ----------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                        + DSGS+ T+     Y  +    +K++  + +  A +D  +  C+
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQI--AADDFVMSPCY 457


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 21/251 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
           TG Y V++ +G P +   L  DTGSDLTW +C A      E   P    S   V C  P+
Sbjct: 131 TGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA-----AETFDPTKSTSYANVSCSTPL 185

Query: 108 CAS-LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C+S + A G+ +    + C Y ++Y DG  S+G L K+      T+   +      GCG 
Sbjct: 186 CSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD---IFNNFYFGCGQ 242

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDDLYDS 225
           + V G  +    G+LGLG+ K S+VSQ   +     +  +CL S    GFL FG     S
Sbjct: 243 D-VDGL-FGKAAGLLGLGRDKLSVVSQTAPK--YNQLFSYCLPSSSSTGFLSFGSSQSKS 298

Query: 226 SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQT 280
           ++  +T +SS  + +Y+  +  +  GG+   +          + DSG+  T L    Y  
Sbjct: 299 AK--FTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAAYSA 356

Query: 281 LTSIMKKELSA 291
           L S  +K +++
Sbjct: 357 LRSAFRKAMAS 367


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y ++  +G P    +  +DTGS++ WLQC  PC  C     P++ PS       +PC 
Sbjct: 87  GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ-PCNTCFNQTSPIFNPSKSSSYKNIPCT 145

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLAL 162
              C   +   H +C +    C+Y + Y     S G L  D+   + T+G   L P + +
Sbjct: 146 SSTCKDTNDT-HISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVI 204

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLF 217
           GCG+  V   +     G++G+G+G  S++ Q+ S   + +   +CL            L 
Sbjct: 205 GCGHINVLQDNSQS-SGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSNSSSKLI 262

Query: 218 FGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFG------GETTGLKNLPVVFDSGS 268
           FG+D+  S  +V ++     +    YY   +     G      GE +      ++ DSG+
Sbjct: 263 FGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNILIDSGT 322

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T L  +    L S + +E+    ++  P D  L LC+       NV D+   F    +
Sbjct: 323 PLTMLPNLFLSKLVSYVAQEVKLPRIE--PPDHHLSLCYNTTGKQLNVPDITAHFNGADV 380

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGIL--NGAEV 365
                 T   FE            G +C G +  NG E+
Sbjct: 381 KLNSNGTFFPFE-----------DGIMCFGFISSNGLEI 408


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 80/279 (28%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P    +L +D+GSD+ W+QC  PC +C     PL+ P+       V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC +L   G     D  +CDY + Y DG  + G L  +      T  Q     +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG+       +    G+LGLG G  S+V QL        V  +CL+    GG G L  G 
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAGGAGSLVLGR 297

Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGS 268
                   VW  +  ++  + +Y  G+  +  GGE   L++            VV D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 357

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           + T L R  Y  L       + A  L  +P    L  C+
Sbjct: 358 AVTRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 394


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 154/352 (43%), Gaps = 50/352 (14%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---APCVRCVEAPH------PLYRP---- 96
           Y NV+  IG P+  Y + LDTGSDL WL CD   + CV+ ++ P        +YRP    
Sbjct: 114 YANVS--IGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASS 171

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ 154
           ++  +PC + +C+         C    + C Y+++Y ++G SS GVLV+D       + Q
Sbjct: 172 TSQTIPCNNTLCSR-----QSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQ 226

Query: 155 R--LNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
              L+ ++  GCG  Q    + GA+    +G+ GLG    S+ S L  +    N    C 
Sbjct: 227 SRALDAKIIFGCGRVQTGSFLDGAA---PNGLFGLGMTNISVPSTLAREGYTSNSFSMCF 283

Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
              G G + FGD           ++   +   Y+  + ++  GG    L+    +FDSG+
Sbjct: 284 GRDGIGRISFGDTGSSGQGETPFNLRQLHPT-YNVSITKINVGGRDADLE-FSAIFDSGT 341

Query: 269 SYTYLNRVTYQTLT---SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           S+TYLN   Y  ++   +I  KE    S+ + P       C++      N+        T
Sbjct: 342 SFTYLNDPAYTLISESFNIGAKEKRYSSISDIP----FEYCYEMSSNQTNLE-----IPT 392

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
           + L    G     F +T    ++I   G    CL I+   +V +   N + G
Sbjct: 393 VNLVMQGGSQ---FNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTG 441


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 153/351 (43%), Gaps = 45/351 (12%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHPLYRP--S 97
           V G    +G Y V + IGQP +   L  DTGSDL W++C A C  C   +P  ++ P  S
Sbjct: 74  VSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 132

Query: 98  NDLVP--CEDPICASL----HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           +   P  C DP+C  +     AP  ++    + C YE  YADG  + G+  ++  +   +
Sbjct: 133 STFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTS 192

Query: 152 NGQRLNPR-LALGCGY----NQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNV 203
           +G+    + +A GCG+      V G S++  +G++GLG+G  S  SQL  +   K    +
Sbjct: 193 SGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 252

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG--------- 252
           + + LS     +L  G+     S++ +T + ++     +Y   +  +F  G         
Sbjct: 253 MDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSI 312

Query: 253 -ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP---LCWK 308
            E     N   V DSG++  +L    Y+++ + +++      +K    D   P   LC  
Sbjct: 313 WEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR-----VKLPIADALTPGFDLCVN 367

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
                  V   +K    L   F+ G    +F   P  Y I + +   CL I
Sbjct: 368 ----VSGVTKPEKILPRLKFEFSGG---AVFVPPPRNYFIETEEQIQCLAI 411


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 100/352 (28%), Positives = 145/352 (41%), Gaps = 44/352 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
           TG Y V + +G P +   L  DTGSDLTW QC  PCV+ C     P++ PS       + 
Sbjct: 151 TGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQ-PCVKSCYAQQQPIFDPSTSKTYSNIS 209

Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C    C+SL  A G+      + C Y ++Y D   ++G   KD       +   +     
Sbjct: 210 CTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQND---VFDGFM 266

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
            GCG N      +    G++GLG+   SIV Q  +QK  +    +CL  S G  G L FG
Sbjct: 267 FGCGQNN--KGLFGKTAGLIGLGRDPLSIVQQT-AQKFGK-YFSYCLPTSRGSNGHLTFG 322

Query: 220 D-DLYDSSRVVWTSM------SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSG 267
           + +   +S+ V   +      SS  T YY   V  +  GG+   +     +N   + DSG
Sbjct: 323 NGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +  T L    Y +L S  K+ +S      AP    L  C+       N   +      ++
Sbjct: 383 TVITRLPSTAYGSLKSAFKQFMS--KYPTAPALSLLDTCYD----LSNYTSIS--IPKIS 434

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
            +F         EL P   LI +    VCL     A  G  D + IG  G+ 
Sbjct: 435 FNFNGNAN---VELDPNGILITNGASQVCL-----AFAGNGDDDSIGIFGNI 478


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 73/266 (27%), Positives = 113/266 (42%), Gaps = 47/266 (17%)

Query: 44  NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSN 98
           +++  G Y   + +G P + +++D+DTGS++ W++C APC  C     V  P   + P  
Sbjct: 34  DIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKC-APCTGCEHSGDVPVPMSTFDPRK 92

Query: 99  DL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY---- 150
                 + C D  C  L+     + E    C Y L Y DG S+ G  + D F FN     
Sbjct: 93  STTKISISCTDAECGVLNKKLQCSPER-LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSD 151

Query: 151 -TNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            +  +    RL  GCG  Q    S   +DG+LG G    S+ +QL  Q +  N+  HCL 
Sbjct: 152 NSTAKSGTARLVFGCGGTQTGSWS---VDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQ 208

Query: 210 GGGGGF-----------------LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
           G   G                  + FG+D Y+   V   ++        +P   +L + G
Sbjct: 209 GDVSGRGSLVIGTIREPDLVYTPMVFGEDHYN---VQLLNIGISGRNVTTPASFDLEYTG 265

Query: 253 ETTGLKNLPVVFDSGSSYTYLNRVTY 278
                    V+ DSG++ TYL +  Y
Sbjct: 266 G--------VIIDSGTTLTYLVQPAY 283


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 79/263 (30%), Positives = 112/263 (42%), Gaps = 38/263 (14%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
           Y TG Y   + IG PA  Y++ LDTGS   W+      + C + PH          Y P 
Sbjct: 78  YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133

Query: 97  ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
              S+  V C+D IC S   P    C    +C Y   YADGG ++G+L  D   ++  Y 
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188

Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
           NGQ    +  +  GCG  Q      S   +DGI+G G    + +SQL +    + +  HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248

Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
           L S  GGG    G+ +    +V  T +  +   Y+   +  +   G T  L        K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306

Query: 259 NLPVVFDSGSSYTYLNRVTYQTL 281
                 DSGS+  YL  + Y  L
Sbjct: 307 TKGTFIDSGSTLVYLPEIIYSEL 329


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G P   Y +  DTGSD TW+QC    V C E    L+ P+     
Sbjct: 172 GRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L+    H C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 232 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340

Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            F  G     S+R+    ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 341 DFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 397

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L       ++A+  K+AP    L  C+     F  +  V     T+
Sbjct: 398 GTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G      ++     +  ++   VCL      + G  D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 494


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 42/294 (14%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND--- 99
           G  + TG Y   + +G P R  +L +DTGSD+TWLQC APC  C +    L+ PS+    
Sbjct: 8   GLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALFNPSSSSSF 66

Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YTNGQRL 156
            ++ C   +C +L   G  +     +C Y+ +Y DG  ++G LV D    +  +  GQ +
Sbjct: 67  KVLDCSSSLCLNLDVMGCLS----NKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
              + LGCG++     ++    GILGLG+G  S  + L +    RN+  +CL        
Sbjct: 123 LTNIPLGCGHDN--EGTFGTAAGILGLGRGPLSFPNNLDAST--RNIFSYCLPDRESDPN 178

Query: 212 GGGFLFFGDDLY-----DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
               L FGD         S + +    +     YY   +  +  GG    L N+P     
Sbjct: 179 HKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNL--LTNIPASVFQ 236

Query: 263 ---------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                    +FDSG++ T L    Y  +    +   +   L  A + +    C+
Sbjct: 237 LDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRA--ATMHLTSAADFKIFDTCY 288


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 154/374 (41%), Gaps = 55/374 (14%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G+  P   G Y   + +G P   + + +DTGSD+ W+ C++ C  C  +       
Sbjct: 65  FSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGCPRSSGLGIQL 123

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
                    S+ LV C DPIC S        C   + QC Y  +Y DG  + G  V ++ 
Sbjct: 124 NFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESM 183

Query: 147 AFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
            F+   GQ +    +  +  GC   Q      S H +DGI G G G  S++SQL ++ + 
Sbjct: 184 YFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGIT 243

Query: 201 RNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
             V  HCL   G GGG L  G+ L     +V++ +      +Y+  +  +   G+T  + 
Sbjct: 244 PKVFSHCLKGEGNGGGILVLGEVL--EPGIVYSPLVPS-QPHYNLYLQSISVNGQTLPID 300

Query: 259 --------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                   N   + DSG++  YL    Y    S         ++  A      P   KG 
Sbjct: 301 PSVFATSINRGTIIDSGTTLAYLVEEAYTPFVS---------AITAAVSQSVTPTISKGN 351

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE---VGL 367
           + +     V + F  ++L+F    +     L PE YL+        LG  +GA    +G 
Sbjct: 352 QCYLVSTSVGEIFPLVSLNFAGSASMV---LKPEEYLM-------HLGFYDGAALWCIGF 401

Query: 368 QDLNV-IGGIGDFV 380
           Q +   +  +GD V
Sbjct: 402 QKVQEGVTILGDLV 415


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P    +L +D+GSD+ W+QC  PC +C     PL+ P+       V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC +L   G     D  +CDY + Y DG  + G L  +      T  Q     +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG+       +    G+LGLG G  S++ QL        V  +CL+    GG G L  G 
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLIGQLGGAA--GGVFSYCLASRGAGGAGSLVLGR 297

Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE----TTGLKNLP------VVFDSGS 268
                   VW  +  ++  + +Y  G+  +  GGE      GL  L       VV D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGT 357

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           + T L R  Y  L       + A  L  +P    L  C+
Sbjct: 358 AVTRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 394


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 149/366 (40%), Gaps = 47/366 (12%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G   P   G Y   + +G P   + + +DTGSD+ W+ C++ C  C +        
Sbjct: 61  FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQIQL 119

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
               P    ++ ++ C D  C +        C     QC Y  +Y DG  + G  V D  
Sbjct: 120 NFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMM 179

Query: 147 AFN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             N  +      N    +  GC  NQ  G    S   +DGI G G+ + S++SQL SQ +
Sbjct: 180 HLNTIFEGSVTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGI 238

Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
              V  HCL G   GGG L  G+ +     +V+TS+      +Y+  +  +   G+T  +
Sbjct: 239 APRVFSHCLKGDSSGGGILVLGEIV--EPNIVYTSLVP-AQPHYNLNLQSIAVNGQTLQI 295

Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
                    +   + DSG++  YL    Y    S +   +        P+     +  +G
Sbjct: 296 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI--------PQ-SVHTVVSRG 346

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQ 368
            + +     V + F  ++L+F  G +     L P+ YLI  N  G   +  +   ++  Q
Sbjct: 347 NQCYLITSSVTEVFPQVSLNFAGGASMI---LRPQDYLIQQNSIGGAAVWCIGFQKIQGQ 403

Query: 369 DLNVIG 374
            + ++G
Sbjct: 404 GITILG 409


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P      
Sbjct: 170 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTY 229

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L+    H C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 230 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 282

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 283 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 338

Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
            F  G     S+R+    ++ +   +Y  G+  +  GG+   L ++P         + DS
Sbjct: 339 DFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 395

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y +L       ++A+  K+AP    L  C+     F  +  V     T+
Sbjct: 396 GTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 449

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           +L F  G      ++     +  ++   VCL      + G  D+ ++G
Sbjct: 450 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 492


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 90/352 (25%), Positives = 146/352 (41%), Gaps = 47/352 (13%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + + +G P  P     DTGSD+ W QC+ PC  C +   P++ PS       V C 
Sbjct: 83  GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCE-PCTNCYQQDLPMFNPSKSTTYRKVSCS 141

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
            P+C+       ++C     C Y + Y D   S G    D      T+G+ +  PR A+G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGFLFF 218
           CG++   G+    + GI+GLG G +S++ Q+ S   +     +CL+      GG   L F
Sbjct: 200 CGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNF 256

Query: 219 GDDLYDS-SRVVWTS--MSSDYTKYYSPGVAELFFGGETT----------GLKNLPVVFD 265
           G +   S S  V T   +S  +  +YS  +  +  G   T          G  N  ++ D
Sbjct: 257 GSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKAN--IIID 314

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++ T L    Y      +   ++ +   +   ++ L  C++         D K  F  
Sbjct: 315 SGTTLTLLPVDLYHNFAKAISNSINLQRTDD--PNQFLEYCFE-----TTTDDYKVPF-- 365

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
           +A+ F     R    L  E  LI  +   +CL      +    D+++ G I 
Sbjct: 366 IAMHFEGANLR----LQRENVLIRVSDNVICLAFAGAQD---NDISIYGNIA 410


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 75/275 (27%), Positives = 122/275 (44%), Gaps = 24/275 (8%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDLV 101
           Y   + +G P   + + LDTGSDL W+ CD  C++C         ++    +Y+P+    
Sbjct: 100 YYAWVDVGTPTTSFLVALDTGSDLFWVPCD--CIQCAPLSSYRGNLDRDLGIYKPAESTT 157

Query: 102 PCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR-LNP 158
               P    L  PG   C +P Q C Y ++Y ++  +S G+L++D+   N   G   +N 
Sbjct: 158 SRHLPCSHELCQPGS-GCTNPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNA 216

Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
            + +GCG  Q    + G +    DG+LGLG    S+ S L    L+RN    C      G
Sbjct: 217 SVIIGCGRKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSG 273

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
            +FFGD    S +           + Y+  V +   G +     +   + DSG+S+T L 
Sbjct: 274 RIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQALVDSGTSFTSLP 333

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
              Y+  T+   K+++A  +    ED T   C+  
Sbjct: 334 PDVYKAFTTEFDKQINASRVPY--EDSTWKYCYSA 366


>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 641

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 102/228 (44%), Gaps = 43/228 (18%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPHPLYRPSNDL-VPCEDPI 107
           Y V M +G+  + +   +DTGS  +WL C  P +    V  P+ +Y P  ++ V C  P 
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185

Query: 108 CASLHA-PGHHN-------CEDP--AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           C SL   P + N       C +P   +C Y++ Y D     G  V+D  +     G++L+
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245

Query: 158 PRLALGCGYNQVPGA-----SYH--------------PL--DGILGLGKGKSSIVSQLHS 196
            ++ LG        A     S+H              PL  DG+LGL KG  S VSQL  
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305

Query: 197 QKLI-RNVVGHCLSG-------GGGGFLFFGD-DLYDSSRVVWTSMSS 235
           Q  I  +VVGHC             GF+FFG   L DS  + W+ M+S
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPMAS 353


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 85/346 (24%), Positives = 139/346 (40%), Gaps = 47/346 (13%)

Query: 39  FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAPHP 92
           F V G   P   G Y   + +G P + +++ +DTGSD+ W+ C +    P    ++ P  
Sbjct: 70  FPVQGTFNPFLVGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLT 129

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFA 147
            + P +     LV C D  C +        C     QC Y  +Y DG  + G  V D   
Sbjct: 130 FFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMH 189

Query: 148 FN---YTNG------QRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHS 196
            +    ++G      Q  +  ++  C   Q      S   +DGI G G+ + S++SQL S
Sbjct: 190 LDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLAS 249

Query: 197 QKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
           Q +   V  HCL G   GGG L  G+ +     +V+T +      +Y+  +  +   G+T
Sbjct: 250 QGITPRVFSHCLKGDDSGGGVLVLGEIV--EPNIVYTPLVPS-QPHYNLYLQSISVAGQT 306

Query: 255 TGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
             +         N   + DSG++  YL    Y    S +   +S  +             
Sbjct: 307 LAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLS-------- 358

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
            KG + +     V   F  ++L+F  G +     L P+ YL+  N 
Sbjct: 359 -KGNQCYLVTSSVNDVFPQVSLNFAGGAS---LILNPQDYLLQQNS 400


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 108/239 (45%), Gaps = 38/239 (15%)

Query: 45  VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           V P+G   Y V + +G P +P    LDTGSDL W QC APC  C+  P P++ P    S 
Sbjct: 96  VRPSGDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSPGASSSY 154

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQ 154
           + + C   +C  +    HH+C+ P  C Y   Y DG ++ GV   + F F    +     
Sbjct: 155 EPMRCAGELCNDIL---HHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETT 211

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GG 211
           +L+  L  GCG   +   S +   GI+G G+   S+VSQL  ++       +CL+    G
Sbjct: 212 KLSAPLGFGCG--TMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRF-----SYCLTPYASG 264

Query: 212 GGGFLFFGD---DLYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKNLPV 262
               L FG     +YD++     +        + T YY P      F G T G + L +
Sbjct: 265 RKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVP------FTGVTVGARRLRI 317


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 158/371 (42%), Gaps = 67/371 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y  T+ +G PA+ + +  DTGSDL W+QC  PC  C     P++ P    S   + C 
Sbjct: 38  GDYVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCG 96

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
           D +C SL      +C     CDY   Y DG  + G L  +      T G++L  + +A G
Sbjct: 97  DTLCDSLP---RKSCS--PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
           CG+  +   S++   G++GLG+G  S VSQL    L  +   +CL     +      +FF
Sbjct: 152 CGH--LNRGSFNDASGLVGLGRGNLSFVSQLG--DLFGHKFSYCLVPWRDAPSKTSPMFF 207

Query: 219 GDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKNLPV--------------- 262
           GD+   SS      +   +T   ++P +   ++      LK++ +               
Sbjct: 208 GDE--SSSHSSGKKLHYAFTPMIHNPAMESFYY----VKLKDISIAGRALRIPAGSFDIK 261

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  +FDSG++ T L    YQ +   ++ ++S   +  +     L LC+       +
Sbjct: 262 PDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGS--SAGLDLCY-------D 312

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNG-AEVGL----- 367
           V   K  ++    +         ++L  E Y I +N     VCL +++   ++G+     
Sbjct: 313 VSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMM 372

Query: 368 -QDLNVIGGIG 377
            Q+  V+  IG
Sbjct: 373 QQNFRVMYDIG 383


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/350 (27%), Positives = 155/350 (44%), Gaps = 47/350 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
           Y   + +G PA  + + LDTGSDL W+ CD  C++C         ++    +YRP+    
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153

Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
              +PC   +C S+  PG   C +P Q C Y ++Y ++  +S G+L++D    NY     
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208

Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
            +N  + +GCG  Q    + G +    DG+L LG    S+ S L    L++N    C   
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLALGMADISVPSFLARAGLVQNSFSMCFKE 265

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
              G +FFGD    S +           + Y+  V +   G +     +   + DSG+S+
Sbjct: 266 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 325

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y+  T    K+++A  +    ED T   C+    P + + DV     T+ L+F
Sbjct: 326 TSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 377

Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
              K  +L  + P   L  ++K       CL +L   E +G+   N + G
Sbjct: 378 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 423


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/352 (25%), Positives = 145/352 (41%), Gaps = 47/352 (13%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + + +G P  P     DTGSD+ W QC  PC  C +   P++ PS       V C 
Sbjct: 83  GEYLMKLSVGTPPFPIIAVADTGSDIIWTQC-VPCTNCYQQDLPMFNPSKSTTYRKVSCS 141

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
            P+C+       ++C     C Y + Y D   S G    D      T+G+ +  PR A+G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGFLFF 218
           CG++   G+    + GI+GLG G +S++ Q+ S   +     +CL+      GG   L F
Sbjct: 200 CGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNF 256

Query: 219 GDDLYDS-SRVVWTS--MSSDYTKYYSPGVAELFFGGETT----------GLKNLPVVFD 265
           G +   S S  V T   +S  +  +YS  +  +  G   T          G  N  ++ D
Sbjct: 257 GSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKAN--IIID 314

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++ T L    Y      +   ++ +   +   ++ L  C++         D K  F  
Sbjct: 315 SGTTLTLLPVDLYHNFAKAISNSINLQRTDD--PNQFLEYCFE-----TTTDDYKVPF-- 365

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
           +A+ F     R    L  E  LI  +   +CL      +    D+++ G I 
Sbjct: 366 IAMHFEGANLR----LQRENVLIRVSDNVICLAFAGAQD---NDISIYGNIA 410


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 140/332 (42%), Gaps = 38/332 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +T  +G P    +   DTGSD+ WLQC+ PC +C     P++ PS       +PC 
Sbjct: 85  GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCL 143

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C   H+    +C D   C Y++ Y D   S G L  D  +   T+G  ++ P+  +G
Sbjct: 144 SKLC---HSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIG 200

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
           CG +   G       GI+GLG G  S+++QL S   I     +CL             L 
Sbjct: 201 CGTDNA-GTFGGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILS 257

Query: 218 FGDDLYDSSRVVWTS--MSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGS 268
           FGD    S   V ++  +  D   Y      +S G   + FGG + G  +   ++ DSG+
Sbjct: 258 FGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGT 317

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR-----PFKNVH----DV 319
           + T +    Y  L S +   +    + +   ++   LC+  +      P    H    D+
Sbjct: 318 TLTLIPSDVYTNLESAVVDLVKLDRVDDP--NQQFSLCYSLKSNEYDFPIITAHFKGADI 375

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
           +    +  +  TDG     F+ +P+   I  N
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGN 407


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/340 (26%), Positives = 142/340 (41%), Gaps = 50/340 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
           +G Y V + IG P   Y   +DTGSDL W QC APC+ C   P P +      +   +PC
Sbjct: 86  SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPC 144

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLAL 162
               CA+L +P          C Y+  Y D  S+ GVL  + F F   +  ++    ++ 
Sbjct: 145 RSSRCAALSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISF 200

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
           GCG   +         G++G G+G  S+VSQL   +       +CL+         L+FG
Sbjct: 201 GCG--SLNAGELANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSPTPSRLYFG 253

Query: 220 DDLYDSSRVVWTSMSSDYTKY-YSPGVAELFF---GGETTGLKNLP-------------- 261
                +S    +      T +  +P +  ++F    G + G K LP              
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313

Query: 262 -VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            V+ DSG+S T+L +  Y+ +   +   +   ++ +   D  L  C++   P     +V 
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMND--TDIGLDTCFQWPPP----PNVT 367

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
                    F DG   T   L PE Y++I S  G +CL +
Sbjct: 368 VTVPDFVFHF-DGANMT---LPPENYMLIASTTGYLCLAM 403


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 128/303 (42%), Gaps = 32/303 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LVPCE 104
           Y VT+ +G P     L++DTGSDL+W+QC  PC    C     PL+ P+       VPC 
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
            P+C  L    + +    AQC Y + Y DG  + GV   D    +  +  R       GC
Sbjct: 199 GPVCGGLGI--YASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVR---GFFFGC 253

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
           G+ Q   + +   DG+LGLG+ ++S+V Q  +      V  +CL       G+L  G   
Sbjct: 254 GHAQ---SGFTGNDGLLGLGREEASLVEQ--TAGTYGGVFSYCLPTRPSTTGYLTLGGPS 308

Query: 223 YDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNL----PVVFDSGSSYTYLNR 275
             +     T+    S +   YY   +  +  GG+   + +       V D+G+  T L  
Sbjct: 309 GAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPP 368

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
             Y  L S  +  +++     AP    L  C+     F     V      +AL+F+ G T
Sbjct: 369 TAYAALRSAFRSGMASYGYPSAPATGILDTCYN----FSGYGTVT--LPNVALTFSGGAT 422

Query: 336 RTL 338
            TL
Sbjct: 423 VTL 425


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/267 (29%), Positives = 119/267 (44%), Gaps = 31/267 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P    ++  VPC 
Sbjct: 114 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLSSPDYGNLKFDVYSPRKSSTSRKVPCS 171

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
             +C          C   +  C Y++EY +D  SS GVLV+D       +G     +  +
Sbjct: 172 SNMCDL-----QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSKITQAPI 226

Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
             G  QV   S+      +G+LGLG    S+ S L SQ +  N    C    G G + FG
Sbjct: 227 TFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGRINFG 286

Query: 220 DDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTY 278
           D    S+  + T ++   +  YY+  +     GG+T   K    V DSG+S+T L+   Y
Sbjct: 287 DT--GSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSGTSFTALSDPMY 343

Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPL 305
             +TS   K++     K  P D +LP 
Sbjct: 344 TEITSAFDKQVKE---KRNPADSSLPF 367


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P++    
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 231 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 284 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPARSTGTGYL 339

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
            FG     ++        +  T YY  G+  +  GG    L   P VF       DSG+ 
Sbjct: 340 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 396

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
            T L    Y +L S     ++A+  ++A     L  C+     F  +  V     T++L 
Sbjct: 397 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 450

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           F  G      ++     +   +   VCL      + G  D+ ++G
Sbjct: 451 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 490


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 137/315 (43%), Gaps = 36/315 (11%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVH-GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           +SS+      +S+   +GSS  FQV       T  + V   +GQP  P    +DTGS L 
Sbjct: 62  ISSARFKYLQNSIDKELGSSN-FQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLL 120

Query: 76  WLQCDAPCVRCV--EAPHPLYRP--SNDLVP--CEDPICASLHAPGHHNCEDPAQCDYEL 129
           W+QC  PC  C      HP++ P  S+  V   C+D  C   +AP  H C    +C YE 
Sbjct: 121 WIQCQ-PCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCR--YAPNGH-CGSSNKCVYEQ 176

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
            Y  G  S GVL K+   F   NG  +  + +A GCGY        H   GILGLG   +
Sbjct: 177 VYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESH-FTGILGLGAKPT 235

Query: 189 SIVSQLHSQKLIRNVVGHC---LSGGGGGF--LFFGDD---LYDSSRVVW-TSMSSDYTK 239
           S+  QL S+        +C   L+    G+  L  G+D   L D + + + T  S  Y  
Sbjct: 236 SLAVQLGSK------FSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMN 289

Query: 240 YYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
                V +     E    K       V+ DSG+ YT+L  + Y+ L + +K  L  K  +
Sbjct: 290 LEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDPKLER 349

Query: 296 EAPEDETLPLCWKGR 310
               D    LC+ GR
Sbjct: 350 FWFRDF---LCYHGR 361


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 134/315 (42%), Gaps = 42/315 (13%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAP---GHHNCE 120
           +DT S+LTW+QC+ PC  C +   PL+ PS+      VPC    C +L          C+
Sbjct: 128 VDTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACD 186

Query: 121 D-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLD 178
           D PA C Y L Y DG  S GVL  D  +    + Q        GCG  NQ P   +    
Sbjct: 187 DQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQ----GFVFGCGTSNQGP---FGGTS 239

Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVWTS 232
           G++GLG+ + S++SQ   Q     V  +CL     G  G L  GDD     +S+ +V+T+
Sbjct: 240 GLMGLGRSQLSLISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTA 297

Query: 233 MSSDYTK--YYSPGVAELFFGGETTGLKNL------PVVFDSGSSYTYLNRVTYQTLTSI 284
           M SD  +  +Y   +  +  GGE               + DSG+  T L    Y  + + 
Sbjct: 298 MVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAE 357

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
              +L+     +A     L  C+        + +V+    +L L F DG      +    
Sbjct: 358 FVSQLA--EYPQAAPFSILDTCFD----LTGLREVQ--VPSLKLVF-DGGAEVEVDSKGV 408

Query: 345 AYLIISNKGNVCLGI 359
            Y++  +   VCL +
Sbjct: 409 LYVVTGDASQVCLAL 423


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P++    
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 232 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 285 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPPRSTGTGYL 340

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
            FG     ++        +  T YY  G+  +  GG    L   P VF       DSG+ 
Sbjct: 341 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 397

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
            T L    Y +L S     ++A+  ++A     L  C+     F  +  V     T++L 
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 451

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           F  G      ++     +   +   VCL      + G  D+ ++G
Sbjct: 452 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 491


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 123/277 (44%), Gaps = 28/277 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VPCED 105
           Y VT+ IG PAR + +  DTGSDLTW+QC  PC   C +   PL+ PS       VPC  
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCK-PCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
           P C      G         C+Y ++Y D   + G L ++AF  + +        +  GC 
Sbjct: 185 PQCKI--GGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAG--VVFGCS 240

Query: 166 Y---NQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
           +   + V GA     + G+LGLG+G SSI+SQ        +V  +CL   G   G+L  G
Sbjct: 241 HEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNS-GDVFSYCLPPRGSSAGYLTIG 299

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSY 270
                 S + +T + +D ++  S  V  L   G +     LP+         V DSG+  
Sbjct: 300 AAAPPQSNLSFTPLVTDNSQLSSVYVVNLV--GISVSGAALPIDASAFYIGTVIDSGTVI 357

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           T++    Y  L    ++ +   ++      E+L  C+
Sbjct: 358 THMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCY 394


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 98/366 (26%), Positives = 159/366 (43%), Gaps = 50/366 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-----LYRPSNDLVP 102
           +G Y V++ +G P +   L  DTGSDLTW++C A    C  + HP     L R S    P
Sbjct: 80  SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNC--SIHPPGSTFLARHSTTFSP 137

Query: 103 --CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
             C   +C  +  P  + C      + C YE  Y+DG  + G   K+    N ++G+ + 
Sbjct: 138 THCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMK 197

Query: 158 PR-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN----VVGHCL 208
            + +A GCG++     + G+S++   G++GLG+G  S  SQL  ++  R+    ++ + L
Sbjct: 198 LKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQL-GRRFGRSFSYCLLDYTL 256

Query: 209 SGGGGGFLFFGDDLY----DSSRVVWTSM--SSDYTKYYSPGVAELFFGG---------- 252
           S     +L  GD +     + S + +T +  + +   +Y   +  +F  G          
Sbjct: 257 SPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVW 316

Query: 253 ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE--APEDETLPLCWKGR 310
               L N   V DSG++ T+L    Y+ + S  K+E+   S     A       LC    
Sbjct: 317 SLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCV--- 373

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
               NV  V +  R   LS   G   +L+   P  Y I  ++G  CL I    E      
Sbjct: 374 ----NVTGVSRP-RFPRLSLELGG-ESLYSPPPRNYFIDISEGIKCLAI-QPVEAESGRF 426

Query: 371 NVIGGI 376
           +VIG +
Sbjct: 427 SVIGNL 432


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 134/336 (39%), Gaps = 58/336 (17%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSNDL----V 101
           Y   + +G P++ Y++ +DTGSD+ W+ C   C +C     +     LY P++ +    V
Sbjct: 27  YFAKIGLGNPSKDYYVQVDTGSDILWVNC-IGCDKCPTKSDLGIKLTLYDPASSVSATRV 85

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----N 157
            C+D  C S +     +C+    C Y + Y DG S+ G  V DA  F    G       N
Sbjct: 86  SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSN 145

Query: 158 PRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
             +  GCG  Q    G S   LDGILG                       HCL    GG 
Sbjct: 146 GTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCLDNVNGGG 185

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSG 267
           +F   +L  S +V  T M  +   +Y+  + E+  GG    L             + DSG
Sbjct: 186 IFAIGELV-SPKVNTTPMVPN-QAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSG 243

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++  YL  V Y ++ + ++ +    SL    E     +C      FK   +V   F  + 
Sbjct: 244 TTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQF---IC------FKYSGNVDDGFPDIK 294

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
             F D  T T++   P  YL   ++   C G  NG 
Sbjct: 295 FHFKDSLTLTVY---PHDYLFQISEDIWCFGWQNGG 327


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
           G    TG Y VT+ +G PA  Y +  DTGSD TW+QC    V C E    L+ P++    
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234

Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             V C  P C+ L   G   C     C Y ++Y DG  S+G    D    +  +  +   
Sbjct: 235 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 287

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GCG        +    G+LGLG+GK+S+  Q + +     V  HCL     G G+L
Sbjct: 288 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPARSTGTGYL 343

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
            FG     ++        +  T YY  G+  +  GG    L   P VF       DSG+ 
Sbjct: 344 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 400

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
            T L    Y +L S     ++A+  ++A     L  C+     F  +  V     T++L 
Sbjct: 401 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 454

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           F  G      ++     +   +   VCL      + G  D+ ++G
Sbjct: 455 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 494


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 85/336 (25%), Positives = 141/336 (41%), Gaps = 36/336 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G+Y + + IG P    +   DTGSDLTW  C  PC  C +  +P++ P        + C+
Sbjct: 70  GHYLMELSIGTPPFKIYGIADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTTYRNISCD 128

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
             +C  L       C    +C+Y   YA    + GVL ++    + T G+ +  + +  G
Sbjct: 129 SKLCHKLDT---GVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFG 185

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS----QKLIRNVVGHCLSGGGGGFLFFG 219
           CG+N   G + H + GI+GLG G  S++SQ+ S    ++  + +V           + FG
Sbjct: 186 CGHNNTGGFNDHEM-GIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFG 244

Query: 220 DDLYDSSR-VVWTSM--SSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSY 270
                S + VV T +    D T Y+      S     L F G +  ++   +  DSG+  
Sbjct: 245 KGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPP 304

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y  + + ++ E++ K + + P D    LC++ +   +           L   F
Sbjct: 305 TILPTQLYDQVVAQVRSEVAMKPVTDDP-DLGPQLCYRTKNNLRG--------PVLTAHF 355

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
                +    L+P    I    G  CLG  N +  G
Sbjct: 356 EGADVK----LSPTQTFISPKDGVFCLGFTNTSSDG 387


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/329 (28%), Positives = 139/329 (42%), Gaps = 45/329 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
           + VT+  G PA+ Y L  DTGSD++W+QC  PC   C +   P++ P+       VPC  
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSAVPCGH 178

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
           P CA+        C     C Y+++Y DG S+ GVL  +  +       R  P  A GCG
Sbjct: 179 PQCAAAGG----KCSSNGTCLYKVQYGDGSSTAGVLSHETLSL---TSARALPGFAFGCG 231

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLY 223
              +    +  +DG++GLG+G+ S+ SQ  +         +CL       G+L  G    
Sbjct: 232 ETNL--GDFGDVDGLIGLGRGQLSLSSQAAASFGAAFS--YCLPSYNTSHGYLTIGTTTP 287

Query: 224 DSSR--VVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTY 272
            S    V +T+M    DY  +Y   +  +  GG    L   P++F       DSG+  TY
Sbjct: 288 ASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFV--LPVPPILFTRDGTLLDSGTVLTY 345

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL-ALSFT 331
           L    Y  L    K   +    K AP  +    C+       +       F  L +  F+
Sbjct: 346 LPPEAYTALRDRFK--FTMTQYKPAPAYDPFDTCY-------DFAGQNAIFMPLVSFKFS 396

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGIL 360
           DG +   F+L+P   LI  +      G L
Sbjct: 397 DGSS---FDLSPFGVLIFPDDTAPATGCL 422


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/349 (27%), Positives = 151/349 (43%), Gaps = 46/349 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VP 102
           +G Y VT+ +G P R      DTGSDLTW QC+ PCV  C +    ++ PS  L    V 
Sbjct: 144 SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVS 202

Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C+ P C  L  A G+      + C Y + Y DG  S+G   ++  +   T+   +     
Sbjct: 203 CDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQ 259

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
            GCG N      +    G+LGL +   S+VSQ  +QK  + V  +CL  S    G+L FG
Sbjct: 260 FGCGQNN--RGLFGGTAGLLGLARNPLSLVSQT-AQKYGK-VFSYCLPSSSSSTGYLSFG 315

Query: 220 DDLYDSSRVVWT--SMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------VFDSG 267
               DS  V +T   ++SDY  +Y      L   G + G + LP+          + DSG
Sbjct: 316 SGDGDSKAVKFTPSEVNSDYPSFYF-----LDMVGISVGERKLPIPKSVFSTAGTIIDSG 370

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +  + L    Y ++  + ++ +S            L  C+   + +K V   K     + 
Sbjct: 371 TVISRLPPTVYSSVQKVFRELMS--DYPRVKGVSILDTCYDLSK-YKTVKVPK-----II 422

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           L F+ G      +L PE  + +     VCL     ++    ++ +IG +
Sbjct: 423 LYFSGGAE---MDLAPEGIIYVLKVSQVCLAFAGNSDD--DEVAIIGNV 466


>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
          Length = 133

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 68/124 (54%), Gaps = 3/124 (2%)

Query: 176 PLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS 234
           P+DGILGLG GK+    QL  QK+I  NV+GHCLS  G G L+ GD    S  V W  M 
Sbjct: 8   PVDGILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVTWVPMK 67

Query: 235 SDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
                YYSPG+AE     +   G      VFDSGS+YT++    Y  + S ++  LS  S
Sbjct: 68  ESLF-YYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVRGTLSESS 126

Query: 294 LKEA 297
           L+E 
Sbjct: 127 LEEV 130


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 60/314 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
           +G Y V + IG P   Y   +DTGSDL W QC APC+ C + P P +      +   +PC
Sbjct: 86  SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPC 144

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
               CASL +P          C Y+  Y D  S+ GVL  + F F   N  ++    +A 
Sbjct: 145 RSSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF 200

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
           GCG   +         G++G G+G  S+VSQL   +       +CL+         L+FG
Sbjct: 201 GCG--SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFG 253

Query: 220 DDLYDSSRVVWTSMSSDYTK----------YYSPGVAELFF---GGETTGLKNLP----- 261
                    V+ ++SS  T             +P +  ++F      + G K LP     
Sbjct: 254 ---------VYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
                     V+ DSG+S T+L +  Y+ +   +   +   ++ +   D  L  C++   
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMND--TDIGLDTCFQWPP 362

Query: 312 PFKNVHDVKKCFRT 325
           P  NV      FRT
Sbjct: 363 P-PNVTVTVPDFRT 375


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 77/272 (28%), Positives = 118/272 (43%), Gaps = 24/272 (8%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH------PLYRPSNDL- 100
           Y NVT  IG PA+ + + LDTGSDL WL   C++ CVR +E          +Y PS    
Sbjct: 90  YANVT--IGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKS 147

Query: 101 ---VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
              V C   +CA       + C  P + C Y + Y   GS S GVLV+D    +   G+ 
Sbjct: 148 SSKVTCNSTLCAL-----RNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEA 202

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
            + R+  GC  +Q+       ++GI+GL     ++ + L    +  +    C    G G 
Sbjct: 203 RDARITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGT 262

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
           + FGD    SS  + T +S   +  +       F  G+ T        FDSG++ T+L  
Sbjct: 263 ISFGDK--GSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTATFDSGTAVTWLIE 320

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
             Y  LT+     +  + L ++  D     C+
Sbjct: 321 PYYTALTTNFHLSVPDRRLSKS-VDSPFEFCY 351


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/306 (29%), Positives = 131/306 (42%), Gaps = 43/306 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
           Y V +  G PA P  + +DTGSD++WLQC  PC   +C     PLY PS+      VPC 
Sbjct: 79  YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137

Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             +C  L A  +   C    QC + + YADG S++G   +D           +      G
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL---APGAIVQNFYFG 194

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDD 221
           CG+ +   A     DG+LGLG+ + S+ ++         V  +CL       GFL  G  
Sbjct: 195 CGHGK--HAVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGAG 246

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFDSGSSYTY 272
             + S  V+T M    T    P  + +   G   G K L          ++ DSG+  T 
Sbjct: 247 -KNPSGFVFTPMG---TVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 302

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y+ L S  +K + A  L    + +T   C+     +KNV   K     +AL+FT 
Sbjct: 303 LQSTAYRALRSAFRKAMEAYRLLPNGDLDT---CYN-LTGYKNVVVPK-----IALTFTG 353

Query: 333 GKTRTL 338
           G T  L
Sbjct: 354 GATINL 359


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 119/284 (41%), Gaps = 33/284 (11%)

Query: 51  YNVTMYIGQP-ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY--RPSNDL--VPCED 105
           Y + + IG P ++P  L LDTGSD+ W QC+ PC  C   P P +    SN +  V C D
Sbjct: 92  YLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTVRSVACSD 150

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YTNGQRLNPRLALG 163
           P+C   +A   H C     C Y   Y DG  S G  ++D+F F+     G+   P +  G
Sbjct: 151 PLC---NAHSEHGCFLHG-CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFG 206

Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
           CG YN   G       GI G G+G  S+ SQL  ++          +     FL    DL
Sbjct: 207 CGMYNA--GRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVFLGGAGDL 264

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAE----LFFGGETTGLKNLPV-----------VFDSG 267
              +      +S+ + +   PG       L F G T G   LPV             DSG
Sbjct: 265 --KAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSG 322

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
           +  T      ++ L S    + +    K A ED+ +   W G++
Sbjct: 323 TDITTFPDAVFRQLKSAFIAQAALPVNKTADEDD-ICFSWDGKK 365


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 155/361 (42%), Gaps = 56/361 (15%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           + G    +G Y   + +G PAR  ++ LDTGSD+ W+QC APC++C     P++ P+   
Sbjct: 135 ISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSR 193

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
               +PC  P+C  L  PG   C    Q C Y++ Y DG  ++G    +   F    G R
Sbjct: 194 SFANIPCGSPLCRRLDYPG---CSTKKQICLYQVSYGDGSFTVGEFSTETLTF---RGTR 247

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-- 213
           +  R+ LGCG++      +    G+LGLG+G+ S  SQ+   +   +   +CL       
Sbjct: 248 VG-RVVLGCGHDN--EGLFVGAAGLLGLGRGRLSFPSQIG--RRFNSKFSYCLGDRSASS 302

Query: 214 --GFLFFGDDLYDSSRVVWTSMSSD---YTKYYSP------------GVAELFFGGETTG 256
               + FGD    S    +T + S+    T YY              G++   F  ++TG
Sbjct: 303 RPSSIVFGDSAI-SRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG 361

Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
             N  V+ DSG+S T L R  Y  L       + A +LK APE      C+         
Sbjct: 362 --NGGVIIDSGTSVTRLTRAAYVALRDAFL--VGASNLKRAPEFSLFDTCFD----LSGK 413

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
            +VK    T+ L F          L    YLI + N G+ C      A      L++IG 
Sbjct: 414 TEVK--VPTVVLHFRGADV----PLPASNYLIPVDNSGSFCFAFAGTAS----GLSIIGN 463

Query: 376 I 376
           I
Sbjct: 464 I 464


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/306 (29%), Positives = 131/306 (42%), Gaps = 43/306 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
           Y V +  G PA P  + +DTGSD++WLQC  PC   +C     PLY PS+      VPC 
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171

Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             +C  L A  +   C    QC + + YADG S++G   +D           +      G
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL---APGAIVQNFYFG 228

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDD 221
           CG+ +   A     DG+LGLG+ + S+ ++         V  +CL       GFL  G  
Sbjct: 229 CGHGK--HAVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGAG 280

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFDSGSSYTY 272
             + S  V+T M    T    P  + +   G   G K L          ++ DSG+  T 
Sbjct: 281 -KNPSGFVFTPMG---TVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 336

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y+ L S  +K + A  L    + +T   C+     +KNV   K     +AL+FT 
Sbjct: 337 LQSTAYRALRSAFRKAMEAYRLLPNGDLDT---CYN-LTGYKNVVVPK-----IALTFTG 387

Query: 333 GKTRTL 338
           G T  L
Sbjct: 388 GATINL 393


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 152/351 (43%), Gaps = 52/351 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP----S 97
           Y   + +G P   + + LDTGSDL WL C+    C+R +E        P  LY P    +
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           +  + C D  C      G   C  P+  C Y++ Y++   + G L++D      T  + L
Sbjct: 162 SSSIRCSDKRCF-----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL-ATEDENL 215

Query: 157 NP---RLALGCGYNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-- 210
            P    + LGCG  Q       + ++G+LGLG    S+ S L    +  N    C     
Sbjct: 216 TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275

Query: 211 GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
           G  G + FGD  Y D     + S++   +  Y   ++ +   G+   ++ L   FD+GSS
Sbjct: 276 GNVGRISFGDRGYTDQEETPFISVAP--STAYGVNISGVSVAGDPVDIR-LFAKFDTGSS 332

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           +T+L    Y  LT         KS  E  ED   P+      PF+  +D+     T+   
Sbjct: 333 FTHLREPAYGVLT---------KSFDELVEDRRRPV--DPELPFEFCYDLSPNATTIQFP 381

Query: 330 FTD----GKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
             +    G ++ +  L    +   + +GNV  CLG+L    VGL+ +NVIG
Sbjct: 382 LVEMTFIGGSKII--LNNPFFTARTQEGNVMYCLGVLK--SVGLK-INVIG 427


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 162/390 (41%), Gaps = 70/390 (17%)

Query: 27  SSLFNHVGSSLL-FQVH-----------GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +SLF+H  S++   Q H           G    T  Y VT+ IG   +   L +DTGSDL
Sbjct: 109 NSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDL 166

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH----APGHHNCEDPAQCD 126
           TW+QC  PC  C     PL+ PSN      +PC  P C +L     + G  + ++   CD
Sbjct: 167 TWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCD 225

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           Y+++Y DG  S G L  +      T G+        GCG N      +    G++GL + 
Sbjct: 226 YQIDYGDGSYSRGELGFEKL----TLGKTEIDNFIFGCGRNN--KGLFGGASGLMGLARS 279

Query: 187 KSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY- 241
           + S+VSQ  S  L  +V  +CL     G  G     G D  +   +   S    YT+   
Sbjct: 280 ELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPIS----YTRMIQ 333

Query: 242 SPGVAELFF---GGETTGLKNLPV-----------VFDSGSSYTYLNRVTYQTLTSIMKK 287
           +P ++  +F    G + G  NL V           + DSG+  T L+   Y+   +  +K
Sbjct: 334 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEK 393

Query: 288 ELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEA 345
           + S    +  P    L  C+   G     N+  VK  F        +G    + ++    
Sbjct: 394 QFSG--YRTTPGFSILNTCFNLTGYEEV-NIPTVKFIF--------EGNAEMIVDVEGVF 442

Query: 346 YLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           Y + S+   +CL     A +G +D  +I G
Sbjct: 443 YFVKSDASQICLAF---ASLGYEDQTMIIG 469


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 162/390 (41%), Gaps = 70/390 (17%)

Query: 27  SSLFNHVGSSLL-FQVH-----------GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +SLF+H  S++   Q H           G    T  Y VT+ IG   +   L +DTGSDL
Sbjct: 30  NSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDL 87

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH----APGHHNCEDPAQCD 126
           TW+QC  PC  C     PL+ PSN      +PC  P C +L     + G  + ++   CD
Sbjct: 88  TWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCD 146

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           Y+++Y DG  S G L  +      T G+        GCG N      +    G++GL + 
Sbjct: 147 YQIDYGDGSYSRGELGFEKL----TLGKTEIDNFIFGCGRNN--KGLFGGASGLMGLARS 200

Query: 187 KSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY- 241
           + S+VSQ  S  L  +V  +CL     G  G     G D  +   +   S    YT+   
Sbjct: 201 ELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPIS----YTRMIQ 254

Query: 242 SPGVAELFF---GGETTGLKNLPV-----------VFDSGSSYTYLNRVTYQTLTSIMKK 287
           +P ++  +F    G + G  NL V           + DSG+  T L+   Y+   +  +K
Sbjct: 255 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEK 314

Query: 288 ELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEA 345
           + S    +  P    L  C+   G     N+  VK  F        +G    + ++    
Sbjct: 315 QFSG--YRTTPGFSILNTCFNLTGYEEV-NIPTVKFIF--------EGNAEMIVDVEGVF 363

Query: 346 YLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           Y + S+   +CL     A +G +D  +I G
Sbjct: 364 YFVKSDASQICLAF---ASLGYEDQTMIIG 390


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 107/350 (30%), Positives = 142/350 (40%), Gaps = 51/350 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---AP---CVRCVEAPHPLYRPSN---- 98
           G Y V+M  G P +   L  DTGSDL WLQC    AP   C +   +  P +  S     
Sbjct: 52  GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 111

Query: 99  DLVPCEDPICASLHAPGHH--NCE--DPAQCDYELEYADGGSSLGVLVKD-AFAFNYTNG 153
            +VPC    C  + AP  H  +C    P  C Y  +YADG S+ G L +D A   N T+G
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 171

Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LS 209
                 +A GCG  NQ  G S+    G++GLG+G+ S  +Q  S  L      +C   L 
Sbjct: 172 GAAVRGVAFGCGTRNQ--GGSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDLE 227

Query: 210 GGGGG----FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG------- 256
           GG  G    FLF G     ++   +T + S+     +Y  GV  +  G            
Sbjct: 228 GGRRGRSSSFLFLGRPERRAA-FAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 286

Query: 257 ---LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKG 309
              L N   V DSGS+ TYL    Y  L S     +    L   P   T    L LC+  
Sbjct: 287 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYNV 343

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
                ++      F  L + F  G +    EL    YL+       CL I
Sbjct: 344 SSS-SSLAPANGGFPRLTIDFAQGLS---LELPTGNYLVDVADDVKCLAI 389


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/281 (28%), Positives = 125/281 (44%), Gaps = 44/281 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y V + +G P        DTGSD+ W QC  PC  C +   P++ PS       V C 
Sbjct: 81  GEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-PCSNCYQQNAPMFDPSKSTTYKNVACS 139

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
            P+C+  ++    +C D ++C Y + Y D   S G L  D      T+G+ +  PR  +G
Sbjct: 140 SPVCS--YSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIG 197

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF------LF 217
           CG++   G     + GI+GLG+G +S+V+QL      +    +CL   G G       L 
Sbjct: 198 CGHDNA-GTFNANVSGIVGLGRGPASLVTQLGPATGGK--FSYCLIPIGTGSTNDSTKLN 254

Query: 218 FGDDLYDS-SRVVWTSM--SSDYTKYYS----------------PGVAELFFGGETTGLK 258
           FG +   S S  V T +  S+ Y  +YS                 G ++L  GGE+    
Sbjct: 255 FGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESN--- 309

Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
              ++ DSG++ TYL      +  S + + +S    ++  E
Sbjct: 310 ---IIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSE 347


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 156/371 (42%), Gaps = 64/371 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP---LYRPSNDLVP-- 102
           +G Y V + +G P +   L  DTGSDL W++C A C  C   P     L R S+   P  
Sbjct: 85  SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHSSSFSPFH 143

Query: 103 CEDPICASL-HAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
           C DP C  L HAP HH C      + C +   YADG  S G   K+       +G  ++ 
Sbjct: 144 CFDPHCRLLPHAP-HHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHL 202

Query: 159 R-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSG 210
           + L+ GCG+      V GA ++   G++GLG+G  S  SQL  +   K    ++ + LS 
Sbjct: 203 KGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSP 262

Query: 211 GGGGFLFFGDDLY-----DSSRVVWTSMSSD---YTKYY---------------SPGVAE 247
               FL  G  L+     +++++ +T +  +    T YY               +P V E
Sbjct: 263 PPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWE 322

Query: 248 LFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE--DETLPL 305
           +   G      N   V DSG++ TYL +  Y+ +   +++ +   +  E     D  +  
Sbjct: 323 IDEQG------NGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNA 376

Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
             + RRP             L      G    +F   P  Y + + +G +CL I    E 
Sbjct: 377 SGESRRP---------SLPRLRFRLGGG---AVFAPPPRNYFLETEEGVMCLAI-RAVES 423

Query: 366 GLQDLNVIGGI 376
           G    +VIG +
Sbjct: 424 G-NGFSVIGNL 433


>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
          Length = 320

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 88/188 (46%), Gaps = 22/188 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
           TG Y   + IG P + Y++ +DTGSD+ W+ C    +RC   P           Y P  S
Sbjct: 81  TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136

Query: 98  NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
              V CE   C +  A G    C   +  C + + Y DG ++ G  V D   +N    NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196

Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
           Q    N  +  GCG       G+S   LDGILG G+  SS++SQL + + +R +  HCL 
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256

Query: 210 GGGGGFLF 217
              GG +F
Sbjct: 257 TVRGGGIF 264


>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
          Length = 127

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/127 (41%), Positives = 71/127 (55%), Gaps = 5/127 (3%)

Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
           CGY Q   A     P+DGILGLG GK+ + +QL   K+I+ NV+GHCLS  G G L+ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
               +  V W  M      YYSPG+AE+F   +   G      VFDSGS+YT++    Y 
Sbjct: 61  FNPPTRGVTWVPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119

Query: 280 TLTSIMK 286
            + S ++
Sbjct: 120 EIVSKVR 126


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/370 (25%), Positives = 139/370 (37%), Gaps = 67/370 (18%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           + G  + +G Y  ++ +G P  P  L +DTGSD+ WLQC  PCV C     PLY P    
Sbjct: 89  ISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSS 147

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCD-------YELEYADGGSSLGVLVKDAFAFN 149
                PC  P            C +P  CD       Y + Y D  S+ G L  D   F 
Sbjct: 148 TYAQTPCSPP-----------QCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVF- 195

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
            +N   +   + LGCG++      +    G+LG+ +G +S  +Q+           +CL 
Sbjct: 196 -SNDTSVG-NVTLGCGHDNE--GLFGSAAGLLGVARGNNSFATQVADS--YGRYFAYCLG 249

Query: 209 ----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLP 261
               SG    +L FG    +    V+T + S+  +   YY   V     G   TG  N  
Sbjct: 250 DRTRSGSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNAS 309

Query: 262 -----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                      VV DSG+S T   R  Y  L        +   +++           +G 
Sbjct: 310 LSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKV---------GRGI 360

Query: 311 RPFKNVHDVKKCFRT----LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
             F   +D++         + L F  G       L PE YL+    G      L  A  G
Sbjct: 361 SVFDACYDLRGVAVADAPGVVLHFAGGAD---VALPPENYLVPEESGRYHCFALEAA--G 415

Query: 367 LQDLNVIGGI 376
              L+VIG +
Sbjct: 416 HDGLSVIGNV 425


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 16/203 (7%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +D+GS +T++ C + C +C +   P ++P  
Sbjct: 81  MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 137

Query: 99  DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           ++     P+  ++      NC+D   QC YE EYA+  SS GVL +D  +F   N  +L 
Sbjct: 138 EMSSTYQPVKCNMDC----NCDDDREQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 191

Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
           P R   GC   +         DGI+GLG+G  S+V QL  + LI N  G C  G   GGG
Sbjct: 192 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 251

Query: 214 GFLFFGDDLYDSSRVVWTSMSSD 236
             +  G D    S +V+T    D
Sbjct: 252 SMILGGFDY--PSDMVFTDSDPD 272


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 99/347 (28%), Positives = 149/347 (42%), Gaps = 54/347 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V +Y+G P R + + +DTGSDL WLQC APC+ C E   P++ P+  L    V C
Sbjct: 149 SGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPATSLSYRNVTC 207

Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
            DP C  +  P     C  P    C Y   Y D  ++ G L  +AF  N T     R   
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVD 267

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG-- 212
            +  GCG++      +H   G+LGLG+G  S  SQL      R V GH    CL   G  
Sbjct: 268 DVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQL------RAVYGHAFSYCLVDHGSS 319

Query: 213 -GGFLFFGDD--LYDSSRVVWTSMSSDYT----KYYSPGVAELFFGGETTGLKNLP---- 261
            G  + FGDD  L    R+ +T+ +         +Y   +  +  GGE   +        
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379

Query: 262 ------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  + DSG++ +Y     Y+    ++++    +  K  P     P+      P  N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYE----VIRRAFVERMDKAYPLVADFPVL----SPCYN 431

Query: 316 VHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
           V  V++      +L F DG    +++   E Y + +   G +CL +L
Sbjct: 432 VSGVERVEVPEFSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVL 475


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 151/367 (41%), Gaps = 57/367 (15%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G  L   VH      G + + + IG PA  Y   +DTGSDL W QC  PCV C +   P+
Sbjct: 81  GGDLQVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPV 136

Query: 94  YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           + PS+      VPC    C+ L       C   ++C Y   Y D  S+ GVL  + F   
Sbjct: 137 FDPSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF--- 190

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            T  +   P +  GCG +   G  +    G++GLG+G  S+VSQL   K       +CL+
Sbjct: 191 -TLAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 243

Query: 210 GGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLK 258
                       G      +    +S V  T +  + ++  +Y   +  +  G     L 
Sbjct: 244 SLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLP 303

Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           +            V+ DSG+S TYL    Y+ L    KK  +A+    A +   + L   
Sbjct: 304 SSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLC 359

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL 367
            R P K V  V+     L   F  G      +L  E Y+++    G +CL ++     G 
Sbjct: 360 FRAPAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GS 409

Query: 368 QDLNVIG 374
           + L++IG
Sbjct: 410 RGLSIIG 416


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 151/367 (41%), Gaps = 57/367 (15%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G  L   VH      G + + + IG PA  Y   +DTGSDL W QC  PCV C +   P+
Sbjct: 91  GGDLQVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPV 146

Query: 94  YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           + PS+      VPC    C+ L       C   ++C Y   Y D  S+ GVL  + F   
Sbjct: 147 FDPSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF--- 200

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
            T  +   P +  GCG +   G  +    G++GLG+G  S+VSQL   K       +CL+
Sbjct: 201 -TLAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 253

Query: 210 GGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLK 258
                       G      +    +S V  T +  + ++  +Y   +  +  G     L 
Sbjct: 254 SLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLP 313

Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           +            V+ DSG+S TYL    Y+ L    KK  +A+    A +   + L   
Sbjct: 314 SSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLC 369

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL 367
            R P K V  V+     L   F  G      +L  E Y+++    G +CL ++     G 
Sbjct: 370 FRAPAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GS 419

Query: 368 QDLNVIG 374
           + L++IG
Sbjct: 420 RGLSIIG 426


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 33/261 (12%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-SNDL-------------VP 102
           IG P   + + LDTGSD+ W+ CD  C+ C       Y     DL             +P
Sbjct: 108 IGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLP 165

Query: 103 CEDPICASLHAPGHHNCED-PAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
           C   +C       + NC+    +C Y  EY +D  SS G L++D       N  +  +  
Sbjct: 166 CGHQLCNQ-----NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNSIQA 220

Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
            + LGCG  Q    + GA+    +G+LGLG G  S+ + L    LIRN +  CL+  G G
Sbjct: 221 SVILGCGRKQSGYFLEGAA---PNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSG 277

Query: 215 FLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
            + FGD  + + R     +  D     Y  GV     G             D+G+S+TYL
Sbjct: 278 RILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYL 337

Query: 274 NRVTYQTLTSIMKKELSAKSL 294
            +  Y+T+ +  +K++ A  +
Sbjct: 338 PKGVYETVVAEFEKQVHATRI 358


>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
          Length = 133

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 3/124 (2%)

Query: 176 PLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS 234
           P+DGILGLG GK+   +QL  QK+I  NV+GHCLS  G G L+ G+    S  V W  M 
Sbjct: 8   PVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVTWVPM- 66

Query: 235 SDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
            + + YYSPG+AEL    +   G      VFDSGS+YT +    Y  +   ++  LS  S
Sbjct: 67  RESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKVRGTLSESS 126

Query: 294 LKEA 297
           L E 
Sbjct: 127 LAEV 130


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 151/365 (41%), Gaps = 57/365 (15%)

Query: 36  SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
           S L  VH      G + + + IG PA  Y   +DTGSDL W QC  PCV C +   P++ 
Sbjct: 62  SRLVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFD 117

Query: 96  PSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
           PS+      VPC    C+ L       C   ++C Y   Y D  S+ GVL  + F    T
Sbjct: 118 PSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF----T 170

Query: 152 NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
             +   P +  GCG +   G  +    G++GLG+G  S+VSQL   K       +CL+  
Sbjct: 171 LAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLTSL 224

Query: 212 G---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLKNL 260
                     G      +    +S V  T +  + ++  +Y   +  +  G     L + 
Sbjct: 225 DDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSS 284

Query: 261 P----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                      V+ DSG+S TYL    Y+ L    KK  +A+    A +   + L    R
Sbjct: 285 AFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLCFR 340

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGLQD 369
            P K V  V+     L   F  G      +L  E Y+++    G +CL ++     G + 
Sbjct: 341 APAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GSRG 390

Query: 370 LNVIG 374
           L++IG
Sbjct: 391 LSIIG 395


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 157/371 (42%), Gaps = 67/371 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y  T+ +G PA+ + +  DTGSDL W+QC  PC  C     P++ P    S   + C 
Sbjct: 38  GDYVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCG 96

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
           D +C SL      +C     CDY   Y DG  + G L  +      T G++L  + +A G
Sbjct: 97  DTLCDSLP---RKSCS--PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
           CG+  +   S++   G++GLG+G  S VSQL    L  +   +CL     +      +FF
Sbjct: 152 CGH--LNRGSFNDASGLVGLGRGNLSFVSQLG--DLFGHKFSYCLVPWRDAPSKTSPMFF 207

Query: 219 GDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKNLPV--------------- 262
           GD+   SS      +   +T   ++P +   ++      LK++ +               
Sbjct: 208 GDE--SSSHSSGKKLHYAFTPMIHNPAMESFYY----VKLKDISIAGRALRIPAGSFDIK 261

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  +FDSG++ T L    YQ +   ++ ++S   +  +     L LC+       +
Sbjct: 262 PDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGS--SAGLDLCY-------D 312

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNG-AEVGL----- 367
           V   K  ++    +          +L  E Y I +N     VCL +++   ++G+     
Sbjct: 313 VSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMM 372

Query: 368 -QDLNVIGGIG 377
            Q+  V+  IG
Sbjct: 373 QQNFRVMYDIG 383


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 141/348 (40%), Gaps = 38/348 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           TG Y V + +G P + + L  DTGSDLTW++C          P  ++RP        +PC
Sbjct: 113 TGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG-----ASPPGRVFRPKTSRSWAPIPC 167

Query: 104 EDPICASLHAP-GHHNCEDPAQ-CDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPR- 159
               C  L  P    NC  PA  C Y+  Y +G + + G++  ++       G+    + 
Sbjct: 168 SSDTC-KLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKD 226

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFL 216
           + LGC  +   G S+   DG+L LG  K S  +Q  ++        +V H       G+L
Sbjct: 227 VVLGCSSSH-DGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYL 285

Query: 217 FFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGG-------ETTGLKNLPVVFDSGS 268
            FG      +    T +  D    +Y   V  +   G       E    K+  V+ DSG+
Sbjct: 286 AFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGN 345

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           + T L    Y+ + + + K L        P  E     W  RRP        +    LA+
Sbjct: 346 TLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHC-YNWTARRP-----GAPEIIPKLAV 399

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F  G  R   E   ++Y+I    G  C+G+  G   G   L+VIG I
Sbjct: 400 QFA-GSAR--LEPPAKSYVIDVKPGVKCIGVQEGEWPG---LSVIGNI 441


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 97/345 (28%), Positives = 146/345 (42%), Gaps = 55/345 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + IG P   Y   LDTGSDL W QC  PC +C + P P++ P    S   V C 
Sbjct: 106 GEYLMELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTQCYKQPTPIFDPKKSSSFSKVSCG 164

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C+++ +     C D   C+Y   Y D   + GVL  + F F  +  +     +  GC
Sbjct: 165 SSLCSAVPS---STCSD--GCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD- 220
           G +   G  +    G++GLG+G  S+VSQL   +       +CL+         L  G  
Sbjct: 220 GEDN-EGDGFEQASGLVGLGRGPLSLVSQLKEPRF-----SYCLTPMDDTKESILLLGSL 273

Query: 221 -DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLK-----NLPVVFDSG 267
             + D+  VV T +  +       Y       V +     E +  +     N  V+ DSG
Sbjct: 274 GKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSG 333

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVHDVKKCF 323
           ++ TY+ +  ++ L    KKE  +++  + P D+T    L LC+        V   K  F
Sbjct: 334 TTITYIEQKAFEAL----KKEFISQT--KLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVF 387

Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
                 F  G      EL  E Y+I  SN G  CL +  GA  G+
Sbjct: 388 H-----FKGGD----LELPAENYMIGDSNLGVACLAM--GASSGM 421


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 97/365 (26%), Positives = 142/365 (38%), Gaps = 55/365 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP----------------------CVRCVE 88
           Y   + +G P   +    DTGSDL WL+C+                            V 
Sbjct: 82  YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141

Query: 89  APHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
             +P    S   V C+ P C +L      N  D   CD+   Y DG S+ G+L  D F F
Sbjct: 142 YFNPFDSSSYSRVGCDGPSCLALATNASCN-GDSHACDFRYSYRDGASATGLLAADTFTF 200

Query: 149 --NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
             N  N       +  GC      G  +   DG++GLG G  S+ SQL  +         
Sbjct: 201 GGNINNDTTSTASIDFGCATGTA-GREFQA-DGMVGLGAGPLSLASQLGRK------FSF 252

Query: 207 CLSG----GGGGFLFFGDDLYDSSRVVWTS----MSSDYTKYYSPGVAELFFGGE----T 254
           CL+          L FG     S     T+     SS+   YY+  +  L   G+    T
Sbjct: 253 CLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGT 312

Query: 255 TGLKNLPVVFDSGSSYTYLNRVTYQT-LTSIMKKELSAKSLKEA-PEDETLPLCWKGRRP 312
           T +    V+ D+G+  T+L+R      LT  + + +    L  A P DETL LC+   R 
Sbjct: 313 TSVSK--VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDVSR- 369

Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
              V DV      + L    G    +  LT E   ++  +G +CL ++  +   LQ L+V
Sbjct: 370 ---VKDVDGVIPDVTLVLGGGGGGEV-RLTGEGTFVLVKEGVLCLAVVTTSP-ELQPLSV 424

Query: 373 IGGIG 377
           +G + 
Sbjct: 425 LGNVA 429


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 104/228 (45%), Gaps = 26/228 (11%)

Query: 29  LFNHVGSSLLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
           L +HV  +  F V     P    Y  T+ IG P R + + +DTGSD+ W+ C + CV C 
Sbjct: 59  LQSHVHGAFSFPVERGTNPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCP 117

Query: 88  EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
                 + P    S   + C D  C S      H     +  +Y++EY+DG  + G  + 
Sbjct: 118 LQNVTFFDPGASSSAVKLACSDKRCFS----DLHKKSGCSPLEYKVEYSDGSFTSGYYIS 173

Query: 144 DAFAFNYTNGQRLNPR----LALGC-----GYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
           D  +F       L  +       GC     G   +P  S H   GI+GLGKG+  +VSQL
Sbjct: 174 DLISFETVMSSNLTVKSSAPFVFGCSNLHAGLISLPETSIH---GIVGLGKGRLLVVSQL 230

Query: 195 HSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY 240
            SQ+L   V   CLSGG  GGG +  G++   ++  V+T +    T Y
Sbjct: 231 SSQRLAPEVFSLCLSGGQEGGGVIILGENRLPNT--VYTPLVRSQTHY 276


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/280 (27%), Positives = 122/280 (43%), Gaps = 39/280 (13%)

Query: 44  NVYPTGYYNVTMY----IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR---- 95
            + P  Y+    Y    IG P+  + + LD+GSDL W+ C+  CV+C       Y     
Sbjct: 86  TISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLWIPCN--CVQCAPLSSAYYSSLAT 143

Query: 96  -------PS----NDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYA-DGGSSLGVLV 142
                  PS    + + PC   +C S  A     CE P  QC Y + YA +  SS G+LV
Sbjct: 144 KDLNEFDPSASTTSKVFPCSHKLCESAPA-----CESPKEQCPYTVTYASENTSSSGLLV 198

Query: 143 KDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPL--DGILGLGKGKSSIVSQLHSQK 198
           +D    A++      +  R+ +GCG  Q  G     +  DG++GLG G+ S+ S L    
Sbjct: 199 EDVLHLAYSANASSSVKARVVVGCGEKQ-SGEFLKGIAPDGVMGLGPGEISVPSFLAKAG 257

Query: 199 LIRNVVGHCLSGGGGGFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
           L+RN    C      G ++FGD       S+R +     +++  Y+  GV     G    
Sbjct: 258 LMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFL--PYKNEFVAYFV-GVEVCCVGNSCL 314

Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
              +   + DSG S+T+L    Y+ +   +   ++A   K
Sbjct: 315 KQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKK 354


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 99/347 (28%), Positives = 149/347 (42%), Gaps = 54/347 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V +Y+G P R + + +DTGSDL WLQC APC+ C E   P++ P+  L    V C
Sbjct: 149 SGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASLSYRNVTC 207

Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
            DP C  +  P     C  P    C Y   Y D  ++ G L  +AF  N T     R   
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVD 267

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG-- 212
            +  GCG++      +H   G+LGLG+G  S  SQL      R V GH    CL   G  
Sbjct: 268 DVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQL------RAVYGHAFSYCLVDHGSS 319

Query: 213 -GGFLFFGDD--LYDSSRVVWTSMSSDYT----KYYSPGVAELFFGGETTGLKNLP---- 261
            G  + FGDD  L    R+ +T+ +         +Y   +  +  GGE   +        
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379

Query: 262 ------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  + DSG++ +Y     Y+    ++++    +  K  P     P+      P  N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYE----VIRRAFVERMDKAYPLVADFPVL----SPCYN 431

Query: 316 VHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
           V  V++      +L F DG    +++   E Y + +   G +CL +L
Sbjct: 432 VSGVERVEVPEFSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVL 475


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 118/285 (41%), Gaps = 39/285 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
           +G Y V + IG P    +L +D+GSD+ W+QC  PC+ C     PL+ P++      V C
Sbjct: 122 SGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFSAVSC 180

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC +L   G   C D   C+YE+ Y DG  + G L  +      T  +     +A+G
Sbjct: 181 GSAICRTLRTSG---CGDSGGCEYEVSYGDGSYTKGTLALETLTLGGTAVE----GVAIG 233

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---------G 214
           CG+       +    G+LGLG G  S+V QL           +CL+  GG         G
Sbjct: 234 CGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAA--GGAFSYCLASRGGSGSGAADAAG 289

Query: 215 FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGE----TTGLKNLP------V 262
            L  G         VW  +  +     +Y  GV+ +  G E      GL  L       V
Sbjct: 290 SLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGV 349

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           V D+G++ T L +  Y  L       + A  L  AP    L  C+
Sbjct: 350 VMDTGTAVTRLPQEAYAALRDAFVGAVGA--LPRAPGVSLLDTCY 392


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 88/292 (30%), Positives = 124/292 (42%), Gaps = 43/292 (14%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPT--GY------YNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
           SS +N+V   L  Q      PT  GY      Y +T+ IG PA    + +DTGSD++W+Q
Sbjct: 99  SSRYNNVAKEL--QQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQ 156

Query: 79  CDAPCV--RCVEAPHPLYRPSNDLV----PCEDPICASLHAPGHHNCEDPAQCDYELEYA 132
           C APC    C      L+ P+         C    CA L   G+   +  +QC Y ++Y 
Sbjct: 157 C-APCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLK--SQCQYIVKYG 213

Query: 133 DGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVS 192
           DG ++ G    D  +   ++  +       GC +          LDG++GLG    S+VS
Sbjct: 214 DGSNTAGTYGSDTLSLTSSDAVK---SFQFGCSHRAA--GFVGELDGLMGLGGDTESLVS 268

Query: 193 QLHSQKLIRNVVGHCL---SGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           Q  +         +CL   S  GGGFL  G      SSR   T M     ++  P    +
Sbjct: 269 Q--TAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPM----VRFSVPTFYGV 322

Query: 249 FFGGETTG--LKNLPV-------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
           F  G T    + N+P        V DSG+  T L    YQ L +  KKE+ A
Sbjct: 323 FLQGITVAGTMLNVPASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKA 374


>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 30/278 (10%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH------------PLYR 95
           Y NVT  IG PA+ + + LDTGSDL WL   C++ CVR +E                +Y 
Sbjct: 112 YANVT--IGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYN 169

Query: 96  PS----NDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGS-SLGVLVKDAFAFN 149
           PS    +  V C   +CA       + C  P + C Y + Y   GS S GVLV+D    +
Sbjct: 170 PSISTSSSKVTCNSTLCAL-----RNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMS 224

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
              G+  + R+  GC   Q+       ++GI+GL     ++ + L    +  +    C  
Sbjct: 225 TEEGEARDARITFGCSETQLGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDSFSMCFG 284

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
             G G + FGD    SS    T +    +  +       F  G+ T       +FDSG++
Sbjct: 285 PNGKGTISFGDK--GSSDQHETPLGGTISPLFYDVSITKFKVGKVTVETKFSAIFDSGTA 342

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            T+L    Y  LT+     +  + L  A  D T   C+
Sbjct: 343 VTWLLDPYYTALTTNFHLSVPDRRLP-ANVDSTFEFCY 379


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 161/367 (43%), Gaps = 40/367 (10%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           TV++   SS  SS+++ + V + +      N Y  G Y + +YIG P       +DTGSD
Sbjct: 34  TVKLIRKSSHLSSNNIQDIVQAPI------NAY-IGQYLMELYIGTPPIKISGTVDTGSD 86

Query: 74  LTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYEL 129
           L W+QC  PC+ C    +P++ P    +   + C+ P+C   + P    C    +CDY  
Sbjct: 87  LIWVQC-VPCLGCYNQINPMFDPLKSSTYTNISCDSPLC---YKPYIGECSPEKRCDYTY 142

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
            YAD   + GVL ++        G+ ++ + +  GCG+N     + H + G++GLG G +
Sbjct: 143 GYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEM-GLIGLGGGPT 201

Query: 189 SIVSQL--------HSQKLIRNVVGHCLSGG---GGGFLFFGDDLYDSSRVVWTS-MSSD 236
           S+VSQ+         SQ L+  +    +S     G G    G+ +  +  V     M+S 
Sbjct: 202 SLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSY 261

Query: 237 YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           Y       V + +    +T ++   ++ DSG+    L +  Y  +   +K ++  + + +
Sbjct: 262 YVTLLGISVEDTYLPMNST-IEKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITD 320

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
            P      LC++ +   K    +   F    L  T    +T    TPE       KG  C
Sbjct: 321 DPSLGPQ-LCYRTQTNLKG-PTLTYHFEGANLLLT--PIQTFIPPTPET------KGVFC 370

Query: 357 LGILNGA 363
           L I N A
Sbjct: 371 LAITNCA 377


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 109/251 (43%), Gaps = 22/251 (8%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLH 112
           IG P   Y    DTGSDLTW QC  PC++C +   P++ P    S   VPC    C   H
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---H 141

Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
           A    +C     CDY   Y D   S G L      F        + +  +GCG+    G 
Sbjct: 142 AVDDGHCGVQGVCDYSYTYGDRTYSKGDL-----GFEKITIGSSSVKSVIGCGHASSGGF 196

Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC----LSGGGGGFLFFGDDLYDSSRV 228
            +    G++GLG G+ S+VSQ+     I     +C    LS   G   F  + +     V
Sbjct: 197 GFA--SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 254

Query: 229 VWTSMSSDYT-KYYSPGVAELFFGGE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIM 285
           V T + S  T  YY   +  +  G E      K   V+ DSG++ ++L +  Y  + S +
Sbjct: 255 VSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSL 314

Query: 286 KKELSAKSLKE 296
            K + AK +K+
Sbjct: 315 LKVVKAKRVKD 325


>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 529

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 76/254 (29%), Positives = 108/254 (42%), Gaps = 23/254 (9%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLH 112
           IG P+  + + LD GSDL W+ CD  C+ C       Y    R  N+  P      +S H
Sbjct: 106 IGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRS--LSSKH 161

Query: 113 APGHH-------NCE--DPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRL-- 160
               H       NC+     QC Y + Y +D  SS G+LV+D F     +G   N  +  
Sbjct: 162 LSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSNSSVQA 221

Query: 161 --ALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
              +GCG  Q  G       DG++GLG G+SS+ S L    LIR+    C +    G LF
Sbjct: 222 PVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNEDDSGRLF 281

Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
           FGD      +     +       Y  GV     G     + +    FDSG+S+T+L    
Sbjct: 282 FGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSFNAQFDSGTSFTFLPGHA 341

Query: 278 YQTLTSIMKKELSA 291
           Y  +     K+++A
Sbjct: 342 YGAIAEEFDKQVNA 355


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 151/368 (41%), Gaps = 59/368 (16%)

Query: 33  VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
           V  +L   VH      G + + M IG PA  Y   +DTGSDL W QC  PCV C     P
Sbjct: 87  VAPALQVPVHAG---NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTP 142

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           ++ PS+      +PC   +C+ L +     C   A+C Y   Y D  S+ GVL  + F  
Sbjct: 143 VFDPSSSSTYAALPCSSTLCSDLPS---SKCTS-AKCGYTYTYGDSSSTQGVLAAETFTL 198

Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
             T      P +A GCG +   G  +    G++GLG+G  S+VSQL   K       +CL
Sbjct: 199 AKTK----LPDVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKF-----SYCL 248

Query: 209 SGGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGL 257
           +            G      +    +S V  T +  + ++  +Y   +  L  G     L
Sbjct: 249 TSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITL 308

Query: 258 KNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            +            V+ DSG+S TYL    Y+ L    KK  +A+    A +   + L  
Sbjct: 309 PSSAFAVQDDGTGGVIVDSGTSITYLELQGYRAL----KKAFAAQMKLPAADGSGIGLDT 364

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGILNGAEVG 366
               P   V  V+       L   D       +L  E Y+++ S  G +CL ++     G
Sbjct: 365 CFEAPASGVDQVEVPKLVFHLDGAD------LDLPAENYMVLDSGSGALCLTVM-----G 413

Query: 367 LQDLNVIG 374
            + L++IG
Sbjct: 414 SRGLSIIG 421


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 93/320 (29%), Positives = 133/320 (41%), Gaps = 38/320 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ IG PA    + +DTGSDL+W+QC  PC    C     PL+ PS+      VPC+
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 176

Query: 105 DPICASLHAPGH-HNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
              C  L A  + H C   A   C+Y +EY +  ++ GV   +           +     
Sbjct: 177 SDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVADFG 233

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
            GCG +Q     Y   DG+LGLG    S+VSQ  SQ        +CL  + GG GFL  G
Sbjct: 234 FGCGDHQ--HGPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFLALG 289

Query: 220 -----DDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGS 268
                     ++  ++T M        +Y   +  +  GG    +     +  +V DSG+
Sbjct: 290 APNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSSGMVIDSGT 349

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T L    Y  L S  +  +S   L        L  C+     F    +V     T+AL
Sbjct: 350 VITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYD----FTGHTNVT--VPTIAL 403

Query: 329 SFTDGKTRTLFELTPEAYLI 348
           +F+ G T  L   TP   L+
Sbjct: 404 TFSGGATIDL--ATPAGVLV 421


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 82/251 (32%), Positives = 110/251 (43%), Gaps = 33/251 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V +  G PAR Y + +DTGS L+WLQC    V C     PL+ PS       + C
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 174

Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
               C+SL     +N  CE  +  C Y   Y D   S+G L +D         Q L P  
Sbjct: 175 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 231

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
             GCG  Q     +    GILGLG+ K S++ Q+ S+        +CL + GGGGFL  G
Sbjct: 232 VYGCG--QDSDGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 287

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF--------GGETTGLK----NLPVVFDSG 267
                 S   +T M++D      PG   L+F        GG   G+      +P + DSG
Sbjct: 288 KASLAGSAYKFTPMTTD------PGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSG 341

Query: 268 SSYTYLNRVTY 278
           +  T L    Y
Sbjct: 342 TVITRLPMSVY 352


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 15/155 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G + + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   +PC 
Sbjct: 95  GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCK-PCKDCFDQPTPIFDPKKSSSFSKLPCS 153

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +CA+L      +C D   C+Y   Y D  S+ GVL  + FAF    G     ++  GC
Sbjct: 154 SDLCAALPI---SSCSD--GCEYLYSYGDYSSTQGVLATETFAF----GDASVSKIGFGC 204

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           G +   G+ +    G++GLG+G  S++SQL   K 
Sbjct: 205 GEDN-DGSGFSQGAGLVGLGRGPLSLISQLGEPKF 238


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 121/290 (41%), Gaps = 35/290 (12%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           LF  +GS   F  +GN     +Y   + IG P   + + LD GSDL+W+ CD  C++C  
Sbjct: 83  LFPSLGSHTFF--YGNDLDWLHY-TWIDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAP 137

Query: 89  APHPLYRP-SNDL-------------VPCEDPICASLHAPGHH--NCEDPAQCDYELEYA 132
               LY+P   DL             + C   +C      G H  N +DP  C Y  +YA
Sbjct: 138 LSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCEL----GSHCKNLKDP--CPYIADYA 191

Query: 133 D-GGSSLGVLVKDAFAF------NYTNGQRLNPRLALGCGYNQVPG-ASYHPLDGILGLG 184
           D   SS G LV+D          + +  +R+   + LGCG  Q  G       DG++GLG
Sbjct: 192 DPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLG 251

Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
            G  S+ S L    LIR     C    G G + FGD  + S +      +      Y   
Sbjct: 252 PGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIE 311

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
           V     G           + DSG+S+TYL    Y  +     K+++A+ +
Sbjct: 312 VESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRI 361


>gi|218185382|gb|EEC67809.1| hypothetical protein OsI_35378 [Oryza sativa Indica Group]
          Length = 344

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 79/140 (56%), Gaps = 29/140 (20%)

Query: 237 YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           +  YYSPG A L+F   + G+  + V+                      K  LS+ SL++
Sbjct: 63  FGNYYSPGSATLYFDRHSLGMNPMDVI----------------------KGGLSSTSLEQ 100

Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
              D +LPLCWKG++ F++V DVKK F++L L+F +     + E+ PE +LI++  GNVC
Sbjct: 101 V-SDPSLPLCWKGQKAFESVSDVKKEFKSLQLNFGNN---AVMEIPPENFLIVTEYGNVC 156

Query: 357 LGILNGAEVGLQDLNVIGGI 376
           LGIL+G+ +   + N+IG I
Sbjct: 157 LGILHGSRL---NFNIIGDI 173



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 3/52 (5%)

Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           +V  +DP+  +LH  G     N   P QCDYE++YADG S++G L+ D F+ 
Sbjct: 1   MVRADDPLFVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSL 52


>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
 gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
          Length = 575

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 98/342 (28%), Positives = 149/342 (43%), Gaps = 44/342 (12%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPCEDPICASLH 112
           +G P+  + + LDTGSDL WL C+  C  C +    +Y PS    +  VPC  P+C    
Sbjct: 127 VGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGHPLCERPD 184

Query: 113 APGHHNCEDPAQCDYELEY--ADGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGY 166
           A      +  + C YE++Y  A+ GSS GVLV+D            G+ +   +  GCG 
Sbjct: 185 ACATAG-KSSSSCPYEVKYVSANTGSS-GVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQ 242

Query: 167 NQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD- 220
            Q    + GA+     G++GLG  K S+ S L S  L+  +    C S  G G + FGD 
Sbjct: 243 VQTGAFLRGAA---AGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDA 299

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
              D +     +  S    YY+  V  +    +   ++   VV DSG+S+TYL+   Y  
Sbjct: 300 GSPDQAETPLIAAGSLQPSYYNISVGAITVDSKAMAVEFTAVV-DSGTSFTYLDDPAYTF 358

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWK---GRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
           LT+     +S  S       E    C++   G+   K         R  A+S T  K   
Sbjct: 359 LTTNFNSRVSEASETYGSGYEKFEFCYRLSPGQTSMK---------RLPAMSLTT-KGGA 408

Query: 338 LFELT-PEAYLIISNKG------NVCLGILNGAEVGLQDLNV 372
           +F +T P   ++ S  G        CLGI+  + +  +D  +
Sbjct: 409 VFPITWPIIPVLASTNGGPYHPIGYCLGIIKTSILSTEDATI 450


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 148/358 (41%), Gaps = 60/358 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y + ++IG P + Y L LDTGSDL W+QC  PC+ C E   P Y P    S + + C
Sbjct: 189 SGEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITC 247

Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG---QRL 156
            DP C  + +P     C+D  Q C Y   Y D  ++ G    + F  N T  NG   Q+ 
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307

Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS------ 209
              +  GCG +N+     +H   G+LGLG+G  S  SQL S      + GH  S      
Sbjct: 308 VENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFASQLQS------IYGHSFSYCLVDR 358

Query: 210 ---GGGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNL 260
                    L FG+D  L     + +TS      +    +Y  G+  +   GE   +   
Sbjct: 359 NSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEE 418

Query: 261 P----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                       + DSG++ TY     Y+ +     K++    L E       PL     
Sbjct: 419 TWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEG----FPPL----- 469

Query: 311 RPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
           +P  NV  ++K       + F+DG    +++   E Y I      VCL IL   +  L
Sbjct: 470 KPCYNVSGIEKMELPDFGILFSDG---AMWDFPVENYFIQIEPDLVCLAILGTPKSAL 524


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 114/273 (41%), Gaps = 26/273 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
           TG Y V + +G PA  + +  DTGSD TW+QC  PCV  C +   PL+ P+       + 
Sbjct: 162 TGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQ-PCVAYCYQQKEPLFTPTKSATYANIS 220

Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
           C    C+ L   G         C Y ++Y DG  ++G   +D     Y   +        
Sbjct: 221 CTSSYCSDLDTRGCSG----GHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFR----F 272

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
           GCG        +    G++GLG+GK+S+  Q + +     V  +C+  +  G GFL FG 
Sbjct: 273 GCGEKNR--GLFGKAAGLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFGP 328

Query: 221 DLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSYTYLN 274
               ++    T M  D    +Y  G+  +  GG       T   +   + DSG+  T L 
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
              Y+ L S   K +     K AP    L  C+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY 421


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 90/333 (27%), Positives = 136/333 (40%), Gaps = 36/333 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V++ +G P R   +  DTGSDL+W+QC  PC  C +   PL+ PS       VPC
Sbjct: 135 TANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPC 193

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL--- 160
               C  L +    +C    +C YE+ Y D   + G L +D      ++    + +L   
Sbjct: 194 GAQECRRLDS---GSCSS-GKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEF 249

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFF 218
             GCG +      +   DG+ GLG+ + S+ SQ  ++        +CL  S    G+L  
Sbjct: 250 VFGCGDDDT--GLFGKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSSTAEGYLSL 305

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
           G     ++R       SD   +Y   +  +   G T  ++  P VF       DSG+  T
Sbjct: 306 GSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRT--VRVSPAVFRTPGTVIDSGTVIT 363

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  L S     +   S K AP    L  C+     F   + V+    ++AL F 
Sbjct: 364 RLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYD----FTGRNKVQ--IPSVALLFD 417

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
            G T     L     L ++NK   CL   +  +
Sbjct: 418 GGAT---LNLGFGEVLYVANKSQACLAFASNGD 447


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 150/361 (41%), Gaps = 51/361 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC-----DAPCVRCVEAPHPLYRPSNDL-- 100
           TG Y V   +G PA+P+ L  DTGSDLTW++C      +P    + +P  ++RP+N    
Sbjct: 107 TGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPR-VFRPANSKSW 165

Query: 101 --VPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NG 153
             +PC    C S       NC      PA C Y+  Y D  S+ GV+  DA     + +G
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSG 225

Query: 154 QRLNPRL---ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHC 207
                +L    LGC      G S+   DG+L LG    S  S+  ++   +    +V H 
Sbjct: 226 SDRKAKLQEVVLGC-TTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHL 284

Query: 208 LSGGGGGFLFFGD--DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
                  +L FG     +  SR     + +    +Y+  V  +   G+   +        
Sbjct: 285 APRNATSYLTFGPVGAAHSPSRTPLL-LDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVK 343

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
           KN   + DSG+S T L    Y+ + + + K+L+   +     D           PF+  +
Sbjct: 344 KNGGAILDSGTSLTILATPAYKAVVAALSKQLA--RVPRVTMD-----------PFEYCY 390

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTP--EAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           +     R  A+   + +      L P  ++Y+I +  G  C+G+  G   G   ++VIG 
Sbjct: 391 NWTATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPG---VSVIGN 447

Query: 376 I 376
           I
Sbjct: 448 I 448


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 100/353 (28%), Positives = 143/353 (40%), Gaps = 51/353 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ IG PA    + +DTGSDL+W+QC  PC    C     PL+ PS+      VPC+
Sbjct: 91  YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 149

Query: 105 DPICASLHAPGH-HNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C  L A  + H C        A C+Y +EY +  ++ GV   +           +  
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 206

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GCG +Q     Y   DG+LGLG    S+VSQ  SQ        +CL  + GG GFL
Sbjct: 207 DFGFGCGDHQH--GPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFL 262

Query: 217 FFGDDLYDSSRVVWTSMS-------SDYTKYYSPGVAELFFGGETTGLK----NLPVVFD 265
             G     SS    + +S            +Y   +  +  GG    +     +  +V D
Sbjct: 263 TLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVID 322

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG+  T L    Y  L S  +  +S   L        L  C+     F    +V     T
Sbjct: 323 SGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD----FTGHANVT--VPT 376

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
           ++L+F+ G T    +L   A +++      CL     A  G    N IG IG+
Sbjct: 377 ISLTFSGGAT---IDLAAPAGVLVDG----CL-----AFAGAGTDNAIGIIGN 417


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 128/296 (43%), Gaps = 37/296 (12%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           LF   GS ++F   GN +   +Y   + +G P+ P+ + LD GSDL W+ CD  C++C  
Sbjct: 84  LFPSEGSQVIF--FGNEFNWLHY-TWIDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAP 138

Query: 89  APHPLYRPSNDLVPCEDPICASLHAP---GHHNC---------EDPAQCDYELEY-ADGG 135
                Y   +  +   +P  +S       GH  C          DP  C Y+ +Y +D  
Sbjct: 139 LSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDP--CTYKRDYYSDNT 196

Query: 136 SSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGK 187
           S+ G +++D         +     L   +  GCG  Q    + GA+    DG++GLG G 
Sbjct: 197 STSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAA---PDGVMGLGPGN 253

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVA 246
            S+ + L  + L+RN    C    G G + FGDD   + +   +  +  ++  Y+  GV 
Sbjct: 254 ISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFI-GVE 312

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS----LKEAP 298
               G           + DSGSS+TYL    Y+ +     K++   +    L+E P
Sbjct: 313 SFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELP 368


>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like, partial [Cucumis sativus]
          Length = 408

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/269 (29%), Positives = 122/269 (45%), Gaps = 37/269 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPSNDL----VP 102
           IG P+  + + LD GSDL W+ C+  C++C          ++     YRPS+      + 
Sbjct: 109 IGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166

Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF-----NYTNGQR 155
           C   +C S       +C+ P Q C Y ++Y  +  SS G+L++D         N +N   
Sbjct: 167 CSHNLCDS-----GQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221

Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
             P + LGCG  Q  G  +   P DG+ GLG G+ S++S L  ++L++N    C +  G 
Sbjct: 222 QAP-VILGCGMKQSGGYLSGVAP-DGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGS 279

Query: 214 GFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           G +FFGD+   S +   +  +   Y  Y   GV             +   + DSG+S+TY
Sbjct: 280 GRIFFGDEGPASQQTTSFVPLDGKYETYIV-GVEACCIENSCLKQTSFKALIDSGTSFTY 338

Query: 273 LNRVTYQTLTSIMKKEL---SAKSLKEAP 298
           L    Y+ +     K L   SA S K  P
Sbjct: 339 LPEEAYENIVIEFDKRLNTTSAVSFKGYP 367


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 125/299 (41%), Gaps = 47/299 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---AP---CVRCVEAPHPLYRPSN---- 98
           G Y V+M  G P +   L  DTGSDL WLQC    AP   C +   +  P +  S     
Sbjct: 51  GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 110

Query: 99  DLVPCEDPICASLHAPGHH----NCEDPAQCDYELEYADGGSSLGVLVKD-AFAFNYTNG 153
            +VPC    C  + AP  H    +   P  C Y  +YADG S+ G L +D A   N T+G
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 170

Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LS 209
                 +A GCG  NQ  G S+    G++GLG+G+ S  +Q  S  L      +C   L 
Sbjct: 171 GAAVRGVAFGCGTRNQ--GGSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDLE 226

Query: 210 GGGGG----FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG------- 256
           GG  G    FLF G     ++   +T + S+     +Y  GV  +  G            
Sbjct: 227 GGRRGRSSSFLFLGRPERRAA-FAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 285

Query: 257 ---LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWK 308
              L N   V DSGS+ TYL    Y  L S     +    L   P   T    L LC+ 
Sbjct: 286 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYN 341


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 147/370 (39%), Gaps = 66/370 (17%)

Query: 42  HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN--- 98
           + N  P   Y V + IG P +P  L LDTGSDL W QC  PC  C         PSN   
Sbjct: 406 YANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSST 464

Query: 99  -DLVPCEDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--G 153
            D++PC  P+C +L   + G HN  +   C Y   YADG  + G L  + F F   +  G
Sbjct: 465 FDVLPCSSPVCDNLTWSSCGKHNWGN-QTCVYVYAYADGSITTGHLDAETFTFAAADGTG 523

Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           Q   P LA GCG +N   G       GI G G+G  S+ SQL           HC +   
Sbjct: 524 QATVPDLAFGCGLFNN--GIFTSNETGIAGFGRGALSLPSQLKVDNF-----SHCFTAIT 576

Query: 213 GG-----FLFFGDDLYDSS--RVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLPV 262
           G       L    +LY  +   V  T +  +++    YY      L   G T G   LP+
Sbjct: 577 GSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYY------LSLKGITVGSTRLPI 630

Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                          + DSG+  T L +  Y+ +      ++    +  A       LC+
Sbjct: 631 PESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLP-VDNATSSSLSRLCF 689

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGN--VCLGILNGAE 364
               P +   DV K    L L F +G T    +L  E Y+    + G    CL I  G  
Sbjct: 690 SFSVPRRAKPDVPK----LVLHF-EGAT---LDLPRENYMFEFEDAGGSVTCLAINAG-- 739

Query: 365 VGLQDLNVIG 374
               DL +IG
Sbjct: 740 ---DDLTIIG 746


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 121/283 (42%), Gaps = 47/283 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G + + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   +PC 
Sbjct: 95  GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L      +C D   C+Y   Y D  S+ GVL  + F F    G     ++  GC
Sbjct: 154 SDLCVALPI---SSCSD--GCEYRYSYGDHSSTQGVLATETFTF----GDASVSKIGFGC 204

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGD 220
           G +   G +Y    G++GLG+G  S++SQL   K       +CL+      G   L  G 
Sbjct: 205 GEDN-RGRAYSQGAGLVGLGRGPLSLISQLGVPKF-----SYCLTSIDDSKGISTLLVGS 258

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------------VFD 265
           +    S  + T +  + ++   P    L   G + G   LP+               + D
Sbjct: 259 EATVKS-AIPTPLIQNPSR---PSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIID 314

Query: 266 SGSSYTYLNRVTYQTL----TSIMKKELSAKSLKEAPEDETLP 304
           SG++ TYL    +  L     S MK ++ A    E     TLP
Sbjct: 315 SGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLP 357


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/277 (28%), Positives = 115/277 (41%), Gaps = 35/277 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P    +L +D+GSD+ W+QC  PC +C     PL+ P+       V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC +L   G     D  +CDY + Y DG  + G L  +      T  Q     +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG+       +    G+LGLG G  S+V QL        V  +CL+    GG G L  G 
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAGGAGSLVLG- 296

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSY 270
                 R          + +Y  G+  +  GGE   L++            VV D+G++ 
Sbjct: 297 ------RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           T L R  Y  L       + A  L  +P    L  C+
Sbjct: 351 TRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 385


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 141/351 (40%), Gaps = 55/351 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + IG PA+P+   +DTGSDL W QC  PC +C     P++ P    S   +PC 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L +P   N      C Y   Y DG  + G +  +   F    G    P +  GC
Sbjct: 152 SQLCQALSSPTCSN----NFCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
           G N   G       G++G+G+G  S+ SQL   K       +C++  G       + L  
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSTP--SNLLLG 255

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VVFD 265
           S     T+ S + T   S  +   ++    G + G   LP                ++ D
Sbjct: 256 SLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIID 315

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++ TY     YQ++      +++   +  +       LC++      N+        T
Sbjct: 316 SGTTLTYFVNNAYQSVRQEFISQINLPVVNGS--SSGFDLCFQTPSDPSNLQ-----IPT 368

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             + F  G      EL  E Y I  + G +CL + + +    Q +++ G I
Sbjct: 369 FVMHFDGGD----LELPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 146/352 (41%), Gaps = 44/352 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G + + + +G PA PY   +DTGSDL W QC  PCV C     P++ P+       +PC 
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPCS 172

Query: 105 DPICASLHAPGHHNCEDPAQCD----YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
             +CA L      +    +       Y   Y D  S+ GVL  + F    T  ++  P +
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETF----TLARQKVPGV 228

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
           A GCG +   G  +    G++GLG+G  S+VSQL   +    +     + G    L    
Sbjct: 229 AFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGSA 287

Query: 221 DLYDSSRVVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVF 264
               +S     + ++   K  S P    +   G T G   L                V+ 
Sbjct: 288 AGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIV 347

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV-HDVKKCF 323
           DSG+S TYL    Y+ L       +S  ++  +  +  L LC++G  P   V  DV+   
Sbjct: 348 DSGTSITYLELRAYRALRKAFVAHMSLPTVDAS--EIGLDLCFQG--PAGAVDQDVQVQV 403

Query: 324 RTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGILNGAEVGLQDLNVIG 374
             L L F  G      +L  E Y+++ S  G +CL ++       + L++IG
Sbjct: 404 PKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVMAS-----RGLSIIG 447


>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 532

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/269 (29%), Positives = 122/269 (45%), Gaps = 37/269 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPSNDL----VP 102
           IG P+  + + LD GSDL W+ C+  C++C          ++     YRPS+      + 
Sbjct: 109 IGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166

Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF-----NYTNGQR 155
           C   +C S       +C+ P Q C Y ++Y  +  SS G+L++D         N +N   
Sbjct: 167 CSHNLCDS-----GQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221

Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
             P + LGCG  Q  G  +   P DG+ GLG G+ S++S L  ++L++N    C +  G 
Sbjct: 222 QAP-VILGCGMKQSGGYLSGVAP-DGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGS 279

Query: 214 GFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           G +FFGD+   S +   +  +   Y  Y   GV             +   + DSG+S+TY
Sbjct: 280 GRIFFGDEGPASQQTTSFVPLDGKYETYIV-GVEACCIENSCLKQTSFKALIDSGTSFTY 338

Query: 273 LNRVTYQTLTSIMKKEL---SAKSLKEAP 298
           L    Y+ +     K L   SA S K  P
Sbjct: 339 LPEEAYENIVIEFDKRLNTTSAVSFKGYP 367


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 121/283 (42%), Gaps = 47/283 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G + + + IG PA  Y   +DTGSDL W QC  PC  C + P P++ P    S   +PC 
Sbjct: 95  GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L      +C D   C+Y   Y D  S+ GVL  + F F    G     ++  GC
Sbjct: 154 SDLCVALPI---SSCSD--GCEYRYSYGDHSSTQGVLATETFTF----GDASVSKIGFGC 204

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGD 220
           G +   G +Y    G++GLG+G  S++SQL   K       +CL+      G   L  G 
Sbjct: 205 GEDNR-GRAYSQGAGLVGLGRGPLSLISQLGVPKF-----SYCLTSIDDSKGISTLLVGS 258

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------------VFD 265
           +    S  + T +  + ++   P    L   G + G   LP+               + D
Sbjct: 259 EATVKS-AIPTPLIQNPSR---PSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIID 314

Query: 266 SGSSYTYLNRVTYQTL----TSIMKKELSAKSLKEAPEDETLP 304
           SG++ TYL    +  L     S MK ++ A    E     TLP
Sbjct: 315 SGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLP 357


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 132/328 (40%), Gaps = 37/328 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRP----SNDLVPC 103
           G Y   M +G PA+PY + +DTGS LTWLQC +PC V C     P++ P    S   V C
Sbjct: 115 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSC 173

Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
             P C  L     +   C     C Y+  Y D   S+G L KD  +F    G    P   
Sbjct: 174 SSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF----GANSVPNFY 229

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGD 220
            GCG +      +    G++GL + K S++ QL     +     +CL S    G+L  G 
Sbjct: 230 YGCGQDN--EGLFGRSAGLMGLARNKLSLLYQL--APTLGYSFSYCLPSTSSSGYLSIGS 285

Query: 221 DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
             Y+     +T M S+       +       VA       ++   +LP + DSG+  T L
Sbjct: 286 --YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRL 343

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFR---TLALS 329
               Y  L+  +   +   S K A     L  C++G+    + V  V   F    TL LS
Sbjct: 344 PTSVYTALSKAVAAAMKG-STKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLS 402

Query: 330 F------TDGKTRTLFELTPEAYLIISN 351
                   DG T  L      +  II N
Sbjct: 403 AGNLLVDVDGATTCLAFAPARSAAIIGN 430


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/162 (34%), Positives = 83/162 (51%), Gaps = 13/162 (8%)

Query: 44  NVYPTGY---YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           N+ P+ Y   + V   +GQPA P    +DTGS++ W++C APC RC +   PL  PS   
Sbjct: 89  NLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSS 147

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQR 155
               +PC + +C   +AP  + C    QC Y L YA G SS GVL  +   F+ ++ G  
Sbjct: 148 TYASLPCTNTMCH--YAPSAY-CNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVN 204

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
             P +  GC +            G+ GLGKG +S V+++ S+
Sbjct: 205 AVPSVVFGCSHENGDYKD-RRFTGVFGLGKGITSFVTRMGSK 245


>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 169/390 (43%), Gaps = 69/390 (17%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           +S++  +   F+    ++  ++ G++Y   Y NV+  +G P   + + LDTGSDL WL C
Sbjct: 76  ASNNEDTPVTFDGGNLTVSIKLLGSLY---YANVS--VGTPPSSFLVALDTGSDLFWLPC 130

Query: 80  D--APCVRCVE-------APHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQ-C 125
           +    C+R +E        P  LY P    ++  + C D  C      G   C  P   C
Sbjct: 131 NCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCF-----GSKKCSSPKSIC 185

Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP---RLALGCGYNQVP-GASYHPLDGIL 181
            Y++ Y++   + G L++D      T  + L P    + LGCG  Q       + ++G+L
Sbjct: 186 PYQISYSNSTGTTGTLLQDVLHL-ATEDENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVL 244

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLY-DSSRVVWTSMSSDYT 238
           GLG    S+ S L    +  +    C     G  G + FGD  Y D     + S++   +
Sbjct: 245 GLGIKGYSVPSLLAKANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAP--S 302

Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
             Y   V  +  GG+  G + L   FD+GSS+T+L    Y  LT         KS  +  
Sbjct: 303 TAYGLNVTGVSVGGDPVGTR-LFAKFDTGSSFTHLMEPAYGVLT---------KSFDDLV 352

Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN------- 351
           ED+  P+      PF+  +D+     ++   F +            + +I++N       
Sbjct: 353 EDKRRPV--DPELPFEFCYDLSPNATSIEFPFVE------MTFVGGSKIILNNPFFTART 404

Query: 352 -----KGNV--CLGILNGAEVGLQDLNVIG 374
                +GNV  CLG+L    VGL+ +NVIG
Sbjct: 405 QARHGEGNVMYCLGVLK--SVGLK-INVIG 431


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 81/318 (25%), Positives = 134/318 (42%), Gaps = 37/318 (11%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPH-----PLYRP----SND 99
           Y NV+  +G P+  + + LDTGS+L WL CD + CV  + +P       +Y P    +++
Sbjct: 63  YANVS--VGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNIYSPNTSSTSE 120

Query: 100 LVPCEDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR-- 155
            VPC   +C+         C  D + C Y++ Y ++G S+ G +V+D       + Q   
Sbjct: 121 KVPCNSTLCSQTQ---RDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISDDSQSKA 177

Query: 156 LNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           ++ ++  GCG  +V   S+      +G+ GLG    S+ S L            C S  G
Sbjct: 178 VDAKITFGCG--KVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNG 235

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
            G + FGD           +     +  Y+  + +   GG+ + L     +FDSG+S+TY
Sbjct: 236 IGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLV-YSAIFDSGTSFTY 294

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           LN   Y  +      E   K +KE     T       + PF   +D++       L F+ 
Sbjct: 295 LNDPAYTLI-----AESFNKLVKETRRSST-------QVPFDYCYDIRSFISAQILPFSC 342

Query: 333 GKTRTLFELTPEAYLIIS 350
                     P   L++S
Sbjct: 343 AYANQTEPTIPAVTLVMS 360


>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
 gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
 gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
 gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
          Length = 528

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/262 (28%), Positives = 112/262 (42%), Gaps = 31/262 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVPCEDP----- 106
           IG P+  + + LDTGS+L W+ C+  CV+C       Y     +  N+  P         
Sbjct: 106 IGTPSVSFLVALDTGSNLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163

Query: 107 ICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL-------N 157
           +C+        +CE P  QC Y + Y  G  SS G+LV+D     Y    RL        
Sbjct: 164 LCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVK 223

Query: 158 PRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
            R+ +GCG  Q    + G +    DG++GLG  + S+ S L    L+RN    C      
Sbjct: 224 ARVVIGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDS 280

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
           G ++FG D+  S +     +  D  KY  Y  GV     G       +     DSG S+T
Sbjct: 281 GRIYFG-DMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 339

Query: 272 YLNRVTYQTLTSIMKKELSAKS 293
           YL    Y+ +   + + ++A S
Sbjct: 340 YLPEEIYRKVALEIDRHINATS 361


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 132/325 (40%), Gaps = 38/325 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y   + +G PA  Y + +DTGS LTWLQC    V C     PLY P        VPC 
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCS 191

Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L A       C     C Y+  Y D   S+G L +D  +F    G    P    
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSF----GSGSYPNFYY 247

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
           GCG +      +    G++GL + K S++ QL     +     +CL +    G+L  G  
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTPASTGYLSIGP- 302

Query: 222 LYDSSRVVWTSMSS---DYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYL 273
            Y S    +T M+S   D + Y+   ++ +  GG    +      +LP + DSG+  T L
Sbjct: 303 -YTSGHYSYTPMASSSLDASLYFV-TLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRL 360

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L+  +   +    ++ AP    L  C++G+     V  V       A++F  G
Sbjct: 361 PTAVYTALSKAVAAAM--VGVQSAPAFSILDTCFQGQASQLRVPAV-------AMAFAGG 411

Query: 334 KTRTLFELTPEAYLIISNKGNVCLG 358
            T    +L  +  LI  +    CL 
Sbjct: 412 AT---LKLATQNVLIDVDDSTTCLA 433


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 90/347 (25%), Positives = 131/347 (37%), Gaps = 40/347 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSN----DLV 101
           TG Y V++ +G PAR   +  DTGSDL+W+QC  PC    C     PL+ PS+      V
Sbjct: 82  TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYHQQDPLFAPSSSSTFSAV 140

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-------NGQ 154
            C +P C         +  D  +C YE+ Y D   ++G L  D      T       N  
Sbjct: 141 RCGEPECPRARQSCSSSPGDD-RCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNS 199

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGG 211
              P    GCG N      +   DG+ GLG+GK S+ SQ   +        +CL   S  
Sbjct: 200 NKLPGFVFGCGENNT--GLFGKADGLFGLGRGKVSLSSQAAGK--YGEGFSYCLPSSSSN 255

Query: 212 GGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP------VV 263
             G+L  G      +   +T M   S+   +Y   +  +   G    + + P      ++
Sbjct: 256 AHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLI 315

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            DSG+  T L    Y  L +     +     K AP    L  C+     F    +     
Sbjct: 316 VDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYD----FTAHANATVSI 371

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL---NGAEVGL 367
             +AL F  G T     +     L ++     CL      NG   G+
Sbjct: 372 PAVALVFAGGAT---ISVDFSGVLYVAKVAQACLAFAPNGNGRSAGI 415


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/353 (28%), Positives = 143/353 (40%), Gaps = 51/353 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ IG PA    + +DTGSDL+W+QC  PC    C     PL+ PS+      VPC+
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 229

Query: 105 DPICASLHAPGH-HNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C  L A  + H C        A C+Y +EY +  ++ GV   +           +  
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 286

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GCG +Q     Y   DG+LGLG    S+VSQ  SQ        +CL  + GG GFL
Sbjct: 287 DFGFGCGDHQH--GPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFL 342

Query: 217 FFGDDLYDSSRVVWTSMS-------SDYTKYYSPGVAELFFGGETTGLK----NLPVVFD 265
             G     SS    + +S            +Y   +  +  GG    +     +  +V D
Sbjct: 343 TLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVID 402

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG+  T L    Y  L S  +  +S   L        L  C+     F    +V     T
Sbjct: 403 SGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD----FTGHANVT--VPT 456

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
           ++L+F+ G T    +L   A +++      CL     A  G    N IG IG+
Sbjct: 457 ISLTFSGGAT---IDLAAPAGVLVDG----CL-----AFAGAGTDNAIGIIGN 497


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 111/263 (42%), Gaps = 35/263 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVPCEDP----- 106
           IG P+  + + LDTGSDL W+ C+  CV+C       Y     +  N+  P         
Sbjct: 106 IGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163

Query: 107 ICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL-------N 157
           +C+        +CE P  QC Y + Y  G  SS G+LV+D     Y    RL        
Sbjct: 164 LCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVK 223

Query: 158 PRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
            R+ +GCG  Q    + G +    DG++GLG  + S+ S L    L+RN    C      
Sbjct: 224 ARVVIGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDS 280

Query: 214 GFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           G ++FGD    +  S+  +    +S Y      GV     G       +     DSG S+
Sbjct: 281 GRIYFGDMGPSIQQSTPFLQLENNSGYIV----GVEACCIGNSCLKQTSFTTFIDSGQSF 336

Query: 271 TYLNRVTYQTLTSIMKKELSAKS 293
           TYL    Y+ +   + + ++A S
Sbjct: 337 TYLPEEIYRKVALEIDRHINATS 359


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 145/344 (42%), Gaps = 56/344 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y + + IG P   +   +DTGSDL W QC+ PC +C   P P++ P +      +PCE
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCE 152

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C  L +   +N E    C Y   Y DG ++ G +  + F F  ++     P +A GC
Sbjct: 153 SQYCQDLPSETCNNNE----CQYTYGYGDGSTTQGYMATETFTFETSS----VPNIAFGC 204

Query: 165 -----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFL 216
                G+ Q  GA      G++G+G G  S+ SQL   +       +C++  G      L
Sbjct: 205 GEDNQGFGQGNGA------GLIGMGWGPLSLPSQLGVGQF-----SYCMTSYGSSSPSTL 253

Query: 217 FFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VV 263
             G     + + S       SS    YY   +  +  GG+  G+ +            ++
Sbjct: 254 ALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMI 313

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            DSG++ TYL +  Y  +      +++  ++ E+     L  C++       V       
Sbjct: 314 IDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDES--SSGLSTCFQQPSDGSTVQ-----V 366

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
             +++ F  G    +  L  +  LI   +G +CL + + +++G+
Sbjct: 367 PEISMQFDGG----VLNLGEQNILISPAEGVICLAMGSSSQLGI 406


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 121/277 (43%), Gaps = 27/277 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +T  +G P    +  +DTGSD+ WLQC  PC +C +   P++ PS       +PC 
Sbjct: 85  GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIPCS 143

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C S+    + +C     C+Y + ++D   S G L  +    + T G  ++ P+  +G
Sbjct: 144 SNLCQSVR---YTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIG 200

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
           CG+N   G       GI+GLG G  S+ +QL S   I     +CL            L F
Sbjct: 201 CGHNN-RGMFQGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLNF 257

Query: 219 GDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSGSSY 270
           GD    S   V ++  +  D   +Y   +     G +    + L       ++ DSG++ 
Sbjct: 258 GDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTL 317

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           T L    Y  L S + + +    + +   ++ L LC+
Sbjct: 318 TLLPSHVYTNLESAVAQLVKLDRVDDP--NQLLNLCY 352


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 140/357 (39%), Gaps = 62/357 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   + +G PAR  ++ LDTGSD+ WLQC APC RC     P++ P        +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
             P C  L + G +       C Y++ Y DG  ++G    +   F  N   G      +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249

Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           LGCG++                     PG + H  +      K    +V +  S K    
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304

Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
           V G+        F  L     L     V    +S   T+   PGVA   F  +  G  N 
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRV--PGVAASLFKLDQIG--NG 360

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            V+ DSG+S T L R  Y  +    +  + AK+LK AP+      C+       N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKALKRAPDFSLFDTCFD----LSNMNEVK 414

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               T+ L F          L    YLI +   G  C          +  L++IG I
Sbjct: 415 --VPTVVLHFRGADV----SLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 142/324 (43%), Gaps = 55/324 (16%)

Query: 17  MSSSSSSSSSSSLFNHVGSS-LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           +SS+      +S+   +GSS     VH  +  T  + V   +GQP  P F  +DTGS L 
Sbjct: 34  ISSARFKYLQNSIVKELGSSDFQVDVHQAI-KTSLFFVNFSVGQPPVPQFTIMDTGSSLL 92

Query: 76  WLQCDAPCVRCV--EAPHPLYRP--SNDLVP--CEDPICASLHAPGHHNCEDPAQCDYEL 129
           W+QC  PC  C      HP++ P  S+  V   C+D  C   +AP  H   +  +C YE 
Sbjct: 93  WIQCH-PCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCR--YAPNGHCSSN--KCVYEQ 147

Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
            Y  G  S GVL K+   F   NG  +  + +A GCG+            GILGLG   +
Sbjct: 148 VYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHENGEQLE-SEFTGILGLGAKPT 206

Query: 189 SIVSQLHSQKLIRNVVGHC---LSGGGGGF--LFFGDD---LYDSSRVVWTSMSSDYTKY 240
           S+  QL S+        +C   L+    G+  L  G+D   L D + + + + +      
Sbjct: 207 SLAVQLGSK------FSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETEN------ 254

Query: 241 YSPGVAELFFGGETTGLKNL---PVVF-----------DSGSSYTYLNRVTYQTLTSIMK 286
              G+  +   G + G K L   PVVF           D+G+ YT+L  + Y+ L + +K
Sbjct: 255 ---GIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYNEIK 311

Query: 287 KELSAKSLKEAPEDETLPLCWKGR 310
             L  K  +    D    LC+ GR
Sbjct: 312 SILDPKLERFWFRDF---LCYHGR 332


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 121/278 (43%), Gaps = 27/278 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
           Y ++  IG P    +  +DT +D  W QC+ PC  C     P++ PS       +PC  P
Sbjct: 89  YIISFLIGTPPFQLYGVMDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPCSSP 147

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
            C ++    H + +D   C+Y   Y     S G L  D    N  N   ++ + + +GCG
Sbjct: 148 KCKNVEN-THCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCG 206

Query: 166 Y-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
           + N+ P   Y  + G +GLG+G  S +SQL+S   I     +CL     + G  G L FG
Sbjct: 207 HRNKGPLEGY--VSGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKLHFG 262

Query: 220 D-DLYDSSRVVWTSMSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGSSYT 271
           D  +      V T +++    Y       S G   + F   T+   NL   + DSG++ T
Sbjct: 263 DKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLT 322

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
            L    Y  L SI+   +  +  K    ++   LC+K 
Sbjct: 323 ILPENVYSRLESIVTSMVKLERAKSP--NQQFKLCYKA 358


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 95/348 (27%), Positives = 148/348 (42%), Gaps = 38/348 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           TG Y V + +G PA+ + L  DTGS+LTW++C          P  ++RP    S   VPC
Sbjct: 88  TGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAG----GASPPGLVFRPEASKSWAPVPC 143

Query: 104 EDPICASLHAP-GHHNCEDPAQ-CDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPR- 159
               C  L  P    NC   A  C Y+  Y +G + +LGV+  D+       G+    + 
Sbjct: 144 SSDTC-KLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQD 202

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFL 216
           + LGC      G S+  +DG+L LG  K S  S+  ++        +V H       G+L
Sbjct: 203 VVLGCSSTH-DGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYL 261

Query: 217 FFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL-------KNLPVVFDSGS 268
            FG      +    T +  D    +Y   V  +   G+   +       K+  V+ DSG+
Sbjct: 262 AFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSGT 321

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           + T L    Y+ + + + K L+     + P  E    C+    P     ++ K    LA+
Sbjct: 322 TLTVLATPAYKAVVAALTKLLAGVPKVDFPPFEH---CYNWTAPRPGAPEIPK----LAV 374

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            FT G  R   E   ++Y+I    G  C+G+  G   G   ++VIG I
Sbjct: 375 QFT-GCAR--LEPPAKSYVIDVKPGVKCIGLQEGEWPG---VSVIGNI 416


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 96/346 (27%), Positives = 144/346 (41%), Gaps = 58/346 (16%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH 112
           IG PA  Y   +DTGSDL W QC  PCV C +   P++ PS+      VPC    C+ L 
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231

Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
                 C   ++C Y   Y D  S+ GVL  + F    +      P +  GCG +   G 
Sbjct: 232 T---SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGCG-DTNEGD 283

Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---------GGFLFFGDDLY 223
            +    G++GLG+G  S+VSQL   K       +CL+            G      +   
Sbjct: 284 GFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLTSLDDTNNSPLLLGSLAGISEASA 338

Query: 224 DSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLKNLP----------VVFDSGSSYT 271
            +S V  T +  + ++  +Y   +  +  G     L +            V+ DSG+S T
Sbjct: 339 AASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSIT 398

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALS 329
           YL    Y+ L    KK  +A+    A +     L LC+  R P K V  V+     L   
Sbjct: 399 YLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLCF--RAPAKGVDQVE--VPRLVFH 450

Query: 330 FTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGLQDLNVIG 374
           F  G      +L  E Y+++    G +CL ++     G + L++IG
Sbjct: 451 FDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GSRGLSIIG 488


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 151/363 (41%), Gaps = 60/363 (16%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           + G    +G Y   + +G P R  ++ LDTGSD+ W+QC APC RC     P++ P    
Sbjct: 116 ISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSR 174

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
               + C  P+C  L +PG   C    Q C Y++ Y DG  + G    +   F  T    
Sbjct: 175 SFASIACRSPLCHRLDSPG---CNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR--- 228

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGG 211
              R+ALGCG++      +    G+LGLG+G+ S  SQ  + +   +   +CL    +  
Sbjct: 229 -VARVALGCGHDN--EGLFVGAAGLLGLGRGRLSFPSQ--TGRRFNHKFSYCLVDRSASS 283

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSD---YTKYY------------SPGVAELFFGGETTG 256
               + FGD    S    +T + S+    T YY             PG+    F  + TG
Sbjct: 284 KPSSMVFGDSAV-SRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTG 342

Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFK 314
             N  V+ DSG+S T L R  Y       +    A +LK AP+      C+   G+   K
Sbjct: 343 --NGGVIIDSGTSVTRLTRPAYIAFRDAFRA--GASNLKRAPQFSLFDTCFDLSGKTEVK 398

Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVI 373
            V  V   FR   +S           L    YLI +   GN CL         +  L++I
Sbjct: 399 -VPTVVLHFRGADVS-----------LPASNYLIPVDTSGNFCLAFAG----TMGGLSII 442

Query: 374 GGI 376
           G I
Sbjct: 443 GNI 445


>gi|62954896|gb|AAY23265.1| Similar to probable aspartic proteinase (EC 3.4.23.-) - barley
          [Oryza sativa Japonica Group]
 gi|77548965|gb|ABA91762.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa
          Japonica Group]
 gi|125576451|gb|EAZ17673.1| hypothetical protein OsJ_33214 [Oryza sativa Japonica Group]
          Length = 96

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 34/53 (64%), Positives = 44/53 (83%)

Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA 89
          ++F +HGNVYP+G + VTM IG P +PYFLD+DTGSDLTW++CDAPC  C +A
Sbjct: 30 MVFPLHGNVYPSGRFFVTMNIGVPEKPYFLDIDTGSDLTWVECDAPCQSCHQA 82


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y PS    +  VPC    C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C   +QC Y++ Y     SS G LV+D    +  +   Q L  ++  GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236

Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
             QV   S+      +G+ GLG    SI S L  + L  N    C S  G G + FGD  
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
                     ++  +   Y+  ++E+  G   T L+    +FD+G+S+TYL    Y  +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEMTVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352

Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
                ++ A   + A +          R PF+  +D+      +       +T   ++F 
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
           +  E  +I   +     CL I+  A++ +   N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 139/353 (39%), Gaps = 59/353 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + IG PA+P+   +DTGSDL W QC  PC +C     P++ P    S   +PC 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L +P   N      C Y   Y DG  + G +  +   F    G    P +  GC
Sbjct: 152 SQLCQALQSPTCSN----NSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDL 222
           G N   G       G++G+G+G  S+ SQL   K       +C++  G           L
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSNSSTLLLGSL 257

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VV 263
            +S     T+ S + T   S  +   ++    G + G   LP                ++
Sbjct: 258 ANS----VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            DSG++ TY     YQ +      +++   +  +       LC++      N+       
Sbjct: 314 IDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGS--SSGFDLCFQMPSDQSNLQ-----I 366

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            T  + F  G       L  E Y I  + G +CL + + +    Q +++ G I
Sbjct: 367 PTFVMHFDGGD----LVLPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 79/279 (28%), Positives = 115/279 (41%), Gaps = 33/279 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSN----DLVPCED 105
           Y VT+ +G PA    L++DTGSD++W+QC   P   C     PL+ P+       VPC  
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
             C+ L    + N     QC Y + Y DG ++ GV   D      +N  +       GCG
Sbjct: 202 ASCSQLAL--YSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCG 256

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---------KLIRNVVGHCLSGGGGGFL 216
           + Q     +  +DG+LGLG+   S+VSQ  S             +N VG+   GG     
Sbjct: 257 HAQQ--GLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTA 314

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTY 272
            F       S     + S+D T YY   +A +  GG+   +         V D+G+  T 
Sbjct: 315 GF-------STTPLLTASNDPT-YYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTR 366

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
           L    Y  L S  +  ++      AP    L  C+   R
Sbjct: 367 LPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 405


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 76/165 (46%), Gaps = 18/165 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V   IG P       LDTGSDL W QCDAPC RC   P PLY P+  +    V C
Sbjct: 97  TATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSC 156

Query: 104 EDPICASLHA---------PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
              +C +L +                +   C Y   Y DG S+ GVL  + F F    G 
Sbjct: 157 GSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFG--AGT 214

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
            ++  LA GCG + + G       G++G+G+G  S+VSQL   K 
Sbjct: 215 TVH-DLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVTKF 256


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 96/354 (27%), Positives = 138/354 (38%), Gaps = 45/354 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
           +G Y   + +G P     L LDT SDLTWLQC  PC RC     P++ P +     E   
Sbjct: 135 SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMSF 193

Query: 108 -CASLHAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             A   A G     D  +  C Y + Y DG +++G  +++   F    G RL PR+++GC
Sbjct: 194 NAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTF--AGGVRL-PRISIGC 250

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDL 222
           G++   G    P  GILGLG+G  S  +Q+         +   LSG G     L FG   
Sbjct: 251 GHDN-KGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSSTLTFGAGA 309

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------------VVFD 265
            D+S  V  S +        P    +   G + G   +P                 V+ D
Sbjct: 310 VDTSPPV--SFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVD 367

Query: 266 SGSSYTYLNRVTYQTLTSIMKK-ELSAKSLKEAPEDETLPLCWK-GRRPFKNVHDVKKCF 323
           SG++ T L R  Y       +   +    +           C+  G R  K V  V   F
Sbjct: 368 SGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMHF 427

Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
                           +L P+ YLI + + G VC      A  G   +++IG I
Sbjct: 428 ----------AGSVEVKLQPKNYLIPVDSMGTVCFAF---AATGDHSVSIIGNI 468


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 92/349 (26%), Positives = 139/349 (39%), Gaps = 46/349 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR  ++ LDTGSD+ WLQC APC +C     P++ P+       +PC
Sbjct: 126 SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPC 184

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L +PG +N      C Y++ Y DG  + G    +   F  T       R+ALG
Sbjct: 185 GAPLCRRLDSPGCNNKNK--VCQYQVSYGDGSFTFGDFSTETLTFRRTRVT----RVALG 238

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVS--QLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
           CG++   G        +       S  V   +  +QK    +V    S      +F    
Sbjct: 239 CGHDN-EGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSA 297

Query: 222 LYDSSRVVWTSMSSDYTKYY-----------SP--GVAELFFGGETTGLKNLPVVFDSGS 268
           +  ++R      +     +Y           SP  G++   F  +  G  N  V+ DSG+
Sbjct: 298 VSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAG--NGGVIIDSGT 355

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
           S T L R  Y  L    +  + A  LK A E      C+        + +VK    T+ L
Sbjct: 356 SVTRLTRPAYIALRDAFR--VGASHLKRAAEFSLFDTCFD----LSGLTEVK--VPTVVL 407

Query: 329 SFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            F          L    YLI + N G+ C          +  L++IG I
Sbjct: 408 HFRGADV----SLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNI 448


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y PS    +  VPC    C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C   +QC Y++ Y     SS G LV+D    +  +   Q L  ++  GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236

Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
             QV   S+      +G+ GLG    SI S L  + L  N    C S  G G + FGD  
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
                     ++  +   Y+  ++E+  G   T L+    +FD+G+S+TYL    Y  +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEITVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352

Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
                ++ A   + A +          R PF+  +D+      +       +T   ++F 
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
           +  E  +I   +     CL I+  A++ +   N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 146/356 (41%), Gaps = 53/356 (14%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND--- 99
           GN+Y   Y NV+  IG P   + + LDTGSDL WL C+  C +C     P Y    D   
Sbjct: 101 GNLY---YANVS--IGTPGLYFLVALDTGSDLFWLPCE--CTKC-----PTYLTKRDNGK 148

Query: 100 ---------------LVPCEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVK 143
                           VPC   +C       +    + + C Y+  Y ++  SS G LV+
Sbjct: 149 FWLNHYSSNASSTSIRVPCSSSLCEL----ANQCSSNKSSCPYQTHYLSENSSSAGYLVQ 204

Query: 144 DAFAFNYTNGQRLNP---RLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKL 199
           D      T+  +L P   ++ LGCG  Q    S     +G++GLG GK S+ S L SQ L
Sbjct: 205 DILHMA-TDDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGL 263

Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
             +    C    G G + FGD      R    + +S     Y+  + ++      T + +
Sbjct: 264 TTDSFSMCFGYYGYGRIDFGDIGPVGQRETPFNPAS---LSYNVTILQIIVTNRPTNV-H 319

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
           L  + DSG+S+TYL    Y  +T  M    +A  L+    D   P  +  R     +   
Sbjct: 320 LTAIIDSGASFTYLTDPFYSIITENMD---AAMELERIKSDSDFPFEYCYRLSLATI--- 373

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
              F+   L+FT    R    +T    +   +   +CL I+   ++ +   N  GG
Sbjct: 374 ---FQQPNLNFTMEGGRKFDVITSYVSVDTDDGPALCLAIVKSTDINVIGHNFFGG 426


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 139/353 (39%), Gaps = 59/353 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + IG PA+P+   +DTGSDL W QC  PC +C     P++ P    S   +PC 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L +P   N      C Y   Y DG  + G +  +   F    G    P +  GC
Sbjct: 152 SQLCQALQSPTCSN----NSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDL 222
           G N   G       G++G+G+G  S+ SQL   K       +C++  G           L
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSTSSTLLLGSL 257

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VV 263
            +S     T+ S + T   S  +   ++    G + G   LP                ++
Sbjct: 258 ANS----VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            DSG++ TY     YQ +      +++   +  +       LC++      N+       
Sbjct: 314 IDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGS--SSGFDLCFQMPSDQSNLQ-----I 366

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            T  + F  G       L  E Y I  + G +CL + + +    Q +++ G I
Sbjct: 367 PTFVMHFDGGD----LVLPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y PS    +  VPC    C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C   +QC Y++ Y     SS G LV+D    +  +   Q L  ++  GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236

Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
             QV   S+      +G+ GLG    SI S L  + L  N    C S  G G + FGD  
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
                     ++  +   Y+  ++E+  G   T L+    +FD+G+S+TYL    Y  +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEITVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352

Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
                ++ A   + A +          R PF+  +D+      +       +T   ++F 
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401

Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
           +  E  +I   +     CL I+  A++ +   N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 83/273 (30%), Positives = 116/273 (42%), Gaps = 38/273 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+  G P+ P  L +DTGSD++W+QC  PC   +C     PL+ PS       + C 
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQC-TPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189

Query: 105 DPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL- 162
              C  L    H+ C     QC Y +EYADG  S GV   +           L P + + 
Sbjct: 190 TDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT--------LAPGITVE 241

Query: 163 ----GCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGF 215
               GCG +Q  P   Y   DG+LGLG    S+V Q  S  +      +CL       GF
Sbjct: 242 DFHFGCGRDQRGPSDKY---DGLLGLGGAPVSLVVQTSS--VYGGAFSYCLPALNSEAGF 296

Query: 216 LFFGDDLY-DSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGS 268
           L  G     + S  V+T M     Y  +Y   +  +  GG+   +        ++ DSG+
Sbjct: 297 LVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGT 356

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
             T L    Y  L + ++K L A  L   P D+
Sbjct: 357 VDTELPETAYNALEAALRKALKAYPL--VPSDD 387


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 144/340 (42%), Gaps = 28/340 (8%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           + G V  TG   V  Y     + Y L +DTGS  T++ C   C RC E  H  Y     +
Sbjct: 29  LRGGVLGTGTL-VAEYALADGQTYDLIVDTGSARTYVPCKG-CARCGEHAHGYYDYDRSM 86

Query: 101 ----VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
               + C +   A+L        C+   +C Y + YA+G SS G +V+D           
Sbjct: 87  EFERLDCGEASDATLCEETMKGTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGT--- 143

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--G 213
           L+  LA GC   +         DG+ G G+G +++ +QL S  LI NV   C+ G G  G
Sbjct: 144 LSAMLAFGCEEAETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANG 203

Query: 214 GFLFFG--DDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGE-TTGLKNLPVVFDSGS 268
           G L  G  D   D+  +  T + +D     +++   +    G      L +     DSG+
Sbjct: 204 GVLTLGRFDFGADAPALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGT 263

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLK--EAPEDETLPLCWKGRRPFKNV----HDVKKC 322
           ++T++ R  + +  + +  + +   L+    P+ +   +C+       N+      V + 
Sbjct: 264 TFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEW 323

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGIL 360
           F  L +++  G + T   L PE YL    +N    C+GI 
Sbjct: 324 FPPLTIAYEGGVSLT---LGPENYLFAHETNSAAFCVGIF 360


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 151/379 (39%), Gaps = 53/379 (13%)

Query: 33  VGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---- 86
           VG  + F V G  + Y  G Y   + +G P R + + +DTGSD+ W+ C++ C  C    
Sbjct: 46  VGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTS 104

Query: 87  -VEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGV 140
            +      +  S+     LV C DPIC S        C     QC Y  +Y DG  + G 
Sbjct: 105 GLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGY 164

Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
            V D   F+   G+ L    +  +  GC   Q    +     +DGI G G+G+ S++SQL
Sbjct: 165 YVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQL 224

Query: 195 HSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
            +  +   V  HCL G G G             +V++ +      +Y+  +  +   G+ 
Sbjct: 225 STHGITPRVFSHCLKGEGIGGGILVLGEILEPGMVYSPLVPS-QPHYNLNLQSIAVNGKL 283

Query: 255 TGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
             +   P VF          DSG++  YL    Y    S +   +S             P
Sbjct: 284 LPID--PSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPS---------VTP 332

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI---ISNKGNVCLGILN 361
           +  KG + +     V + F   + +F  G +     L PE YLI    S  G+V   I  
Sbjct: 333 IISKGNQCYLVSTSVSQMFPLASFNFAGGASMV---LKPEDYLIPFGPSQGGSVMWCI-- 387

Query: 362 GAEVGLQDLNVIGGIGDFV 380
               G Q +  +  +GD V
Sbjct: 388 ----GFQKVQGVTILGDLV 402


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 132/318 (41%), Gaps = 50/318 (15%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPA 123
           +DTGSDL W QC APC+ C + P P +      +   +PC    CASL +P         
Sbjct: 1   MDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFK----K 55

Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILG 182
            C Y+  Y D  S+ GVL  + F F   N  ++    +A GCG   +         G++G
Sbjct: 56  MCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG--SLNAGDLANSSGMVG 113

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK 239
            G+G  S+VSQL   +       +CL+         L+FG     SS    +      T 
Sbjct: 114 FGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTP 168

Query: 240 Y-YSPGVAELFF---GGETTGLKNLP---------------VVFDSGSSYTYLNRVTYQT 280
           +  +P +  ++F      + G K LP               V+ DSG+S T+L +  Y+ 
Sbjct: 169 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 228

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
           +   +   +   ++ +   D  L  C++   P     +V      L   F D    TL  
Sbjct: 229 VRRGLVSAIPLPAMND--TDIGLDTCFQWPPP----PNVTVTVPDLVFHF-DSANMTLL- 280

Query: 341 LTPEAYLII-SNKGNVCL 357
             PE Y++I S  G +CL
Sbjct: 281 --PENYMLIASTTGYLCL 296


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 125/291 (42%), Gaps = 44/291 (15%)

Query: 37  LLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---- 91
           L F    + Y +G  Y   + +G P   + + LDTGSDL W+ CD  C +C   P     
Sbjct: 95  LTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQCATIPSANAT 152

Query: 92  ----PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGG-S 136
               P  RP       +++ V C++P+C      G  N    A    C YE++Y     S
Sbjct: 153 GPDAPPLRPYSPRRSSTSEQVACDNPLC------GRRNGCSAATNGSCPYEVQYVSANTS 206

Query: 137 SLGVLVKDAFAFNYTN------GQRLNPRLALGCGYNQVPGASYH----PLDGILGLGKG 186
           S GVLV+D              G+ L   +  GCG  Q  GA        +DG++GLG G
Sbjct: 207 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQT-GAFLDDGGGAVDGLMGLGMG 265

Query: 187 KSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSPG 244
           K S+ S L +  L+  +    C    G G + FGD      +   +T  S + T  Y+  
Sbjct: 266 KVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPT--YNVS 323

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
              +  G E+   +    V DSG+S+TYL+   Y  L +    ++S + + 
Sbjct: 324 FTSIGIGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVN 373


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 134/311 (43%), Gaps = 39/311 (12%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNC---- 119
           +DT S+LTW+QC APC  C +   PL+ P++     ++PC    C +L            
Sbjct: 142 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200

Query: 120 --EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHP 176
             E P+ C Y L Y DG  S GVL  D  +     G+ ++     GCG  NQ P   +  
Sbjct: 201 GGEQPS-CSYTLSYRDGSYSQGVLAHDKLSL---AGEVIDG-FVFGCGTSNQGP---FGG 252

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVW 230
             G++GLG+ + S++SQ   Q     V  +CL        G L  GDD     +S+ +V+
Sbjct: 253 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 310

Query: 231 TSMSSDYTK--YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
           T+M SD  +  +Y   +  +  GG+        V+ DSG+  T L    Y  + +    +
Sbjct: 311 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 370

Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
            +     +AP    L  C+     F+ V        +L   F +G      + +   Y +
Sbjct: 371 FA--EYPQAPGFSILDTCFN-LTGFREVQ-----IPSLKFVF-EGNVEVEVDSSGVLYFV 421

Query: 349 ISNKGNVCLGI 359
            S+   VCL +
Sbjct: 422 SSDSSQVCLAL 432


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 124/291 (42%), Gaps = 44/291 (15%)

Query: 37  LLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---- 91
           L F    + Y +G  Y   + +G P   + + LDTGSDL W+ CD  C +C   P     
Sbjct: 93  LTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQCATIPSANGT 150

Query: 92  ----PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGG-S 136
               P  RP       ++  V C++P+C      G  N    A    C YE++Y     S
Sbjct: 151 GQDAPSLRPYSPRRSSTSKQVACDNPLC------GQRNGCSAATNGSCPYEVQYVSANTS 204

Query: 137 SLGVLVKDAFAFNYTN------GQRLNPRLALGCGYNQVPGASYH----PLDGILGLGKG 186
           S GVLV+D              G+ L   +  GCG  Q  GA        +DG++GLG G
Sbjct: 205 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQT-GAFLDGGGGAVDGLMGLGMG 263

Query: 187 KSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSPG 244
           K S+ S L +  L+  +    C    G G + FGD      +   +T  S + T  Y+  
Sbjct: 264 KVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPT--YNVS 321

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
              +  G E+   +    V DSG+S+TYL+   Y  L +    ++S + + 
Sbjct: 322 FTSIGVGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVN 371


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 136/313 (43%), Gaps = 38/313 (12%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRP---- 96
           G+ Y +  Y  T+ +G PA P  L LDTGS LTW+QC  PC   +C     PL+ P    
Sbjct: 121 GSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRLPLFDPNTSS 179

Query: 97  SNDLVPCEDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           S   VPC+   C +L A      C       C YE+ Y  G +  G    DA        
Sbjct: 180 SYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTL---GP 236

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGG 211
             +  R   GCG++Q  G  +   DG+LGLG+   S+  Q  +++    V  HCL  +G 
Sbjct: 237 GAIVKRFHFGCGHHQQRG-KFDMADGVLGLGRLPQSLAWQASARR-GGGVFSHCLPPTGV 294

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLP-------V 262
             GFL  G   +D+S  V+T + +  D   +Y      +   G+   L ++P       V
Sbjct: 295 STGFLALGAP-HDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQ---LLDIPPAVFREGV 350

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           + DSG+  + L    Y  L +  +  ++   L  AP    L  C+     F    +V   
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEYPL--APPVGHLDTCFN----FTGYDNVT-- 402

Query: 323 FRTLALSFTDGKT 335
             T++L+F  G T
Sbjct: 403 VPTVSLTFRGGAT 415


>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
 gi|219887047|gb|ACL53898.1| unknown [Zea mays]
 gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 416

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 70/258 (27%), Positives = 108/258 (41%), Gaps = 19/258 (7%)

Query: 47  PTGYYNVTMYIGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SN 98
           P+  +   + +G P + + + LDTGSDL WL  QCD   P           Y P    ++
Sbjct: 3   PSSLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTS 62

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QR 155
             VPC    C          C    QC Y++ Y   G SS G LV+D    +  N   Q 
Sbjct: 63  KAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 117

Query: 156 LNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           L  ++ LGCG  Q          +G+ GLG  + S+ S L  + L  N    C    G G
Sbjct: 118 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 177

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
            + FGD            ++  +   Y+  ++ +  G + T + +   +FD+G+S+TYL 
Sbjct: 178 RISFGDQESSDQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLA 235

Query: 275 RVTYQTLTSIMKKELSAK 292
              Y  +T     ++ A 
Sbjct: 236 DPAYTYITQSFHAQVQAN 253


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 79/279 (28%), Positives = 115/279 (41%), Gaps = 33/279 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSN----DLVPCED 105
           Y VT+ +G PA    L++DTGSD++W+QC   P   C     PL+ P+       VPC  
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
             C+ L    + N     QC Y + Y DG ++ GV   D      +N  +       GCG
Sbjct: 191 ASCSQLAL--YSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCG 245

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---------KLIRNVVGHCLSGGGGGFL 216
           + Q     +  +DG+LGLG+   S+VSQ  S             +N VG+   GG     
Sbjct: 246 HAQQ--GLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTA 303

Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTY 272
            F       S     + S+D T YY   +A +  GG+   +         V D+G+  T 
Sbjct: 304 GF-------STTPLLTASNDPT-YYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTR 355

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
           L    Y  L S  +  ++      AP    L  C+   R
Sbjct: 356 LPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 394


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 128/307 (41%), Gaps = 35/307 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LV 101
           T  Y VT  +G P     L++DTGSDL+W+QC  PC    C     PL+ P+       V
Sbjct: 134 TSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAV 192

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKD--AFAFNYTNGQRLNPR 159
           PC    CA L    + +    AQC Y + Y DG ++ GV   D    A N T    L   
Sbjct: 193 PCGRSACAGLGI--YASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFL--- 247

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLF 217
              GCG+ Q  G  +  +DG+LG G+ + S+V Q  +      V  +CL       G+L 
Sbjct: 248 --FGCGHAQ-SGGLFTGIDGLLGFGREQPSLVQQ--TAGAYGGVFSYCLPTKSSTTGYLT 302

Query: 218 FGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYT 271
            G     +     T +  S +   YY   +  +  GG+   +         V D+G+  T
Sbjct: 303 LGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVIT 362

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  L S  +  ++  S   AP    L  C+     F     V     ++AL+F+
Sbjct: 363 RLPPAAYAALRSAFRSGMA--SYPSAPPIGILDTCYS----FAGYGTVN--LTSVALTFS 414

Query: 332 DGKTRTL 338
            G T TL
Sbjct: 415 SGATMTL 421


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 134/311 (43%), Gaps = 39/311 (12%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNC---- 119
           +DT S+LTW+QC APC  C +   PL+ P++     ++PC    C +L            
Sbjct: 141 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199

Query: 120 --EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHP 176
             E P+ C Y L Y DG  S GVL  D  +     G+ ++     GCG  NQ P   +  
Sbjct: 200 GGEQPS-CSYTLSYRDGSYSQGVLAHDKLSL---AGEVIDG-FVFGCGTSNQGP---FGG 251

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVW 230
             G++GLG+ + S++SQ   Q     V  +CL        G L  GDD     +S+ +V+
Sbjct: 252 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 309

Query: 231 TSMSSDYTK--YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
           T+M SD  +  +Y   +  +  GG+        V+ DSG+  T L    Y  + +    +
Sbjct: 310 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 369

Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
            +     +AP    L  C+     F+ V        +L   F +G      + +   Y +
Sbjct: 370 FA--EYPQAPGFSILDTCFN-LTGFREVQ-----IPSLKFVF-EGNVEVEVDSSGVLYFV 420

Query: 349 ISNKGNVCLGI 359
            S+   VCL +
Sbjct: 421 SSDSSQVCLAL 431


>gi|356546446|ref|XP_003541637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 160

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 54/75 (72%), Gaps = 1/75 (1%)

Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
           +LP+CWK  + FK++HDV   F+ +AL FT  K  +L +L PE+YLI++  G VCLGIL+
Sbjct: 58  SLPICWKDTKTFKSLHDVTSNFKPIALRFTKSK-NSLLQLQPESYLIVTKHGKVCLGILD 116

Query: 362 GAEVGLQDLNVIGGI 376
           G E+GL + N+IG I
Sbjct: 117 GTEIGLGNTNIIGDI 131


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 33/314 (10%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQ---VHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
           +R+   SS  ++S   +  G +  F    + G    +G Y   + +G P +  ++ LDTG
Sbjct: 3   IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTG 62

Query: 72  SDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDY 127
           SD+ WLQC APC  C     P++ P    S   V C  P+C  L +PG   C     C Y
Sbjct: 63  SDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPG---CNQRQTCLY 118

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKG 186
           ++ Y DG  + G  V +   F  T  +    ++ALGCG+ N+        L G+   G  
Sbjct: 119 QVSYGDGSYTTGEFVTETLTFRRTKVE----QVALGCGHDNEGLFVGAAGLLGLGRGGLS 174

Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY----- 241
             S   +  +QK    +V    S      +F    +  ++R      +     +Y     
Sbjct: 175 FPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELL 234

Query: 242 ------SP--GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
                 +P  G+    F  + TG  N  V+ D G+S T LN+  Y  L    +    A S
Sbjct: 235 GISVGGTPVSGITASHFKLDRTG--NGGVIIDCGTSVTRLNKPAYIALRDAFRA--GASS 290

Query: 294 LKEAPEDETLPLCW 307
           LK APE      C+
Sbjct: 291 LKSAPEFSLFDTCY 304


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 76/160 (47%), Gaps = 11/160 (6%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----L 100
           V   G + + + IG P R +   +DTGSDL W QC  PC +C +   P++ P        
Sbjct: 105 VAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYK 163

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNPR 159
           + C   +C +L       C     C+Y   Y D  S+ GVL  + F F + T  Q   P 
Sbjct: 164 ISCSSELCGALPT---STCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPG 219

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           L  GCG N   G  +    G++GLG+G  S+VSQL  QK 
Sbjct: 220 LGFGCG-NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKF 258


>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
          Length = 829

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 66/376 (17%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMY----IGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEA 89
           S L F      Y  G +    +    +G P   + + LDTGSDL WL C+   CVR VE+
Sbjct: 82  SPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGVES 141

Query: 90  -----PHPLY----RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSL 138
                   +Y      ++  V C   +C          C    + C YE+ Y ++G S+ 
Sbjct: 142 NGEKIAFNIYDLKGSSTSQTVLCNSNLCEL-----QRQCPSSDSICPYEVNYLSNGTSTT 196

Query: 139 GVLVKDAFAF--NYTNGQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVS 192
           G LV+D      +    +  + R+  GCG  Q    + GA+    +G+ GLG G  S+ S
Sbjct: 197 GFLVEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAP---NGLFGLGMGNESVPS 253

Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPG 244
            L  + L  N    C    G G + FGD+         +S+    T +        Y+  
Sbjct: 254 ILAKEGLTSNSFSMCFGSDGLGRITFGDN---------SSLVQGKTPFNLRALHPTYNIT 304

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           V ++  GG    L+    +FDSG+S+T+LN   Y+ +T+     +  +    +  DE   
Sbjct: 305 VTQIIVGGNAADLE-FHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDEL-- 361

Query: 305 LCWKGRRPFKNVHDV---KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGI 359
                  PF+  +D+   K     + L+   G       L  +  + IS +G   +CLG+
Sbjct: 362 -------PFEYCYDLSSNKTVELPINLTMKGGDNY----LVTDPIVTISGEGVNLLCLGV 410

Query: 360 LNGAEVGLQDLNVIGG 375
           L    V +   N + G
Sbjct: 411 LKSNNVNIIGQNFMTG 426


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 143/352 (40%), Gaps = 44/352 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
           TG Y V + +G P +   L  DTGSDLTW QC  PCV+ C     P++ PS       + 
Sbjct: 151 TGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQ-PCVKSCYAQQQPIFDPSASKTYSNIS 209

Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C    C+ L  A G+      + C Y ++Y D   ++G   KD       +   +     
Sbjct: 210 CTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQND---VFDGFM 266

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
            GCG N      +    G++GLG+   SIV Q  +QK  +    +CL  S G  G L FG
Sbjct: 267 FGCGQNNR--GLFGKTAGLIGLGRDPLSIVQQ-TAQKFGK-YFSYCLPTSRGSNGHLTFG 322

Query: 220 D-DLYDSSRVVWTSM------SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSG 267
           + +   +S+ V   +      SS    +Y   V  +  GG+   +     +N   + DSG
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +  T L    Y +L S  K+ +S      AP    L  C+       N   +      ++
Sbjct: 383 TVITRLPSTVYGSLKSTFKQFMS--KYPTAPALSLLDTCYD----LSNYTSIS--IPKIS 434

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
            +F         +L P   LI +    VCL     A  G  D + IG  G+ 
Sbjct: 435 FNFNGNAN---VDLEPNGILITNGASQVCL-----AFAGNGDDDTIGIFGNI 478


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 95/346 (27%), Positives = 142/346 (41%), Gaps = 57/346 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
           G Y +T+ IG P  PY    DTGSDL W QC APC  +C E P PLY P++     ++PC
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 168

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLAL 162
              +     A           C Y   Y  G ++ GV   + F F  +   Q   P +A 
Sbjct: 169 NSSLSMCAGALAGAAPPPGCACMYNQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAF 227

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
           GC  +    + ++   G++GLG+G  S+VSQL + +       +CL+        F D  
Sbjct: 228 GC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP-------FQDTN 273

Query: 223 YDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV-------- 262
             S+ ++  S + + T   S P VA            L   G + G K LP+        
Sbjct: 274 STSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLK 333

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
                  + DSG++ T L    YQ + + +K  ++     +  +   L LC+    P   
Sbjct: 334 PDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSA 393

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
              V     ++ L F DG    L    P    +IS  G  CL + N
Sbjct: 394 PPAV---LPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 431


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 108/246 (43%), Gaps = 18/246 (7%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDLVPCEDPIC 108
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P       + P  
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKSSTSRKVPCS 162

Query: 109 ASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
           +SL  P          C Y ++Y ++  SS GVLV+D       +GQ    +  +  G  
Sbjct: 163 SSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKITQAPITFGCG 222

Query: 168 QVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
           QV   S+      +G+LGLG    S+ S L S+ +  N    C    G G + FGD    
Sbjct: 223 QVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGHGRINFGDT--G 280

Query: 225 SSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTS 283
           SS  + T ++      YY+  +     GG++   K    V DSG+S+T L+   Y  +TS
Sbjct: 281 SSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTK-FSAVVDSGTSFTALSDPMYTEITS 339

Query: 284 IMKKEL 289
               ++
Sbjct: 340 TFNAQV 345


>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 529

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 113/268 (42%), Gaps = 45/268 (16%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVP--------- 102
           IG P+  + + LDTGSDL W+ C+  CV+C       Y     +  N+  P         
Sbjct: 106 IGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVF 163

Query: 103 -CEDPICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL--- 156
            C   +C S       +C+ P  QC Y ++Y  G  SS G+LV+D     Y    RL   
Sbjct: 164 LCSHKLCGS-----ASDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218

Query: 157 ----NPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
                 R+ +GCG  Q    + G +    DG++GLG  + S+ S L    L+RN    C 
Sbjct: 219 SSSVKARVVVGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF 275

Query: 209 SGGGGGFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
                G ++FGD    +  S+  +    +S Y      GV     G       +     D
Sbjct: 276 DEEDSGRIYFGDMGPSIQQSAPFLQLENNSGYIV----GVEACCIGNSCLKQTSFTTFID 331

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKS 293
           SG S+TYL    Y+ +   + + ++A S
Sbjct: 332 SGQSFTYLPEEIYRKVALEIDRHINATS 359


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 123/287 (42%), Gaps = 23/287 (8%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S S S++L N   ++    +   + P +G Y +++ IG P   Y    DTGSDL W QC 
Sbjct: 62  SLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC- 120

Query: 81  APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
            PC++C +   P++ P    S   VPC    C ++      +C     CDY   Y D   
Sbjct: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAID---DSHCGAQGVCDYSYTYGD--- 174

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
                 K    F        + +  +GCG+       +    G++GLG G+ S+VSQ+  
Sbjct: 175 --QTYTKGDLGFEKITIGSSSVKSVIGCGHESG--GGFGFASGVIGLGGGQLSLVSQMSQ 230

Query: 197 QKLIRNVVGHC----LSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFG 251
              I     +C    LS   G   F  + +     VV T + S +   YY   +  +  G
Sbjct: 231 TSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG 290

Query: 252 GE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
            E      K   V+ DSG++ ++L +  Y  + S + K + AK +K+
Sbjct: 291 NERHMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKD 337


>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 498

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 136/334 (40%), Gaps = 32/334 (9%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y P    ++  VPC    C
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 174

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C    QC Y++ Y   G SS G LV+D    +  N   Q L  ++ LGCG
Sbjct: 175 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 229

Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
             Q          +G+ GLG  + S+ S L  + L  N    C    G G + FGD    
Sbjct: 230 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESS 289

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                   ++  +   Y+  ++ +  G + T + +   +FD+G+S+TYL    Y  +T  
Sbjct: 290 DQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYITQS 347

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE-LTP 343
              ++ A   + A +          R PF+  +D+ +    +        T ++F  + P
Sbjct: 348 FHAQVQAN--RHAADS---------RIPFEYCYDLSEARFPIPDIILRTVTGSMFPVIDP 396

Query: 344 EAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
              + I     V CL I+   ++ +   N + G+
Sbjct: 397 GQVISIQEHEYVYCLAIVKSMKLNIIGQNFMTGL 430


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 151/352 (42%), Gaps = 52/352 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G P +  ++ LDTGSD+ WLQC  PC +C      ++ PS       +PC
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPC 185

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L +PG     +   C Y++ Y DG  + G    +   F     +   PR+A+G
Sbjct: 186 YSPLCRRLDSPGCSLKNN--LCQYQVSYGDGSFTFGDFSTETLTFR----RAAVPRVAIG 239

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFG 219
           CG++      +    G+LGLG+G  S  +Q  ++    N   +CL+          + FG
Sbjct: 240 CGHDN--EGLFVGAAGLLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAKPSSIVFG 295

Query: 220 DD-LYDSSRVVWTSMSSDYTKYY-----------SP--GVAELFFGGETTGLKNLPVVFD 265
           D  +  ++R      +     +Y           +P  G++  FF  ++TG  N  V+ D
Sbjct: 296 DSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTG--NGGVIID 353

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG+S T L R  Y +L    +  + A  LK APE      C+        + +VK    T
Sbjct: 354 SGTSVTRLTRPAYVSLRDAFR--VGASHLKRAPEFSLFDTCYD----LSGLSEVK--VPT 405

Query: 326 LALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           + L F          L    YL+ + N G+ C          +  L++IG I
Sbjct: 406 VVLHFRGADV----SLPAANYLVPVDNSGSFCFAFAG----TMSGLSIIGNI 449


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 154/350 (44%), Gaps = 44/350 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y +T+YIG P        DTGSDL W+QC +PC  C     PL+ P    +     C+
Sbjct: 90  GEYLMTLYIGTPPVERLAIADTGSDLIWVQC-SPCQNCFPQDTPLFEPLKSSTFKAATCD 148

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLN-PRLAL 162
              C S+  P    C    QC Y   Y D   ++GV+  +  +F  T + Q ++ P    
Sbjct: 149 SQPCTSV-PPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIF 207

Query: 163 GCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
           GCG YN     +   + G++GLG G  S+VSQL  Q  I     +CL   S      L F
Sbjct: 208 GCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQ--IGYKFSYCLLPFSSNSTSKLKF 265

Query: 219 GDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------VVFDSGSSY 270
           G + +  ++ VV T +     K   P    L     T G K +P       ++ DSG+  
Sbjct: 266 GSEAIVTTNGVVSTPL---IIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVL 322

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TYL +  Y    + +++ LS +S ++      LP  +K   P++++         +A  F
Sbjct: 323 TYLEQTFYNNFVASLQEVLSVESAQD------LPFPFKFCFPYRDM-----TIPVIAFQF 371

Query: 331 TDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
           T         L P+  LI + ++  +CL ++  +   L  +++ G +  F
Sbjct: 372 TGASV----ALQPKNLLIKLQDRNMLCLAVVPSS---LSGISIFGNVAQF 414


>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
 gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
          Length = 499

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 105/247 (42%), Gaps = 19/247 (7%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y P    ++  VPC    C
Sbjct: 114 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 173

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C    QC Y++ Y   G SS G LV+D    +  N   Q L  ++ LGCG
Sbjct: 174 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 228

Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
             Q          +G+ GLG  + S+ S L  + L  N    C    G G + FGD    
Sbjct: 229 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQGSS 288

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                  +++  +   Y+  ++ +  G + T L +   +FD+G+S+TYL    Y  +T  
Sbjct: 289 DQEETPLNINQQHPT-YAITISGITIGNKPTDL-DFITIFDTGTSFTYLADPAYTYITQS 346

Query: 285 MKKELSA 291
              ++ A
Sbjct: 347 FHAQVQA 353


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 93/328 (28%), Positives = 130/328 (39%), Gaps = 42/328 (12%)

Query: 58  GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV---------PCEDPIC 108
           G PA    + +DTGSDLTW+QC  PC  C     PL+ P+              C D + 
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 213

Query: 109 ASLHAPGH--HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           A+   PG          +C Y L Y DG  S GVL  D  A     G  L      GCG 
Sbjct: 214 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLGG-FVFGCGL 269

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF--GD 220
           +      +    G++GLG+ + S+VSQ  S+     V  +CL    SG   G L    GD
Sbjct: 270 SNR--GLFGGTAGLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGGGD 325

Query: 221 DLYDSSR----VVWTSMSSDYTK--YYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYT 271
           D   S R    V +T M +D  +  +Y   V     GG      GL    V+ DSG+  T
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVIT 385

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y+ + +   ++  A     AP    L  C+          +VK    TL L   
Sbjct: 386 RLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYD----LTGHDEVKVPLLTLRL--- 438

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGI 359
           +G      +     +++  +   VCL +
Sbjct: 439 EGGADVTVDAAGMLFVVRKDGSQVCLAM 466


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 33/314 (10%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQ---VHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
           +R+   SS  ++S   +  G +  F    + G    +G Y   + +G P +  ++ LDTG
Sbjct: 90  IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTG 149

Query: 72  SDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDY 127
           SD+ WLQC APC  C     P++ P    S   V C  P+C  L +PG   C     C Y
Sbjct: 150 SDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPG---CNQRQTCLY 205

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKG 186
           ++ Y DG  + G  V +   F  T  +    ++ALGCG+ N+        L G+   G  
Sbjct: 206 QVSYGDGSYTTGEFVTETLTFRRTKVE----QVALGCGHDNEGLFVGAAGLLGLGRGGLS 261

Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY----- 241
             S   +  +QK    +V    S      +F    +  ++R      +     +Y     
Sbjct: 262 FPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELL 321

Query: 242 ------SP--GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
                 +P  G+    F  + TG  N  V+ D G+S T LN+  Y  L    +    A S
Sbjct: 322 GISVGGTPVSGITASHFKLDRTG--NGGVIIDCGTSVTRLNKPAYIALRDAFRA--GASS 377

Query: 294 LKEAPEDETLPLCW 307
           LK APE      C+
Sbjct: 378 LKSAPEFSLFDTCY 391


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 95/350 (27%), Positives = 137/350 (39%), Gaps = 55/350 (15%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSND---- 99
           + T  Y     +G P +     +DTGS L W QC A C+R  CV    P +  S+     
Sbjct: 81  WATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTA-CLRKVCVRQDLPYFNASSSGSFA 139

Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
            VPC+D  CA  +    H C     C + + Y  GG  +G L  DAF F     Q     
Sbjct: 140 PVPCQDKACAGNYL---HFCALDGTCTFRVTYGAGGI-IGFLGTDAFTF-----QSGGAT 190

Query: 160 LALGC-GYNQVPGASY-HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
           LA GC  + +       H   G++GLG+G+ S+ SQ  +++    +  +  + G    LF
Sbjct: 191 LAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLF 250

Query: 218 FGDDLYDSS------RVVWTSMSSDY---TKYYSPGVAELFFGGETT------------- 255
            G     S        + +     DY   T YY P V      GET              
Sbjct: 251 VGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITV--GETKLAIPSTAFDLQEV 308

Query: 256 --GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE-TLPLCWKGRRP 312
             G     V+ DSGS +T L    Y+ L   + ++L+   +    ED+  + LC      
Sbjct: 309 EEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVA---- 364

Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
                D+ +   TL L F+ G       L PE Y     K   C+ I+ G
Sbjct: 365 ---RGDLDRVVPTLVLHFSGGAD---MALPPENYWAPLEKSTACMAIVRG 408


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
           V   +G+P  P  + +DTGSDL W+QC  PC  C     P++ PS       +  + PIC
Sbjct: 93  VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 151

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
            +     +++     QC Y   YADG +S G L  +   F  ++ G      +  GCG++
Sbjct: 152 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 208

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
              G       GILGL  G  SIVS+L S+        +C+     G LF  D  Y  ++
Sbjct: 209 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 254

Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
           +V      M    T +++  G   +   G + G   L                VV DSG+
Sbjct: 255 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 314

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
           + T+L +  +  L++ +++ +     +      T+P  LC+KGR     V++  + F  L
Sbjct: 315 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 367

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           A  F +G       L   +  +  N+   CL +L   E  L+++  + GI
Sbjct: 368 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 411


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
           V   +G+P  P  + +DTGSDL W+QC  PC  C     P++ PS       +  + PIC
Sbjct: 61  VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
            +     +++     QC Y   YADG +S G L  +   F  ++ G      +  GCG++
Sbjct: 120 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
              G       GILGL  G  SIVS+L S+        +C+     G LF  D  Y  ++
Sbjct: 177 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 222

Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
           +V      M    T +++  G   +   G + G   L                VV DSG+
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
           + T+L +  +  L++ +++ +     +      T+P  LC+KGR     V++  + F  L
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 335

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           A  F +G       L   +  +  N+   CL +L   E  L+++  + GI
Sbjct: 336 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 379


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 148/348 (42%), Gaps = 53/348 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + +++G P + + L LDTGSDL W+QC  PC  C E   P Y P +      + C
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNITC 250

Query: 104 EDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQ-----RL 156
            DP C  + +P     C+   Q C Y   Y D  ++ G    + F  N T  +     ++
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310

Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
              +  GCG +N+     +H   G+LGLG+G  S  +QL  Q L  +   +CL     + 
Sbjct: 311 VENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFATQL--QSLYGHSFSYCLVDRNSNS 365

Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
                L FG+D  L     + +TS      +    +Y   +  +  GGE   +       
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHL 425

Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                   + DSG++ TY     Y+ +     KE   + +K  P  ET P      +P  
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEII-----KEAFMRKIKGFPLVETFPPL----KPCY 476

Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
           NV  V+K      A+ F DG    +++   E Y I I  +  VCL IL
Sbjct: 477 NVSGVEKMELPEFAILFADG---AMWDFPVENYFIQIEPEDVVCLAIL 521


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
           V   +G+P  P  + +DTGSDL W+QC  PC  C     P++ PS       +  + PIC
Sbjct: 61  VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
            +     +++     QC Y   YADG +S G L  +   F  ++ G      +  GCG++
Sbjct: 120 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
              G       GILGL  G  SIVS+L S+        +C+     G LF  D  Y  ++
Sbjct: 177 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 222

Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
           +V      M    T +++  G   +   G + G   L                VV DSG+
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
           + T+L +  +  L++ +++ +     +      T+P  LC+KGR     V++  + F  L
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 335

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           A  F +G       L   +  +  N+   CL +L   E  L+++  + GI
Sbjct: 336 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 379


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 139/357 (38%), Gaps = 62/357 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   + +G PAR  ++ LDTGSD+ WLQC APC RC     P++ P        +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
             P C  L + G +       C Y++ Y DG  ++G    +   F  N   G      +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249

Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           LGCG++                     PG + H  +      K    +V +  S K    
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304

Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
           V G+        F  L     L     V    +S   T+   PGV    F  +  G  N 
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRV--PGVTASLFKLDQIG--NG 360

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            V+ DSG+S T L R  Y  +    +  + AK+LK AP       C+       N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKRAPNFSLFDTCFD----LSNMNEVK 414

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               T+ L F     R    L    YLI +   G  C          +  L++IG I
Sbjct: 415 --VPTVVLHF----RRADVSLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 110/257 (42%), Gaps = 30/257 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVP---------- 102
           IG P   + + LD GSDL W+ CD  C++C       Y    R  N   P          
Sbjct: 106 IGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163

Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF----AFNYTNGQRL 156
           C   +C S       NC+ P Q C Y + Y ++  SS G+L++D        +  +   +
Sbjct: 164 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 218

Query: 157 NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
              + +GCG  Q  G      P DG++GLG G+ S+ S L    L++N    C +    G
Sbjct: 219 RAPVIIGCGMRQTGGYLDGVAP-DGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSG 277

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
            +FFGD    + +      S    + Y  GV     G       +   + DSG+S+T+L 
Sbjct: 278 RIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLP 337

Query: 275 RVTYQTLTSIMKKELSA 291
             +Y+ +     K+++A
Sbjct: 338 DESYRNVVDEFDKQVNA 354


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 133/333 (39%), Gaps = 38/333 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSN----DLV 101
           TG Y V++ +G PAR   +  DTGSDL+W+QC  PC    C +   PL+ PS+      V
Sbjct: 151 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYKQQDPLFAPSDSSTFSAV 209

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--------NYTNG 153
            C    C +  + G    +D  +C YE+ Y D   + G L  D            +  N 
Sbjct: 210 RCGARECRARQSCGGSPGDD--RCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEND 267

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SG 210
            +L P    GCG N      +   DG+ GLG+GK S+ SQ   +        +CL   S 
Sbjct: 268 NKL-PGFVFGCGENNT--GLFGQADGLFGLGRGKVSLSSQAAGK--FGEGFSYCLPSSSS 322

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT--KYYSPGVAELFFGGETTGLKN----LPVVF 264
              G+L  G  +   +   +T M +  T   +Y   +  +   G    + +    LP++ 
Sbjct: 323 SAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIV 382

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG+  T L    Y+ L +     +     K AP    L  C+     F    +      
Sbjct: 383 DSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYD----FTAHANATVSIP 438

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
            +AL F  G T     +     L ++     CL
Sbjct: 439 AVALVFAGGAT---ISVDFSGVLYVAKVAQACL 468


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 110/257 (42%), Gaps = 30/257 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVP---------- 102
           IG P   + + LD GSDL W+ CD  C++C       Y    R  N   P          
Sbjct: 87  IGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144

Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF----AFNYTNGQRL 156
           C   +C S       NC+ P Q C Y + Y ++  SS G+L++D        +  +   +
Sbjct: 145 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 199

Query: 157 NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
              + +GCG  Q  G      P DG++GLG G+ S+ S L    L++N    C +    G
Sbjct: 200 RAPVIIGCGMRQTGGYLDGVAP-DGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSG 258

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
            +FFGD    + +      S    + Y  GV     G       +   + DSG+S+T+L 
Sbjct: 259 RIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLP 318

Query: 275 RVTYQTLTSIMKKELSA 291
             +Y+ +     K+++A
Sbjct: 319 DESYRNVVDEFDKQVNA 335


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 126/294 (42%), Gaps = 39/294 (13%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +R   ++ ++  +SL +  G        G  + +G Y   + +G P+    L +DTGSDL
Sbjct: 50  LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
            WLQC +PC RC      ++ P        VPC  P C +L  PG   C+        C 
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           Y + Y DG SS G L  D  AF   N   +N  + LGCG +      +    G+LG+G+G
Sbjct: 166 YMVAYGDGSSSTGDLATDKLAF--ANDTYVN-NVTLGCGRDNE--GLFDSAAGLLGVGRG 220

Query: 187 KSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-- 239
           K SI +Q+       +V  +CL           +L FG      S   +T++ S+  +  
Sbjct: 221 KISISTQV--APAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPS-TAFTALLSNPRRPS 277

Query: 240 YYSPGVAELFFGGE-TTGLKNLP-----------VVFDSGSSYTYLNRVTYQTL 281
            Y   +A    GGE  TG  N             VV DSG++ +   R  Y  L
Sbjct: 278 LYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAAL 331


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 74/278 (26%), Positives = 112/278 (40%), Gaps = 25/278 (8%)

Query: 42  HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL 100
           +G    TG Y V + +G PA  + +  DTGSD TW+QC  PCV  C     PL+ P+   
Sbjct: 87  YGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSA 145

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C    C+ L+  G   C     C Y ++Y DG  ++G   +D     Y   +  
Sbjct: 146 TYANISCSSSYCSDLYVSG---CSG-GHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNF 201

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
                 GCG        +    G+LGLG+GK+S+  Q + +     V  +CL  +  G G
Sbjct: 202 R----FGCGEKNR--GLFGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTG 253

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-----TGLKNLPVVFDSGSS 269
           FL  G     ++  +   +      +Y  G+  +  GG       +       + DSG+ 
Sbjct: 254 FLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTV 313

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            T L    Y  L S   K +       AP    L  C+
Sbjct: 314 ITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 351


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 118/284 (41%), Gaps = 33/284 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y  T+ +G P R + + +DTGSDLTW+QC +PC +C      L+ P+       + C 
Sbjct: 11  GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTSTSFTKLACG 69

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C  L  P  +       C Y   Y DG  + G  V D    +  NGQ+   P  A G
Sbjct: 70  SALCNGLPFPMCNQ----TTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFG 125

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFLFFGD 220
           CG++     S+   DGILGLG+G  S  SQL S    K    +V           L FGD
Sbjct: 126 CGHDN--EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGD 183

Query: 221 D----LYDSSRVVWTSMSSDYTKYYSP-----------GVAELFFGGETTGLKNLPVVFD 265
                L D   +   +     T YY              ++   F  ++ G      +FD
Sbjct: 184 AAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG--TIFD 241

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
           SG++ T L    Y+ + + M     A S ++  +   L LC  G
Sbjct: 242 SGTTVTQLAEAAYKEVLAAMNASTMAYS-RKIDDISRLDLCLSG 284


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 73/278 (26%), Positives = 111/278 (39%), Gaps = 25/278 (8%)

Query: 42  HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL 100
           +G    TG Y V + +G PA  + +  DTGSD TW+QC  PCV  C     PL+ P+   
Sbjct: 152 YGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSA 210

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C    C+ L+  G         C Y ++Y DG  ++G   +D     Y   +  
Sbjct: 211 TYANISCSSSYCSDLYVSGCSG----GHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNF 266

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
                 GCG        +    G+LGLG+GK+S+  Q + +     V  +CL  +  G G
Sbjct: 267 R----FGCGEKNR--GLFGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTG 318

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-----TGLKNLPVVFDSGSS 269
           FL  G     ++  +   +      +Y  G+  +  GG       +       + DSG+ 
Sbjct: 319 FLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTV 378

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            T L    Y  L S   K +       AP    L  C+
Sbjct: 379 ITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 416


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 87/181 (48%), Gaps = 20/181 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC+ PC +C     P++ P++      V C
Sbjct: 131 SGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSYAGVSC 189

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C+ +   G H      +C YE+ Y DG  + G L  +   F    G+ L   +A+G
Sbjct: 190 ASTVCSHVDNAGCHE----GRCRYEVSYGDGSYTKGTLALETLTF----GRTLIRNVAIG 241

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
           CG++      +    G+LGLG G  S V QL  Q        +CL   G    G L FG 
Sbjct: 242 CGHHN--QGMFVGAAGLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGLLQFGR 297

Query: 221 D 221
           +
Sbjct: 298 E 298


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 79/167 (47%), Gaps = 16/167 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G + + + IG P R +   +DTGSDL W QC  PC +C +   P++ P        + C 
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCS 422

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNPRLALG 163
             +C +L       C     C+Y   Y D  S+ GVL  + F F + T  Q   P L  G
Sbjct: 423 SELCGALPTS---TCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFG 478

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           CG N   G  +    G++GLG+G  S+VSQL  QK       +CL+ 
Sbjct: 479 CG-NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKF-----AYCLTA 519


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 26/255 (10%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLH 112
           IG P   + + LD GSDL W+ CD  C++C       Y    R  N+  P       S H
Sbjct: 119 IGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHS--STSKH 174

Query: 113 APGHH-------NCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF--AFNYTNGQRLNPR-- 159
               H       NC  P Q C Y ++Y  +  SS G+LV+D    A N  N    + R  
Sbjct: 175 LSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAP 234

Query: 160 LALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
           + +GCG  Q  G      P DG++GLG  + S+ S L    LIRN    C      G +F
Sbjct: 235 VVIGCGMKQSGGYLDGVAP-DGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIF 293

Query: 218 FGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
           FGD    + +   + ++  +YT Y   GV     G       +   + D+G+S+T+L   
Sbjct: 294 FGDQGPTTQQSTPFLTLDGNYTTYVV-GVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNG 352

Query: 277 TYQTLTSIMKKELSA 291
            Y+ +T    ++++A
Sbjct: 353 VYERITEEFDRQVNA 367


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 71/133 (53%), Gaps = 15/133 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---- 98
           G +Y +G Y V + +G PAR  F+ +DTGSDL WLQC  PC  C +   P++ P N    
Sbjct: 121 GLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 179

Query: 99  DLVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
             +PC  P+C +L     H+C       ++C Y++ Y DG  S+G    D F    T  +
Sbjct: 180 QRIPCLSPLCKALEI---HSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSK 235

Query: 155 RLNPRLALGCGYN 167
            ++  +A GCG++
Sbjct: 236 AMS--VAFGCGFD 246


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 139/357 (38%), Gaps = 62/357 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   + +G PAR  ++ LDTGSD+ WLQC APC RC     P++ P        +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
             P C  L + G +       C Y++ Y DG  ++G    +   F  N   G      +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249

Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
           LGCG++                     PG + H  +      K    +V +  S K    
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304

Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
           V G+        F  L     L     V    +S   T+   PGV    F  +  G  N 
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRV--PGVTASLFKLDQIG--NG 360

Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            V+ DSG+S T L R  Y  +    +  + AK+LK AP+      C+       N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKRAPDFSLFDTCFD----LSNMNEVK 414

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               T+ L F          L    YLI +   G  C          +  L++IG I
Sbjct: 415 --VPTVVLHFRGADV----SLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 77/259 (29%), Positives = 106/259 (40%), Gaps = 22/259 (8%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV- 101
           G    T  Y +T+ IG PA    + +DTGSD++W+QC  PC +C      L+ PS     
Sbjct: 123 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTY 181

Query: 102 ---PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
               C    C  L      N    +QC Y + Y DG S+ G    D      T G     
Sbjct: 182 SPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTL----TLGSNAIK 237

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GC  ++  G S    DG++GLG    S+VSQ  +         +CL  + G  GFL
Sbjct: 238 GFQFGCSQSESGGFSDQ-TDGLMGLGGDAQSLVSQ--TAGTFGKAFSYCLPPTPGSSGFL 294

Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSY 270
             G      S  V T M  S+    YY   +  +  GG+     T + +   V DSG+  
Sbjct: 295 TLG--AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDSGTVI 352

Query: 271 TYLNRVTYQTLTSIMKKEL 289
           T L    Y  L+S  K  +
Sbjct: 353 TRLPPTAYSALSSAFKAGM 371


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 136/357 (38%), Gaps = 47/357 (13%)

Query: 48  TGYYNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LV 101
           +G Y +   IG P RP    L +DTGSDL W QC  PC  C + P PL+ PS       V
Sbjct: 84  SGEYLIHFNIGTP-RPQRVALTMDTGSDLVWTQC-TPCPVCFDQPFPLFDPSVSSTFRAV 141

Query: 102 PCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR- 159
            C DPIC          C     +C Y   Y D   + G + KD F F   NG+   P  
Sbjct: 142 ACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVA 201

Query: 160 ---LALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
              LA GCG YN    AS     GI G G+G  S+ SQL   +    +  H  +      
Sbjct: 202 VSGLAFGCGDYNTGVFASNE--SGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTS 259

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV---------- 262
             F     +  R   +         +SP     ++    G T G   LPV          
Sbjct: 260 AVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKD 319

Query: 263 -----VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                V DSG+  T      ++ L +    +L         E   L LC++  +  K V 
Sbjct: 320 GSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNL-LCFQRPKGGKQVP 378

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
             K  F    L+  D       +L  E Y+       V   ++NGAEV   D+ +IG
Sbjct: 379 VPKLIFH---LASAD------MDLPRENYIPEDTDSGVMCLMINGAEV---DMVLIG 423


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 71/133 (53%), Gaps = 15/133 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---- 98
           G +Y +G Y V + +G PAR  F+ +DTGSDL WLQC  PC  C +   P++ P N    
Sbjct: 46  GLLYGSGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 104

Query: 99  DLVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
             +PC  P+C +L     H+C       ++C Y++ Y DG  S+G    D F    T  +
Sbjct: 105 QRIPCLSPLCKALEV---HSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSK 160

Query: 155 RLNPRLALGCGYN 167
            ++  +A GCG++
Sbjct: 161 AMS--VAFGCGFD 171


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/328 (28%), Positives = 141/328 (42%), Gaps = 41/328 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
           + VT+  G PA+ Y L +DTGSD++W+QC  PC   C +   P++ P+       VPC  
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
           P CA+        C +   C Y++ Y DG S+ GVL  +  + + T   R  P  A GCG
Sbjct: 220 PQCAAAGG----KCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSST---RDLPGFAFGCG 272

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL- 222
              +    +  +DG++GLG+G  S+ SQ  +         +CL       G+L  G    
Sbjct: 273 QTNL--GEFGGVDGLVGLGRGALSLPSQ--AAATFGATFSYCLPSYDTTHGYLTMGSTTP 328

Query: 223 ---YDSSRVVWTSM--SSDYTKYYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTY 272
               D   V +T+M    DY   Y   V  +  GG       T       +FDSG+  TY
Sbjct: 329 AASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSGTILTY 388

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y +L    K   +    K AP  +    C+     F   + +      +A  F+D
Sbjct: 389 LPPEAYASLRDRFK--FTMTQYKPAPAYDPFDTCYD----FTGHNAIF--MPAVAFKFSD 440

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGIL 360
           G    +F+L+P A LI  +      G L
Sbjct: 441 GA---VFDLSPVAILIYPDDTAPATGCL 465


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 92/184 (50%), Gaps = 18/184 (9%)

Query: 30  FNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA 89
           +NH+  +    ++G++   GYY   +YIG P + + L +DTGS++T++ C      C + 
Sbjct: 29  YNHLHPNARMPLYGDILSYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKH 88

Query: 90  PHPLYRPSNDLVPCEDPICASLHAP--GHHNCEDP---AQCDYELEYADGGSSLGVLVKD 144
             P ++  +          +S + P   H +C+     +QC Y++ Y DG  S GVL +D
Sbjct: 89  EDPAFQTES----------SSTYQPVNCHPSCDCDYLRSQCSYKMHYGDGSYSRGVLAED 138

Query: 145 AFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
             +F   N     P RL  GC  + +        DGI+GLG+G+S+IV QL  + +I + 
Sbjct: 139 IISFG--NESEFAPQRLVFGCELDAIGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDS 196

Query: 204 VGHC 207
              C
Sbjct: 197 FSLC 200


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 143/345 (41%), Gaps = 57/345 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
           +G Y + + IG PA      +DTGSDL W QC+ PC +C   P P++ P +      +PC
Sbjct: 93  SGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPC 151

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           E   C  L +   +N      C Y   Y DG S+ G +  + F F  ++     P +A G
Sbjct: 152 ESQYCQDLPSESCYN-----DCQYTYGYGDGSSTQGYMATETFTFETSS----VPNIAFG 202

Query: 164 C-----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF--- 215
           C     G+ Q  GA      G++G+G G  S+ SQL   +       +C++  G      
Sbjct: 203 CGEDNQGFGQGNGA------GLIGMGWGPLSLPSQLGVGQF-----SYCMTSSGSSSPST 251

Query: 216 LFFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
           L  G     + + S       SS    YY   +  +  GG+  G+ +            +
Sbjct: 252 LALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGM 311

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           + DSG++ TYL +  Y  +      +++   + E+     L  C++       V      
Sbjct: 312 IIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDES--SSGLSTCFQLPSDGSTVQ----- 364

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
              +++ F  G    +  L  E  LI   +G +CL + + ++ G+
Sbjct: 365 VPEISMQFDGG----VLNLGEENVLISPAEGVICLAMGSSSQQGI 405


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 147/379 (38%), Gaps = 79/379 (20%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
           Y  G Y+V   +G P++ + L  DTGSDLTW+ C   C            +R     H  
Sbjct: 7   YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 66

Query: 94  YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
              S   +PC   +C    +      NC  P   C Y+  Y+DG ++LG    +      
Sbjct: 67  LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 126

Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
             G+++    + +GC      G S+   DG++GLG  K S   +   +   K    +V H
Sbjct: 127 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 185

Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
                   +L FG     S   +  +M+  YT+        +Y+  +  +  GG    + 
Sbjct: 186 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 238

Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            +P            + DSGSS T+L    YQ + + ++  L                  
Sbjct: 239 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 281

Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
           K R+   ++  ++ CF +          L   F DG     FE   ++Y+I +  G  CL
Sbjct: 282 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 338

Query: 358 GILNGAEVGLQDLNVIGGI 376
           G ++ A  G    +V+G I
Sbjct: 339 GFVSVAWPG---TSVVGNI 354


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 130/330 (39%), Gaps = 36/330 (10%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRP----S 97
           G  Y  G Y   M +G PA+PY + +DTGS LTWLQC +PC V C     P++ P    S
Sbjct: 129 GTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSS 187

Query: 98  NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
              V C  P C  L     +   C     C Y+  Y D   S+G L KD  +F    G  
Sbjct: 188 YAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF----GSN 243

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
             P    GCG +      +    G++GL + K S++ QL     +     +CL       
Sbjct: 244 SVPNFYYGCGQDN--EGLFGRSAGLMGLARNKLSLLYQL--APTLGYSFSYCLPSSSSSG 299

Query: 216 LFFGDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
                  Y+  +  +T M S        + K     VA       ++   +LP + DSG+
Sbjct: 300 YLSI-GSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGT 358

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T L    Y  L+  +   +  K  K A     L  C+ G+     V  V       ++
Sbjct: 359 VITRLPTTVYDALSKAVAGAM--KGTKRADAYSILDTCFVGQASSLRVPAV-------SM 409

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLG 358
           +F+ G      +L+ +  L+  +    CL 
Sbjct: 410 AFSGGAA---LKLSAQNLLVDVDSSTTCLA 436


>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
 gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 500

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 104/247 (42%), Gaps = 19/247 (7%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           +G P + + + LDTGSDL WL  QCD   P           Y P    ++  VPC    C
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 174

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                     C    QC Y++ Y   G SS G LV+D    +  N   Q L  ++ LGCG
Sbjct: 175 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 229

Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
             Q          +G+ GLG  + S+ S L  + L  N    C    G G + FGD    
Sbjct: 230 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESS 289

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                   ++  +   Y+  ++ +  G + T + +   +FD+G+S+TYL    Y  +T  
Sbjct: 290 DQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYITQS 347

Query: 285 MKKELSA 291
              ++ A
Sbjct: 348 FHAQVQA 354


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 101/350 (28%), Positives = 153/350 (43%), Gaps = 62/350 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y V +Y+G P R + + +DTGSDL WLQC APC+ C +   P++ P    S   V C
Sbjct: 147 SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTC 205

Query: 104 EDPICASLHAPGH-HNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLN 157
            D  C  +  P     C     DP  C Y   Y D  ++ G L  +AF  N T +  R  
Sbjct: 206 GDTRCGLVSPPAAPRTCRSSRSDP--CPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRV 263

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG- 212
             + LGCG+       +H   G+LGLG+G  S  SQL      R V GH    CL   G 
Sbjct: 264 DGVVLGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHAFSYCLVDHGS 315

Query: 213 --GGFLFFGDD--LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
             G  + FGDD  L    ++ +T+   S+    +Y   +  +  GGE   + ++P     
Sbjct: 316 AVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGE---MLDIPSNTWG 372

Query: 262 ---------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
                     + DSG++ +Y     Y+ +    ++    +  K  P     P+      P
Sbjct: 373 VSKEDGSGGTIIDSGTTLSYFPEPAYKAI----RQAFVDRMDKAYPLIADFPVL----SP 424

Query: 313 FKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
             NV  V++      +L F DG    +++   E Y I +  +G +CL +L
Sbjct: 425 CYNVSGVERVEVPEFSLLFADG---AVWDFPAENYFIRLDTEGIMCLAVL 471


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 147/379 (38%), Gaps = 79/379 (20%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
           Y  G Y+V   +G P++ + L  DTGSDLTW+ C   C            +R     H  
Sbjct: 78  YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137

Query: 94  YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
              S   +PC   +C    +      NC  P   C Y+  Y+DG ++LG    +      
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197

Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
             G+++    + +GC      G S+   DG++GLG  K S   +   +   K    +V H
Sbjct: 198 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256

Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
                   +L FG     S   +  +M+  YT+        +Y+  +  +  GG    + 
Sbjct: 257 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 309

Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            +P            + DSGSS T+L    YQ + + ++  L                  
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 352

Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
           K R+   ++  ++ CF +          L   F DG     FE   ++Y+I +  G  CL
Sbjct: 353 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 409

Query: 358 GILNGAEVGLQDLNVIGGI 376
           G ++ A  G    +V+G I
Sbjct: 410 GFVSVAWPG---TSVVGNI 425


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/347 (28%), Positives = 142/347 (40%), Gaps = 58/347 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
           G Y +T+ IG P  PY    DTGSDL W QC APC  +C E P PLY P++     ++PC
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 170

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLAL 162
              +     A           C Y   Y  G ++ GV   + F F  +   Q   P +A 
Sbjct: 171 NSSLSMCAGALAGAAPPPGCACMYYQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAF 229

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
           GC  +    + ++   G++GLG+G  S+VSQL + +       +CL+        F D  
Sbjct: 230 GC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP-------FQDTN 275

Query: 223 YDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV-------- 262
             S+ ++  S + + T   S P VA            L   G + G K LP+        
Sbjct: 276 STSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLK 335

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFK 314
                  + DSG++ T L    YQ + + +K +L          D T L LC+    P  
Sbjct: 336 PDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTS 395

Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
               V     ++ L F DG    L    P    +IS  G  CL + N
Sbjct: 396 APPAV---LPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 434


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 133/336 (39%), Gaps = 43/336 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y V + +G PA+ + + +DTGS L+WLQC    + C     P++ PS       +PC
Sbjct: 110 SGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPC 169

Query: 104 -----EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
                     ++L+APG  N      C Y+  Y D   S+G L +D      T  +  + 
Sbjct: 170 SSSQCSSLKSSTLNAPGCSNAT--GACVYKASYGDTSFSIGYLSQDVLTL--TPSEAPSS 225

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------ 212
               GCG +      +    GI+GL   K S++ QL   K   N   +CL          
Sbjct: 226 GFVYGCGQDN--QGLFGRSSGIIGLANDKISMLGQL--SKKYGNAFSYCLPSSFSAPNSS 281

Query: 213 --GGFLFFGDDLYDSSRVVWTSMSSDYT--KYYSPGVAELFFGGETTGLK----NLPVVF 264
              GFL  G     SS   +T +  +      Y   +  +   G+  G+     N+P + 
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTII 341

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCF 323
           DSG+  T L    Y  L       +S K   +AP    L  C+KG  +    V +++  F
Sbjct: 342 DSGTVITRLPVAVYNALKKSFVLIMS-KKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIF 400

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
           R  A            EL     L+   KG  CL I
Sbjct: 401 RGGA----------GLELKAHNSLVEIEKGTTCLAI 426


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/293 (28%), Positives = 130/293 (44%), Gaps = 41/293 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           TG Y + M++G P +  +L LDTGSDL+W+QCD PC  C E   P Y P+       + C
Sbjct: 167 TGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISC 225

Query: 104 EDPICASLHAPG--HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNPR 159
            DP C  + +P    H   +   C Y  +YADG ++ G    + F  N T  NG+     
Sbjct: 226 YDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKH 285

Query: 160 LA---LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
           +     GCG+       +H   G+LGLG+G  S  SQL  Q +  +   +CL+       
Sbjct: 286 VVDVMFGCGHWN--KGFFHGAGGLLGLGRGPLSFPSQL--QSIYGHSFSYCLTDLFSNTS 341

Query: 212 GGGFLFFGDD--LYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
               L FG+D  L +   + +T +     + D T YY   +  +  GGE   +       
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYL-QIKSIVVGGEVLDIPEKTWHW 400

Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                   + DSGS+ T+     Y  +    +K++  + +  A +D  +  C+
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI--AADDFIMSPCY 451


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/274 (27%), Positives = 112/274 (40%), Gaps = 42/274 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P    +L +D+GSD+ W+QC  PC +C     PL+ P+       V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC +L   G     D  +CDY + Y DG  + G L  +      T  Q     +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLY 223
           CG+       +    G+LGLG G  S+V QL        V  +CL+  G G         
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAG--------- 288

Query: 224 DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSYTYL 273
                      S  + +Y  G+  +  GGE   L++            VV D+G++ T L
Sbjct: 289 --------GAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRL 340

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            R  Y  L       + A  L  +P    L  C+
Sbjct: 341 PREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 372


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/294 (28%), Positives = 125/294 (42%), Gaps = 39/294 (13%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +R   ++ ++  +SL +  G        G  + +G Y   + +G P+    L +DTGSDL
Sbjct: 50  LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
            WLQC +PC RC      ++ P        VPC  P C +L  PG   C+        C 
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
           Y + Y DG SS G L  D  AF   N   +N  + LGCG +      +    G+LG+ +G
Sbjct: 166 YMVAYGDGSSSTGELATDKLAF--ANDTYVN-NVTLGCGRDNE--GLFDSAAGLLGVARG 220

Query: 187 KSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-- 239
           K SI +Q+       +V  +CL           +L FG      S   +T++ S+  +  
Sbjct: 221 KISISTQV--APAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPS-TAFTALLSNPRRPS 277

Query: 240 YYSPGVAELFFGGE-TTGLKNLP-----------VVFDSGSSYTYLNRVTYQTL 281
            Y   +A    GGE  TG  N             VV DSG++ +   R  Y  L
Sbjct: 278 LYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAAL 331


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 126/291 (43%), Gaps = 47/291 (16%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G    +G Y V++ +G P +   L  DTGSDLTW QC  PC R C     P++ PS    
Sbjct: 123 GATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQ-PCARYCYNQKDPVFVPSQSTT 181

Query: 101 ---VPCEDPICASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
              + C  P C+ L +     PG   C     C Y ++Y D   S+G   K+      T+
Sbjct: 182 YSNISCSSPDCSQLESGTGNQPG---CSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD 238

Query: 153 GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SG 210
              +      GCG N      +    G++GLG+ K SIV Q  +QK    V  +CL  + 
Sbjct: 239 ---VIENFLFGCGQNNR--GLFGSAAGLIGLGQDKISIVKQT-AQKY-GQVFSYCLPKTS 291

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPV---- 262
              G+L FG      + + +T ++  +      GVA  F+G +  G+K     +P+    
Sbjct: 292 SSTGYLTFGGGGGGGA-LKYTPITKAH------GVAN-FYGVDIVGMKVGGTQIPISSSV 343

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                 + DSG+  T L    Y  L S  +K ++     +APE   L  C+
Sbjct: 344 FSTSGAIIDSGTVITRLPPDAYSALKSAFEKGMA--KYPKAPELSILDTCY 392


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 12/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           + G    +G Y   + +G P R  ++ LDTGSD+ W+QC  PC +C     PL+ P+   
Sbjct: 143 ISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQC-LPCAKCYGQTDPLFNPAASS 201

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               VPC  P+C  L   G   C +   C+Y++ Y DG  ++G    +   F    GQ +
Sbjct: 202 TYRKVPCATPLCKKLDISG---CRNKRYCEYQVSYGDGSFTVGDFSTETLTF---RGQVI 255

Query: 157 NPRLALGCGYN 167
             R+ALGCG++
Sbjct: 256 R-RVALGCGHD 265


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 90/349 (25%), Positives = 140/349 (40%), Gaps = 61/349 (17%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G+Y + + IG P    +   DTGSDLTW  C  PC +C +  +P++ P        + C+
Sbjct: 23  GHYLMEVSIGTPPFKIYGIADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKSTSYRNISCD 81

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
             +C  L       C     C+Y   YA    + GVL ++    + T G+ +  + +  G
Sbjct: 82  SKLCHKLDT---GVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFG 138

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF----LFFG 219
           CG+N   G +   + GI+GLG G  S +SQ+ S            S GG  F    + F 
Sbjct: 139 CGHNNTGGFNDREM-GIIGLGGGPVSFISQIGS------------SFGGKRFSQCLVPFH 185

Query: 220 DDLYDSSR-------------VVWTSM--SSDYTKYY------SPGVAELFFGGETT-GL 257
            D+  SS+             VV T +    D T Y+      S G   L F G ++  +
Sbjct: 186 TDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSV 245

Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
           +   V  DSG+  T L    Y  L + ++ E++ K +     D    LC++ +   +   
Sbjct: 246 EKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTND-LDLGPQLCYRTKNNLRG-- 302

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
                   L   F  G  +    L P    +    G  CLG  N +  G
Sbjct: 303 ------PVLTAHFEGGDVK----LLPTQTFVSPKDGVFCLGFTNTSSDG 341


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 73/142 (51%), Gaps = 14/142 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G P +  ++ LDTGSD+ W+QC APC +C     P++ P    S   + C
Sbjct: 171 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISC 229

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L +PG   C     C Y++ Y DG  + G    +   F    G R+ P++ALG
Sbjct: 230 RSPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTF---RGTRV-PKVALG 282

Query: 164 CGYNQVPGASYHPLDGILGLGK 185
           CG++      +    G+LGLG+
Sbjct: 283 CGHDNE--GLFVGAAGLLGLGR 302


>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
          Length = 519

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 145/350 (41%), Gaps = 59/350 (16%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEA-------PHPLYRPS--- 97
           Y NV+  +G PA  + + LDTGSDL WL C+  + C+R ++        P  LY P+   
Sbjct: 103 YANVS--VGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160

Query: 98  -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
            +  + C D  C              + C Y+++Y    + + G L +D      T  + 
Sbjct: 161 TSSSIRCSDDRCFGSSRCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDEG 215

Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
           L P    + LGCG NQ     S   ++G+LGLG    S+ S L   K+  N    C    
Sbjct: 216 LEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNI 275

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
               G + FGD  Y       T           P V E+  GG+  G++ L  +FD+G+S
Sbjct: 276 IDVVGRISFGDKGY-------TDQMETPLLPTEPSVTEVSVGGDAVGVQ-LLALFDTGTS 327

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
           +T+L    Y  +T      ++ K     PE            PF+  +D+        F 
Sbjct: 328 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------LPFEFCYDLSPNKTTILFP 376

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
            +A++F  G    +F   P   L I N    CLGIL   +     +N+IG
Sbjct: 377 RVAMTFEGGS--QMFLRNP---LFIDNSAMYCLGILKSVDF---KINIIG 418


>gi|399218365|emb|CCF75252.1| unnamed protein product [Babesia microti strain RI]
          Length = 535

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/313 (22%), Positives = 132/313 (42%), Gaps = 32/313 (10%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G      ++G ++   YY + ++IG P    ++ LDTGS L  + C   C++C    +P 
Sbjct: 163 GKKFKIPIYGTLHDFAYYFIKIFIGTPPSVQWVVLDTGSSLLGITC-GNCIQCGNHQNPN 221

Query: 94  YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
           Y P       +   C  ++      C+   +C +   Y++G    G    D  +F+ ++ 
Sbjct: 222 YEPYESATAIK---CTDVNQCKLKGCD---ECRFMQHYSEGSFISGDYYTDVISFDKSSP 275

Query: 153 GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           G + N    LGC   +         +GI G+     SI+SQL  +  I N+   CLS  G
Sbjct: 276 GYKFN---NLGCVLYENKLIYNQRANGIFGMSPNDDSIISQLFKRPEIDNIFSICLSDEG 332

Query: 213 GGFLFFGDD-----LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
           G  +  G +     + ++S + WT +++D   Y    +  + +  +   + N     DSG
Sbjct: 333 GELIIGGIEPELFNIKNNSEMAWTRLNTDNNYYIH--INSMSYLSDHVEITNTKFSIDSG 390

Query: 268 SSYTYLNRVTYQTLTS------IMKKELSAKSL-------KEAPEDETLPLCWKGRRPFK 314
           ++ T L    Y+++ +       M +E+    L       ++ P+D    +  +     K
Sbjct: 391 TTNTVLMEKMYKSIVNGVMNICFMDREIEGYDLDIGVTVIQKKPDDIVDLMIEREENVTK 450

Query: 315 -NVHDVKKCFRTL 326
             +HD + C R +
Sbjct: 451 CEIHDDEICSRNI 463


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 55/153 (35%), Positives = 74/153 (48%), Gaps = 12/153 (7%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y  T+ +G P R + + +DTGSDLTW+QC +PC  C      L+ P+       + C 
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTSTSFTKLACG 59

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C  L  P  +       C Y   Y DG  S G  V D    +  NGQ+   P  A G
Sbjct: 60  TELCNGLPYPMCNQ----TTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFG 115

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           CG++     S+   DGILGLG+G  S  SQL +
Sbjct: 116 CGHDNE--GSFAGADGILGLGQGPLSFPSQLKT 146


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 138/344 (40%), Gaps = 43/344 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +   +G P+       DTGSDL+WLQC  PC  C     PL+ P+       VPCE
Sbjct: 86  GEYLMRFSLGTPSVERLAIFDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQSSTYVDVPCE 144

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT---NGQRLNPRLA 161
              C +L       C    QC Y  +Y     ++G L  D  +F+ T    G    P+  
Sbjct: 145 SQPC-TLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV 203

Query: 162 LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLF 217
            GC  Y+          +G +GLG G  S+ SQL  Q  I +   +C+   S    G L 
Sbjct: 204 FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLK 261

Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGET--TGLKNLPVVFDSGSSYTYL 273
           FG  +  ++ VV T   ++  Y  YY   +  +  G +   TG     ++ DS    T+L
Sbjct: 262 FG-SMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHL 320

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
            +  Y    S +K+ ++     E  ED   P  +  R P  N++     F      FT  
Sbjct: 321 EQGIYTDFISSVKEAINV----EVAEDAPTPFEYCVRNP-TNLN-----FPEFVFHFTGA 370

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGIL---------NGAEVGLQ 368
                  L P+   I  +   VC+ ++         N A+V  Q
Sbjct: 371 DVV----LGPKNMFIALDNNLVCMTVVPSKGISIFGNWAQVNFQ 410


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/272 (28%), Positives = 113/272 (41%), Gaps = 38/272 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-------------VPC 103
           +G P   + + LDTGSDL WL C+  C  CV           DL             VPC
Sbjct: 119 VGTPPLWFLVALDTGSDLFWLPCN--CTSCVRGLKTQNGKVIDLNIYELDKSSTRKNVPC 176

Query: 104 EDPICASL--HAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
              +C     H+ G       + C YE+EY ++  SS G LV+D       N Q   ++ 
Sbjct: 177 NSNMCKQTQCHSSG-------SSCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQTKDIDT 229

Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           ++ +GCG  Q    + GA+    +G+ GLG    S+ S L  + LI +    C    G G
Sbjct: 230 QITIGCGQVQTGVFLNGAA---PNGLFGLGMENVSVPSILAQKGLISDSFSMCFGSDGSG 286

Query: 215 FLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
            + FGD    D  +  +    S  T  Y+  + ++  GG          +FDSG+S+TYL
Sbjct: 287 RITFGDTGSSDQGKTPFNLRESHPT--YNVTITQIIVGGYAAD-HEFHAIFDSGTSFTYL 343

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           N   Y  ++      + A        D  LP 
Sbjct: 344 NDPAYTLISEKFNSLVKANRHSPLSPDSDLPF 375


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/280 (27%), Positives = 115/280 (41%), Gaps = 34/280 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G P +  ++ LDTGSD+ W+QC APC +C     P++ P    S   + C
Sbjct: 144 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISC 202

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L +PG   C     C Y++ Y DG  + G    +   F    G R+ P++ALG
Sbjct: 203 RSPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTF---RGTRV-PKVALG 255

Query: 164 CGYNQ-------------VPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCLS 209
           CG++                G    P    L  G+  S  +V +  S K    V G    
Sbjct: 256 CGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAV 315

Query: 210 GGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
                F  L     L     +  T +S    +    G+    F  +T G  N  V+ DSG
Sbjct: 316 SRTAVFTPLITNPKLDTFYYLELTGISVGGARV--AGITASLFKLDTAG--NGGVIIDSG 371

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           +S T L R  Y +L    +    A  LK AP+      C+
Sbjct: 372 TSVTRLTRRAYVSLRDAFRA--GAADLKRAPDYSLFDTCF 409


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 134/323 (41%), Gaps = 38/323 (11%)

Query: 58  GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLH 112
           G PA    + +DTGSDLTW+QC  PC  C     PL+ P+       V C    C ASL 
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLK 255

Query: 113 A----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
           A    PG     +  +C Y L Y DG  S GVL  D  A     G  L+     GCG + 
Sbjct: 256 AATGTPGSCGGGNE-RCYYALAYGDGSFSRGVLATDTVAL---GGASLDG-FVFGCGLSN 310

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL-- 222
                +    G++GLG+ + S+VSQ  +      V  +CL    SG   G L  G D   
Sbjct: 311 R--GLFGGTAGLMGLGRTELSLVSQ--TALRYGGVFSYCLPATTSGDASGSLSLGGDASS 366

Query: 223 -YDSSRVVWTSMSSDYTK--YYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRV 276
             +++ V +T M +D  +  +Y   V     GG      GL    V+ DSG+  T L   
Sbjct: 367 YRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPS 426

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR 336
            Y+ + +   ++ +A     AP    L  C+          +VK    TL L   +G   
Sbjct: 427 VYRGVRAEFTRQFAAAGYPTAPGFSILDTCYD----LTGHDEVKVPLLTLRL---EGGAE 479

Query: 337 TLFELTPEAYLIISNKGNVCLGI 359
              +     +++  +   VCL +
Sbjct: 480 VTVDAAGMLFVVRKDGSQVCLAM 502


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 90/343 (26%), Positives = 138/343 (40%), Gaps = 68/343 (19%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC 108
           G + + + IG P   Y   +DTGSDL W QC  PC +C + P P++ P       +    
Sbjct: 98  GEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPSPIFDPKKSSSFSKLSCS 156

Query: 109 ASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
           + L  A    +C D   C+Y   Y D  S+ G +  + F F    G+   P +  GCG +
Sbjct: 157 SQLCKALPQSSCSD--SCEYLYTYGDYSSTQGTMATETFTF----GKVSIPNVGFGCGED 210

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
              G  +    G++GLG+G  S+VSQL   K       +CL+          DD   S+ 
Sbjct: 211 N-EGDGFTQGSGLVGLGRGPLSLVSQLKEAKF-----SYCLTS--------IDDTKTSTL 256

Query: 228 VVWTSMSSDYTKY-----------YSPGVAELFFGGETTGLKNLPV-------------- 262
           ++ +  S + T               P    L   G + G   LP+              
Sbjct: 257 LMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGG 316

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVH 317
            + DSG++ TYL    +     ++KKE +++     P D +    L LC+        + 
Sbjct: 317 LIIDSGTTITYLEESAFD----LVKKEFTSQ--MGLPVDNSGATGLELCYNLPSDTSELE 370

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
             K     L L FT        EL  E Y+I  S+ G +CL +
Sbjct: 371 VPK-----LVLHFTGAD----LELPGENYMIADSSMGVICLAM 404


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 143/329 (43%), Gaps = 40/329 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y VTM +G  ++   + +DTGSDLTW+QC+ PC+ C     P+++P    S   V C   
Sbjct: 65  YIVTMGLG--SKNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 107 ICASLH----APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
            C SL       G     +P+ C+Y + Y DG  + G L  +A +F    G         
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF----GGVSVSDFVF 177

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
           GCG N      +  + G++GLG+   S+VSQ ++      V  +CL     G  G L  G
Sbjct: 178 GCGRNN--KGLFGGVSGLMGLGRSYLSLVSQTNAT--FGGVFSYCLPTTEAGSSGSLVMG 233

Query: 220 DD---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSY 270
           ++     +++ + +T M S+   + +Y   +  +  GG          N  ++ DSG+  
Sbjct: 234 NESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVI 293

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y+ L +   K+ +      AP    L  C+          +V     T++L F
Sbjct: 294 TRLPSSVYKALKAEFLKKFTG--FPSAPGFSILDTCFN----LTGYDEVS--IPTISLRF 345

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGI 359
            +G  +   + T   Y++  +   VCL +
Sbjct: 346 -EGNAQLNVDATGTFYVVKEDASQVCLAL 373


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 154/353 (43%), Gaps = 55/353 (15%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEA-------PHPLYRPS--- 97
           Y NV+  +G PA  + + LDTGSDL WL C+  + C+R ++        P  LY P+   
Sbjct: 103 YANVS--VGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160

Query: 98  -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
            +  + C D  C              + C Y+++Y    + + G L +D      T  + 
Sbjct: 161 TSSSIRCSDDRCFGSSRCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDEG 215

Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
           L P    + LGCG NQ     S   ++G+LGLG    S+ S L   K+  N    C    
Sbjct: 216 LEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNI 275

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
               G + FGD  Y + ++    + ++ +  Y+  V E+  GG+  G++ L  +FD+G+S
Sbjct: 276 IDVVGRISFGDKGY-TDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQ-LLALFDTGTS 333

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
           +T+L    Y  +T      ++ K     PE            PF+  +D+        F 
Sbjct: 334 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------LPFEFCYDLSPNKTTILFP 382

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGN---VCLGILNGAEVGLQDLNVIG 374
            +A++F  G    +F   P    I+ N+ N    CLGIL   +     +N+IG
Sbjct: 383 RVAMTFEGGS--QMFLRNP--LFIVWNEDNSAMYCLGILKSVDF---KINIIG 428


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 102/399 (25%), Positives = 160/399 (40%), Gaps = 63/399 (15%)

Query: 5   HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
            + E + +   R+S +    SS S  + V    L    G++  +G Y V + +G P R  
Sbjct: 102 QDKERVKYINSRISKNLGQDSSVSELDSV---TLPAKSGSLIGSGNYFVVVGLGTPKRDL 158

Query: 65  FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLH-APGHH- 117
            L  DTGSDLTW QC+ PC R C +    ++ PS       + C   +C  L  A G+  
Sbjct: 159 SLIFDTGSDLTWTQCE-PCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEP 217

Query: 118 NCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP 176
            C    + C Y ++Y D   S+G   ++  +   T+   +      GCG N      +  
Sbjct: 218 GCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD---IVDNFLFGCGQNN--QGLFGG 272

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS 234
             G++GLG+   S V Q  +  + R +  +CL  +    G L FG           T+  
Sbjct: 273 SAGLIGLGRHPISFVQQ--TAAVYRKIFSYCLPATSSSTGRLSFG---------TTTTSY 321

Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLPV----------VFDSGSSYTYLNRVTYQT 280
             YT + +      F+G + TG+      LPV          + DSG+  T L    Y  
Sbjct: 322 VKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVITRLPPTAYTA 381

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           L S  ++ +S      A E   L  C+   G   F            +  SF  G T   
Sbjct: 382 LRSAFRQGMS--KYPSAGELSILDTCYDLSGYEVFS--------IPKIDFSFAGGVT--- 428

Query: 339 FELTPEAYLIISNKGNVCLGI-LNGAEVGLQDLNVIGGI 376
            +L P+  L +++   VCL    NG +    D+ + G +
Sbjct: 429 VQLPPQGILYVASAKQVCLAFAANGDD---SDVTIYGNV 464


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 96/343 (27%), Positives = 139/343 (40%), Gaps = 35/343 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPC 103
           +G Y V + +G P + Y + LDTGS L+WLQC    V C     PLY PS       + C
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSC 181

Query: 104 EDPICASLHAPGHHN--CE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
               C+ L A   ++  CE D   C Y   Y D   S+G L +D      T+ Q L P+ 
Sbjct: 182 ASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTL--TSSQTL-PQF 238

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
             GCG  Q     +    GI+GL + K S+++QL ++    +   +CL     G    G 
Sbjct: 239 TYGCG--QDNQGLFGRAAGIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGF 294

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT---------GLKNLPVVFDSGSSYT 271
               S        +   T   +P +  L     T           +  +P + DSG+  T
Sbjct: 295 LSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVIT 354

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  L     K +S K  K AP    L  C+KG    K++  V +    + + F 
Sbjct: 355 RLPMSMYAALRQAFVKIMSTKYAK-APAYSILDTCFKGS--LKSISAVPE----IKMIFQ 407

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
            G   T   L   + LI ++KG  CL        G   + +IG
Sbjct: 408 GGADLT---LRAPSILIEADKGITCLAF--AGSSGTNQIAIIG 445


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 137/328 (41%), Gaps = 43/328 (13%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
           V+    Y + + +G P       +DTGS++TW QC  PCV C E   P++ PS      E
Sbjct: 59  VFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQC-LPCVHCYEQNAPIFDPSKSSTFKE 117

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
                     GH        C YE++Y D   ++G L  +    + T+G+  + P   +G
Sbjct: 118 K------RCDGH-------SCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIG 164

Query: 164 CGYNQVPGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG-DD 221
           CG+N    + + P   G++GL  G SS+++Q+  +     ++ +C SG G   + FG + 
Sbjct: 165 CGHNN---SWFKPSFSGMVGLNWGPSSLITQMGGEY--PGLMSYCFSGQGTSKINFGANA 219

Query: 222 LYDSSRVVWTSMSSDYTK---YY------SPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           +     VV T+M     K   YY      S G   +   G T       +V DSG++ TY
Sbjct: 220 IVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTY 279

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
              V+Y  L     + +        P    + LC+          D    F  + + F+ 
Sbjct: 280 F-PVSYCNLVRQAVEHVVTAVRAADPTGNDM-LCYNS--------DTIDIFPVITMHFSG 329

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGIL 360
           G    L +     Y+  +N G  CL I+
Sbjct: 330 GVDLVLDKY--NMYMESNNGGVFCLAII 355


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 162/371 (43%), Gaps = 40/371 (10%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPA-RPYFLDLDTGSDLTWLQCDAPCVRC-VEAPHPLYRP 96
           F +HG+V   GYY   + +G P+ R + + +DTGS LT++ C A C +C        + P
Sbjct: 100 FPLHGSVKEHGYYYANIALGDPSPRTFQVIVDTGSTLTYVPC-ATCAKCGTHTGGTRFDP 158

Query: 97  SNDLVPCEDPICASLHAPG---HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
           +   + C++  C +   PG           +C Y   YA+G    G LV+D   F     
Sbjct: 159 TGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSRTYAEGSGVSGDLVRDKMHFGGDIA 218

Query: 154 QRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGK-SSIVSQLHSQKLIRNVVGHCL-S 209
              N  L +  G       + H    DG++GLG  + +SI +QL     +  V   C  S
Sbjct: 219 PATNGTLDVVFGCTNAESGTIHDQEADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGS 278

Query: 210 GGGGGFLFFGD--DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL-KNLPV-- 262
             GGG L FG       +  +V+T M  +  +  YY    A +  G        +L V  
Sbjct: 279 FEGGGALSFGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGY 338

Query: 263 --VFDSGSSYTYLNRVTYQTLTSIMKKELSA-----KSLKEAP-EDETLP--LCWKGR-- 310
             V DSG+++TY+    +    + +   ++      K L + P  D + P  +C++    
Sbjct: 339 GTVMDSGTTFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDDVCFQREGA 398

Query: 311 ---RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEV 365
               P   + ++ + +  L ++F DG+  +L  L P  YL +  K  G  CLG+++  + 
Sbjct: 399 TEIEPIVTMANLGEYYPPLTIAF-DGEGASLV-LPPSNYLFVHGKKPGAFCLGVMDNKQQ 456

Query: 366 GLQDLNVIGGI 376
           G     +IGGI
Sbjct: 457 G----TLIGGI 463


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 146/358 (40%), Gaps = 57/358 (15%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-------------VEAPHPLYRP 96
           Y NV+  +G P+  + + LDTGSDL WL C+  C  C             +    P    
Sbjct: 105 YANVS--VGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPNDST 160

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNGQR 155
           ++  VPC   +C       +    +   C YE+ Y     SS+G LV+D      T+   
Sbjct: 161 TSSTVPCTSSLC-------NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA-TDDSL 212

Query: 156 LNP---RLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
           L P   ++  GCG  Q    A+    +G++GLG  K S+ S L  Q L  N    C    
Sbjct: 213 LKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGAD 272

Query: 212 GGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           G G + FGD    D  +  + +M     + Y+     +  GGE   +     +FDSG+S+
Sbjct: 273 GYGRIDFGDTGPADQKQTPFNTMLE--YQSYNVTFNVINVGGEPNDVP-FTAIFDSGTSF 329

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLA 327
           TYL    Y T+T  M   +  K              +    PF+  +++    K F+ L 
Sbjct: 330 TYLTEPAYSTITKQMDAGMKLKRYS----------LFGPNFPFEYCYEIPPGAKEFQYLT 379

Query: 328 LSFT----DGKTRT-LFELTP----EAYLIISNKGNV-CLGILNGAEVGLQDLNVIGG 375
           L+FT    D  T T +F   P       +I     +V CL I    ++ L   N + G
Sbjct: 380 LNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDIDLIGQNFMTG 437


>gi|357461293|ref|XP_003600928.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355489976|gb|AES71179.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 295

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/251 (28%), Positives = 107/251 (42%), Gaps = 57/251 (22%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
            G Y V++ IG P + + + +DTGSDLTW              + LY+  N+ V     +
Sbjct: 15  VGGYTVSLKIGYPGQSFDVFIDTGSDLTW------------DKYKLYKLHNNFVYVRIKL 62

Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
                                  Y DG  + G LV+D      ++     P+        
Sbjct: 63  AI---------------------YVDGLQTKGFLVQDNIPLESSDRTLQRPKCT---NIL 98

Query: 168 QVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYD 224
           +V      P+  GILGLG G++SI+SQL S+ LI+NVVGHC SG  G GG          
Sbjct: 99  KVTDKKPKPISKGILGLGHGETSILSQLKSKGLIKNVVGHCFSGKEGQGG---------- 148

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                  +   D    Y    A L F  + T +K+L ++FDSG++ +  N   ++ L   
Sbjct: 149 -------NTKIDLEGRYFSEPANLIFDEKLTFIKDLQLIFDSGTTLSAFNSKDHKVLVD- 200

Query: 285 MKKELSAKSLK 295
            + E+S   LK
Sbjct: 201 PENEVSKDYLK 211


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 85/195 (43%), Gaps = 21/195 (10%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           ++R S +   S  S +  ++ SS+ + +    Y    Y +   IG PA   +   D+GS 
Sbjct: 64  SIRTSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSS 123

Query: 74  LTWLQCDAP-CVRCVEAPHPLYRPSNDLV----PCEDPICASLHAPGHHNCEDPAQ-CDY 127
           L WLQC  P C  C     PL+ PS  +      C    C       +  C+ P Q C Y
Sbjct: 124 LVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKY 183

Query: 128 ELEYADGGSSLGVLVKDAFAF--------NYTNGQRLNPRLALGCGYNQVPGASYHPLDG 179
             +Y D   + GV+  D F F        NYT       R+  GCGYN      ++P  G
Sbjct: 184 HEDYLDDSYTEGVISTDIFTFPEHISGFGNYT------LRIIFGCGYNNSDPQHFYP-PG 236

Query: 180 ILGLGKGKSSIVSQL 194
           ++GL   K+S+V Q+
Sbjct: 237 LVGLTNNKASLVGQM 251


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/336 (25%), Positives = 135/336 (40%), Gaps = 26/336 (7%)

Query: 40  QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEAPHPLYRPS 97
           +V+G V  TG    +  +   A+ + L +DTGS  T+L C   A C       +  Y  S
Sbjct: 24  EVYGEVLETGVLVASFELA-GAQTFELIVDTGSSRTYLPCKGCASCGAHEAGRYYDYDAS 82

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
            D    E   CA +       C     C Y++ Y +G  S G LV+D  +   + G   N
Sbjct: 83  ADFSRVECSACAGIGG----KCGTSGVCRYDVHYLEGSGSEGYLVRDVVSLGGSVG---N 135

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------- 210
             +  GC   ++        DG+ G G+   ++ +QL S  +I ++   C+ G       
Sbjct: 136 ATVVFGCEERELGSIKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGE 195

Query: 211 GGGGFLFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
             GG L  G  D   D+  +V+T M S    Y     +         G + +  + DSG+
Sbjct: 196 HVGGLLTLGNFDFGADAPALVYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGT 255

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSL-KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           SYTY+    +     + +       L K AP ++   LC+ G         V + F  L 
Sbjct: 256 SYTYVPGNMHARFLQLAEDAARESGLEKVAPPEDYPDLCF-GNSGGLGWSTVSEYFPALK 314

Query: 328 LSFTDGKTRTLFELTPEAYLII--SNKGNVCLGILN 361
           + +  G  R    L+PE YL     N    C+GIL 
Sbjct: 315 IEY-HGSAR--LTLSPETYLYWHQKNASAFCVGILE 347


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 99/358 (27%), Positives = 135/358 (37%), Gaps = 63/358 (17%)

Query: 40  QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPS 97
           QVH     T  Y  +  IG P +     +DTGSDL W QC   C+   C +   P Y  S
Sbjct: 78  QVH---RATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLS 134

Query: 98  NDL----VPCEDP--ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
                  VPC D    CA   A G H C     C +   Y   G  +G L  ++FAF   
Sbjct: 135 QSSTFVPVPCADKAGFCA---ANGVHLCGLDGSCTFIASYG-AGRVIGSLGTESFAF--- 187

Query: 152 NGQRLNPRLALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
             +     LA GC    ++   + +   G++GLG+G+ S+VSQ+ + +    +  +  S 
Sbjct: 188 --ESGTTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSS 245

Query: 211 GGGGFLFFGDDLYDSSR---VVWTSMSSDY---TKYYSPGVAELFFGGETTGLKNLP--- 261
           G    LF G           + +     DY   T YY P        G T G   LP   
Sbjct: 246 GASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLP------LEGITVGKTRLPAVN 299

Query: 262 -----------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
                            V+ D+GS  T L    Y+ L   +  +L   SL  APED  L 
Sbjct: 300 STTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLE 359

Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
           LC   R  F+ V         L   F  G       +   +Y    +K   C+ IL G
Sbjct: 360 LC-VAREGFQKV------VPALVFHFGGGAD---MAVPAASYWAPVDKAAACMMILEG 407


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 146/379 (38%), Gaps = 79/379 (20%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
           Y  G Y V   +G P++ + L  DTGSDLTW+ C   C            +R     H  
Sbjct: 78  YGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137

Query: 94  YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
              S   +PC   +C    +      NC  P   C Y+  Y+DG ++LG    +      
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197

Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
             G+++    + +GC      G S+   DG++GLG  K S   +   +   K    +V H
Sbjct: 198 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256

Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
                   +L FG     S   +  +M+  YT+        +Y+  +  +  GG    + 
Sbjct: 257 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 309

Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            +P            + DSGSS T+L    YQ + + ++  L                  
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 352

Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
           K R+   ++  ++ CF +          L   F DG     FE   ++Y+I +  G  CL
Sbjct: 353 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 409

Query: 358 GILNGAEVGLQDLNVIGGI 376
           G ++ A  G    +V+G I
Sbjct: 410 GFVSVAWPG---TSVVGNI 425


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 151/351 (43%), Gaps = 50/351 (14%)

Query: 41  VHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN- 98
           V   VY   G + + M IG P+  +   LDTGSDLTW QC  PC  C   P P+Y PS  
Sbjct: 104 VEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK-PCTDCYPQPTPIYDPSQS 162

Query: 99  ---DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
                VPC   +C +L     ++C   A C+Y   Y D  S+ G+L  ++F       Q 
Sbjct: 163 STYSKVPCSSSMCQALPM---YSCSG-ANCEYLYSYGDQSSTQGILSYESFTL---TSQS 215

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
           L P +A GCG  +  G  +    G++G G+G  S++SQL   + + N   +CL     S 
Sbjct: 216 L-PHIAFGCG-QENEGGGFSQGGGLVGFGRGPLSLISQLG--QSLGNKFSYCLVSITDSP 271

Query: 211 GGGGFLFFGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFGGETTGLKNLP------ 261
                LF G     +++ V ++    S     +Y   +  +  GG+   + +        
Sbjct: 272 SKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLD 331

Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFKNV 316
               V+ DSG++ TYL +  Y     + K  +S+ +L +       L LC++ +      
Sbjct: 332 GTGGVIIDSGTTVTYLEQSGYDV---VKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTS 388

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL--NGAEV 365
           H     F T+   F        F L  E Y+   + G  CL +L  NG  +
Sbjct: 389 H-----FPTITFHFEGAD----FNLPKENYIYTDSSGIACLAMLPSNGMSI 430


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 81/274 (29%), Positives = 126/274 (45%), Gaps = 24/274 (8%)

Query: 55  MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-RPSND---LVPCEDPICAS 110
           + IG P    ++ LDTGSDL W+QC+ PC  C +   P+Y R  +D    + C +P C S
Sbjct: 97  LSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCVS 155

Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLV--KDAFAFNYTNGQRLNPRLALGCGYNQ 168
           L   G   C D   C Y+  YADG  + G+L   K AF  +Y++  +   ++  GCG   
Sbjct: 156 LGREGQ--CSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDK-TAQVGFGCGLQN 212

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----GGGGFLFFGDDLY- 223
           +   + +   G+LGLG G  S+VSQL +   +     +C         GGFL FGD  Y 
Sbjct: 213 LNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDATYL 272

Query: 224 --DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYTYLNRV 276
             D + +V              GV E      ++  +  P     V+ DSGS+ +     
Sbjct: 273 NGDMTPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFPPE 332

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            Y+ + + +  +L  K    +P   + P C++G+
Sbjct: 333 VYEVVRNAVVDKLK-KGYNISPLTSS-PDCFEGK 364


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 12/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           + G    +G Y   + +GQPA+P+++ LDTGSD+ WLQC  PC  C +   P++ P +  
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               +PCE   C +L   G   C   ++C Y++ Y DG  ++G  V +   F   N   +
Sbjct: 204 SFASLPCESQQCQALETSG---CR-ASKCLYQVSYGDGSFTVGEFVTETLTFG--NSGMI 257

Query: 157 NPRLALGCGYN 167
           N  +A+GCG++
Sbjct: 258 N-DVAVGCGHD 267


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 158/388 (40%), Gaps = 54/388 (13%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +R+ + +SS++  S    V  + +    G    T  Y VT+ +G   +   L +DTGSDL
Sbjct: 106 LRIKAMTSSTTEQS----VSETQIPLTSGIKLETLNYIVTVELG--GKNMSLIVDTGSDL 159

Query: 75  TWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAP-------GHHNCEDPA 123
           TW+QC  PC  C     PLY P    S   V C    C  L A        G  N     
Sbjct: 160 TWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKT 218

Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGL 183
            C+Y + Y DG  + G L  ++     T  +     L  GCG N      +    G++GL
Sbjct: 219 TCEYVVSYGDGSYTRGDLASESIVLGDTKLE----NLVFGCGRNN--KGLFGGASGLMGL 272

Query: 184 GKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFLFFGDDL---YDSSRVVWTSMSSD- 236
           G+   S+VSQ  + K    V  +C   L  G  G L FG+D     +S+ V +T +  + 
Sbjct: 273 GRSSVSLVSQ--TLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNP 330

Query: 237 -YTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
               +Y   +     GG    LK L     ++ DSG+  T L    Y+ + +   K+ S 
Sbjct: 331 QLRSFYILNLTGASIGG--VELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSG 388

Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
                AP    L  C+       +  D+     T+ + F +G      ++T   Y +  +
Sbjct: 389 --FPSAPGYSILDTCFN----LTSYEDIS--IPTIKMIF-EGNAELEVDVTGVFYFVKPD 439

Query: 352 KGNVCLGILNGAEVGLQDLNVIGGIGDF 379
              VCL +       L   N +G IG++
Sbjct: 440 ASLVCLAL-----ASLSYENEVGIIGNY 462


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 154/360 (42%), Gaps = 56/360 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + +Y+G P R + + +DTGSDL WLQC APC+ C E   P++ P+       V C
Sbjct: 146 SGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTC 204

Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
            D  C  +  P     C  PA+  C Y   Y D  ++ G L  ++F  N T     R   
Sbjct: 205 GDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 264

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-------GG 211
            +  GCG+       +H   G+LGLG+G  S  SQL      R V GH  S         
Sbjct: 265 GVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHTFSYCLVEHGSD 316

Query: 212 GGGFLFFGDD--LYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
            G  + FG+D  +    ++ +T+    SS    +Y   +  +  GG+   + +       
Sbjct: 317 AGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGK 376

Query: 262 -----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
                 + DSG++ +Y     YQ +      +L ++     P+   L  C+       NV
Sbjct: 377 DGSGGTIIDSGTTLSYFVEPAYQVIRQAF-VDLMSRLYPLIPDFPVLNPCY-------NV 428

Query: 317 HDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIG 374
             V++     L+L F DG    +++   E Y + +   G +CL +      G   +++IG
Sbjct: 429 SGVERPEVPELSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVRGTPRTG---MSIIG 482


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 36/281 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPHPLYRPSNDLVPCED 105
           +G Y + M IG PA      +DTGSDL W +C+ PC  C       P    +   V C+ 
Sbjct: 39  SGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCN-PCTDCSTSSIYDPSSSSTYSKVLCQS 97

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
            +C     P   +C +   C+Y   Y D  S+ G+L  + F+    + Q L P +  GCG
Sbjct: 98  SLC---QPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSI---SSQSL-PNITFGCG 150

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD 221
           ++      +  + G++G G+G  S+VSQL     + N   +CL           LF G+ 
Sbjct: 151 HDN---QGFDKVGGLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNT 205

Query: 222 LYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGET----TGLKNLP------VVFDSGSS 269
               +  V ++  + S  T +Y   +  +  GG++    TG  ++       ++ DSG++
Sbjct: 206 ASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTT 265

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            T+L +  Y  +     KE    S+     D  L LC+  +
Sbjct: 266 LTFLQQTAYDAV-----KEAMVSSINLPQADGQLDLCFNQQ 301


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 149/366 (40%), Gaps = 51/366 (13%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPL 93
           S+ L    G++  +  Y V + +G P R   L  DTGSDLTW QC+ PC   C +    +
Sbjct: 30  STTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAI 88

Query: 94  YRPSNDL----VPCEDPICASLHAPG-HHNCEDP--AQCDYELEYADGGSSLGVLVKDAF 146
           + PS       + C   +C  L + G    C     A C Y+ +Y D  +S+G L ++  
Sbjct: 89  FDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERL 148

Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
               T+   +      GCG  Q     ++   G++GLG+   SIV Q  S      +  +
Sbjct: 149 TITATD---IVDDFLFGCG--QDNEGLFNGSAGLMGLGRHPISIVQQTSSN--YNKIFSY 201

Query: 207 CL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPV 262
           CL  +    G L FG     ++ +++T +S  S    +Y   +  +  GG       LP 
Sbjct: 202 CLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGG-----TKLPA 256

Query: 263 V-----------FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
           V            DSG+  T L    Y  L S  ++ +    +  A E   L  C+    
Sbjct: 257 VSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPV--ANEAGLLDTCYD-LS 313

Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI-LNGAEVGLQDL 370
            +K +   +  F      F+ G T    EL     L + ++  VCL    NG++    D+
Sbjct: 314 GYKEISVPRIDFE-----FSGGVT---VELXHRGILXVESEQQVCLAFAANGSD---NDI 362

Query: 371 NVIGGI 376
            V G +
Sbjct: 363 TVFGNV 368


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 144/366 (39%), Gaps = 60/366 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----DAPCVRCVEAPHPLYRPSNDL--- 100
           TG Y V + +G PA+P+ L  DTGSDLTW++C     +        P  ++RP+      
Sbjct: 101 TGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWS 160

Query: 101 -VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRL 156
            +PC+   C S       NC  P   C Y+  Y D  S+ GV+  D+   + +  +G R 
Sbjct: 161 PLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRK 220

Query: 157 NP--RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGG 211
                + LGC      G S+   DG+L LG    S  S+  S+   +    +V H     
Sbjct: 221 AKLQEVVLGCT-TSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRN 279

Query: 212 GGGFLFFGDDLYDSS------RVVWTSMSSDYTK-YYSPGVAELFFGGETTGL------- 257
              FL FG+            R     +    T+ +Y   V  +   GE   +       
Sbjct: 280 ATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDF 339

Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
            KN   + DSG+S T L    Y  +   + K+ +                     P  N+
Sbjct: 340 RKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGV-------------------PRVNM 380

Query: 317 HDVKKCFRTLALSFTDGKTRTLFE----LTP--EAYLIISNKGNVCLGILNGAEVGLQDL 370
              + C+    +S    +    F     L P  ++Y+I +  G  C+G++ GA  G   +
Sbjct: 381 DPFEYCYNWTGVSAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPG---V 437

Query: 371 NVIGGI 376
           +VIG I
Sbjct: 438 SVIGNI 443


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 146/352 (41%), Gaps = 52/352 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---------- 100
           Y   + +G P   + + LDTGSDL W+ CD  C+ C  AP   YR + D           
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCD--CIEC--APLAGYRETLDRDLGIYKPAES 198

Query: 101 -----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNG 153
                +PC   +C     P    C  P Q C Y  +Y  +  +S G+L++D    +    
Sbjct: 199 TTSRHLPCSHELC-----PPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRES 253

Query: 154 QR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
              +   + +GCG  Q    SY      DG+LGLG    S+ S L    L+RN    C  
Sbjct: 254 HAPVKASVVIGCGRKQ--SGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFK 311

Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSG 267
              G  +FFGD      +   T     Y KY  Y+  V +   G +     +   + DSG
Sbjct: 312 EDSGR-IFFGDQGVSIQQS--TPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSG 368

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +S+T L    Y+ +     K++ A  + +  ED +   C+    P K + DV     T+ 
Sbjct: 369 TSFTALPLNVYKAVAVEFDKQVHAPRITQ--EDASFEYCYSA-SPLK-MPDVP----TVT 420

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAE-VGLQDLNVIGG 375
           L+F   K+   F+      ++   +G+V   CL +    E +G+   N + G
Sbjct: 421 LTFAANKS---FQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLTG 469


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 88/351 (25%), Positives = 132/351 (37%), Gaps = 60/351 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCED 105
           +G Y + + +G P + +   +DTGSDL W+QC  PC +C     P+Y P  S+       
Sbjct: 1   SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSC 59

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLALGC 164
              +    P          C Y  +Y D  S+ G    +      + G  +  P    GC
Sbjct: 60  STSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGC 119

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
           G  ++   S+    GI+GLG+GK S+ +QL S   I N   +CL            L FG
Sbjct: 120 G--RLNSGSFGGAAGIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTSPLIFG 175

Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGL-------------KNLPV-- 262
                 S  + T +  +S  + YY  G+  +  GG+   L             K L V  
Sbjct: 176 SSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235

Query: 263 --------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                   +FDSG++ T L+   Y  + S     +S            LP        F 
Sbjct: 236 LEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVS------------LPTVDASSSGFD 283

Query: 315 NVHDVKKC----FRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGI 359
             +DV K     F  L L+F   K    F    + Y +I +      CL +
Sbjct: 284 LCYDVSKSKNFKFPALTLAFKGTK----FSPPQKNYFVIVDTAETVACLAM 330


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 12/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           + G    +G Y   + +GQPA+P+++ LDTGSD+ WLQC  PC  C +   P++ P +  
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               +PCE   C +L   G   C   ++C Y++ Y DG  ++G  V +   F   N   +
Sbjct: 204 SFASLPCESQQCQALETSG---CR-ASKCLYQVSYGDGSFTVGEFVIETLTFG--NSGMI 257

Query: 157 NPRLALGCGYN 167
           N  +A+GCG++
Sbjct: 258 N-NVAVGCGHD 267


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/336 (27%), Positives = 130/336 (38%), Gaps = 82/336 (24%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           T  Y V + +G P RP  L LDTGSDL W QC APC  C +   PL  P+       +PC
Sbjct: 83  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPC 141

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-------L 156
             P C +L      +C     C Y   Y D   ++G +  D F F   NG+R        
Sbjct: 142 GAPRCRALP---FTSCGG-RSCVYVYHYGDKSVTVGKIATDRFTFG-DNGRRNGDGSLPA 196

Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
             RL  GCG +N+  G       GI G G+G+ S+ SQL++                  F
Sbjct: 197 TRRLTFGCGHFNK--GVFQSNETGIAGFGRGRWSLPSQLNATS----------------F 238

Query: 216 LFFGDDLYDSSRVVWT---SMSSDYTKYYS-----------PGVAELFF---GGETTGLK 258
            +    ++DS   + T   + ++ Y+  +S           P    L+F    G + G  
Sbjct: 239 SYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKT 298

Query: 259 NLPV--------VFDSGSSYTYLNRVTYQTLTSIMKKE------------------LSAK 292
            LPV        + DSG+S T L    Y+ + +    +                  L   
Sbjct: 299 RLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVS 358

Query: 293 SLKEAPEDETLPLC-WKGRRPFKNVHDVKKCFRTLA 327
           +L   P   +L  C W  R P  + H    C RT A
Sbjct: 359 ALWRRPAVPSLTRCTW--RAPTGSSHAATTCSRTSA 392


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 152/368 (41%), Gaps = 62/368 (16%)

Query: 48  TGYYNVTMYIGQP-----ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR----PSN 98
           +G Y   + +G P     +    L  D GSD+TWLQC  PC RC   P P+Y      S 
Sbjct: 122 SGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKSSSA 180

Query: 99  DLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
             V C  P C +L + G   C +   +C Y++EY DG SS G    +   F    G R+ 
Sbjct: 181 SDVGCYAPACRALGSSG--GCVQFLNECQYKVEYGDGSSSAGDFGVETLTF--PPGVRV- 235

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG--- 214
           P +A+GCG +   G    P  GILGLG+G  S  SQ+  +        +CL+G G G   
Sbjct: 236 PGVAIGCGSDN-QGLFPAPAAGILGLGRGSLSFPSQIAGR--YGRSFSYCLAGQGTGGRS 292

Query: 215 -FLFFGDDL-------YDSSRVVWTSMSSDYTKYYSPGVAELFFGG------ETTGLKNL 260
             L FG             S     + S  YT YY  G+  +  GG        + L+  
Sbjct: 293 STLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYV-GLVGISVGGVRVRGVTESDLRLD 351

Query: 261 P------VVFDSGSSYTYLNRVTYQTLTSIMK----KELSAKSLKEAPEDETLPLCWKGR 310
           P      V+ DSG++ T L+   Y       +    KEL   S            C+   
Sbjct: 352 PSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPS--PGGPFAFFDTCYSSV 409

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGILNGAEVGLQ 368
           R       V K    +++ F  G      +L P+ YLI   SNKG +C      A  G +
Sbjct: 410 R-----GRVMKKVPAVSMHFAGG---VEVKLPPQNYLIPVDSNKGTMCFAF---AGSGDR 458

Query: 369 DLNVIGGI 376
            +++IG I
Sbjct: 459 GVSIIGNI 466


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 132/315 (41%), Gaps = 49/315 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ +G P+    L +DTGSDL+W+QC  PC    C     PL+ PS       +PC 
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQ-PCNSTTCYPQKDPLFDPSKSSTYAPIPCN 182

Query: 105 DPICASLHAPGH----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
              C  L   G+     + +  AQC + + Y DG  + GV   +  A        L P +
Sbjct: 183 TDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLA--------LAPGV 234

Query: 161 AL-----GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----- 210
           A+     GCG++Q    +    DG+LGLG    S+V Q  S  +      +CL       
Sbjct: 235 AVKDFRFGCGHDQ--DGANDKYDGLLGLGGAPESLVVQTAS--VYGGAFSYCLPALNNQV 290

Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----VV 263
                GG       + ++S  V+T M  +   +Y   +  +  GGE   +        ++
Sbjct: 291 GFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSGGMI 350

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            DSG+  T L    Y  L +  +K ++A  L    E +T   C+     F    +V    
Sbjct: 351 IDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELDT---CYD----FSGYSNVT--L 401

Query: 324 RTLALSFTDGKTRTL 338
             +AL+F+ G T  L
Sbjct: 402 PKVALTFSGGATIDL 416


>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 525

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 107/262 (40%), Gaps = 38/262 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPS----NDLVP 102
           IG P   + + LD GSD+ W+ CD  C+ C          ++     YRPS    +  +P
Sbjct: 111 IGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLP 168

Query: 103 CEDPICASLHAPGHHNCE---DPAQCDYELEYADGG-SSLGVLVKDAFAF----NYTNGQ 154
           C   +C       H  C+   DP  C YE++YA    SS G + +D         +    
Sbjct: 169 CGHKLCDV-----HSFCKGSKDP--CPYEVQYASANTSSSGYVFEDKLHLTSDGKHAEQN 221

Query: 155 RLNPRLALGCGYNQVPGASYHPL--DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
            +   + LGCG  Q  G   H    DG+LGLG G  S+ S L    LI+N    CL    
Sbjct: 222 SVQASIILGCGRKQT-GDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICLDENE 280

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
            G + FGD       V   S        Y  GV     G           + DSGSS+T+
Sbjct: 281 SGRIIFGDQ----GHVTQHSTPFLPIIAYMVGVESFCVGSLCLKETRFQALIDSGSSFTF 336

Query: 273 LNRVTYQTLTSIMKKELSAKSL 294
           L    YQ + +   K+++A  +
Sbjct: 337 LPNEVYQKVVTEFDKQVNASRI 358


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/321 (27%), Positives = 134/321 (41%), Gaps = 46/321 (14%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH------APGHH 117
           +DT S+LTW+QC APC  C +   PL+ PS+      VPC    C +L       + G  
Sbjct: 168 VDTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226

Query: 118 NCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGAS 173
            C+      A C Y L Y DG  S GVL  D  +     G+ ++     GCG +   G  
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL---AGEVIDG-FVFGCGTSN-QGPP 281

Query: 174 YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDD---LYDSSR 227
           +    G++GLG+ + S+VSQ   Q     V  +CL        G L  GDD     +S+ 
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNSTP 339

Query: 228 VVWTSMSSDYTK--YYSPGVAELFFGGETT-------GLKNLPVVFDSGSSYTYLNRVTY 278
           +V+ SM SD  +  +Y   +  +  GG+         G      + DSG+  T L    Y
Sbjct: 340 IVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIY 399

Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
             + +    + +     +AP    L  C+        + +V+    +L L F DG     
Sbjct: 400 NAVKAEFLSQFA--EYPQAPGFSILDTCFN----MTGLREVQ--VPSLKLVF-DGGVEVE 450

Query: 339 FELTPEAYLIISNKGNVCLGI 359
            +     Y + S+   VCL +
Sbjct: 451 VDSGGVLYFVSSDSSQVCLAM 471


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 65/126 (51%), Gaps = 8/126 (6%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + IG PAR  ++ LDTGSD+TWLQC APC  C     PL+ P    S   VPC
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQC-APCADCYAQSDPLFDPALSSSYATVPC 251

Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           + P C +L A   HN      + C YE+ Y DG  ++G    +       +G      +A
Sbjct: 252 DSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVA 310

Query: 162 LGCGYN 167
           +GCG++
Sbjct: 311 IGCGHD 316


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/276 (25%), Positives = 117/276 (42%), Gaps = 43/276 (15%)

Query: 40  QVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
           ++   V P  G + + + IG P   Y   +DTGSDL W QC  PC +C + P P++ P  
Sbjct: 85  EIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPTPIFDPKK 143

Query: 99  DLVPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                +    + L  A     C D   C+Y   Y D  S+ G+L  +   F    G+   
Sbjct: 144 SSSFSKLSCSSKLCEALPQSTCSD--GCEYLYGYGDYSSTQGMLASETLTF----GKVSV 197

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL------IRNVVGHCLSGG 211
           P +A GCG +   G+ +    G++GLG+G  S+VSQL   K       + +     L  G
Sbjct: 198 PEVAFGCGEDN-EGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMG 256

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------- 262
               +   D    ++ ++  S    +  YY      L   G + G  +LP+         
Sbjct: 257 SLASVKASDSEIKTTPLIQNSAQPSF--YY------LSLEGISVGDTSLPIKKSTFSLQE 308

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
                 + DSG++ TYL +  +     ++ KE +++
Sbjct: 309 DGSGGLIIDSGTTITYLEQSAFD----LVAKEFTSQ 340


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 124/293 (42%), Gaps = 51/293 (17%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV--EAPHPLYRPSN----DLVP 102
           G YN+ + +G P   + + +DTGS+L W QC APC RC     P P+ +P+       +P
Sbjct: 89  GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147

Query: 103 CEDPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C    C  L        C   A C Y   Y  G ++ G L  +      T G    P++A
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETL----TVGDGTFPKVA 202

Query: 162 LGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
            GC   N V  +S     GI+GLG+G  S+VSQL   +       +CL    + GG   +
Sbjct: 203 FGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAVGRF-----SYCLRSDMADGGASPI 252

Query: 217 FFGDDLYDSSRVVWTS---MSSDY----TKYYS--PGVA----ELFFGGETTGLKNLPV- 262
            FG     + R V  S   + + Y    T YY    G+A    EL   G T G     + 
Sbjct: 253 LFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLG 312

Query: 263 ---VFDSGSSYTYLNRVTY----QTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
              + DSG++ TYL +  Y    Q   S M           AP D  L LC+K
Sbjct: 313 GGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK 363


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y +T+ +G PA    + +DTGSD++W+QC  PC +C     PL+ P    +     C   
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + Y DG S+ G    D  A     G         GC  
Sbjct: 187 ACAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGC-- 239

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
           + V        DG++GLG G  S+VSQ  +   +     +CL  +    GFL        
Sbjct: 240 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297

Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
            +S  V T M  SS    +Y   +  +  GG    +     +   V DSG+  T L    
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 357

Query: 278 YQTLTSIMKKEL 289
           Y  L+S  K  +
Sbjct: 358 YSALSSAFKAGM 369


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/274 (29%), Positives = 125/274 (45%), Gaps = 24/274 (8%)

Query: 55  MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-RPSND---LVPCEDPICAS 110
           + IG P    ++ LDTGSDL W+QC+ PC  C +   P+Y R  +D    + C +P C S
Sbjct: 110 LSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCLS 168

Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLV--KDAFAFNYTNGQRLNPRLALGCGYNQ 168
           L   G   C D   C Y+  YADG  + G+L   K AF  +Y++  +   ++  GCG   
Sbjct: 169 LGREGQ--CSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDK-TAQVGFGCGLQN 225

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----GGGGFLFFGDDLY- 223
           +   +     G+LGLG G  S+VSQL +   +     +C         GGFL FGD  Y 
Sbjct: 226 LNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYL 285

Query: 224 --DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYTYLNRV 276
             D + +V              GV E      ++  +  P     V+ DSGS+ +     
Sbjct: 286 NGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPE 345

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            Y+ + + +  +L  K    +P   + P C++G+
Sbjct: 346 VYEVVRNAVVDKLK-KGYNISPLTSS-PDCFEGK 377


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 152/350 (43%), Gaps = 40/350 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G++  +G Y VT+ +G P + + L  DTGSDLTW QC+ PCV+ C      ++ PS    
Sbjct: 145 GSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCE-PCVKSCYNQKEAIFNPSQSTS 203

Query: 101 ---VPCEDPICASL-HAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
              + C   +C SL  A G+  NC   + C Y ++Y D   S+G   K+  +   T+   
Sbjct: 204 YANISCGSTLCDSLASATGNIFNCAS-STCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGG 213
           +      GCG N      +    G+LGLG+ K S+VSQ  + +    +  +CL  S    
Sbjct: 260 VFNDFYFGCGQNN--KGLFGGAAGLLGLGRDKLSLVSQ--TAQRYNKIFSYCLPSSSSST 315

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DS 266
           GFL FG     S+     +  S  + +Y   +  +  GG    +   P VF       DS
Sbjct: 316 GFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAIS--PSVFSTAGTIIDS 373

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
           G+  T L    Y  L+S  +K +S      AP    L  C+     F N HD     + +
Sbjct: 374 GTVITRLPPAAYSALSSTFRKLMS--QYPAAPALSILDTCFD----FSN-HDTISVPK-I 425

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            L F+ G    + ++       +++   VCL     ++    D+ + G +
Sbjct: 426 GLFFSGG---VVVDIDKTGIFYVNDLTQVCLAFAGNSDA--SDVAIFGNV 470


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 131/307 (42%), Gaps = 37/307 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+  G P+ P  L +DTGSD++W+QC APC    C     PL+ PS       + C 
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQC-APCNSTECYPQKDPLFDPSKSSTYAPIACG 183

Query: 105 DPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              C  L     + C     QC Y +EY DG S+ GV   +   F    G  +      G
Sbjct: 184 ADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITF--APGITVK-DFHFG 240

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFG-- 219
           CG++Q  G S    DG+LGLG    S+V Q  S  +      +CL       GFL  G  
Sbjct: 241 CGHDQR-GPS-DKFDGLLGLGGAPESLVVQTAS--VYGGAFSYCLPALNSEAGFLALGVR 296

Query: 220 -DDLYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYT 271
                ++S  V+T M     D T Y    +  +  GG+   +        ++ DSG+  T
Sbjct: 297 PSAATNTSAFVFTPMWHLPMDATSYMV-NMTGISVGGKPLDIPRSAFRGGMLIDSGTIVT 355

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  L + ++K  +A  +  + + +T   C+     F    +V      +AL+F+
Sbjct: 356 ELPETAYNALNAALRKAFAAYPMVASEDFDT---CYN----FTGYSNVT--VPRVALTFS 406

Query: 332 DGKTRTL 338
            G T  L
Sbjct: 407 GGATIDL 413


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/399 (24%), Positives = 158/399 (39%), Gaps = 62/399 (15%)

Query: 5   HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
            + E + +   R+S +    SS      + S+ L    G++  +G Y V + +G P R  
Sbjct: 103 QDKERVKYINSRLSKNLGQDSS---VEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDL 159

Query: 65  FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHN- 118
            L  DTGSDLTW QC+ PC R C +    ++ PS       + C   +C  L     ++ 
Sbjct: 160 SLIFDTGSDLTWTQCE-PCARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDP 218

Query: 119 -CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP 176
            C    + C Y ++Y D   S+G   ++      T+   +      GCG N      +  
Sbjct: 219 GCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD---VVDNFLFGCGQNN--QGLFGG 273

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS 234
             G++GLG+   S V Q  ++   R +  +CL  +    G L FG           T   
Sbjct: 274 SAGLIGLGRHPISFVQQTAAK--YRKIFSYCLPSTSSSTGHLSFGP--------AATGRY 323

Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLPV----------VFDSGSSYTYLNRVTYQT 280
             YT + +      F+G + T +      LPV          + DSG+  T L    Y  
Sbjct: 324 LKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVITRLPPTAYGA 383

Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           L S  ++ +S      A E   L  C+   G + F           T+  SF  G T   
Sbjct: 384 LRSAFRQGMS--KYPSAGELSILDTCYDLSGYKVFS--------IPTIEFSFAGGVT--- 430

Query: 339 FELTPEAYLIISNKGNVCLGI-LNGAEVGLQDLNVIGGI 376
            +L P+  L +++   VCL    NG +    D+ + G +
Sbjct: 431 VKLPPQGILFVASTKQVCLAFAANGDD---SDVTIYGNV 466


>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Brachypodium distachyon]
          Length = 509

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 120/288 (41%), Gaps = 38/288 (13%)

Query: 34  GSSLLFQVHGNV---YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           G SLL    GN    +    +   + +G P   + + LDTGSDL W+ CD  C RC    
Sbjct: 63  GESLLSFADGNSTTRHAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIA 120

Query: 91  H---------PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGV 140
           +         P    ++  V C   +C   +A G+ N      C Y ++Y     SS GV
Sbjct: 121 NTSELLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGN----GSCPYTVKYVSANTSSSGV 176

Query: 141 LVKDAFAFNYTN-----------GQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGK 185
           LV+D       +           G+ +  R+  GCG  Q    + GA+   ++G+LGLG 
Sbjct: 177 LVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAA---MEGLLGLGM 233

Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
            + S+ S L +  L+  +    C S  G G + FG+     ++     + S     Y+  
Sbjct: 234 DRVSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNIS 293

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
           V  +   G+         V DSG+S+TYLN   Y  L +    ++  K
Sbjct: 294 VTAVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREK 341


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 127/287 (44%), Gaps = 26/287 (9%)

Query: 21  SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S +SS++ +FN + + +     G     G Y VT+ +G P + + L  DTGSDLTW QC+
Sbjct: 107 SMNSSTTGVFNEMKTRVPTTHFG-----GGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCE 161

Query: 81  APCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
                C       + P+       + C    C S+       C     C Y ++Y   G 
Sbjct: 162 PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGT-GY 220

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           ++G L  +      ++   +     +GCG     G  +    G+LGLG+   ++ SQ  S
Sbjct: 221 TVGFLATETLTITPSD---VFENFVIGCGERN--GGRFSGTAGLLGLGRSPVALPSQTSS 275

Query: 197 QKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG-- 252
               +N+  +CL  S    G L FG  +  +++  +T ++S   + Y   V+ +  GG  
Sbjct: 276 T--YKNLFSYCLPASSSSTGHLSFGGGVSQAAK--FTPITSKIPELYGLDVSGISVGGRK 331

Query: 253 ---ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
              + +  +    + DSG++ TYL    +  L+S  ++ ++  +L +
Sbjct: 332 LPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTK 378


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 32/212 (15%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            +++ ++   GYY   ++IG P + + L +D+GS +T++ C + C +C +    L  P +
Sbjct: 80  MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQVMLSSPKD 138

Query: 99  D---LVPCE-----------------DPICASLHAPGHHNCE-----DPAQCDYELEYAD 133
               LV C+                  P  +S + P   N +     D  QC YE EYA+
Sbjct: 139 QILCLVSCKVQIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQCVYEREYAE 198

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVS 192
             SS GVL +D  +F   N   L P R   GC   +         DGI+GLG+G  S+V 
Sbjct: 199 HSSSKGVLGEDLISFG--NESHLTPQRAVFGCKTVETGDLYSQRADGIIGLGQGDLSLVG 256

Query: 193 QLHSQKLIRNVVGHCLSG---GGGGFLFFGDD 221
           QL  + LI N  G C  G   GGG  +  G D
Sbjct: 257 QLVDKGLISNSFGLCYGGLDVGGGSMIVGGFD 288


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 124/293 (42%), Gaps = 51/293 (17%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV--EAPHPLYRPSN----DLVP 102
           G YN+ + +G P   + + +DTGS+L W QC APC RC     P P+ +P+       +P
Sbjct: 89  GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147

Query: 103 CEDPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C    C  L        C   A C Y   Y  G ++ G L  +      T G    P++A
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETL----TVGDGTFPKVA 202

Query: 162 LGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
            GC   N V  +S     GI+GLG+G  S+VSQL   +       +CL    + GG   +
Sbjct: 203 FGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAVGRF-----SYCLRSDMADGGASPI 252

Query: 217 FFGD--DLYDSSRVVWTSMSSD-----YTKYYS--PGVA----ELFFGGETTGLKNLPV- 262
            FG    L + S V  T +  +      T YY    G+A    EL   G T G     + 
Sbjct: 253 LFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLG 312

Query: 263 ---VFDSGSSYTYLNRVTY----QTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
              + DSG++ TYL +  Y    Q   S M           AP D  L LC+K
Sbjct: 313 GGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK 363


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/275 (27%), Positives = 115/275 (41%), Gaps = 36/275 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPHPLYRPSNDLVP---------- 102
           IG P   + + LDTGSDL W+ C+  C  C     E+  P     N   P          
Sbjct: 117 IGTPNVQFLVVLDTGSDLLWIPCE--CESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174

Query: 103 CEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSL-GVLVKDAFAF-NYTNGQRLNPR 159
           C DP+C          C  P  QC YE+ Y    +S  G L +D   F   + G  +   
Sbjct: 175 CSDPLCEM-----SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP 229

Query: 160 LALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
           + LGCG  Q    + GA+ +   G++GLG    S+ ++L S   + +    C+S GG G 
Sbjct: 230 VYLGCGKVQTGSLLKGAAPN---GLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGT 286

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL--FFGGETTGLKNLPVVFDSGSSYTYL 273
           L FGD+   + R   T +           + E+     G T  L     +FD+G+S+TYL
Sbjct: 287 LTFGDEGPAAQRT--TPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYL 344

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           ++  Y         ++S     + P      LC++
Sbjct: 345 SKTVYPQFVQAYDAQMSLPKWND-PRFSKWDLCYQ 378


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/311 (26%), Positives = 129/311 (41%), Gaps = 51/311 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y ++  IG P    F  +DTGSDL WLQC+ PC +C     P++ P    S   +PC 
Sbjct: 86  GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCE-PCKQCYPQITPIFDPSLSSSYQNIPCL 144

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C S+             CD            G L  +    + T G  ++ P+  +G
Sbjct: 145 SDTCHSMRT---------TSCDVR----------GYLSVETLTLDSTTGYSVSFPKTMIG 185

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----------GGGG 213
           CGY    G  + P  GI+GLG G  S+ SQL +   I     +CL             G 
Sbjct: 186 CGYRNT-GTFHGPSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGD 242

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
             + +GD    +  V   + S  Y   + +S G   + FGG T G     ++ DSG+++T
Sbjct: 243 AAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFT 302

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK-----GRRPFKNVH----DVKKC 322
           +L    Y    S + + ++ + +++   + T  LC+         P    H    D+K  
Sbjct: 303 FLPYDVYYRFESAVAEYINLEHVEDP--NGTFKLCYNVAYHGFEAPLITAHFKGADIKLY 360

Query: 323 FRTLALSFTDG 333
           + +  +  +DG
Sbjct: 361 YISTFIKVSDG 371


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/173 (29%), Positives = 85/173 (49%), Gaps = 20/173 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           T  + V + +G P + +++  D  +D TWLQC  PC++C + P  ++ PS      L+ C
Sbjct: 184 TSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQ-PCIKCYDQPDSIFDPSQSSSYTLLSC 242

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           E   C  L    + +C D   C Y + Y DG ++ GVL+ +  +F  +       R++LG
Sbjct: 243 ETKHCNLL---PNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWVD---RVSLG 296

Query: 164 C-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
           C   NQ P   +   DG  GLG+G  S  S++++  +      +CL     G+
Sbjct: 297 CSNKNQGP---FVGSDGTFGLGRGSLSFPSRINASSM-----SYCLVESKDGY 341


>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 520

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 104/251 (41%), Gaps = 23/251 (9%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH---------PLYRPSNDLVPCED 105
           +G P + + + LDTGSDL WL  QCD  C     A           P    ++  VPC  
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDG-CTPPATAASGSFQATFYIPGMSSTSKAVPCNS 173

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLAL 162
             C          C    QC Y++ Y   G SS G LV+D    +  N   Q L  ++ L
Sbjct: 174 NFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIML 228

Query: 163 GCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
           GCG  Q          +G+ GLG  + S+ S L  + L  N    C    G G + FGD 
Sbjct: 229 GCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQ 288

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTL 281
                      ++  +   Y+  ++ +  G + T + +   +FD+G+S+TYL    Y  +
Sbjct: 289 ESSDQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYI 346

Query: 282 TSIMKKELSAK 292
           T     ++ A 
Sbjct: 347 TQSFHAQVQAN 357


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 145/351 (41%), Gaps = 62/351 (17%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------------ 86
             ++G+      Y   + +G P +     +DTGSD+ W +C   C  C            
Sbjct: 76  LMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSKKNVIVCSSI 134

Query: 87  -VEAPHPLYRPSNDLVP----CEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGV 140
            ++ P  LY P   +      C DP+C+   +  G++N      C Y++ Y D  SS G+
Sbjct: 135 IMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNN-----SCAYDISYEDTSSSTGI 189

Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
             +D     +     LN  + LGC    + G    P+DGI+G G+ K S+ +QL +Q   
Sbjct: 190 YFRDVVHLGHK--ASLNTTMFLGCA-TSISG--LWPVDGIMGFGRSKVSVPNQLAAQAGS 244

Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYS------------PGVA 246
            N+  HCLSG   GGG L  G +  +   +V+T M ++   Y              P  A
Sbjct: 245 YNIFYHCLSGEKEGGGILVLGKN-DEFPEMVYTPMLANDIVYNVKLVSLSVNSKALPIEA 303

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
             F    T G  N   + DSG+S       T+ +    +  +  +K     P   T PL 
Sbjct: 304 SEFEYNATVG--NGGTIIDSGTS-----SATFPSKALALFVKAVSKFTTAIP---TAPLE 353

Query: 307 WKGRRPFKNVHD---VKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNK 352
             G   F ++ D   V+  F  + L F  G T    ELT   YL  ++S K
Sbjct: 354 SSGSPCFISISDRNSVEVDFPNVTLKFDGGAT---MELTAHNYLEAVVSRK 401


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 145/367 (39%), Gaps = 55/367 (14%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G  + +G Y   + +G P     + +DTGSDL WLQC  PC  C     PLY P    ++
Sbjct: 80  GVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQC-VPCRHCYRQVTPLYDPRSSSTH 138

Query: 99  DLVPCEDPICAS-LHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
             +PC  P C   L  PG   C+     C Y + Y DG +S G L  D   F        
Sbjct: 139 RRIPCASPRCRDVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH- 194

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SG 210
              + LGCG++ V         G+LG+G+G+ S  +QL       +V  +CL      + 
Sbjct: 195 --NVTLGCGHDNV--GLLESAAGLLGVGRGQLSFPTQL--APAYGHVFSYCLGDRLSRAQ 248

Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLP------ 261
            G  +L FG      S   +T + ++  +   YY   V     G   TG  N        
Sbjct: 249 NGSSYLVFGRTPEPPS-TAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPA 307

Query: 262 -----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL--KEAPEDETLPLCWKGRRPFK 314
                +V DSG++ +   R  Y  +        +A     K A +      C+  R    
Sbjct: 308 TGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGA 367

Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-----VCLGILNGAEVGLQD 369
               V+    ++ L F  G    L    P+A  +I  +G       CLG L  A+ G   
Sbjct: 368 PAAAVR--VPSIVLHFAGGADMAL----PQANYLIPVQGGDRRTYFCLG-LQAADDG--- 417

Query: 370 LNVIGGI 376
           LNV+G +
Sbjct: 418 LNVLGNV 424


>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
 gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
          Length = 475

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 116/264 (43%), Gaps = 24/264 (9%)

Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILG 182
           +C Y   YA+  SS G +V+DAF F      +   R+  GC  N   G  Y  L DGI+G
Sbjct: 6   KCYYSRTYAERSSSEGWMVEDAFGFP---DDQPPVRMVFGC-ENGETGEIYRQLADGIMG 61

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSD-YTKY 240
           +G   ++  SQL ++ +I +V   C      G L  GD  +   +  V+T + ++ +  Y
Sbjct: 62  MGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHLHY 121

Query: 241 YSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
           Y+  +  +   G    L      +   VV DSG+++TYL    +  + + +     +  L
Sbjct: 122 YNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYALSHGL 181

Query: 295 KEAP--EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
           +  P  + +   +CWKG     N   ++  F +    F D        L P  YL +S  
Sbjct: 182 QSTPGADPQYNDICWKGAP--DNFQGLENHFPSAEFVFGDNAR---LSLPPLRYLFVSRP 236

Query: 353 GNVCLGILNGAEVGLQDLNVIGGI 376
           G  CLG+ +    G     +IGG+
Sbjct: 237 GEYCLGVFDNGGSG----TLIGGV 256


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 91/356 (25%), Positives = 139/356 (39%), Gaps = 60/356 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR  ++ LDTGSD+ WLQC APC +C      ++ P+       +PC
Sbjct: 115 SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQTDHVFDPTKSRTYAGIPC 173

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L +PG  N      C Y++ Y DG  + G    +   F     +    R+ALG
Sbjct: 174 GAPLCRRLDSPGCSN--KNKVCQYQVSYGDGSFTFGDFSTETLTFR----RNRVTRVALG 227

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ-----LHSQKLIRNVVGHCL----SGGGGG 214
           CG++          +G+     G   +        + + +   +   +CL    +     
Sbjct: 228 CGHDN---------EGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPS 278

Query: 215 FLFFGDDLYDSSRVVWTSMSSD---YTKYYSPGVAELFFGGETTGLK----------NLP 261
            + FGD    S    +T +  +    T YY   +     G    GL           N  
Sbjct: 279 SVIFGDSAV-SRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGG 337

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           V+ DSG+S T L R  Y  L    +  + A  LK APE      C+        + +VK 
Sbjct: 338 VIIDSGTSVTRLTRPAYIALRDAFR--IGASHLKRAPEFSLFDTCFD----LSGLTEVK- 390

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
              T+ L F          L    YLI + N G+ C          +  L++IG I
Sbjct: 391 -VPTVVLHFRGADV----SLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNI 437


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/312 (28%), Positives = 121/312 (38%), Gaps = 49/312 (15%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSNDL--- 100
           + T  Y     IG P +     +DTGSDL W QC   C+R  C     P Y  S      
Sbjct: 85  WATLQYVAEYLIGDPPQRAEALIDTGSDLVWTQCST-CLRKVCARQALPYYNSSASSTFA 143

Query: 101 -VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
            VPC   ICA+ +    H C+  A C     Y   G   G L  +AFAF     Q     
Sbjct: 144 PVPCAARICAA-NDDIIHFCDLAAGCSVIAGYG-AGVVAGTLGTEAFAF-----QSGTAE 196

Query: 160 LALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
           LA GC  + ++   + H   G++GLG+G+ S+VSQ  + K    +  +  + G  G LF 
Sbjct: 197 LAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFV 256

Query: 219 GDDLYDSSRVVWTSMSSDYTK-------YYSPGVAELFFGGETTGLKNLP---------- 261
           G             M++ + K       YY P +      G T G   LP          
Sbjct: 257 GASASLGGH--GDVMTTQFVKGPKGSPFYYLPLI------GLTVGETRLPIPATVFDLRE 308

Query: 262 ---------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
                    V+ DSGS +T L    Y  L S +   L+   +   P+ +   LC   R  
Sbjct: 309 VAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDV 368

Query: 313 FKNVHDVKKCFR 324
            + V  V   FR
Sbjct: 369 GRVVPAVVFHFR 380


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 134/347 (38%), Gaps = 57/347 (16%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSNDL---- 100
           +P   Y V +  G P +   L LDTGSD+TW QC   P   C     PL+ PS       
Sbjct: 83  FPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFAS 142

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN--- 157
           +PC  P C +    G  N      C+Y + Y DG  S G + ++ F F    G+  +   
Sbjct: 143 LPCSSPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAV 202

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
           P L  GCG+    G       GI G G+G  S+ SQL           HC +        
Sbjct: 203 PGLVFGCGHANR-GVFTSNETGIAGFGRGSLSLPSQLKVGNF-----SHCFT-------- 248

Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG--GETTG---LKNLPVVFDSGSSYTY 272
                        T   +       PGVA       G   G    ++ P   +SG+S T 
Sbjct: 249 -----------TITGSKTSAVLLGLPGVAPPSASPLGRRRGSYRCRSTPRSSNSGTSITS 297

Query: 273 LNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKG--RRPFKNVHDVKKCFRTLAL 328
           L   TY+ +    ++E +A+  L   P + T P  C+    R P  +V        T+AL
Sbjct: 298 LPPRTYRAV----REEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVP-------TMAL 346

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGN----VCLGILNGAEVGLQDLN 371
            F     R   E      +   + GN    +CL ++ G E+ L ++ 
Sbjct: 347 HFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEIILGNIQ 393


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P+       VPC 
Sbjct: 68  LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 125

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
             +C   +A     C   +  C Y ++Y +D  SS GVLV+D       + Q   +   +
Sbjct: 126 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 180

Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
             GCG  QV   S+      +G+LGLG    S+ S L S+ L  N    C    G G + 
Sbjct: 181 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 238

Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
           FGD    D  ++   V+         YY+  +  +  G ++   +    + DSG+S+T L
Sbjct: 239 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 292

Query: 274 NRVTYQTLTSIMKKEL 289
           +   Y  +TS    ++
Sbjct: 293 SDPMYTQITSSFDAQI 308


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 158/374 (42%), Gaps = 68/374 (18%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G PA    + LDTGSD+ W+QC APC RC E   P++ P    
Sbjct: 119 VSGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSS 177

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C   +C  L + G   C+     C Y++ Y DG  + G  V +   F    G R
Sbjct: 178 SYGAVGCGAALCRRLDSGG---CDLRRGACMYQVAYGDGSVTAGDFVTETLTF--AGGAR 232

Query: 156 LNPRLALGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           +  R+ALGCG+ N+    +   L G+   G    + +S+ + +     +V    SG G  
Sbjct: 233 V-ARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAA 291

Query: 215 -------FLFFGDDLYDSSRVVWTSMSSD---YTKYY------------SPGVAELFFGG 252
                   + FG     +S   +T M  +    T YY             PGVAE     
Sbjct: 292 PGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAE----- 346

Query: 253 ETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL-PL 305
             + L+  P      V+ DSG+S T L R +Y  L    +   +A  L+ +P   +L   
Sbjct: 347 --SDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAA-AAGGLRLSPGGFSLFDT 403

Query: 306 CWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNG 362
           C+   GRR  K          T+++ F  G       L PE YLI + ++G  C     G
Sbjct: 404 CYDLGGRRVVK--------VPTVSMHFAGGAEAA---LPPENYLIPVDSRGTFCFA-FAG 451

Query: 363 AEVGLQDLNVIGGI 376
            + G   +++IG I
Sbjct: 452 TDGG---VSIIGNI 462


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/222 (31%), Positives = 97/222 (43%), Gaps = 28/222 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V +  G P   +   +DT SDL W+QC  PCV C     P++ P    S  +VPC 
Sbjct: 90  GEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQ-PCVSCYRQLDPVFNPKLSSSYAVVPCT 148

Query: 105 DPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              CA L     H C  +D   C Y  +Y+  G + G L  D  A     G  +   +  
Sbjct: 149 SDTCAQLDG---HRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI----GGDVFHAVVF 201

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
           GC  + V G +     G++GLG+G  S+VSQL   + +     +CL        G L  G
Sbjct: 202 GCSDSSVGGPAAQA-SGLVGLGRGPLSLVSQLSVHRFM-----YCLPPPMSRTSGKLVLG 255

Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG 256
              D + + S  V  +MSS   Y  YY   +  L  G +T G
Sbjct: 256 AGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPG 297


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 121/301 (40%), Gaps = 31/301 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y +T+ +G PA    + +DTGSD++W+QC  PC +C     PL+ P    +     C   
Sbjct: 52  YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 110

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + Y DG S+ G    D  A     G         GC  
Sbjct: 111 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 163

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
           + V        DG++GLG G  S+VSQ  +   +     +CL  +    GFL        
Sbjct: 164 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 221

Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
            +S  V T M  SS    +Y   +  +  GG    +     +   V DSG+  T L    
Sbjct: 222 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 281

Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
           Y  L+S  K  +  K    A     L  C+     F     V     ++AL F+ G   +
Sbjct: 282 YSALSSAFKAGM--KQYPPAQPSGILDTCFD----FSGQSSVS--IPSVALVFSGGAVVS 333

Query: 338 L 338
           L
Sbjct: 334 L 334


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 145/348 (41%), Gaps = 51/348 (14%)

Query: 54  TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDL 100
           T+ +G P   + + LDTGSDL W+ CD  C RC          +    +Y P    ++  
Sbjct: 100 TVELGTPGVKFMVALDTGSDLFWVPCD--CSRCAPTHGASYASDFELSIYNPRESSTSKK 157

Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR--L 156
           V C + +CA       + C    + C Y + Y    +S  G+LVKD       +G R  +
Sbjct: 158 VTCNNDMCAQ-----RNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFV 212

Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
              +  GCG  QV   S+  +   +G+ GLG  K S+ S L  + LI +    C    G 
Sbjct: 213 EAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGI 270

Query: 214 GFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
           G + FGD    D  ++   V  +  +         V  +    E T L      FDSG+S
Sbjct: 271 GRISFGDKGSPDQEETPFNVNPAHPTYNVTVTQARVGTMLIDVEFTAL------FDSGTS 324

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL--CWKGRRPFKNVHDVKKCFRTLA 327
           +TY+    Y   + + +K  S    K  P D  +P   C+    P  N   V     +++
Sbjct: 325 FTYMVDPAY---SRVSEKFHSLARDKRRPPDPRIPFEYCYD-MSPDANASLVP----SMS 376

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           L+   G+  T+++  P   +   N+   CL ++   E+ +   N + G
Sbjct: 377 LTMKGGRHFTVYD--PIIVISTQNEIVYCLAVVKSTELNIIGQNFMTG 422


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P+       VPC 
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLQSPNYGSLKFDVYSPAQSTTSRKVPCS 162

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
             +C   +A     C   +  C Y ++Y +D  SS GVLV+D       + Q   +   +
Sbjct: 163 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 217

Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
             GCG  QV   S+      +G+LGLG    S+ S L S+ L  N    C    G G + 
Sbjct: 218 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 275

Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
           FGD    D  ++   V+         YY+  +  +  G ++   +    + DSG+S+T L
Sbjct: 276 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329

Query: 274 NRVTYQTLTSIMKKEL 289
           +   Y  +TS    ++
Sbjct: 330 SDPMYTQITSSFDAQI 345


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 106/388 (27%), Positives = 152/388 (39%), Gaps = 60/388 (15%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM+  S + +   L +   + +    + +  P   Y + + IG P +P  L LDTGSDL 
Sbjct: 56  RMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLV 115

Query: 76  WLQCDAPCVRCVEAPHPLY---RPSNDLVPCEDPICASLHAPGHHNC--EDPAQCDYELE 130
           W QC  PC  C     P Y   R S   +P  D     L  P    C  +    C +   
Sbjct: 116 WTQCQ-PCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLD-PSVTMCVNQTVQTCAFSYS 173

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
           Y D  +++G L  D    ++  G  + P +  GCG N   G       GI G G+G  S+
Sbjct: 174 YGDKSATIGFL--DVETVSFVAGASV-PGVVFGCGLNNT-GIFRSNETGIAGFGRGPLSL 229

Query: 191 VSQLHSQKLIRNVVGHCLSGGGG----GFLF-FGDDLYDSSRVVWTSMSSDYTKYYS-PG 244
            SQL           HC +   G      LF    DLY + R   T  ++   K  + P 
Sbjct: 230 PSQLKVGNF-----SHCFTAVSGRKPSTVLFDLPADLYKNGR--GTVQTTPLIKNPAHPT 282

Query: 245 VAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTLTSIMKKELS 290
              L   G T G   LPV              + DSG+++T L    Y+    ++  E +
Sbjct: 283 FYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR----LVHDEFA 338

Query: 291 AK-SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
           A   L   P +ET PL      P      V K    L L F +G T     L  E Y+  
Sbjct: 339 AHVKLPVVPSNETGPLLCFSAPPLGKAPHVPK----LVLHF-EGAT---MHLPRENYVFE 390

Query: 350 SNKG---NVCLGILNGAEVGLQDLNVIG 374
           +  G   ++CL I+ G      ++ +IG
Sbjct: 391 AKDGGNCSICLAIIEG------EMTIIG 412


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 140/294 (47%), Gaps = 46/294 (15%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S + SS +F  + ++    +  ++ PTG  Y VT+ +G P + + L  DTGSDLTW QC+
Sbjct: 114 SMNPSSGVFKEMQTT----IPASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE 169

Query: 81  APCV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCED--PAQCDYELEYAD 133
            PC+  C     P + P+       V C    C  L A G++  +D     C Y ++Y  
Sbjct: 170 -PCLGGCFPQNQPKFDPTTSTSYKNVSCSSEFC-KLIAEGNYPAQDCISNTCLYGIQYGS 227

Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
            G ++G L  +  A   ++  +       GC  ++    +++   G+LGLG+   ++ SQ
Sbjct: 228 -GYTIGFLATETLAIASSDVFK---NFLFGC--SEESRGTFNGTTGLLGLGRSPIALPSQ 281

Query: 194 LHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
             ++   +N+  +CL  S    G L FG ++  +++          +   SP + +L +G
Sbjct: 282 TTNK--YKNLFSYCLPASPSSTGHLSFGVEVSQAAK----------STPISPKLKQL-YG 328

Query: 252 GETTGL----KNLPV-------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
             T G+    + LP+       + DSG+++T+L   TY  L S  ++ ++  +L
Sbjct: 329 LNTVGISVRGRELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTL 382


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 151/349 (43%), Gaps = 60/349 (17%)

Query: 54  TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDL 100
           T+ +G P   + + LDTGSDL W+ CD  C +C          E    +Y P    +N  
Sbjct: 110 TVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNKK 167

Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQRL 156
           V C + +CA       + C    + C Y + Y    +S  G+L++D         N +R+
Sbjct: 168 VTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERV 222

Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
              +  GCG  QV   S+  +   +G+ GLG  K S+ S L  + L+ +    C    G 
Sbjct: 223 EAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGV 280

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYTY 272
           G + FGD           +++  +   Y+  V  +  G  TT + +    +FD+G+S+TY
Sbjct: 281 GRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFTY 337

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFRTLA 327
           L    Y T++     +  A+  + +P+          R PF+  +D+          +L+
Sbjct: 338 LVDPMYTTVSESFHSQ--AQDKRHSPD---------SRIPFEYCYDMSNDANASLIPSLS 386

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
           L+       T+     +  ++IS +G +  CL I+  +E     LN+IG
Sbjct: 387 LTMKGNSHFTI----NDPIIVISTEGELVYCLAIVKSSE-----LNIIG 426


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y +T+ +G PA    + +DTGSD++W+QC  PC +C     PL+ P    +     C   
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + Y DG S+ G    D  A     G         GC  
Sbjct: 187 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 239

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
           + V        DG++GLG G  S+VSQ  +   +     +CL  +    GFL        
Sbjct: 240 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297

Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
            +S  V T M  SS    +Y   +  +  GG    +     +   V DSG+  T L    
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 357

Query: 278 YQTLTSIMKKEL 289
           Y  L+S  K  +
Sbjct: 358 YSALSSAFKAGM 369


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y +T+ +G PA    + +DTGSD++W+QC  PC +C     PL+ P    +     C   
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 256

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            CA L   G + C   +QC Y + Y DG S+ G    D  A     G         GC  
Sbjct: 257 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 309

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
           + V        DG++GLG G  S+VSQ  +   +     +CL  +    GFL        
Sbjct: 310 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 367

Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
            +S  V T M  SS    +Y   +  +  GG    +     +   V DSG+  T L    
Sbjct: 368 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 427

Query: 278 YQTLTSIMKKEL 289
           Y  L+S  K  +
Sbjct: 428 YSALSSAFKAGM 439


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
           Y V + +G PA    L +DTGSD++W+QC  PC  CV A  P + P +      +PC   
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 197

Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP----RLA 161
            C +++      C    + C + ++Y DG  S G+L  +  A N  N     P     + 
Sbjct: 198 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 257

Query: 162 LGCG---YNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GG 212
           LGC       +P GAS     G+LG+ +   S  SQL S+   +    HC          
Sbjct: 258 LGCADIDREGLPTGAS-----GLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNS 310

Query: 213 GGFLFFGDDLYDSSRVVWT------SMSSDYTKYYSPGVAELFFGGETTGL--KNLPV-- 262
            G +FFG+    S  + +T      ++ S    YY  G+  +        L  KN  +  
Sbjct: 311 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDK 370

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
                  + DSG+++TYL +  +Q     M++E  A++   A  D+
Sbjct: 371 VTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDD 412


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 12/164 (7%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
           Y + + IG P    +   DTGSDL WLQC  PC  C +  +P++   +      + C   
Sbjct: 59  YLMELSIGTPPVKIYAQADTGSDLIWLQC-IPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117

Query: 107 ICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGC 164
            C+ L++    +C  D   C Y   Y DG  + GVL ++      T G+ +  + +  GC
Sbjct: 118 SCSKLYS---TSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGC 174

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           G+N   GA      GI+GLG+G  S+VSQ+ S  L  N+   CL
Sbjct: 175 GHNN-NGAFNDKEMGIIGLGRGPLSLVSQIGS-SLGGNMFSQCL 216


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/281 (29%), Positives = 126/281 (44%), Gaps = 38/281 (13%)

Query: 21  SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           SS S++S      GS +   V G    +G Y V + +G P R  ++ +D+GSD+ W+QC 
Sbjct: 16  SSGSTASYGVEDFGSEV---VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK 72

Query: 81  APCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
            PC +C     PL+ P++      V C   +C  +   G ++     +C YE+ Y DG S
Sbjct: 73  -PCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNS----GRCRYEVSYGDGSS 127

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           + G L  +      T G+ +   +A+GCG+ NQ     +    G+LGLG G  S V QL 
Sbjct: 128 TKGTLALETL----TLGRTVVQNVAIGCGHMNQ---GMFVGAAGLLGLGGGSMSFVGQLS 180

Query: 196 SQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFF 250
            ++   N   +CL        GFL FG +        W  +  +     YY  G++ L  
Sbjct: 181 RER--GNAFSYCLVSRVTNSNGFLEFGSEAMPVG-AAWIPLIRNPHSPSYYYIGLSGLGV 237

Query: 251 GG----------ETTGLKNLPVVFDSGSSYTYLNRVTYQTL 281
           G           E T L N  VV D+G++ T    V Y+  
Sbjct: 238 GDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAF 278


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 151/350 (43%), Gaps = 60/350 (17%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SND 99
            T+ +G P   + + LDTGSDL W+ CD  C +C          E    +Y P    +N 
Sbjct: 107 TTVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKISTTNK 164

Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQR 155
            V C + +CA       + C    + C Y + Y    +S  G+L++D         N +R
Sbjct: 165 KVTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER 219

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  QV   S+  +   +G+ GLG  K S+ S L  + L+ +    C    G
Sbjct: 220 VEAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDG 277

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYT 271
            G + FGD           +++  +   Y+  V  +  G  TT + +    +FD+G+S+T
Sbjct: 278 VGRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFT 334

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFRTL 326
           YL    Y T++     +  A+  + +P+          R PF+  +D+          +L
Sbjct: 335 YLVDPMYTTVSESFHSQ--AQDKRHSPD---------SRIPFEYCYDMSNDANASLIPSL 383

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
           +L+       T+     +  ++IS +G +  CL I+  +E     LN+IG
Sbjct: 384 SLTMKGNSHFTI----NDPIIVISTEGELVYCLAIVKSSE-----LNIIG 424


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 71/227 (31%), Positives = 97/227 (42%), Gaps = 26/227 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + IG P   +   +DT SDL WLQC  PCV C     P++ P    S  +VPC 
Sbjct: 86  GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ-PCVSCYRQLDPIFNPRLSSSYAVVPCS 144

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              C+ L   GH   ED  Q C Y  +Y+    + G L  D  A     G  +   + LG
Sbjct: 145 SDTCSQLD--GHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV----GGNVFHAVVLG 198

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL-------IRNVVGHCLSGGGGGFL 216
           C  + V G       G++GL +G  S++SQL  ++        +    G  + G G G  
Sbjct: 199 CSDSSVGGPPPQ-ASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGAG-- 255

Query: 217 FFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLP 261
              D + + S  V  +MSS   Y  YY      L  G +T G    P
Sbjct: 256 --ADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRP 300


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 161/368 (43%), Gaps = 74/368 (20%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G PA P  + LDTGSD+ WLQC APC RC E    ++ P    S + V C
Sbjct: 137 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYEQSGQVFDPRRSRSYNAVGC 195

Query: 104 EDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
             P+C  L + G   C+   + C Y++ Y DG  + G    +   F    G R+  R+AL
Sbjct: 196 AAPLCRRLDSGG---CDLRRSACLYQVAYGDGSVTAGDFATETLTF--AGGARV-ARVAL 249

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGG 214
           GCG++      +    G+LGLG+G  S  +Q+ S++  R+   +CL        +     
Sbjct: 250 GCGHDNE--GLFVAAAGLLGLGRGSLSFPTQI-SRRYGRS-FSYCLVDRTSSANTASRSS 305

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF----------GGETTGLKNLP--- 261
            + FG      S  V ++++S +T        E F+          G    G+ N     
Sbjct: 306 TVTFG------SGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRL 359

Query: 262 --------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
                   V+ DSG+S T L R  Y  L    +   +A  L+ +P   +L         F
Sbjct: 360 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRG--AAAGLRLSPGGFSL---------F 408

Query: 314 KNVHDV--KKCFR--TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQ 368
              +D+  +K  +  T+++ F  G       L PE YLI + +KG  C     G + G  
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAA---LPPENYLIPVDSKGTFCFA-FAGTDGG-- 462

Query: 369 DLNVIGGI 376
            +++IG I
Sbjct: 463 -VSIIGNI 469


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 97/354 (27%), Positives = 145/354 (40%), Gaps = 52/354 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + +++G P + + L LDTGSDL W+QC  PC+ C E   P Y P +      + C
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISC 250

Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQ---RL 156
            DP C  + +P   N C+   Q C Y   Y DG ++ G    + F  N T  NG+   + 
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310

Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
              +  GCG +N+     +H   G+LGLGKG  S  SQ+  Q L      +CL     + 
Sbjct: 311 VENVMFGCGHWNR---GLFHGAAGLLGLGKGPLSFASQM--QSLYGQSFSYCLVDRNSNA 365

Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
                L FG+D  L     + +TS           +Y   +  +    E   +       
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHL 425

Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                   + DSG++ TY     Y+ +     KE   + +K     E LP      +P  
Sbjct: 426 SSEGAGGTIIDSGTTLTYFAEPAYEII-----KEAFVRKIKGYELVEGLPPL----KPCY 476

Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
           NV  ++K       + F DG    ++    E Y I  +   VCL IL      L
Sbjct: 477 NVSGIEKMELPDFGILFADG---AVWNFPVENYFIQIDPDVVCLAILGNPRSAL 527


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 138/352 (39%), Gaps = 55/352 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSNDLV----PCE 104
           Y +T+ +G PA    + +DTGSD++W+QC APC    C      L+ P+         C 
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQC-APCAAQSCSSQKDKLFDPAKSATYSAFSCS 188

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              CA L   G + C + + C Y ++Y D  ++ G    D      ++  +       GC
Sbjct: 189 SAQCAQLGGEG-NGCLN-SHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVK---NFQFGC 243

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDD 221
            +          LDG++GLG    S+VSQ  +         +CL   S   GGFL  G  
Sbjct: 244 SHRA--NGFVGQLDGLMGLGGDTESLVSQ--TAATYGKAFSYCLPPSSSSAGGFLTLGAA 299

Query: 222 L--YDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-TGLK-NLPV-------VFDSGSSY 270
                SSR   T +     ++  P    +F    T  G K N+P        V DSG+  
Sbjct: 300 AGGTSSSRYSRTPL----VRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVI 355

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW------KGRRPFKNVH------- 317
           T L    YQ L +  KKE+  K+   A     L  C+        R P   +        
Sbjct: 356 TQLPPTAYQALRTAFKKEM--KAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGAVM 413

Query: 318 --DVKKCFRTLALSFT----DGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
             DV   F    L+FT    DG T  L  +    + ++ + G   LG   GA
Sbjct: 414 DLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGA 465


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P+       VPC 
Sbjct: 82  LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 139

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
             +C   +A     C   +  C Y ++Y +D  SS GVLV+D       + Q   +   +
Sbjct: 140 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 194

Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
             GCG  QV   S+      +G+LGLG    S+ S L S+ L  N    C    G G + 
Sbjct: 195 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 252

Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
           FGD    D  ++   V+         YY+  +  +  G ++   +    + DSG+S+T L
Sbjct: 253 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 306

Query: 274 NRVTYQTLTSIMKKEL 289
           +   Y  +TS    ++
Sbjct: 307 SDPMYTQITSSFDAQI 322


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 133/352 (37%), Gaps = 63/352 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LV 101
           T  Y     IG P +     +DTGS+L W QC   C    C +   P Y  S       V
Sbjct: 81  TRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAV 140

Query: 102 PCED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
           PC D   +CA   A G H C     C +   Y   GS  G L  +AF F     Q    +
Sbjct: 141 PCADSAKLCA---ANGVHLCGLDGSCTFAASYG-AGSVFGSLGTEAFTF-----QSGAAK 191

Query: 160 LALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
           L  GC    ++   + +   G++GLG+G+ S+VSQ  + K    +  +  + G    LF 
Sbjct: 192 LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFV 251

Query: 219 GDDLYDS------SRVVWTSMSSDY---TKYYSPGVAELFFGGETTGLKNLP-------- 261
           G     S      + + +     DY   T YY P V      G + G   LP        
Sbjct: 252 GASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLV------GISVGETKLPIPSAAFEL 305

Query: 262 -----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                      V+ D+GS  T L    Y  L+  + ++L+ +SL + P D  L LC    
Sbjct: 306 RRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLN-RSLVQPPADTGLDLCVA-- 362

Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
                  DV K    L   F  G       ++  +Y    +K   C+ I  G
Sbjct: 363 -----RQDVDKVVPVLVFHFGGGAD---MAVSAGSYWGPVDKSTACMLIEEG 406


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 123/303 (40%), Gaps = 54/303 (17%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQ-VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           RM++ S + S+  L     S+ +    + +  P   Y V M IG P +P  L LDTGSDL
Sbjct: 75  RMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDL 134

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ----CD 126
           TW QC APCV C     P + PS  +    +PC+  IC  L      +C + +     C 
Sbjct: 135 TWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNGICV 190

Query: 127 YELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILG 182
           Y   YAD   + G L  D F+F   ++  G    P L  GCG +N   G       GI G
Sbjct: 191 YAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETGIAG 248

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VVWTS 232
             +G  S+ +QL           +C +   G      FL    +LY  +      VV   
Sbjct: 249 FSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV--- 300

Query: 233 MSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTYLNR 275
            S+   +Y+S  +   +    G T G   LP+               + DSG+  T L  
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360

Query: 276 VTY 278
             Y
Sbjct: 361 AVY 363


>gi|168025647|ref|XP_001765345.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683398|gb|EDQ69808.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 879

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 84/292 (28%), Positives = 134/292 (45%), Gaps = 37/292 (12%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC- 79
           S+  SS  FN +  + +F +   V   +  ++V M +G P + +   +DTGS  TW+ C 
Sbjct: 197 STRGSSLPFNFLYYTCVFGIGPRVLMESEEFHVEMKLGVPPKKFHFHMDTGSRDTWVYCQ 256

Query: 80  -----DAPCVRCVEAPHPLYRPSND--LVPC---EDPICASLHAPGHH-NCEDPAQCDYE 128
                D P +     P+  + P ++   + C      +C+      H  N  D   C  +
Sbjct: 257 VSRNLDEPPIEL--GPNGKFEPRDESSYIQCIGHTASLCSEYQYEPHLCNSVDKYHCVND 314

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGK 185
           L YAD  +  GVLV ++   +  +   ++      C    +  AS HP    DGI+GLG 
Sbjct: 315 LNYADDSTYSGVLVNESLMVSTIDNSDMDAMGLFWC----INEAS-HPFTGTDGIIGLGN 369

Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGG--GFLFFGDDL---YDSSRVVW---TSMSSD 236
            K ++  Q  + K+I +NV+G CL+ G G  G++  G +    ++ S  VW   T MSS 
Sbjct: 370 CKKTLGDQWTTNKVISQNVLGVCLAKGPGPVGYISLGVNFKKKFEESTSVWSKLTPMSSA 429

Query: 237 YTKYYSPGVAELFFGGET---TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIM 285
               YS  +A + F  +T   T   NL   FD+GS   YL  V Y+ L  ++
Sbjct: 430 GECAYSSPLASISFHDKTFVFTSETNLG--FDTGSDMMYLEAVIYEPLLDML 479


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 123/303 (40%), Gaps = 54/303 (17%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQ-VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           RM++ S + S+  L     S+ +    + +  P   Y V M IG P +P  L LDTGSDL
Sbjct: 49  RMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDL 108

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ----CD 126
           TW QC APCV C     P + PS  +    +PC+  IC  L      +C + +     C 
Sbjct: 109 TWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNGICV 164

Query: 127 YELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILG 182
           Y   YAD   + G L  D F+F   ++  G    P L  GCG +N   G       GI G
Sbjct: 165 YAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETGIAG 222

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VVWTS 232
             +G  S+ +QL           +C +   G      FL    +LY  +      VV   
Sbjct: 223 FSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV--- 274

Query: 233 MSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTYLNR 275
            S+   +Y+S  +   +    G T G   LP+               + DSG+  T L  
Sbjct: 275 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 334

Query: 276 VTY 278
             Y
Sbjct: 335 AVY 337


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 154/372 (41%), Gaps = 59/372 (15%)

Query: 37  LLFQVHGNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
           L F   G + PTG      Y   + +G P   + + LDTGSDL W+ CD  C+ C  AP 
Sbjct: 189 LSFSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCD--CIEC--APL 244

Query: 92  PLYRPSND-----LVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
             Y  S D       P E     S H P  H       +C +  Q C Y  +Y  +  +S
Sbjct: 245 SGYHGSLDRDLGIYKPAES--TTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTS 302

Query: 138 LGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQ 193
            G+LV+D    +       +   + +GCG  Q    SY      DG+LGLG    S+ S 
Sbjct: 303 SGLLVEDILHLDSRESHAPVKASVIIGCGRKQ--SGSYLDGIAPDGLLGLGMADISVPSF 360

Query: 194 LHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYT------KYYSPGVAE 247
           L    L+RN    C +   G  +FFGD      + V T  S+ +       + Y+  V +
Sbjct: 361 LARAGLVRNSFSMCFTKDSGR-IFFGD------QGVSTQQSTPFVPLYGKLQTYTVNVDK 413

Query: 248 LFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
              G +     +   + DSG+S+T L    Y+ +     K+++A  L +  E  +   C+
Sbjct: 414 SCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQ--EATSFDYCY 471

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAE 364
               P      V     T+ L+F   K+   F+     +L+   +G V   CL ++   E
Sbjct: 472 SA-SPL-----VMPDVPTVTLTFAGNKS---FQPVNPTFLLHDEEGAVAGFCLAVVQSPE 522

Query: 365 -VGLQDLNVIGG 375
            +G+   N + G
Sbjct: 523 PIGIIAQNFLLG 534


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 139/327 (42%), Gaps = 38/327 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y VTM +G       +D  TGSDLTW+QC+ PC+ C     P+++P    S   V C   
Sbjct: 65  YIVTMGLGSTNMTVIID--TGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            C SL  A G+      +P+ C+Y + Y DG  + G L  +  +F    G         G
Sbjct: 122 TCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF----GGVSVSDFVFG 177

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
           CG N      +  + G++GLG+   S+VSQ ++      V  +CL     G  G L  G+
Sbjct: 178 CGRNN--KGLFGGVSGLMGLGRSYLSLVSQTNAT--FGGVFSYCLPTTESGASGSLVMGN 233

Query: 221 D---LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
           +     + + + +T M  +   + +Y   +  +   G   +     N  V+ DSG+  T 
Sbjct: 234 ESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITR 293

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y+ L ++  K+ +      AP    L  C+          +V     T+++ F +
Sbjct: 294 LPSSVYKALKALFLKQFTG--FPSAPGFSILDTCFN----LTGYDEVS--IPTISMHF-E 344

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
           G      + T   Y++  +   VCL +
Sbjct: 345 GNAELKVDATGTFYVVKEDASQVCLAL 371


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
           Y V + +G PA    L +DTGSD++W+QC  PC  CV A  P + P +      +PC   
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 196

Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP----RLA 161
            C +++      C    + C + ++Y DG  S G+L  +  A N  N     P     + 
Sbjct: 197 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 256

Query: 162 LGCG---YNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GG 212
           LGC       +P GAS     G+LG+ +   S  SQL S+   +    HC          
Sbjct: 257 LGCADIDREGLPTGAS-----GLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNS 309

Query: 213 GGFLFFGDDLYDSSRVVWT------SMSSDYTKYYSPGVAELFFGGETTGL--KNLPV-- 262
            G +FFG+    S  + +T      ++ S    YY  G+  +        L  KN  +  
Sbjct: 310 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDK 369

Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
                  + DSG+++TYL +  +Q     M++E  A++   A  D+
Sbjct: 370 VTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDD 411


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 77/271 (28%), Positives = 121/271 (44%), Gaps = 26/271 (9%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPH--PLYRPSNDLVPCEDPICASLH 112
           IG P   + + LD+GSDL W+ CD  CV+C  + A H   L R  ++  P +      L 
Sbjct: 104 IGTPHVSFMVALDSGSDLFWVPCD--CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLS 161

Query: 113 APGHH------NCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLA--- 161
              H       NC++P Q C Y + Y  +  SS G+LV+D           LN  +    
Sbjct: 162 C-SHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPV 220

Query: 162 -LGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
            +GCG  Q  G      P DG+LGLG  + S+ S L    LI+N    C +    G +FF
Sbjct: 221 IIGCGMKQSGGYLDGVAP-DGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIFF 279

Query: 219 GDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
           GD    + +   +  ++ +YT Y   GV     G       +   + DSG+S+T+L    
Sbjct: 280 GDQGPATQQSAPFLKLNGNYTTYIV-GVEVCCVGTSCLKQSSFSALVDSGTSFTFLPDDV 338

Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           ++ +      +++A   + + E  +   C+K
Sbjct: 339 FEMIAEEFDTQVNAS--RSSFEGYSWKYCYK 367


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 147/387 (37%), Gaps = 68/387 (17%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM+  S + ++  L +   + +    + N  PT  Y V + IG P +P  L LDTGSDL 
Sbjct: 47  RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106

Query: 76  WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
           W QC  PC  C +   P + P    +  L  C+  +C  L      +C  P       C 
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
           Y   Y D   + G L  D F F         P +A GCG +N   G       GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
           G  S+ SQL           HC +   G       L    DLY S R    S       +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273

Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTL 281
           + T YY      L   G T G   LPV              + DSG++ T L    Y+ +
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLV 327

Query: 282 TSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFEL 341
                 ++    +     D    L      P +    V K    L L F +G T    +L
Sbjct: 328 RDAFAAQVKLPVVSGNTTDPYFCL----SAPLRAKPYVPK----LVLHF-EGAT---MDL 375

Query: 342 TPEAYLI-ISNKGN--VCLGILNGAEV 365
             E Y+  + + G+  +CL I+ G EV
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGEV 402


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 90/212 (42%), Gaps = 33/212 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + +G P   +   +DT SDL W QC  PCV+C +   P++ P    S  +VPC 
Sbjct: 86  GEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCN 144

Query: 105 DPICASLHAPGHHNC------EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C  L     H C      +D   C Y   Y    ++ G+L  D  A     G  +  
Sbjct: 145 SDTCDELDT---HRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI----GDDVFR 197

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGF 215
            +  GC  + V G     + G++GLG+G  S+VSQL  ++ +     +CL        G 
Sbjct: 198 GVVFGCSSSSVGGPPPQ-VSGVVGLGRGALSLVSQLSVRRFM-----YCLPPPVSRSAGR 251

Query: 216 LFFGDDLYDSSR------VVWTSMSSDYTKYY 241
           L  G D   + R      VV  S  S Y  YY
Sbjct: 252 LVLGADAAATVRNASERVVVPMSTGSRYPSYY 283


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 101/388 (26%), Positives = 156/388 (40%), Gaps = 57/388 (14%)

Query: 4   SHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQ--VHGNVYPTGYYNVTMYIGQPA 61
           + + E + F   R+++  S+S+S++     G SL+      G    +G Y V + +G PA
Sbjct: 58  TKDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPA 117

Query: 62  RPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPCEDPICASLH 112
           + + + +DTGS L+WLQC    + C     P++ PS              C     ++L+
Sbjct: 118 KYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLN 177

Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
           APG  N      C Y+  Y D   S+G L +D      T     +     GCG +     
Sbjct: 178 APGCSNAT--GACVYKASYGDTSFSIGYLSQDVLTL--TPSAAPSSGFVYGCGQDN--QG 231

Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGGFLFFGDDLYD 224
            +    GI+GL   K S++ QL ++    N   +CL        +    GFL  G     
Sbjct: 232 LFGRSAGIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASSLS 289

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETT--------GLK----NLPVVFDSGSSYTY 272
           SS   +T +  +      P +  L+F G TT        G+     N+P + DSG+  T 
Sbjct: 290 SSPYKFTPLVKN------PKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITR 343

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFRTLALSFT 331
           L    Y  L       +S K   +AP    L  C+KG  +    V +++  FR  A    
Sbjct: 344 LPVAIYNALKKSFVMIMS-KKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA---- 398

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGI 359
                   EL     L+   KG  CL I
Sbjct: 399 ------GLELKVHNSLVEIEKGTTCLAI 420


>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 537

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/288 (27%), Positives = 119/288 (41%), Gaps = 37/288 (12%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           L      +L F++ G+++   Y  V   +G P   + + LDTGSDL W+ CD  C +C  
Sbjct: 90  LLTFASGNLTFRLEGSLH---YAEVA--VGTPNATFLVALDTGSDLFWVPCD--CKQCAP 142

Query: 89  APH-------PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADG 134
             +       P  RP       ++  V CE  +C   +A           C Y + Y   
Sbjct: 143 IANASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAG-NSSTSCPYTVRYVSA 201

Query: 135 G-SSLGVLVKDAFAFNYTNG----QRLNPRLALGCGYNQ----VPGASYHPLDGILGLGK 185
             SS GVLV+D    +          +   + LGCG  Q    + GA+   +DG+LGLG 
Sbjct: 202 NTSSSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAA---VDGLLGLGM 258

Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
            K S+ S LH+  L+  +    C S  G G + FGD           ++ + +  Y    
Sbjct: 259 DKVSVPSVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISV 318

Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
            A    G E         + DSG+S+TYLN   Y  L +    E+  +
Sbjct: 319 TAMSVSGKEVAA--EFAAIVDSGTSFTYLNDPAYTELATGFNSEVRER 364


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 95/340 (27%), Positives = 140/340 (41%), Gaps = 49/340 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y +   IG+P    + ++DTGSDL W++C +PC  C   P PLY P    S+  +PC 
Sbjct: 85  GKYIMQFSIGEPPLLIWAEVDTGSDLMWVKC-SPCNGCNPPPSPLYDPARSRSSGKLPCS 143

Query: 105 DPICASL---HAPGHHNCEDPAQCDYELEYADGG--SSLGVLVKDAFAFNYTNGQRLNPR 159
             +C +L           +DP  C Y   Y   G  S+ GVL  + F F   +G   N  
Sbjct: 144 SQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFG--DGYVAN-N 200

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR------NVVGHCLSGGGG 213
           ++ G   + + G+ +    G++GLG+G  S+VSQL + +         NV    L G   
Sbjct: 201 VSFGRS-DTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLA 259

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VV 263
                  D+  SS  + T+   D   +Y   +  +  GG    +K+            V 
Sbjct: 260 ALDTSAGDV--SSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVF 317

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
           FDSG+  T L    YQ +   +  E+  + L     D+T   C+       N   V +  
Sbjct: 318 FDSGAIDTSLKDAAYQVVRQAITSEI--QRLGYDAGDDT---CFVA----ANQQAVAQ-M 367

Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGN----VCLGI 359
             L L F DG       L    YL  S KG     VC+ I
Sbjct: 368 PPLVLHFDDGAD---MSLNGRNYLKTSTKGPSEVLVCMAI 404


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 147/387 (37%), Gaps = 68/387 (17%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM+  S + ++  L +   + +    + N  PT  Y V + IG P +P  L LDTGSDL 
Sbjct: 47  RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106

Query: 76  WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
           W QC  PC  C +   P + P    +  L  C+  +C  L      +C  P       C 
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
           Y   Y D   + G L  D F F         P +A GCG +N   G       GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
           G  S+ SQL           HC +   G       L    DLY S R    S       +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273

Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTL 281
           + T YY      L   G T G   LPV              + DSG++ T L    Y+ +
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYRLV 327

Query: 282 TSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFEL 341
                 ++    +     D    L      P +    V K    L L F +G T    +L
Sbjct: 328 RDAFAAQVKLPVVSGNTTDPYFCL----SAPLRAKPYVPK----LVLHF-EGAT---MDL 375

Query: 342 TPEAYLI-ISNKGN--VCLGILNGAEV 365
             E Y+  + + G+  +CL I+ G EV
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGEV 402


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 155/369 (42%), Gaps = 61/369 (16%)

Query: 37  LLFQVHGNVYPT-----GYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           LLF  HG+   +     G+ + T   IG P+  + + LD GSDL W+ CD  CV+C    
Sbjct: 76  LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLS 133

Query: 91  HPLY----RPSNDLVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
              Y    R  N+  P      +S H    H       NC+   Q C Y + Y ++  SS
Sbjct: 134 SSYYSNLDRDLNEYSPSRS--LSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSS 191

Query: 138 LGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIV 191
            G+LV+D        + +N     P + LGCG  Q  G      P DG+LGLG G+SS+ 
Sbjct: 192 SGLLVEDILHLQSGGSLSNSSVQAP-VVLGCGMKQSGGYLDGVAP-DGLLGLGPGESSVP 249

Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFGDD---LYDSSRVVWTSMSSDYTKYYSPGVAEL 248
           S L    LI +    C +    G +FFGD    +  S+  +   +   Y+ Y   GV   
Sbjct: 250 SFLAKSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFL--PLDGLYSTYII-GVESC 306

Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK---------------S 293
             G     + +  V  DSG+S+T+L    Y  +     ++++                 S
Sbjct: 307 CVGNSCLKMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPS 366

Query: 294 LKEAPEDETLPLCWKGRRPFKNVHDVKKCFR--------TLALSFTDGKTRTLFELTPEA 345
            +E P+  +L L ++    F  V+D    F          LA+  T+G   T+ +     
Sbjct: 367 SQELPKVPSLTLTFQQNNSFV-VYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTG 425

Query: 346 YLIISNKGN 354
           Y ++ ++GN
Sbjct: 426 YRLVFDRGN 434


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 73/253 (28%), Positives = 111/253 (43%), Gaps = 33/253 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC  PC +C     P++ P++      V C
Sbjct: 137 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCTQCYHQSDPVFDPADSASFTGVSC 195

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  L   G H      +C YE+ Y DG  + G L  +   F    G+ +   +A+G
Sbjct: 196 SSSVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIG 247

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
           CG+       +    G+LGLG G  S V QL  Q        +CL   G    G L FG 
Sbjct: 248 CGHRNR--GMFVGAAGLLGLGGGSMSFVGQLGGQT--GGAFSYCLVSRGTDSSGSLVFGR 303

Query: 221 DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG----------ETTGLKNLPVVFDSGS 268
           +   +    W  +  +     +Y  G+A L  GG            T L +  VV D+G+
Sbjct: 304 EALPAG-AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGT 362

Query: 269 SYTYLNRVTYQTL 281
           + T L  + YQ  
Sbjct: 363 AVTRLPTLAYQAF 375


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 58/393 (14%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
           SS + S  S  ++    L+  +   V   +G Y + ++IG P + + L LDTGSDL W+Q
Sbjct: 164 SSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQ 223

Query: 79  CDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYA 132
           C  PC  C E   P Y P + +    + C DP C  + +P     C+   Q C Y   Y 
Sbjct: 224 C-VPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYG 282

Query: 133 DGGSSLGVLVKDAFAFNYTNGQ------RLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
           D  ++ G    + F  N T+        R    +  GCG +N+     +H   G+LGLG+
Sbjct: 283 DSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNR---GLFHGAAGLLGLGR 339

Query: 186 GKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGD--DLYDSSRVVWTSM----S 234
           G  S  SQL  Q L  +   +CL            L FG+  DL     + +TS+     
Sbjct: 340 GPLSFSSQL--QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397

Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLP------VVFDSGSSYTYLNRVTYQTLTSI 284
           +    +Y   +  +F GGE   +     NL        + DSG++ +Y +   Y+ +   
Sbjct: 398 NPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII--- 454

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTP 343
             KE   + +K     E  P+      P  NV    +  F    + F DG    ++    
Sbjct: 455 --KEAFLRKVKGYKLVEDFPIL----HPCYNVSGTDELNFPEFLIQFADG---AVWNFPV 505

Query: 344 EAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
           E Y I I     VCL +L   +     L++IG 
Sbjct: 506 ENYFIRIQQLDIVCLAMLGTPKSA---LSIIGN 535


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/353 (25%), Positives = 144/353 (40%), Gaps = 69/353 (19%)

Query: 40  QVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
           ++   V P  G + + + IG P   Y   LDTGSDL W QC  PC +C     P++ P  
Sbjct: 85  EIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCK-PCTQCFHQSTPIFDPKK 143

Query: 99  DLVPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                +    + L  A    +C +   C+Y   Y D  S+ G+L  +   F    G+   
Sbjct: 144 SSSFSKLSCSSQLCEALPQSSCNN--GCEYLYSYGDYSSTQGILASETLTF----GKASV 197

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
           P +A GCG +   G+ +    G++GLG+G  S+VSQL   K       +CL+        
Sbjct: 198 PNVAFGCGADN-EGSGFSQGAGLVGLGRGPLSLVSQLKEPKF-----SYCLTT------- 244

Query: 218 FGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFF---GGETTGLKNLPV---- 262
             DD   S+ ++ +  S + +          +SP     ++    G + G   LP+    
Sbjct: 245 -VDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKST 303

Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCW 307
                      + DSG++ TYL    +    +++ KE +AK     P D +    L +C+
Sbjct: 304 FSLQDDGSGGLIIDSGTTITYLEESAF----NLVAKEFTAK--INLPVDSSGSTGLDVCF 357

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGI 359
                  N+   K  F        DG      EL  E Y+I  S+ G  CL +
Sbjct: 358 TLPSGSTNIEVPKLVFH------FDGAD---LELPAENYMIGDSSMGVACLAM 401


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 99/348 (28%), Positives = 148/348 (42%), Gaps = 45/348 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+  G PA P  L +DTGSDL+W+QC  PC    C     P++ PS       VPC 
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQ-PCNSSTCYPQKDPVFDPSASSTYAPVPCG 180

Query: 105 DPICASLHAPGHHN-CEDPAQ----CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
              C  L    + N C + +     C Y ++Y +G +++GV   +    +      +N  
Sbjct: 181 SEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVN-N 239

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLF 217
            + GCG  Q     +   DG+LGLG    S+VSQ  +         +CL  G    GFL 
Sbjct: 240 FSFGCGLVQ--KGVFDLFDGLLGLGGAPESLVSQ--TTGTYGGAFSYCLPAGNSTAGFLA 295

Query: 218 FGDDLY---DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------DSGS 268
            G       +++   +T +    T +Y   +  +  GG+   ++  P VF      DSG+
Sbjct: 296 LGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIE--PTVFAGGMIIDSGT 353

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             T L    Y  L +  +  +SA  L    +DE L  C+     F    +V     T+AL
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYD----FTGNTNVT--VPTVAL 407

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           +F  G T  L    P   L+     + CL  + GA  G  D  +IG +
Sbjct: 408 TFEGGVTIDLD--VPSGVLL-----DGCLAFVAGASDG--DTGIIGNV 446


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 58/393 (14%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
           SS + S  S  ++    L+  +   V   +G Y + ++IG P + + L LDTGSDL W+Q
Sbjct: 164 SSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQ 223

Query: 79  CDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYA 132
           C  PC  C E   P Y P + +    + C DP C  + +P     C+   Q C Y   Y 
Sbjct: 224 C-VPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYG 282

Query: 133 DGGSSLGVLVKDAFAFNYTNGQ------RLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
           D  ++ G    + F  N T+        R    +  GCG +N+     +H   G+LGLG+
Sbjct: 283 DSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNR---GLFHGAAGLLGLGR 339

Query: 186 GKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGD--DLYDSSRVVWTSM----S 234
           G  S  SQL  Q L  +   +CL            L FG+  DL     + +TS+     
Sbjct: 340 GPLSFSSQL--QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397

Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLP------VVFDSGSSYTYLNRVTYQTLTSI 284
           +    +Y   +  +F GGE   +     NL        + DSG++ +Y +   Y+ +   
Sbjct: 398 NPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII--- 454

Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTP 343
             KE   + +K     E  P+      P  NV    +  F    + F DG    ++    
Sbjct: 455 --KEAFLRKVKGYKLVEDFPIL----HPCYNVSGTDELNFPEFLIQFADG---AVWNFPV 505

Query: 344 EAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
           E Y I I     VCL +L   +     L++IG 
Sbjct: 506 ENYFIRIQQLDIVCLAMLGTPKSA---LSIIGN 535


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P+       VPC 
Sbjct: 41  LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 98

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
             +C   +A     C   +  C Y ++Y +D  SS GVLV+D       + Q   +   +
Sbjct: 99  SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 153

Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
             GCG  QV   S+      +G+LGLG    S+ S L S+ L  N    C    G G + 
Sbjct: 154 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 211

Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
           FGD    D  ++   V+         YY+  +  +  G ++   +    + DSG+S+T L
Sbjct: 212 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 265

Query: 274 NRVTYQTLTSIMKKEL 289
           +   Y  +TS    ++
Sbjct: 266 SDPMYTQITSSFDAQI 281


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 71/260 (27%), Positives = 110/260 (42%), Gaps = 36/260 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P    +L +D+GSD+ W+QC  PC+ C     PL+ P+       V C
Sbjct: 168 SGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFSGVSC 226

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC  L      + E    C+YE+ YADG  + G L  +      T  +     + +G
Sbjct: 227 GSAICRILPTSACGDGE-LGGCEYEVSYADGSYTKGALALETLTLGGTAVE----GVVIG 281

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---------- 213
           CG+       +    G++GLG G  S+V QL  +  +     +CL+  GG          
Sbjct: 282 CGHRNR--GLFVGAAGLMGLGWGPMSLVGQLGGE--VGGAFSYCLASRGGYGSGAADDDA 337

Query: 214 GFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGE----TTGLKNLP------ 261
           G+L  G         VW  +  +     +Y  G++ +  G E      GL  L       
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397

Query: 262 VVFDSGSSYTYLNRVTYQTL 281
           VV D+G++ T L +  Y  L
Sbjct: 398 VVMDTGTTVTRLPQEAYAAL 417


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 156/363 (42%), Gaps = 63/363 (17%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
           Y + +Y+G P R + + +DTGSDL WLQC APC+ C E   P++ P+       + C DP
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNLTCGDP 204

Query: 107 ICASL---HAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT---NGQRLNP 158
            C  +    AP    C  P +  C Y   Y D  +S G L  ++F  N T      R++ 
Sbjct: 205 RCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVD- 263

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG-----HCLSGGGG 213
            +  GCG+       +H   G+LGLG+G  S  SQL      R V G     +CL   G 
Sbjct: 264 GVVFGCGHRNR--GLFHGAAGLLGLGRGPLSFASQL------RAVYGGHTFSYCLVDHGS 315

Query: 214 GF---LFFGDD----LYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP-- 261
                + FG+D    L    R+ +T+    SS    +Y   +  +  GGE   + +    
Sbjct: 316 DVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWD 375

Query: 262 --------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
                    + DSG++ +Y     YQ +       +S  S    P+   L  C+      
Sbjct: 376 ASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSG-SYPPVPDFPVLSPCY------ 428

Query: 314 KNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLN 371
            NV  V++     L+L F DG    +++   E Y I +   G +CL +L     G   ++
Sbjct: 429 -NVSGVERPEVPELSLLFADG---AVWDFPAENYFIRLDPDGIMCLAVLGTPRTG---MS 481

Query: 372 VIG 374
           +IG
Sbjct: 482 IIG 484


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 89/306 (29%), Positives = 123/306 (40%), Gaps = 60/306 (19%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVY----PTGYYNVTMYIGQPARPYFLDLDTG 71
           RM++ S + S+  L     S+   +V    Y    P   Y V M IG P +P  L LDTG
Sbjct: 75  RMAARSKARSARLLSGRAASA---RVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTG 131

Query: 72  SDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ--- 124
           SDLTW QC APCV C     P + PS  +    +PC+  IC  L      +C + +    
Sbjct: 132 SDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNG 187

Query: 125 -CDYELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDG 179
            C Y   YAD   + G L  D F+F   ++  G    P L  GCG +N   G       G
Sbjct: 188 ICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETG 245

Query: 180 ILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VV 229
           I G  +G  S+ +QL           +C +   G      FL    +LY  +      VV
Sbjct: 246 IAGFSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300

Query: 230 WTSMSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTY 272
               S+   +Y+S  +   +    G T G   LP+               + DSG+  T 
Sbjct: 301 ---QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTM 357

Query: 273 LNRVTY 278
           L    Y
Sbjct: 358 LPEAVY 363


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 112/279 (40%), Gaps = 28/279 (10%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G+   TG Y VT   G PA+   L +DTGSD+TW+QC  PC  C     P++ P    S 
Sbjct: 130 GSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSDCYSQVDPIFEPQQSSSY 188

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C    C  L    H        C YE+ Y DG  S G   ++      T G    P
Sbjct: 189 KHLSCLSSACTELTTMNHCRL---GGCVYEINYGDGSRSQGDFSQETL----TLGSDSFP 241

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGG 213
             A GCG+       +    G+LGLG+   S  SQ  S+        +CL     S   G
Sbjct: 242 SFAFGCGHTNT--GLFKGSAGLLGLGRTALSFPSQTKSK--YGGQFSYCLPDFVSSTSTG 297

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGS 268
            F      +  ++  V    +S+Y  +Y  G+  +  GGE   +    +     + DSG+
Sbjct: 298 SFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGT 357

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
             T L    Y  L +  + +   ++L  A     L  C+
Sbjct: 358 VITRLVPQAYDALKTSFRSK--TRNLPSAKPFSILDTCY 394


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 129/297 (43%), Gaps = 32/297 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + + IG P  P     DTGSDL W QC+ PC  C +   PL+ P        V C 
Sbjct: 84  GEYLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESSTYRKVSCS 142

Query: 105 DPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LAL 162
              C +L      +C  D   C Y + Y D   + G +  D      +  + ++ R + +
Sbjct: 143 SSQCRALE---DASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMII 199

Query: 163 GCGYNQVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
           GCG+      ++ P   GI+GLG G +S+VSQL  +K I     +CL      +G     
Sbjct: 200 GCGHENT--GTFDPAGSGIIGLGGGSTSLVSQL--RKSINGKFSYCLVPFTSETGLTSKI 255

Query: 216 LFFGDDLYDSSRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSG 267
            F  + +     VV TSM   D   YY       S G  ++ F     G     +V DSG
Sbjct: 256 NFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSG 315

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           ++ T L    Y  L S++   + A+ +++   D  L LC++    FK V D+   F+
Sbjct: 316 TTLTLLPSNFYYELESVVASTIKAERVQDP--DGILSLCYRDSSSFK-VPDITVHFK 369


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 82/169 (48%), Gaps = 14/169 (8%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G  L+  V      +G Y   + +G PA    L LDT SDLTWLQC  PC RC     P+
Sbjct: 117 GRGLVAPVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPV 175

Query: 94  YRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADG----GSSLGVLVKDA 145
           + P +      +  + P C +L   G  + +    C Y ++Y DG     +S+G LV++ 
Sbjct: 176 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKR-GTCIYTVQYGDGHGSTSTSVGDLVEET 234

Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
             F    G      L++GCG++   G    P  GILGLG+G+ SI  Q+
Sbjct: 235 LTF---AGGVRQAYLSIGCGHDN-KGLFGAPAAGILGLGRGQISIPHQI 279


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 80/289 (27%), Positives = 126/289 (43%), Gaps = 24/289 (8%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R+ S  +  SS  +F    ++ L    G     G Y VT+ +G P + + L  DTGSD+T
Sbjct: 96  RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 155

Query: 76  WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQCDY 127
           W QC+ PCV+ C +   P   PS       + C   +C  L A G     +C   + C Y
Sbjct: 156 WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSS-STCLY 212

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
           +++Y DG  S+G    +    + +N   +      GCG  Q     +    G+LGLG+ K
Sbjct: 213 QVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTK 267

Query: 188 SSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
            ++ SQ  + K  + +  +CL  S    G+L  G  +  S +    S   D T +Y   +
Sbjct: 268 LALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDI 325

Query: 246 AELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
             L  GG    +         V DSG+  T L+   Y  L+S  +  ++
Sbjct: 326 TGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 374


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 88/186 (47%), Gaps = 21/186 (11%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLV 101
           + T  Y VTM +G   +   + +DTGSDLTW+QC+ PC+ C     P+++PS       +
Sbjct: 140 FQTLNYIVTMELG--GQDMTVIIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSI 196

Query: 102 PCEDPICASLHAPGHH--NCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
           PC    C SL     +   CE +P+ C Y + Y DG  + G L  +  +F    G     
Sbjct: 197 PCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF----GGISVS 252

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGF 215
               GCG N      +  + G++GLG+   S++SQ +S      V  +CL     G  G 
Sbjct: 253 NFVFGCGKNN--KGLFGGVSGLMGLGRSNLSLISQTNST--FGGVFSYCLPPTDAGASGS 308

Query: 216 LFFGDD 221
           L  G++
Sbjct: 309 LAMGNE 314


>gi|403222804|dbj|BAM40935.1| aspartyl(acid) protease [Theileria orientalis strain Shintoku]
          Length = 509

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/353 (23%), Positives = 143/353 (40%), Gaps = 44/353 (12%)

Query: 40  QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR---- 95
           +V GN++   YY V + IG P     L +DTGS L  + C   C  C     P Y     
Sbjct: 69  KVFGNLHKFAYYYVYVGIGNPKTKQMLIIDTGSQLINVAC-GKCKECGNHLLPNYELGAS 127

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
            ++ L+ C+   C ++       C     C +   Y++G +  G +V D  +F+      
Sbjct: 128 VTHKLIDCDSEFCKAVEGK----CGLDESCLFNESYSEGSNVEGKVVGDLISFDIKKDSS 183

Query: 156 LNPRL--ALGCGYNQVPGASYHPLDGILGLGKG-KSSIVSQ--LHSQKLI---------- 200
                   +GC  N+         +GILGL K  K +++S     +Q  I          
Sbjct: 184 YLSTFFNYIGCVTNESQLIKSQITNGILGLAKSDKPTLISHEYFETQSFIEKYLTDHFRP 243

Query: 201 -RNVVGHCLSGGGGGFLFFGDD------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
            + +   CLS  GG     G D      + ++++++W  +    +++Y   V +  F   
Sbjct: 244 MKKIFSLCLSENGGVMTLGGVDDQLNLKIKNTTQLIWAPLVK--SEFYIIKVLDASFQEN 301

Query: 254 TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
               KN   V D+G++ + L +  +  +  I +  L     K + E +T   C   ++  
Sbjct: 302 KIEFKNKNFVLDTGTTISTLEKEVFNKIHKIFEG-LCEDITKLSNEKKTSSKCTVDKKTG 360

Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNV------CLGI 359
           K          ++ L+F +G     FE T ++Y+I  +NK  V      CLGI
Sbjct: 361 KMCFSDISKLPSIVLTFENGSN---FEWTSDSYMINRTNKRTVNDYSWWCLGI 410


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 79/288 (27%), Positives = 125/288 (43%), Gaps = 22/288 (7%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R+ S  +  SS  +F    ++ L    G     G Y VT+ +G P + + L  DTGSD+T
Sbjct: 84  RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 143

Query: 76  WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE--DPAQCDYE 128
           W QC+ PCV+ C +   P   PS       + C   +C  L A G    +    + C Y+
Sbjct: 144 WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSSSTCLYQ 201

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKS 188
           ++Y DG  S+G    +    + +N   +      GCG  Q     +    G+LGLG+ K 
Sbjct: 202 VQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTKL 256

Query: 189 SIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA 246
           ++ SQ  + K  + +  +CL  S    G+L  G  +  S +    S   D T +Y   + 
Sbjct: 257 ALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDIT 314

Query: 247 ELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
            L  GG    +         V DSG+  T L+   Y  L+S  +  ++
Sbjct: 315 GLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 362


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 161/385 (41%), Gaps = 55/385 (14%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           + +SSS  + SS     + S + FQ       T  Y VTM +G  ++   + +DTGSDLT
Sbjct: 94  KRTSSSQIADSSETQVPLTSGIKFQ-------TLNYIVTMGLG--SQNMSVIVDTGSDLT 144

Query: 76  WLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCEDP---AQCDYE 128
           W+QC+ PC  C     PL++PS       + C    C SL      +  DP   A CDY 
Sbjct: 145 WVQCE-PCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGS--DPSTSATCDYV 201

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKS 188
           + Y DG  + G L  +   F    G         GCG N      +    G++GLG+ + 
Sbjct: 202 VNYGDGSYTSGELGIEKLGF----GGISVSNFVFGCGRNN--KGLFGGASGLMGLGRSEL 255

Query: 189 SIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD---LYDSSRVVWTSM--SSDYTK 239
           S++SQ ++      V  +CL      G  G L  G+      + + + +T M  +   + 
Sbjct: 256 SMISQTNAT--FGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSN 313

Query: 240 YYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
           +Y   +  +  GG     + +   N  V+ DSG+  + L    Y+ L +   ++ S    
Sbjct: 314 FYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSG--F 371

Query: 295 KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN 354
             AP    L  C+     +  V+       T+++ F +G      + T   YL+  +   
Sbjct: 372 PSAPGFSILDTCFN-LTGYDQVN-----IPTISMYF-EGNAELNVDATGIFYLVKEDASR 424

Query: 355 VCLGILNGAEVGLQDLNVIGGIGDF 379
           VCL +       L D   +G IG++
Sbjct: 425 VCLAL-----ASLSDEYEMGIIGNY 444


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 112/280 (40%), Gaps = 49/280 (17%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           T  Y V + +G P RP  L LDTGSDL W QC APC  C     PL  P+       +PC
Sbjct: 89  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALPC 147

Query: 104 EDPICASL----------HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
             P C +L           + G+ N      C Y   Y D   ++G +  D F F   NG
Sbjct: 148 GAPRCRALPFTSCGGGGRSSWGNGN----RSCAYIYHYGDKSVTVGEIATDRFTFGGDNG 203

Query: 154 ---QRL-NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH-- 206
               RL   RL  GCG +N+  G       GI G G+G+ S+ SQL+             
Sbjct: 204 DGDSRLPTRRLTFGCGHFNK--GVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFE 261

Query: 207 ------CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
                  L G     L +    + S  V  T +  + ++   P +  L   G + G   L
Sbjct: 262 SKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQ---PSLYFLSLKGISVGKTRL 318

Query: 261 PV--------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
            V        + DSG+S T L    Y+     +K E +A+
Sbjct: 319 AVPEAKLRSTIIDSGASITTLPEAVYEA----VKAEFAAQ 354


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 113/258 (43%), Gaps = 38/258 (14%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
           +G P   + + LDTGSDL W+ CD  C++C     P        +Y P+       VPC 
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 162

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
             +C   +A     C   +  C Y ++Y +D  SS GVLV+D       + Q   +   +
Sbjct: 163 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 217

Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
             GCG  QV   S+      +G+LGLG    S+ S L S+ L  N    C    G G + 
Sbjct: 218 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 275

Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
           FGD    D  ++   V+         YY+  +  +  G ++   +    + DSG+S+T L
Sbjct: 276 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329

Query: 274 NRVTYQTLTSIMKKELSA 291
           +   Y  +TS    ++ +
Sbjct: 330 SDPMYTQITSSFDAQIRS 347


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 118/283 (41%), Gaps = 37/283 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-----EAPHPLYRPSNDLVP 102
           +G Y V++ IG P +   L  DTGSDL W++C +PC  C       A    +  +   + 
Sbjct: 83  SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHSTTYSAIH 141

Query: 103 CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNG--QRLN 157
           C  P C  +  P  + C      + C Y+  YAD  ++ G   K+A   N + G  ++LN
Sbjct: 142 CYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLN 201

Query: 158 PRLALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSG 210
             L+ GCG+      + GAS+    G++GLG+   S  SQL  +   K    ++ + LS 
Sbjct: 202 -GLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSP 260

Query: 211 GGGGFLFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV------ 262
               FL  G   ++  S + +  S +       SP    +   G       LP+      
Sbjct: 261 PPTSFLTIGGAQNVAVSKKGIM-SFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319

Query: 263 ---------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
                    + DSG++ T++    Y  +    KK +   S  E
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAE 362


>gi|357152658|ref|XP_003576193.1| PREDICTED: F-box/FBD/LRR-repeat protein At5g22660-like
           [Brachypodium distachyon]
          Length = 594

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 8/101 (7%)

Query: 89  APHPLYRPS--NDLVPCEDPICASLHAP--GHHNCE-DPAQCDYELEYADGGSSLGVLVK 143
            PH LY+P   N L+ C D  C  +H       +C  DP QCDYE+EY +G +S+GVL+ 
Sbjct: 382 VPHDLYKPRRMNKLL-CGDERCVKVHKDLDIEQDCTLDPNQCDYEIEYTNGENSMGVLLA 440

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG 184
           D F+   T   RLN  LA GCGY    G    P+DG+L +G
Sbjct: 441 DTFSLPTTTNDRLN--LAFGCGYGHQGGQEVTPVDGVLRIG 479


>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
          Length = 431

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 120/320 (37%), Gaps = 61/320 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
            G Y   + IG PAR Y++ ++    LT                     +  LV C+   
Sbjct: 95  VGLYYAKIGIGTPARDYYVQME----LTLYDIKESL-------------TGKLVSCDQDF 137

Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK---DAFAFNYTNGQRLNPRLA--L 162
           C +++      C     C Y   YADG SS G  VK    A  +N       NP L   L
Sbjct: 138 CYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPLLEVPL 197

Query: 163 GCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGD 220
            C   Q    +S   LDGILG GK  +S++SQL S   +R +  HCL G  GGG    G 
Sbjct: 198 RCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGH 257

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSS 269
            +    +V  T +  + T +Y+  +  +  GG      NLP            + DSG++
Sbjct: 258 IV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGDKKGTIIDSGTT 311

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
             YL  V Y  L S +                     W+       +HD   CF+  + S
Sbjct: 312 LAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHDQFTCFQ-YSES 351

Query: 330 FTDGKTRTLFELTPEAYLII 349
             DG     F      YL +
Sbjct: 352 LDDGFPAVTFHFENSLYLKV 371


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 67/133 (50%), Gaps = 13/133 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           V G    +G Y   + IG P R  ++ LDTGSD+ W+QC+ PC  C     P++ PS+ +
Sbjct: 144 VSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSV 202

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               V C+  +C+ L A   H       C YE+ Y DG  ++G    +   F  T+ Q  
Sbjct: 203 SFSTVGCDSAVCSQLDANDCHG----GGCLYEVSYGDGSYTVGSYATETLTFGTTSIQ-- 256

Query: 157 NPRLALGCGYNQV 169
              +A+GCG++ V
Sbjct: 257 --NVAIGCGHDNV 267


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/350 (23%), Positives = 140/350 (40%), Gaps = 55/350 (15%)

Query: 54  TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------- 100
           T+ IG P   + + LDTGSDL W+ CD  C RC  +    +    DL             
Sbjct: 103 TVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDLNVYNPNGSSTSKK 160

Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR--L 156
           V C + +C          C    + C Y + Y    +S  G+LV+D       +     +
Sbjct: 161 VTCNNSLCTH-----RSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLV 215

Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
              +  GCG  Q+   S+  +   +G+ GLG  K S+ S L  +    +    C    G 
Sbjct: 216 EANVIFGCG--QIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 273

Query: 214 GFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
           G + FGD   +D     +    S  T  Y+  V ++  G     ++    +FDSG+S+TY
Sbjct: 274 GRISFGDKGSFDQDETPFNLNPSHPT--YNITVTQVRVGTTVIDVE-FTALFDSGTSFTY 330

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK-----KCFRTLA 327
           L   TY  LT     ++  +  +              R PF+  +D+          +++
Sbjct: 331 LVDPTYTRLTESFHSQVQDRRHRS-----------DSRIPFEYCYDMSPDANTSLIPSVS 379

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
           L+   G    ++    +  +IIS +  +  CL ++  AE+ +   N + G
Sbjct: 380 LTMGGGSHFAVY----DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTG 425


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 75/270 (27%), Positives = 111/270 (41%), Gaps = 28/270 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCE 104
           G Y + + IG P  P    +DTGSDLTW QC  PC  C +   P + P N        C 
Sbjct: 90  GEYIMNLSIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPFFDPKNSSTYRDSSCG 148

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C +L      +C +  +C +   YADG  + G L  +      T G+ ++ P  A G
Sbjct: 149 TSFCLALG--NDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFG 206

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
           C +        H   GI+GLG  + S++SQL S   I     +CL      S       F
Sbjct: 207 CVHRSGGIFDEHS-SGIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMSSRINF 263

Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGG--ETTGLKNLPVVFDS 266
               +   +  V T   M    T YY       S G   L + G  +   ++   ++ DS
Sbjct: 264 GRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDS 323

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           G++YTYL    Y  L   +   +  K +++
Sbjct: 324 GTTYTYLPLEFYVKLEESVAHSIKGKRVRD 353


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 145/363 (39%), Gaps = 64/363 (17%)

Query: 42  HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----S 97
           + N  PT  Y V + IG P +P  L LDTGSDL W QC  PC  C +   P + P    +
Sbjct: 26  YDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSST 84

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPA-----QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
             L  C+  +C  L      +C  P       C Y   Y D   + G L  D F F    
Sbjct: 85  LSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG 141

Query: 153 GQRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
                P +A GCG +N   G       GI G G+G  S+ SQL           HC +  
Sbjct: 142 ASV--PGVAFGCGLFNN--GVFKSNETGIAGFGRGPLSLPSQLKVGNF-----SHCFTTI 192

Query: 212 GGG-----FLFFGDDLYDSSR-VVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLPV-- 262
            G       L    DL+ + +  V T+    Y K  + P +  L   G T G   LPV  
Sbjct: 193 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 252

Query: 263 ------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDET-LPLCWK 308
                       + DSG+S T L    YQ    +++ E +A+  L   P + T    C+ 
Sbjct: 253 SAFALTNGTGGTIIDSGTSITSLPPQVYQ----VVRDEFAAQIKLPVVPGNATGHYTCFS 308

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNKGN--VCLGILNGAE 364
              P +   DV K    L L F +G T    +L  E Y+  +  + GN  +CL I  G E
Sbjct: 309 A--PSQAKPDVPK----LVLHF-EGAT---MDLPRENYVFEVPDDAGNSIICLAINKGDE 358

Query: 365 VGL 367
             +
Sbjct: 359 TTI 361


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 83/167 (49%), Gaps = 11/167 (6%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PA    L +DTGSD+TWLQC  PC RC     P++ P +      +  
Sbjct: 131 SGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGY 189

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPRLAL 162
           + P C +L   G  + +    C Y + Y D GS ++G  +++   F    G    P +++
Sbjct: 190 DAPDCQALGRSGGGDAKR-MTCVYAVGYGDDGSTTVGDFIEETLTF---AGGVQVPHMSI 245

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
           GCG++   G    P  GILGLG+G+ S  SQ+ +         +CL+
Sbjct: 246 GCGHDN-KGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLA 291


>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
          Length = 802

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 60/385 (15%)

Query: 35  SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE----AP 90
           SS   +++G    TGY+  T+ IG P   + + +DTGS  T++ C  PC  C +    AP
Sbjct: 122 SSAGLELNGKARDTGYFYATVLIGTPGHQFEVIVDTGSTYTFVTC-YPCASCGQHGSNAP 180

Query: 91  HPLYRPSN-DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
           +   + S+ + VPC               C     C+Y+ ++++     G +V D     
Sbjct: 181 YDAAKSSSYERVPCGSGCIFGA-------CRASGLCEYDEKFSEDSQVGGHVVSDVID-- 231

Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL----IRNVVG 205
              G    PR+  GC   +         +G++ LG+ ++ +  QL  +           G
Sbjct: 232 -VGGSLGTPRIHFGCNSLETNMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFG 290

Query: 206 HCL-SGGGGGFLFFG---DDLYDS--SRVVWTS----MSSDYTKYYSPGVAELFFGGETT 255
            CL S  GGG L  G   +  Y +  +R   TS    +    ++YY+  V  +F     T
Sbjct: 291 LCLGSFEGGGVLSLGKLPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFV--RNT 348

Query: 256 GLKN-------------LPVVFDSGSSYTYLNRVTYQTLTSIMKKEL----SAKSLKEAP 298
            LK                 V DSG++YTYL+   +    S ++ ++     A   +   
Sbjct: 349 ELKKPSGAELMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISEIEDKVVNDHGANFFRVRG 408

Query: 299 EDETLP--LCWKGRRPFKNVHD--VKKCFRTLALSFTDGKTRTL-FELTPEAYLIIS-NK 352
            D   P  +CW+     K + +  V   F T  L+F       L  E  PE YL +  N+
Sbjct: 409 GDPNYPNDVCWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEFLPENYLFVHPNE 468

Query: 353 GNV-CLGILNGAEVGLQDLNVIGGI 376
            N  C+G+ +  + G    ++IGGI
Sbjct: 469 PNAFCVGVFDNGQQG----SIIGGI 489


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 24/289 (8%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R+ S  +  SS  +F    ++ L    G     G Y VT+ +G P + + L  DTGSD+T
Sbjct: 36  RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95

Query: 76  WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQCDY 127
           W QC+ PCV+ C +   P   PS       + C   +C  L A G     +C   + C Y
Sbjct: 96  WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSS-STCLY 152

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
           +++Y DG  S+G    +    + +N   +      GCG  Q     +    G+LGLG+ K
Sbjct: 153 QVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTK 207

Query: 188 SSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
            ++ SQ  + K  + +  +CL  S    G+L  G  +  S +    S   D T +Y   +
Sbjct: 208 LALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDI 265

Query: 246 AELFFGGETTGLK----NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
             L  GG    +     +   V DSG+  T L+   Y  L+S  +  ++
Sbjct: 266 TGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 314


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 121/276 (43%), Gaps = 30/276 (10%)

Query: 52  NVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPI 107
           N  + IG   +   + +DTGSDLTW+QCD PC+ C     P++      S + + C    
Sbjct: 132 NYIVTIGLGNQNMTVIIDTGSDLTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSST 190

Query: 108 CASLH--APGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           C +L         CE  +P+ C++ + Y DG  + G L  +  +F    G         G
Sbjct: 191 CQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSF----GGISVSNFVFG 246

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
           CG N      +  + GI+GLG+   S++SQ ++      V  +CL     G  G L  G+
Sbjct: 247 CGRNN--KGLFGGVSGIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTDSGASGSLVIGN 302

Query: 221 D---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
           +     + + + +TSM S+   + +Y   +  +  GG   + T   N  ++ DSG+  T 
Sbjct: 303 ESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITR 362

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           L    Y  L +   K+ S   +  AP    L  C+ 
Sbjct: 363 LAPSLYNALKAEFLKQFSGYPI--APALSILDTCFN 396


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 65/126 (51%), Gaps = 13/126 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + IG P R  ++ LDTGSD+ W+QC+ PC  C     P++ PS+ +    V C
Sbjct: 5   SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSVSFSTVGC 63

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           +  +C+ L A   H       C YE+ Y DG  ++G    +   F  T+ Q     +A+G
Sbjct: 64  DSAVCSQLDANDCHG----GGCLYEVSYGDGSYTVGSYATETLTFGTTSIQ----NVAIG 115

Query: 164 CGYNQV 169
           CG++ V
Sbjct: 116 CGHDNV 121


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 58/363 (15%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-----RCVEAPHPLYRPSNDL 100
           Y T  Y   + +G PA+ + + +DTGS+LTW+ C          R   A       S   
Sbjct: 79  YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADES---KSFKT 135

Query: 101 VPCEDPICAS--LHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           V C    C    ++      C  P+  C Y+  YADG ++ GV  K+      TNG+   
Sbjct: 136 VGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMAR 195

Query: 158 -PRLALGCGYNQVPGASYHPLDGILGLGKGK---SSIVSQLHSQKLIRNVVGHCLSGGGG 213
            P   +GC  +   G S+   DG+LGL       +S  + L+  K    +V H  +    
Sbjct: 196 LPGHLIGCS-SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVS 254

Query: 214 GFLFFGDDLYDSSRVVWTSMSS----DYTK---YYSPGVAELFFGGETTGLKNLP----- 261
            +L FG     SSR   T+       D T+   +Y+  V  +  G +   + ++P     
Sbjct: 255 NYLIFG-----SSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYD---MLDIPSQVWD 306

Query: 262 ------VVFDSGSSYTYLNRVTY-QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                  + DSG+S T L    Y Q +T + +  +  K +K  PE   +  C+     F 
Sbjct: 307 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSFTSGF- 363

Query: 315 NVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
           NV  + +      L+F   G  R  FE   ++YL+ +  G  CLG ++    G    NVI
Sbjct: 364 NVSKLPQ------LTFHLKGGAR--FEPHRKSYLVDAAPGVKCLGFVSA---GTPATNVI 412

Query: 374 GGI 376
           G I
Sbjct: 413 GNI 415


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/286 (29%), Positives = 120/286 (41%), Gaps = 35/286 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y V M +G P + Y + +DTGS  +WLQC    + C     P++ PS       VPC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159

Query: 104 -----EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                     A+L+ P    C   +  C Y+  Y D   SLG L +D      T  Q L+
Sbjct: 160 SSSQCSSLKSATLNEP---TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQTLS 214

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-------SG 210
                GCG  Q     +   DGI+GL   + S++SQL  +    N   +CL       + 
Sbjct: 215 -SFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGK--YGNAFSYCLPTSFSTPNS 269

Query: 211 GGGGFLFFG-DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVV 263
              GFL  G   L  SS   +T +  + +    Y   +  +   G   G+      +P +
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTI 329

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
            DSG+  T L    Y TL +     LS K  ++AP    L  C+KG
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILS-KKYQQAPGISLLDTCFKG 374


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 139/357 (38%), Gaps = 60/357 (16%)

Query: 47  PTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY---RPSNDLVPC 103
           P   Y + + IG P +P  L LDTGS L W QC  PC  C     P Y   R S   +P 
Sbjct: 31  PMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPS 89

Query: 104 EDPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
            D     L  P    C  +    C Y   Y D  +++G L  D    ++  G  + P + 
Sbjct: 90  CDSTQCKLD-PSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGASV-PGVV 145

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG----GFLF 217
            GCG N   G       GI G G+G  S+ SQL           HC +   G      LF
Sbjct: 146 FGCGLNNT-GIFRSNETGIAGFGRGPLSLPSQLKVGNF-----SHCFTAVSGRKPSTVLF 199

Query: 218 -FGDDLYDSSRVVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLPV------------- 262
               DLY + R   T  ++   K  + P    L   G T G   LPV             
Sbjct: 200 DLPADLYKNGR--GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGG 257

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWKGRRPFKNVHDVK 320
            + DSG+++T L    Y+    ++  E +A   L   P +ET PL      P      V 
Sbjct: 258 TIIDSGTAFTSLPPRVYR----LVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 313

Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKG---NVCLGILNGAEVGLQDLNVIG 374
           K    L L F +G T     L  E Y+  +  G   ++CL I+ G      ++ +IG
Sbjct: 314 K----LVLHF-EGAT---MHLPRENYVFEAKDGGNCSICLAIIEG------EMTIIG 356


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/248 (28%), Positives = 106/248 (42%), Gaps = 24/248 (9%)

Query: 59  QPARPYFLDLDTGSDLTWLQC-DAPCVRCVEAPHPLYRPS----NDLVPCEDPICASL-- 111
           +P     + LDT SD+ W+QC   P  +C      LY PS    ++   C  P C  L  
Sbjct: 177 RPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGP 236

Query: 112 HAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVP 170
           +A G  +  + A QC Y + Y DG ++ G LV D  + + T+     P+   GC +    
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS---QVPKFEFGCSHAARG 293

Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRV 228
             S     GI+ LG+G  S+VSQ  ++     V  +C   +    GF   G     SSR 
Sbjct: 294 SFSRSKTAGIMALGRGVQSLVSQTSTK--YGQVFSYCFPPTASHKGFFVLGVPRRSSSRY 351

Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS---SYTYLNRV---TYQTLT 282
             T M       Y   +  +   G+   L   P VF +G+   S T + R+    YQ L 
Sbjct: 352 AVTPMLKT-PMLYQVRLEAIAVAGQR--LDVPPTVFAAGAALDSRTVITRLPPTAYQALR 408

Query: 283 SIMKKELS 290
           S  + ++S
Sbjct: 409 SAFRDKMS 416


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 106/388 (27%), Positives = 151/388 (38%), Gaps = 60/388 (15%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM+  S + +   L +   + +    + +  P   Y + + IG P +P  L LDTGS L 
Sbjct: 56  RMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLV 115

Query: 76  WLQCDAPCVRCVEAPHPLY---RPSNDLVPCEDPICASLHAPGHHNC--EDPAQCDYELE 130
           W QC  PC  C     P Y   R S   +P  D     L  P    C  +    C Y   
Sbjct: 116 WTQCQ-PCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLD-PSVTMCVNQTVQTCAYSYS 173

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
           Y D  +++G L  D    ++  G  + P +  GCG N   G       GI G G+G  S+
Sbjct: 174 YGDKSATIGFL--DVETVSFVAGASV-PGVVFGCGLNNT-GIFRSNETGIAGFGRGPLSL 229

Query: 191 VSQLHSQKLIRNVVGHCLSGGGG----GFLF-FGDDLYDSSRVVWTSMSSDYTKYYS-PG 244
            SQL           HC +   G      LF    DLY + R   T  ++   K  + P 
Sbjct: 230 PSQLKVGNF-----SHCFTAVSGRKPSTVLFDLPADLYKNGR--GTVQTTPLIKNPAHPT 282

Query: 245 VAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTLTSIMKKELS 290
              L   G T G   LPV              + DSG+++T L    Y+    ++  E +
Sbjct: 283 FYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR----LVHDEFA 338

Query: 291 AK-SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
           A   L   P +ET PL      P      V K    L L F +G T     L  E Y+  
Sbjct: 339 AHVKLPVVPSNETGPLLCFSAPPLGKAPHVPK----LVLHF-EGAT---MHLPRENYVFE 390

Query: 350 SNKG---NVCLGILNGAEVGLQDLNVIG 374
           +  G   ++CL I+ G      ++ +IG
Sbjct: 391 AKDGGNCSICLAIIEG------EMTIIG 412


>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 521

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 125/291 (42%), Gaps = 33/291 (11%)

Query: 37  LLFQVHGNVYPT-----GYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
           LLF  HG+   +     G+ + T   IG P+  + + LD GSDL W+ CD  CV+C    
Sbjct: 77  LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLS 134

Query: 91  HPLY----RPSNDLVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
              Y    R  N+  P      +S H    H       NC+   Q C Y + Y ++  SS
Sbjct: 135 SSYYSNLDRDLNEYSPSRS--LSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSS 192

Query: 138 LGVLVKDAFAFN---YTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVS 192
            G+LV+D          +   +   + LGCG  Q  G      P DG+LGLG G+SS+ S
Sbjct: 193 SGLLVEDILHLQSGGTLSNSSVQAPVVLGCGMKQSGGYLDGVAP-DGLLGLGPGESSVPS 251

Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR-VVWTSMSSDYTKYYSPGVAELFFG 251
            L    LI      C +    G +FFGD    S +   +  +   Y+ Y   GV     G
Sbjct: 252 FLAKSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYII-GVESCCIG 310

Query: 252 GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL--SAKSLKEAPED 300
                + +     DSG+S+T+L    Y  +T    +++  S  S + +P +
Sbjct: 311 NSCLKMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWE 361


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 160/391 (40%), Gaps = 63/391 (16%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           R  +S SSS   +L   + +++     G    +G Y + +Y+G P R + + +DTGSDL 
Sbjct: 119 RTPASPSSSPRRALSERMVATV---ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLN 175

Query: 76  WLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ--CDYE 128
           WLQC APC+ C +   P++ P+       V C D  C  +  P     C  P +  C Y 
Sbjct: 176 WLQC-APCLDCFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYY 234

Query: 129 LEYADGGSSLGVLVKDAFAFNYT--NGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
             Y D  ++ G L  ++F  N T     R    +  GCG +N+     +H   G+LGLG+
Sbjct: 235 YWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVFGCGHWNR---GLFHGAAGLLGLGR 291

Query: 186 GKSSIVSQLHSQKLIRNVVGH----CLSGGGGGF---LFFGDDLYDSSR--------VVW 230
           G  S  SQL      R V GH    CL   G      + FG+D   +            +
Sbjct: 292 GPLSFASQL------RAVYGHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAF 345

Query: 231 TSMSSDYTKYYSPGVAELFFGGETTGLKN------------LPVVFDSGSSYTYLNRVTY 278
              SS    +Y   +  +  GGE   + +               + DSG++ +Y     Y
Sbjct: 346 APASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAY 405

Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRT 337
           Q +       +  +S    P+   L  C+       NV  V +     L+L F DG    
Sbjct: 406 QVIRQAFIDRM-GRSYPLIPDFPVLSPCY-------NVSGVDRPEVPELSLLFADG---A 454

Query: 338 LFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
           +++   E Y I +   G +CL +L     G+
Sbjct: 455 VWDFPAENYFIRLDPDGIMCLAVLGTPRTGM 485


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/286 (29%), Positives = 120/286 (41%), Gaps = 35/286 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y V M +G P + Y + +DTGS  +WLQC    + C     P++ PS       VPC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159

Query: 104 -----EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
                     A+L+ P    C   +  C Y+  Y D   SLG L +D      T  Q L+
Sbjct: 160 SSSQCSSLKSATLNEP---TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQTLS 214

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-------SG 210
                GCG  Q     +   DGI+GL   + S++SQL  +    N   +CL       + 
Sbjct: 215 -SFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGK--YGNAFSYCLPTSFSTPNS 269

Query: 211 GGGGFLFFG-DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVV 263
              GFL  G   L  SS   +T +  + +    Y   +  +   G   G+      +P +
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTI 329

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
            DSG+  T L    Y TL +     LS K  ++AP    L  C+KG
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILS-KKYQQAPGISLLDTCFKG 374


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 58/363 (15%)

Query: 46  YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-----RCVEAPHPLYRPSNDL 100
           Y T  Y   + +G PA+ + + +DTGS+LTW+ C          R   A       S   
Sbjct: 101 YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADES---KSFKT 157

Query: 101 VPCEDPICAS--LHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           V C    C    ++      C  P+  C Y+  YADG ++ GV  K+      TNG+   
Sbjct: 158 VGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMAR 217

Query: 158 -PRLALGCGYNQVPGASYHPLDGILGLGKGK---SSIVSQLHSQKLIRNVVGHCLSGGGG 213
            P   +GC  +   G S+   DG+LGL       +S  + L+  K    +V H  +    
Sbjct: 218 LPGHLIGCS-SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVS 276

Query: 214 GFLFFGDDLYDSSRVVWTSMSS----DYTK---YYSPGVAELFFGGETTGLKNLP----- 261
            +L FG     SSR   T+       D T+   +Y+  V  +  G +   + ++P     
Sbjct: 277 NYLIFG-----SSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYD---MLDIPSQVWD 328

Query: 262 ------VVFDSGSSYTYLNRVTY-QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                  + DSG+S T L    Y Q +T + +  +  K +K  PE   +  C+     F 
Sbjct: 329 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSFTSGF- 385

Query: 315 NVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
           NV  + +      L+F   G  R  FE   ++YL+ +  G  CLG ++    G    NVI
Sbjct: 386 NVSKLPQ------LTFHLKGGAR--FEPHRKSYLVDAAPGVKCLGFVSA---GTPATNVI 434

Query: 374 GGI 376
           G I
Sbjct: 435 GNI 437


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 91/192 (47%), Gaps = 16/192 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VP 102
           +G Y VT+ +G P R      DTGSDLTW QC+ PCV  C +    ++ PS  L    V 
Sbjct: 86  SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVS 144

Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
           C+ P C  L  A G+      + C Y + Y DG  S+G   ++  +   T+   +     
Sbjct: 145 CDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQ 201

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
            GCG N      +    G+LGL +   S+VSQ  +QK  + V  +CL  S    G+L FG
Sbjct: 202 FGCGQNNR--GLFGGTAGLLGLARNPLSLVSQT-AQKYGK-VFSYCLPSSSSSTGYLSFG 257

Query: 220 DDLYDSSRVVWT 231
               DS  V +T
Sbjct: 258 SGDGDSKAVKFT 269


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 164/403 (40%), Gaps = 65/403 (16%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVH--------------GNVYPTGYYNVTMYIGQP 60
           V+ + S SS++  SLF +  S+ +FQ H              G  +  G Y  ++ +G P
Sbjct: 54  VKANPSPSSAAQKSLFPY--SAHIFQQHTKNPAALRSSTTTLGRKF--GEYYTSIKLGSP 109

Query: 61  ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED-PICASLHAPG 115
            +   L +DTGS+LTWLQC  PC  C  +   +Y  +       V C +  +C++     
Sbjct: 110 GQEAILIVDTGSELTWLQC-LPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSSQGT 168

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR--LNPRLALGCGYNQ---VP 170
           +  C   +QC +   Y DG  S G L  D        G +       A GC       VP
Sbjct: 169 YAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVP 228

Query: 171 -GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GGGGFLFFGDDLYD 224
            GAS     GILGL  GK ++  QL  +   +    HC           G +FFG+    
Sbjct: 229 TGAS-----GILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNSTGVVFFGNAELP 281

Query: 225 SSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRV 276
             +V +TS+    S    K+Y   +  +        L  LP    V+ DSGSS++   R 
Sbjct: 282 HEQVQYTSVALTNSELQRKFYHVALKGVSINSHE--LVFLPRGSVVILDSGSSFSSFVRP 339

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
            +  L     K     SLK    D    L  C+K      ++ ++ +   +L+L F DG 
Sbjct: 340 FHSQLREAFLKH-RPPSLKHLEGDSFGDLGTCFKVSN--DDIDELHRTLPSLSLVFEDGV 396

Query: 335 T---RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           T    ++  L P A     N   +C    +G   G   +NVIG
Sbjct: 397 TIGIPSIGVLLPVARF--QNHVKMCFAFEDG---GPNPVNVIG 434


>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
 gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
          Length = 523

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 75/270 (27%), Positives = 113/270 (41%), Gaps = 37/270 (13%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR------------PSNDLVPCE 104
           +G P   + + LDTGSDL W+ CD  C+ C     P YR             ++  VPC 
Sbjct: 110 LGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSSTSRKVPCS 167

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
             +C    A    +   P    Y +EY +D  SS GVLV+D       Y   + +   + 
Sbjct: 168 SNLCDLQSACRSASSSCP----YSIEYLSDNTSSTGVLVEDVLYLITEYGQPKIVTAPIT 223

Query: 162 LGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
            GCG  Q      S  P +G+LGLG    S+ S L S+ +  N    C    G G + FG
Sbjct: 224 FGCGRIQTGSFLGSAAP-NGLLGLGMDSISVPSLLASEGVAANSFSMCFGDDGRGRINFG 282

Query: 220 D----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
           D    D  ++   ++         YY+  +     G ++    N   + DSG+S+T L+ 
Sbjct: 283 DTGSSDQQETPLNIYKQ-----NPYYNISITGAMVGSKSFN-TNFNAIVDSGTSFTALSD 336

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
             Y  +TS    ++  K  +    D +LP 
Sbjct: 337 PMYSEITSSFNSQVQDKPTQ---LDSSLPF 363


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 110/262 (41%), Gaps = 36/262 (13%)

Query: 41  VHGNVYPT---GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRP 96
           V   V PT   G + +T+ IG P  P+    DTGSDL W QC APC R C + P PLY P
Sbjct: 72  VSAPVSPTTVPGEFLMTLAIGTPPLPFLAIADTGSDLIWTQC-APCSRQCFQQPTPLYNP 130

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQ 154
           S+       P  +SL       C     C Y + Y  G + +     + F F  +    Q
Sbjct: 131 SSSTTFSALPCNSSLGL-----CAPACACMYNMTYGSGWTYV-FQGTETFTFGSSTPADQ 184

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
              P +A GC  N   G +     G++GLG+G  S+VSQL + K    +  +  +     
Sbjct: 185 VRVPGIAFGCS-NASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTST 243

Query: 215 FLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLPV---------- 262
            L       + + VV ++  ++S  + YY      L   G + G   LP+          
Sbjct: 244 LLLGPSASLNDTGVVSSTPFVASPSSIYY-----YLNLTGISLGTTALPIPPNAFSLKAD 298

Query: 263 -----VFDSGSSYTYLNRVTYQ 279
                + DSG++ T L    YQ
Sbjct: 299 GTGGLIIDSGTTITMLGNTAYQ 320


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/348 (24%), Positives = 149/348 (42%), Gaps = 49/348 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
            T+ +G P + + + LDTGSDL W+ CD  C RC       Y    +L            
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSSTSR 162

Query: 101 -VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR- 155
            V C++ +CA      H N C    + C Y + Y    +S  G+LV+D       + ++ 
Sbjct: 163 KVTCDNSLCA------HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQE 216

Query: 156 -LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
            +   +  GCG  QV   S+  +   +G+ GLG  K S+ S L  +    +    C    
Sbjct: 217 FVEAYVTFGCG--QVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPD 274

Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
           G G + FGD           ++++ +   Y+  V ++  G     L +   +FDSG+S+T
Sbjct: 275 GIGRISFGDKGSPDQEETPFNLNALHPT-YNITVTQVRVGTTLIDL-DFTALFDSGTSFT 332

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTLALS 329
           YL    Y   T+++K   S       P D  +P   C+    P +N   +     +++L+
Sbjct: 333 YLVDPIY---TNVLKSFHSQAQDSRRPPDSRIPFEFCYD-MSPGENTSLIP----SMSLT 384

Query: 330 FTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
              G    ++    +  +IIS++  +  C+ ++  AE+ +   N + G
Sbjct: 385 MKGGSQFPVY----DPIIIISSQSELIYCMAVVRSAELNIIGQNFMTG 428


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
           + G    +G Y   + IG+PAR  ++ LDTGSD+ WLQC  PC  C     P++ PS+  
Sbjct: 138 ISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 196

Query: 99  --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
             + + C+ P C +L      N    A C YE+ Y DG  ++G    +      T G  L
Sbjct: 197 SYEPLSCDTPQCNALEVSECRN----ATCLYEVSYGDGSYTVGDFATETL----TIGSTL 248

Query: 157 NPRLALGCGYN 167
              +A+GCG++
Sbjct: 249 VQNVAVGCGHS 259


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 126/291 (43%), Gaps = 27/291 (9%)

Query: 27  SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
           +S   H+ SS+ F     +  + Y  V + IG P +   L  DTGS L W QC  PC  C
Sbjct: 109 TSSVEHMKSSVPFYGLSKITASDYI-VNVGIGTPKKEMPLIFDTGSGLIWTQCK-PCKAC 166

Query: 87  VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
                P++ P+       +PC   +C S+       C  P +C Y   Y D  SS G L 
Sbjct: 167 YPK-VPVFDPTKSASFKGLPCSSKLCQSI----RQGCSSP-KCTYLTAYVDNSSSTGTLA 220

Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
            +  +F++      N  + +GC  +QV G S     GI+GL +   S+ SQ  +  +   
Sbjct: 221 TETISFSHLKYDFKN--ILIGCS-DQVSGESLGE-SGIMGLNRSPISLASQ--TANIYDK 274

Query: 203 VVGHCL--SGGGGGFLFFGDDLYDSSR---VVWTSMSSDY-TKYYSPGVAELFFGGETTG 256
           +  +C+  + G  G L FG  + +  R   V  T+ SSDY  K     V       + + 
Sbjct: 275 LFSYCIPSTPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASA 334

Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
            K +    DSG+  T L    Y  L S+ ++ +    L +  +D+ L  C+
Sbjct: 335 FK-IASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLD--QDDFLDTCY 382


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/345 (24%), Positives = 139/345 (40%), Gaps = 62/345 (17%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y + + IG P    + + DTGSDL W QC  PC +C +  +P++ P    S   + C   
Sbjct: 60  YLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
            C  L +       D   C+Y   YAD   + GVL ++      T G+ +  + +  GCG
Sbjct: 119 SCNKLDS--SLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCG 176

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---------- 215
           +N   G +   + G++GLG+G  S++SQ          +G  L  GG  F          
Sbjct: 177 HNN-SGFNDREM-GLIGLGRGPLSLISQ----------IGSSLGAGGNMFSQCLVPFNTD 224

Query: 216 ------LFF--GDDLYDSSRVVWTSMSSDYTKYYSP----GVAEL---FFGGETTG-LKN 259
                 + F  G ++  +  V    +S D T Y++      V ++   F  G + G +  
Sbjct: 225 PSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITK 284

Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
             ++ DSG++ TYL    Y  L   ++ +++ +  +     +   LC++           
Sbjct: 285 GNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRI----DGYELCYQTPTNLNG---- 336

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
                TL + F  G       LTP    I     N C  + +  E
Sbjct: 337 ----PTLTIHFEGGDVL----LTPAQMFIPVQDDNFCFAVFDTNE 373


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 127/280 (45%), Gaps = 28/280 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y ++  +G P    +  +DTGSD+ WLQC+ PC +C     P + PS       + C 
Sbjct: 85  GDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCE-PCEQCYNQTTPKFNPSKSSSYKNISCS 143

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             +C S+      +C D   C+Y + Y +   S G L  +      T G+ ++ P+  +G
Sbjct: 144 SKLCQSVR---DTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIG 200

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQL-------HSQKLIRNVVGHCLSGGGGGFL 216
           CG N + G+      G++GLG G +S+++QL        S  L+R  +       G   L
Sbjct: 201 CGTNNI-GSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKL 259

Query: 217 FFGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSG 267
            FGD    S   V ++  +  D++ +Y       S G   + F G + G++   ++ DS 
Sbjct: 260 NFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIIIDSS 319

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           +  T++    Y  L S +   ++ + + +   ++   LC+
Sbjct: 320 TIVTFVPSDVYTKLNSAIVDLVTLERVDDP--NQQFSLCY 357


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
           LL  ++G       Y V + IG P     P ++  DTGSDL+W QC+ PC  C    P+P
Sbjct: 90  LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 148

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            + PS       + C DP+C  L           A C +   Y DGG+  G LV D F F
Sbjct: 149 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 207

Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
                  G +L   +A GC + +   A      GIL LG GK S V+QL   + 
Sbjct: 208 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 261


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 77/169 (45%), Gaps = 13/169 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
           LL  ++G       Y V + IG P     P ++  DTGSDL+W QC+ PC  C    P+P
Sbjct: 109 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 167

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            + PS       + C DP+C  L           A C +   Y DGG+  G LV D F F
Sbjct: 168 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 226

Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
                  G +L   +A GC + +   A      GIL LG GK S V+QL
Sbjct: 227 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 275


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 77/169 (45%), Gaps = 13/169 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
           LL  ++G       Y V + IG P     P ++  DTGSDL+W QC+ PC  C    P+P
Sbjct: 108 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 166

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            + PS       + C DP+C  L           A C +   Y DGG+  G LV D F F
Sbjct: 167 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 225

Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
                  G +L   +A GC + +   A      GIL LG GK S V+QL
Sbjct: 226 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 274


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
           LL  ++G       Y V + IG P     P ++  DTGSDL+W QC+ PC  C    P+P
Sbjct: 88  LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 146

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            + PS       + C DP+C  L           A C +   Y DGG+  G LV D F F
Sbjct: 147 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 205

Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
                  G +L   +A GC + +   A      GIL LG GK S V+QL   + 
Sbjct: 206 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 259


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 143/364 (39%), Gaps = 55/364 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---APCVRCVEAPHPLYR--PSNDLVP 102
           TG Y V   +G PA+P+ L  DTGSDLTW++C    A       +P  ++R   S    P
Sbjct: 98  TGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAP 157

Query: 103 --CEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
             C    C S       NC  PA  C Y+  Y DG ++ GV+  D+     ++G      
Sbjct: 158 IACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGG 217

Query: 160 ------------LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVV 204
                       + LGC      G S+   DG+L LG    S  S+  ++   +    +V
Sbjct: 218 DSSGGRRAKLQGVVLGCAAT-YDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLV 276

Query: 205 GHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------- 257
            H        +L FG      +      +    T +Y+  V  ++  GE   +       
Sbjct: 277 DHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDV 336

Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
            +N   + DSG+S T L    Y+ + + + K L+   L     D           PF+  
Sbjct: 337 DRNGGAILDSGTSLTILATPAYRAVVTALSKHLAG--LPRVTMD-----------PFEYC 383

Query: 317 HDVKKC----FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
           ++           + + F  G  R   E   ++Y+I +  G  C+G+  G+  G   ++V
Sbjct: 384 YNWTDAGALEIPKMEVHFA-GSAR--LEPPAKSYVIDAAPGVKCIGVQEGSWPG---VSV 437

Query: 373 IGGI 376
           IG I
Sbjct: 438 IGNI 441


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)

Query: 37  LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
           LL  ++G       Y V + IG P     P ++  DTGSDL+W QC+ PC  C    P+P
Sbjct: 87  LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 145

Query: 93  LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
            + PS       + C DP+C  L           A C +   Y DGG+  G LV D F F
Sbjct: 146 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 204

Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
                  G +L   +A GC + +   A      GIL LG GK S V+QL   + 
Sbjct: 205 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 258


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 141/364 (38%), Gaps = 61/364 (16%)

Query: 21  SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           SS+S   SL  H G  L          T  Y V++ +G P R   +  DTGSDL+W+QC 
Sbjct: 167 SSASKGVSLPAHRGLRL---------GTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCK 217

Query: 81  APCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCED-----PAQCDYELEY 131
            PC  C +   PL+ PS       VPC           G   C D       +C YE+ Y
Sbjct: 218 -PCNNCYKQHDPLFDPSQSTTYSAVPC-----------GAQECLDSGTCSSGKCRYEVVY 265

Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
            D   + G L +D      ++ Q        GCG +      +   DG+ GLG+ + S+ 
Sbjct: 266 GDMSQTDGNLARDTLTLGPSSDQLQG--FVFGCGDDDT--GLFGRADGLFGLGRDRVSLA 321

Query: 192 SQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAE 247
           SQ  ++        +CL  S    G+L  G          +T+M   SD   +Y   +  
Sbjct: 322 SQAAAR--YGAGFSYCLPSSWRAEGYLSLG-SAAAPPHAQFTAMVTRSDTPSFYYLDLVG 378

Query: 248 LFFGGETTGLKNLPVVF-------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           +   G T  ++  P VF       DSG+  T L    Y  L S     +  +  K AP  
Sbjct: 379 IKVAGRT--VRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSFAGFM--RRYKRAPAL 434

Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
             L  C+     F     V+    ++AL F  G T     L     L ++N+   CL   
Sbjct: 435 SILDTCYD----FTGRTKVQ--IPSVALLFDGGAT---LNLGFGGVLYVANRSQACLAFA 485

Query: 361 NGAE 364
           +  +
Sbjct: 486 SNGD 489


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 138/360 (38%), Gaps = 59/360 (16%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---PLYRPSND----LVPC 103
           +++T+ IG P +P  L +DTGSDL W QC       V A H   P+Y P        +PC
Sbjct: 91  HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPC 150

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            D +C         NC    +C YE  Y    +++GVL  + F F       L  RL  G
Sbjct: 151 SDRLCQEGQF-SFKNCTSKNRCVYEDVYGS-AAAVGVLASETFTFGARRAVSL--RLGFG 206

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
           CG   +   S     GILGL     S+++QL  Q+       +CL+         L FG 
Sbjct: 207 CG--ALSAGSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFG- 258

Query: 221 DLYDSSR------VVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLPV--------- 262
            + D SR      +  T++ S+  K   YY P V      G + G K L V         
Sbjct: 259 AMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLV------GISLGHKRLAVPAASLAMRP 312

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
                 + DSGS+  YL    ++ +   +   +         ED  L      R     +
Sbjct: 313 DGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAM 372

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             V+     L L F  G    L     + Y      G +CL +  G       +++IG +
Sbjct: 373 EAVQ--VPPLVLHFDGGAAMVLPR---DNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNV 425


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/357 (24%), Positives = 143/357 (40%), Gaps = 53/357 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
           G Y + + IG P  PY    DTGSDL W QC APC  +C   P PLY PS+     ++PC
Sbjct: 90  GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 148

Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
                +CA+  A           C Y + Y  G +S+     + F F  T  G    P +
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 207

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFL 216
           A GC      G +     G++GLG+G+ S+VSQL   K       +CL+          L
Sbjct: 208 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKF-----SYCLTPYQDTNSTSTL 261

Query: 217 FFGDDL-------YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
             G            S+  V +  ++    +Y   +  +  G  TT L   P        
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLG--TTALSIPPDAFSLNAD 319

Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
               ++ DSG++ T L    YQ + + +   ++  +  +   D  L LC+       +  
Sbjct: 320 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPT-TDGSADTGLDLCFM----LPSST 374

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
                  ++ L F          L  ++Y++  + G  CL + N  +    ++N++G
Sbjct: 375 SAPPAMPSMTLHFNGAD----MVLPADSYMMSDDSGLWCLAMQNQTD---GEVNILG 424


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 148/355 (41%), Gaps = 48/355 (13%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           V G    +G Y   + +G PA+  +L LDTGSD+ W+QC+ PC  C +   P++ P++  
Sbjct: 152 VSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSS 210

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C  P C+ L      +     +C Y++ Y DG  ++G L  D   F   N  ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
           N  +ALGCG++      +    G+LGLG G  SI +Q+ +         +CL    SG  
Sbjct: 265 N-NVALGCGHDN--EGLFTGAAGLLGLGGGVLSITNQMKATSF-----SYCLVDRDSGKS 316

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
               F    L           +     +Y  G++    GGE   L +            V
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           + D G++ T L    Y +L     K L+    K +        C+     F ++  VK  
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLK-LTVNLKKGSSSISLFDTCYD----FSSLSTVK-- 429

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             T+A  FT GK+    +L  + YLI + + G  C      +      L++IG +
Sbjct: 430 VPTVAFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSS----SLSIIGNV 477


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 61/361 (16%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + + IG P  P     DTGSDLTWLQ   PC +C     P++ PSN      +PC 
Sbjct: 78  GEYMMNLSIGTPPFPILAIADTGSDLTWLQ-SKPCDQCYPQKGPIFDPSNSTTFHKLPCT 136

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C +L      +C DP  C Y   Y D   + G L  D       + Q  N  +A GC
Sbjct: 137 TAPCNALDESA-RSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRN--VAFGC 193

Query: 165 GYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL------------SGG 211
           G     G ++      +    G + S VSQL     I     +CL               
Sbjct: 194 GTRN--GGNFDEQGSGIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLENEISSQPSDSP 249

Query: 212 GGGFLFFGDD-LYDSSR---VVWTS---MSSDYTKYYSPGVAELFFG------------- 251
               + FGD+ ++ SS    VV+ +   ++ + + YY   +  +  G             
Sbjct: 250 ATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKT 309

Query: 252 -----GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
                G  + ++   ++ DSG++ T+L    Y  L + + +E+  + + +  ++    LC
Sbjct: 310 ASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDV-KNSMFSLC 368

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
           +K  +    +  +K  FR  A            EL P    + + +G VC  +L   +VG
Sbjct: 369 FKSGKEEVELPLMKVHFRGGA----------DVELKPVNTFVRAEEGLVCFTMLPTNDVG 418

Query: 367 L 367
           +
Sbjct: 419 I 419


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 122/286 (42%), Gaps = 42/286 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y ++  +G P  P +  +DT SD+ W+QC   C  C     P++ PS       +PC 
Sbjct: 86  GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQL-CETCYNDTSPMFDPSYSKTYKNLPCS 144

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C S+      + ++   C++ + Y DG  S G L+ +       N   ++ PR  +G
Sbjct: 145 STTCKSVQGTSCSS-DERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIG 203

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGD- 220
           C  N     S+  + GI+GLG G  S+V QL S   I     +CL+        L FGD 
Sbjct: 204 CIRNT--NVSFDSI-GIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLKFGDA 258

Query: 221 -----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDSG 267
                D   S+R+V+     D+ K+Y   +     G      ++          ++ DSG
Sbjct: 259 AMVSGDGTVSTRIVF----KDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSG 314

Query: 268 SSYTYLNRVTYQTLTS----IMKKELSAKSLKEAPEDETLPLCWKG 309
           +++T L    Y  L S    ++K E +   LK+        LC+K 
Sbjct: 315 TTFTVLPDDVYSKLESAVADVVKLERAEDPLKQ------FSLCYKS 354


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 131/327 (40%), Gaps = 39/327 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y   + +G P+  Y + +DTGS LTWLQC    V C     PL+ P        V C 
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCS 191

Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L A       C     C Y+  Y D   S+G L  D  +F    G    P    
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSF----GSTRYPSFYY 247

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
           GCG +      +    G++GL + K S++ QL     +     +CL +    G+L  G  
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP- 302

Query: 222 LYDS----SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTY 272
            Y++    S     S S D + Y+   ++ +  GG    +      +LP + DSG+  T 
Sbjct: 303 -YNTGHYYSYTPMASSSLDASLYFI-TLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITR 360

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    +  L+  + + ++    + AP    L  C++G+     V        T+A++F  
Sbjct: 361 LPTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQLRV-------PTVAMAFAG 411

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
           G +    +LT    LI  +    CL  
Sbjct: 412 GAS---MKLTTRNVLIDVDDSTTCLAF 435


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 128/326 (39%), Gaps = 41/326 (12%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S  SSS+   +GSSL          T  Y +++ +G PA    + +DTGSD++W+QC+ P
Sbjct: 108 SKVSSSVPTKLGSSL---------DTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCN-P 157

Query: 83  CVR--CVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
           C    C      L+ P+       V C    CA L   G+       +C Y ++Y DG +
Sbjct: 158 CPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGST 217

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           + G   +D      +           GC +  V        DG++GLG G  S+VSQ  +
Sbjct: 218 TNGTYSRDTLTL--SGASDAVKGFQFGCSH--VESGFSDQTDGLMGLGGGAQSLVSQ--T 271

Query: 197 QKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
                N   +CL   SG  G     G              S     +Y   + ++  GG+
Sbjct: 272 AAAYGNSFSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGK 331

Query: 254 TTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
             GL   P VF      DSG+  T L    Y  L+S  K  +  K  + AP    L  C 
Sbjct: 332 QLGLS--PSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGM--KQYRSAPARSILDTC- 386

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDG 333
                F      +    T+AL F+ G
Sbjct: 387 -----FDFAGQTQISIPTVALVFSGG 407


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/352 (24%), Positives = 142/352 (40%), Gaps = 43/352 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
           G Y + + IG P  PY    DTGSDL W QC APC  +C   P PLY PS+     ++PC
Sbjct: 30  GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 88

Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
                +CA+  A           C Y + Y  G +S+     + F F  T  G    P +
Sbjct: 89  NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 147

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL---- 216
           A GC      G +     G++GLG+G+ S+VSQL   K    +  +  +      L    
Sbjct: 148 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTLLLGPS 206

Query: 217 --FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------V 262
               G     S+  V +  ++    +Y   +  +  G  TT L   P            +
Sbjct: 207 ASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLG--TTALSIPPDAFSLNADGTGGL 264

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           + DSG++ T L    YQ + + +   ++  +  +   D  L LC+       +       
Sbjct: 265 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPT-TDGSADTGLDLCFM----LPSSTSAPPA 319

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
             ++ L F          L  ++Y++  + G  CL + N  +    ++N++G
Sbjct: 320 MPSMTLHFNGAD----MVLPADSYMMSDDSGLWCLAMQNQTD---GEVNILG 364


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 153/353 (43%), Gaps = 43/353 (12%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G  + +G Y V + IG P +  +L +DTGSD+ W+QC +PC  C +    ++ P    S 
Sbjct: 6   GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C  P C  L      + ++  +C Y++ Y DG  ++G L  D+F+    +  R +P
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDN--RCLYQVSYGDGSFTVGDLASDSFS---VSRGRTSP 119

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
            +  GCG++      +    G+LGLG GK S  SQL S+K    +V           L F
Sbjct: 120 -VVFGCGHDN--EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLF 176

Query: 219 GDD-LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVF 264
           GD  L  S+   +T +  +     +Y  G++ +  GG    + +             V+ 
Sbjct: 177 GDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVII 236

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG+S T L    Y  +    +   + + L  A +      C+     F  +  V     
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRS--ATQKLPRAADFSLFDTCYD----FSALTSVT--IP 288

Query: 325 TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T++  F  G +    +L P  YL+ +   G  C      ++  L DL++IG I
Sbjct: 289 TVSFHFEGGAS---VQLPPSNYLVPVDTSGTFCFAF---SKTSL-DLSIIGNI 334


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 114/298 (38%), Gaps = 53/298 (17%)

Query: 16  RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
           RM+  S + ++  L +   + +    + N  PT  Y V + IG P +P  L LDTGSDL 
Sbjct: 47  RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106

Query: 76  WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
           W QC  PC  C +   P + P    +  L  C+  +C  L      +C  P       C 
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
           Y   Y D   + G L  D F F         P +A GCG +N   G       GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218

Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
           G  S+ SQL           HC +   G       L    DLY S R    S       +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273

Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQ 279
           + T YY      L   G T G   LPV              + DSG++ T L    Y+
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYR 325


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 57/159 (35%), Positives = 76/159 (47%), Gaps = 11/159 (6%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
           G Y + + IG P  PY    DTGSDL W QC APC  +C   P PLY PS+     ++PC
Sbjct: 88  GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 146

Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
                +CA+  A           C Y + Y  G +S+     + F F  T  GQ   P +
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGQSRVPGI 205

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           A GC      G +     G++GLG+G+ S+VSQL   K 
Sbjct: 206 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKF 243


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 141/351 (40%), Gaps = 55/351 (15%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
            T+ IG P   + + LDTGSDL W+ CD  C RC       +    DL            
Sbjct: 98  TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDLNVYNPNGSSTSK 155

Query: 101 -VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR-- 155
            V C + +C  +H      C    + C Y + Y    +S  G+LV+D       +     
Sbjct: 156 KVTCNNSLC--MH---RSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 210

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  Q+   S+  +   +G+ GLG  K S+ S L  +    +    C    G
Sbjct: 211 VEANVIFGCG--QIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG 268

Query: 213 GGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
            G + FGD   +D     +    S  T  Y+  V ++  G     ++    +FDSG+S+T
Sbjct: 269 IGRISFGDKGSFDQDETPFNLNPSHPT--YNITVTQVRVGTTLIDVE-FTALFDSGTSFT 325

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK-----KCFRTL 326
           YL   TY  LT     ++  +  +              R PF+  +D+          ++
Sbjct: 326 YLVDPTYTRLTESFHSQVQDRRHRS-----------DSRIPFEYCYDMSPDANTSLIPSV 374

Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
           +L+   G    ++    +  +IIS +  +  CL ++  AE+ +   N + G
Sbjct: 375 SLTMGGGSHFAVY----DPIIIISTQSELVYCLAVVKTAELNIIGQNFMTG 421


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 167/376 (44%), Gaps = 50/376 (13%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGY-YNVTMYIGQPARPYFLDLDTGSDLT 75
           M++ ++SSS SS+           V   ++P G  Y + + +G P + +    DTGSDL 
Sbjct: 26  MAARANSSSWSSMAGTT------DVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLV 79

Query: 76  WLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYA 132
           W+Q + PC  C       P    +   + C   +CA L  PG  +CE   + C Y  EY 
Sbjct: 80  WVQSE-PCTGCSGGTIFDPRQSSTFREMDCSSQLCAEL--PG--SCEPGSSTCSYSYEYG 134

Query: 133 DGGSSLGVLVKDAFAFNYT-NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
             G + G   +D  +   T +G +  P  A+GCG   +  + +  +DG++GLG+G  S+ 
Sbjct: 135 S-GETEGEFARDTISLGTTSDGSQKFPSFAVGCG---MVNSGFDGVDGLVGLGQGPVSLT 190

Query: 192 SQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL------YDSSRVVWTSMSSDYTKYY 241
           SQL +   I +   +CL    S      L FG           S+++  T  S  Y  YY
Sbjct: 191 SQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKI--TPPSDTYPTYY 246

Query: 242 SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
              V  +   G+T G     ++ DSG++ TY+    Y  + S M+  ++   +  +    
Sbjct: 247 LLTVNGIAVAGQTMGSPGTTII-DSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS--SM 303

Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGIL 360
            L LC+  R   +N       F  L +        T+   +   +L++ + G+ VCL + 
Sbjct: 304 GLDLCYD-RSSNRNYK-----FPALTIRLAGA---TMTPPSSNYFLVVDDSGDTVCLAM- 353

Query: 361 NGAEVGLQDLNVIGGI 376
            G+  GL  +++IG +
Sbjct: 354 -GSASGLP-VSIIGNV 367


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/332 (27%), Positives = 135/332 (40%), Gaps = 43/332 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +  YIG P        DT SDL W+QC +PC  C     PL+ P        + C+
Sbjct: 88  GEYLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFEPHKSSTFANLSCD 146

Query: 105 DPICAS---LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRL 160
              C S    + P   N      C Y   Y DG S+ GVL  ++  F     Q +  P+ 
Sbjct: 147 SQPCTSSNIYYCPLVGNL-----CLYTNTYGDGSSTKGVLCTESIHF---GSQTVTFPKT 198

Query: 161 ALGCGYNQ-VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFL 216
             GCG N        + + GI+GLG G  S+VSQL  Q  I +   +CL   +      L
Sbjct: 199 IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKL 256

Query: 217 FFGDDLYDSSR-VVWTSMSSD--YTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
            FG+D   +   VV T +  D  Y  YY   +  +  G +     TT   N  ++ D G+
Sbjct: 257 KFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGT 316

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
             TYL    Y    +++++ L    + E  +D   P  +     F N  ++   F  +  
Sbjct: 317 VLTYLEVNFYHNFVTLLREAL---GISETKDDIPYPFDFC----FPNQANIT--FPKIVF 367

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
            FT  K   +F      +    +   +CL +L
Sbjct: 368 QFTGAK---VFLSPKNLFFRFDDLNMICLAVL 396


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 148/355 (41%), Gaps = 48/355 (13%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           V G    +G Y   + +G PA+  +L LDTGSD+ W+QC+ PC  C +   P++ P++  
Sbjct: 152 VSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSS 210

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C  P C+ L      +     +C Y++ Y DG  ++G L  D   F   N  ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
           N  +ALGCG++      +    G+LGLG G  SI +Q+ +         +CL    SG  
Sbjct: 265 N-NVALGCGHDN--EGLFTGAAGLLGLGGGVLSITNQMKATSF-----SYCLVDRDSGKS 316

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
               F    L           +     +Y  G++    GGE   L +            V
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
           + D G++ T L    Y +L     K L+    K +        C+     F ++  VK  
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLK-LTVNLKKGSSSISLFDTCYD----FSSLSTVK-- 429

Query: 323 FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             T+A  FT GK+    +L  + YLI + + G  C      +      L++IG +
Sbjct: 430 VPTVAFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSS----SLSIIGNV 477


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 164/401 (40%), Gaps = 61/401 (15%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVH--------------GNVYPTGYYNVTMYIGQP 60
           V+ + S SS++  SLF +  S+ +FQ H              G  +  G Y  ++ +G P
Sbjct: 54  VKANPSPSSAAQKSLFPY--SAHIFQQHTKNPAALRSSTTTLGRKF--GEYYTSIKLGSP 109

Query: 61  ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED-PICASLHAPG 115
            +   L +DTGS+LTWL+C  PC  C  +   +Y  +  +    V C +  +C++     
Sbjct: 110 GQEAILIVDTGSELTWLKC-LPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSSQGT 168

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR--LNPRLALGCGYNQ---VP 170
           +  C   +QC +   Y DG  S G L  D        G +       A GC       VP
Sbjct: 169 YAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVP 228

Query: 171 -GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GGGGFLFFGDDLYD 224
            GAS     GILGL  GK ++  QL  +   +    HC           G +FFG+    
Sbjct: 229 TGAS-----GILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNSTGVVFFGNAELP 281

Query: 225 SSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGL--KNLPVVFDSGSSYTYLNRVTY 278
             +V +TS+    S    K+Y   +  +        L  +   V+ DSGSS++   R  +
Sbjct: 282 HEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGSSFSSFVRPFH 341

Query: 279 QTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT- 335
             L     K     SLK    D    L  C+K      ++ ++ +   +L+L F DG T 
Sbjct: 342 SQLREAFLKH-RPPSLKHLEGDSFGDLGTCFKVSN--DDIDELHRTLPSLSLVFEDGVTI 398

Query: 336 --RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
              ++  L P A     N   +C    +G   G   +NVIG
Sbjct: 399 GIPSIGVLLPVARY--QNHVKMCFAFEDG---GPNPVNVIG 434


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 82/305 (26%), Positives = 125/305 (40%), Gaps = 37/305 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
           Y VT+ +G P     +++DTGSD++W+QC  PC    C      L+ P+       VPC 
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C+ L       C   +QC Y + Y DG ++ GV   D  A     G  +   L  GC
Sbjct: 202 ADACSELRIY-EAGCSG-SQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FGC 256

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
           G+ Q     +  +DG+L LG+   S+ SQ  +      V  +CL       G+L  G   
Sbjct: 257 GHAQA--GMFAGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLGGPT 312

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYL 273
             +S    T +    T + +P    +   G + G + + V         V D+G+  T L
Sbjct: 313 -SASGFATTGL---LTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRL 368

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L S  +  ++      AP +  L  C+     F     V     T+AL+F+ G
Sbjct: 369 PPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYD----FSRYGVVT--LPTVALTFSGG 422

Query: 334 KTRTL 338
            T  L
Sbjct: 423 ATLAL 427


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 121/282 (42%), Gaps = 30/282 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCE 104
           T  Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C 
Sbjct: 79  TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             +C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + G
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFG 193

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL- 222
           C  +      +  +DG+LG+G G  S++ Q        +   +CL        FF     
Sbjct: 194 CNMDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTG 250

Query: 223 YDSSRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGS 268
           Y S   V T     YTK  +     ELFF         GE  GL         VVFDSGS
Sbjct: 251 YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 310

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
             +Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 349


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 74/157 (47%), Gaps = 12/157 (7%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
           G+   TG Y VT+ +G P R      DTGSDLTW QC+ PC R C     P++ PS    
Sbjct: 130 GSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCE-PCARYCYHQQEPIFNPSKSTS 188

Query: 101 ---VPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              + C  P C  L +  G+      + C Y ++Y D   S+G   +D  A   T+   +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD---V 245

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
                 GCG N      +  + G++GLG+   S++S+
Sbjct: 246 FNNFLFGCGQNNR--GLFVGVAGLIGLGRNALSLMSK 280


>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 145/350 (41%), Gaps = 61/350 (17%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLY----RPSNDLVPC 103
           +G P   + + LDTGSDL WL C+  C +CV         +    +Y      ++  V C
Sbjct: 107 VGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKGSSTSQPVLC 164

Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
              +C          C      C YE+ Y ++G S+ G LV+D      +    +  + R
Sbjct: 165 NSSLCEL-----QRQCPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDDDKTKDADTR 219

Query: 160 LALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
           +  GCG  Q    + GA+    +G+ GLG    S+ S L  + L  N    C    G G 
Sbjct: 220 ITFGCGQVQTGAFLDGAAP---NGLFGLGMSNESVPSILAKEGLTSNSFSMCFGSDGLGR 276

Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVFDSG 267
           + FGD+         +S+    T +        Y+  V ++  G +   L+    +FDSG
Sbjct: 277 ITFGDN---------SSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLE-FHAIFDSG 326

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           +S+TYLN   Y+ +T+    E+  +    +  +E          PF+  +++    +T+ 
Sbjct: 327 TSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNEL---------PFEYCYELSPN-QTVE 376

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
           LS           L  +  + +S +G   +CLG+L    V +   N + G
Sbjct: 377 LSINLTMKGGDNYLVTDPIVTVSGEGINLLCLGVLKSNNVNIIGQNFMTG 426


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 90/332 (27%), Positives = 132/332 (39%), Gaps = 36/332 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y V + +G P   + +  DTGSD TW+QC    V C +    L+ P+       V C
Sbjct: 160 TANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSC 219

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            DP CA L A G   C +   C Y ++Y DG  ++G   KD  A      Q        G
Sbjct: 220 ADPACADLDASG---C-NAGHCLYGIQYGDGSYTVGFFAKDTLAV----AQDAIKGFKFG 271

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFF--G 219
           CG        +    G+LGLG+G +SI  Q + +        +CL  S    G+L F   
Sbjct: 272 CGEKNR--GLFGQTAGLLGLGRGPTSITVQAYEK--YGGSFSYCLPASSAATGYLEFGPL 327

Query: 220 DDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGETTG------LKNLPVVFDSGSSYTY 272
                 S    T M +D    +Y  G+  +  GG+  G        N   + DSG+  T 
Sbjct: 328 SPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTLVDSGTVITR 387

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y  L+S     ++A   K+A     L  C+     F  +  V     T++L F  
Sbjct: 388 LPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYD----FTGLSQVS--LPTVSLVFQG 441

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
           G      +L     +   ++  VCLG  +  +
Sbjct: 442 G---ACLDLDASGIVYAISQSQVCLGFASNGD 470


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 91/319 (28%), Positives = 124/319 (38%), Gaps = 68/319 (21%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
           Y  T+  G PA P  + +DTGSDLTWLQC  PC   +C     PL+ PS+      VPC 
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK-PCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170

Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              C  L A  +   C +   C + + Y DG S++GV  KD              +L L 
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKD--------------KLTLA 216

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIV-------------SQLHSQKLIRNVVGHCLSG 210
                 PGA     D   G G  KSS+                L +Q        +CL  
Sbjct: 217 ------PGAIVK--DFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPA 268

Query: 211 GGG--GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------- 261
                GFL FG    + S  V+T M     +   P  + +   G T G K L        
Sbjct: 269 VNSKPGFLAFGAG-RNPSGFVFTPMGRVPGQ---PTFSTVTLAGITVGGKKLDLRPSAFS 324

Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
             ++ DSG+  T L    Y+ L +  ++ + A  L     D    L       +KNV   
Sbjct: 325 GGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHGDLDTCYDLTG-----YKNVVVP 379

Query: 320 KKCFRTLALSFTDGKTRTL 338
           K     +AL+F+ G T  L
Sbjct: 380 K-----IALTFSGGATINL 393


>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
 gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
          Length = 455

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 127/290 (43%), Gaps = 48/290 (16%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SND 99
            T+ +G P   + + LDTGSDL W+ CD  C +C          E    +Y P    +N 
Sbjct: 109 TTVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNK 166

Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQR 155
            V C + +CA       + C    + C Y + Y    +S  G+L++D         N +R
Sbjct: 167 KVTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER 221

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  QV   S+  +   +G+ GLG  K S+ S L  + L+ +    C    G
Sbjct: 222 VEAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDG 279

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYT 271
            G + FGD           +++  +   Y+  V  +  G  TT + +    +FD+G+S+T
Sbjct: 280 VGRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFT 336

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           YL    Y T++       SA+  + +P+          R PF+  +D+++
Sbjct: 337 YLVDPMYTTVSE------SAQDKRHSPD---------SRIPFEYCYDMRE 371


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 141/354 (39%), Gaps = 52/354 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + +++G P + + L LDTGSDL W+QC  PC+ C E   P Y P +      + C
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISC 252

Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQ---RL 156
            DP C  + AP     C+   Q C Y   Y DG ++ G    + F  N T  NG    + 
Sbjct: 253 HDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKH 312

Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
              +  GCG +N+     +H   G+LGLGKG  S  SQ+  Q L      +CL     + 
Sbjct: 313 VENVMFGCGHWNR---GLFHGAAGLLGLGKGPLSFASQM--QSLYGQSFSYCLVDRNSNA 367

Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
                L FG+D  L     + +TS           +Y   +  +    E   +       
Sbjct: 368 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHL 427

Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
                   + DSG++ TY     Y+ +     +++    L E      LP      +P  
Sbjct: 428 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEG-----LP----PLKPCY 478

Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
           NV  ++K       + F D     ++    E Y I  +   VCL IL      L
Sbjct: 479 NVSGIEKMELPDFGILFAD---EAVWNFPVENYFIWIDPEVVCLAILGNPRSAL 529


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 148/368 (40%), Gaps = 70/368 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + +Y+G P R + + +DTGSDL WLQC APC+ C E   P++ P+       V C
Sbjct: 148 SGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTC 206

Query: 104 EDPICASL------HAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
            D  C  +       A     C  P +  C Y   Y D  ++ G L  ++F  N T    
Sbjct: 207 GDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 266

Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLS 209
            R    +  GCG+       +H   G+LGLG+G  S  SQL      R V GH    CL 
Sbjct: 267 SRRVDGVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHTFSYCLV 318

Query: 210 GGG---GGFLFFGDDLYDSSRVVWTSMSSDYTK-------------YYSPGVAELFFGGE 253
             G   G  + FG+D  D +  +       YT              +Y   +  +  GGE
Sbjct: 319 DHGSDVGSKVVFGED--DDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGE 376

Query: 254 TTGLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
              + +             + DSG++ +Y     YQ +       +S +S    PE   L
Sbjct: 377 LLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMS-RSYPLVPEFPVL 435

Query: 304 PLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGN---VCLGI 359
             C+       NV  V++     L+L F DG    +++   E Y I  +      +CL +
Sbjct: 436 SPCY-------NVSGVERPEVPELSLLFADG---AVWDFPAENYFIRLDPDGGSIMCLAV 485

Query: 360 LNGAEVGL 367
           L     G+
Sbjct: 486 LGTPRTGM 493


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/305 (26%), Positives = 124/305 (40%), Gaps = 37/305 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
           Y VT+ +G P     +++DTGSD++W+QC  PC    C      L+ P+       VPC 
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C+ L       C   +QC Y + Y DG ++ GV   D  A     G  +   L  GC
Sbjct: 202 ADACSELRIY-EAGCSG-SQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FGC 256

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
           G+ Q     +  +DG+L LG+   S+ SQ  +      V  +CL       G+L  G   
Sbjct: 257 GHAQA--GMFAGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLGGP- 311

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYL 273
              S     + +   T + +P    +   G + G + + V         V D+G+  T L
Sbjct: 312 ---SSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRL 368

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L S  +  ++      AP +  L  C+     F     V     T+AL+F+ G
Sbjct: 369 PPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYD----FSRYGVVT--LPTVALTFSGG 422

Query: 334 KTRTL 338
            T  L
Sbjct: 423 ATLAL 427


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 82/305 (26%), Positives = 128/305 (41%), Gaps = 32/305 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
           Y VT  +G P     +++DTGSDL+W+QC  PC     C     PL+ P+       VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+CA L       C   AQC Y + Y DG ++ GV   D    + ++  +       G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
           CG+ Q     ++ +DG+LGLG+ + S+V Q  +      V  +CL       G+L  G  
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTAGYLTLGLG 310

Query: 222 LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYL 273
               +   +++     S +   YY   +  +  GG+   +         V D+G+  T L
Sbjct: 311 GPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRL 370

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L S  +  +++     AP +  L  C+     F     V      +AL+F  G
Sbjct: 371 PPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN----FAGYGTVT--LPNVALTFGSG 424

Query: 334 KTRTL 338
            T  L
Sbjct: 425 ATVML 429


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 152/353 (43%), Gaps = 43/353 (12%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G  + +G Y V + IG P +  +L +DTGSD+ W+QC +PC  C +    ++ P    S 
Sbjct: 6   GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
             + C  P C  L      + ++  +C Y++ Y DG  ++G L  D+F     +  R +P
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDN--RCLYQVSYGDGSFTVGDLASDSF---LVSRGRTSP 119

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
            +  GCG++      +    G+LGLG GK S  SQL S+K    +V           L F
Sbjct: 120 -VVFGCGHDN--EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLF 176

Query: 219 GDD-LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVF 264
           GD  L  S+   +T +  +     +Y  G++ +  GG    + +             V+ 
Sbjct: 177 GDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVII 236

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG+S T L    Y  +    +   + + L  A +      C+     F  +  V     
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRS--ATQKLPRAADFSLFDTCYD----FSALTSVT--IP 288

Query: 325 TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           T++  F  G +    +L P  YL+ +   G  C      ++  L DL++IG I
Sbjct: 289 TVSFHFEGGAS---VQLPPSNYLVPVDTSGTFCFAF---SKTSL-DLSIIGNI 334


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 143/360 (39%), Gaps = 69/360 (19%)

Query: 51  YNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCED 105
           YNV  + IG P +P    +D   +L W QC   C RC +   PL+ P+        PC  
Sbjct: 66  YNVANFTIGTPPQPASAIIDVAGELVWTQCSM-CSRCFKQDLPLFVPNASSTFRPEPCGT 124

Query: 106 PICASLHAPGHHNCEDPAQCDYE--LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             C S+      NC     C YE  +    GG +LG++  D FA            L  G
Sbjct: 125 DACKSIPT---SNCSS-NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS-----LGFG 175

Query: 164 C----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           C    G + + G S     G++GLG+  SS+VSQ++  K    +  H    G    L  G
Sbjct: 176 CVVASGIDTMGGPS-----GLIGLGRAPSSLVSQMNITKFSYCLTPH--DSGKNSRLLLG 228

Query: 220 DDLY-------DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--KNLPVVFDSGSSY 270
                       ++  V TS   D ++YY   +  +  G     L      V+  + +  
Sbjct: 229 SSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPM 288

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS- 329
           ++L    YQ L    KKE++ K++  AP    L       +PF        CF    LS 
Sbjct: 289 SFLVDSAYQAL----KKEVT-KAVGAAPTATPL-------QPF------DLCFPKAGLSN 330

Query: 330 -------FTDGKTRTLFELTPEAYLII--SNKGNVCLGILNGAEVGL----QDLNVIGGI 376
                  FT  +      + P  YLI     KG VC+ IL+ + +      ++LN++G +
Sbjct: 331 ASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSL 390


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 90/384 (23%), Positives = 152/384 (39%), Gaps = 64/384 (16%)

Query: 18  SSSSSSSSSSSLFNHVGSSLLFQVHGNVY-----PTGYYNVTMY---IGQPARPYFLDLD 69
           SS S  +S  ++++H    +L Q   N Y     P+  Y V +    IG+P  P    +D
Sbjct: 52  SSLSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMD 111

Query: 70  TGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYE 128
           TGS LTW+ C  PC  C +   P++ PS      +    ++L     + C+    +C Y 
Sbjct: 112 TGSSLTWVMCH-PCSSCSQQSVPIFDPS------KSSTYSNLSCSECNKCDVVNGECPYS 164

Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYN---QVPGASYHPLDGILGLG 184
           +EY   GSS G+  ++       +   +  P L  GCG        G  Y  ++G+ GLG
Sbjct: 165 VEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLG 224

Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVW---TSMSSDYTK-- 239
            G+ S++     +        +C+           +  Y  +R+V     +M  D T   
Sbjct: 225 SGRFSLLPSFGKK------FSYCIGN-------LRNTNYKFNRLVLGDKANMQGDSTTLN 271

Query: 240 ----YYSPGVAELFFGGETTGL-----------KNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                Y   +  +  GG    +            N  V+ DSG+ +T+L +  ++ L S 
Sbjct: 272 VINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVL-SF 330

Query: 285 MKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELT 342
             + L    L  A +D+  P  LC+ G      V      F  +   F +G    + +L 
Sbjct: 331 EVENLLEGVLVLAQQDKHNPYTLCYSGV-----VSQDLSGFPLVTFHFAEG---AVLDLD 382

Query: 343 PEAYLIISNKGNVCLGILNGAEVG 366
             +  I + +   C+ +L G   G
Sbjct: 383 VTSMFIQTTENEFCMAMLPGNYFG 406


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 150/393 (38%), Gaps = 81/393 (20%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC---------DAPCVRCVEAPHPL----- 93
           TG Y V   +G PA+P+ L  DTGSDLTW++C                + AP P      
Sbjct: 84  TGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT 143

Query: 94  YRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDA--F 146
           +RP        +PC    C          C  PA  C Y+  Y DG ++ G +  D+   
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATI 203

Query: 147 AFNYTNGQRLNPR-LALGC--GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLI 200
           A +    ++   R + LGC   YN   G S+   DG+L LG    S  S+  S+   +  
Sbjct: 204 ALSGRAARKAKLRGVVLGCTTSYN---GQSFLASDGVLSLGYSNISFASRAASRFGGRFS 260

Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSS------------------------- 235
             +V H        +L FG +   SSR     ++S                         
Sbjct: 261 YCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLD 320

Query: 236 -DYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTS 283
                +Y+  V  +   GE   L  +P            + DSG+S T L +  Y+ + +
Sbjct: 321 HRTRPFYAVTVKGVSVAGE---LLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVA 377

Query: 284 IMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTP 343
            + K L+   L     D     C+    P  +  DV      LA+ F  G  R   E   
Sbjct: 378 ALSKRLAG--LPRVTMDP-FDYCYNWTSP--SGSDVAAPLPMLAVHFA-GSAR--LEPPA 429

Query: 344 EAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           ++Y+I +  G  C+G+  G   G   L+VIG I
Sbjct: 430 KSYVIDAAPGVKCIGLQEGPWPG---LSVIGNI 459


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 109/268 (40%), Gaps = 34/268 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           T  Y V + +G P RP  L LDTGSDL W QC APC  C +   P+  P+       +PC
Sbjct: 81  TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPC 139

Query: 104 EDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP- 158
               C +L   + G     +   C Y   Y D   ++G +  D F F  +  +G+ L+  
Sbjct: 140 GAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGF 215
           RL  GCG+    G       GI G G+G+ S+ SQL+          +C +         
Sbjct: 200 RLTFGCGHLN-KGVFQSNETGIAGFGRGRWSLPSQLNVTSF-----SYCFTSMFESKSSL 253

Query: 216 LFFGDD---LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------VF 264
           +  G     LY  +       +        P +  L   G + G   LPV        + 
Sbjct: 254 VTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTII 313

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAK 292
           DSG+S T L    Y+     +K E +A+
Sbjct: 314 DSGASITTLPEEVYEA----VKAEFAAQ 337


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
           + G    +G Y   + IG PAR  ++ LDTGSD+ WLQC  PC  C     P++ PS+  
Sbjct: 141 ISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 199

Query: 99  --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
             + + C+ P C +L      N    A C YE+ Y DG  ++G    +      T G  L
Sbjct: 200 SYEPLSCDTPQCNALEVSECRN----ATCLYEVSYGDGSYTVGDFATETL----TIGSTL 251

Query: 157 NPRLALGCGYN 167
              +A+GCG++
Sbjct: 252 VQNVAVGCGHS 262


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 110/268 (41%), Gaps = 32/268 (11%)

Query: 39  FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G   P+  G Y   + +G P R  ++ +DTGSD+ W+ C + C  C +        
Sbjct: 63  FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
               P    ++ L+ C D  C S       +C     QC Y  +Y DG  + G  V D  
Sbjct: 122 NYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLM 181

Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
             A  F  T     +  +  GC   Q      S   +DGI G G+   S++SQL SQ + 
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241

Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
             V  HCL G   GGG L  G+ +     +V++ +      +Y+  +  +   G+   + 
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGEIV--EPNIVYSPLVPS-QPHYNLNLQSISVNGQIVRIA 298

Query: 258 -------KNLPVVFDSGSSYTYLNRVTY 278
                   N   + DSG++  YL    Y
Sbjct: 299 PSVFATSNNRGTIVDSGTTLAYLAEEAY 326


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 10/178 (5%)

Query: 21  SSSSSSSSLFNHVGSSLLFQVHGNVYPT---GYYNVTMYIGQPARPYFLDLDTGSDLTWL 77
           S S+ + S  +++ ++ +  +  +V P      +   + IG P  P  L +DTGSDLTW+
Sbjct: 55  SKSTPAPSRLDNLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWI 114

Query: 78  QCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGS 136
           QC  PC +C     P + PS           ++ HA P     E    C Y L Y D  +
Sbjct: 115 QC-LPC-KCYPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRYRDFSN 172

Query: 137 SLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
           + G+L K+   F  ++ G    P +  GCG +      Y    G+LGLG G  SIV++
Sbjct: 173 TRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSGFTQY---SGVLGLGPGTFSIVTR 227


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 130/326 (39%), Gaps = 41/326 (12%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S  SSS+   +GSSL          T  Y +++ +G PA    + +DTGSD++W+QC+ P
Sbjct: 108 SKVSSSVPTKLGSSL---------DTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCN-P 157

Query: 83  CVR--CVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
           C    C      L+ P+       V C    CA L   G+       +C Y ++Y DG +
Sbjct: 158 CPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGST 217

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           + G   +D      +           GC +  +        DG++GLG G  S+VSQ  +
Sbjct: 218 TNGTYSRDTLTL--SGASDAVKGFQFGCSH--LESGFSDQTDGLMGLGGGAQSLVSQ--T 271

Query: 197 QKLIRNVVGHCL-SGGGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE 253
                N   +CL    G            +S  V T M  S     +Y   + ++  GG+
Sbjct: 272 AAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGK 331

Query: 254 TTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
             GL   P VF      DSG+  T L    Y  L+S  K  +  K  + AP    L  C 
Sbjct: 332 QLGLS--PSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGM--KQYRSAPARSILDTC- 386

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDG 333
                F      +    T+AL F+ G
Sbjct: 387 -----FDFAGQTQISIPTVALVFSGG 407


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 76/152 (50%), Gaps = 16/152 (10%)

Query: 51  YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVP---CE 104
           Y +   IG P RP    L++DTGSD+ W QC  PC  C   P P +  S +D V    C 
Sbjct: 92  YLIHFGIGTP-RPQQVALEVDTGSDVVWTQCR-PCFDCFTQPLPRFDTSASDTVHGVLCT 149

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
           DPIC +L     H C     C Y++ Y D   ++G L KD+F F+   G ++  P L  G
Sbjct: 150 DPICRALRP---HACFL-GGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFG 205

Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQL 194
           CG YN   G  +    GI G G+G  S+  QL
Sbjct: 206 CGQYNT--GNFHSNETGIAGFGRGPLSLPRQL 235


>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 417

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 88/356 (24%), Positives = 146/356 (41%), Gaps = 72/356 (20%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---APHP------LYRP----SND 99
            T+ +G P   + + LDTGSDL W+ CD  C RC     +P+       +Y P    ++ 
Sbjct: 6   TTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSSTSK 63

Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADG-GSSLGVLVKDAFAFNYTN--GQR 155
            VPC + +CA         C +    C Y + Y     S+ G+L++D       N   + 
Sbjct: 64  TVPCNNSLCAQ-----RDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEP 118

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  QV   S+  +   +G+ GLG  + S+ S L  + L+ N    C S  G
Sbjct: 119 IQAYITFGCG--QVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDG 176

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVF 264
            G + FGD           S+  + T +        Y+  V  +   G T    ++  +F
Sbjct: 177 VGRINFGDK---------GSLEQEETPFNLNQLHPNYNITVTSIRV-GTTLIDADITALF 226

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG+S++Y     Y  L++    +         P           R PF+  +++     
Sbjct: 227 DSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNP-----------RIPFEYCYNMSP--- 272

Query: 325 TLALSFTDGKTRTLFELTP----EAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
               S T G + T+    P    +  ++IS +  +  CL ++  AE     LN+IG
Sbjct: 273 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKSAE-----LNIIG 323


>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 880

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 112/266 (42%), Gaps = 42/266 (15%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPS----NDLVP 102
           IG P   + + LD GSD+ W+ CD  C+ C          ++     YRPS    +  +P
Sbjct: 111 IGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLP 168

Query: 103 CEDPICASLHAPGHHNCE---DPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNGQR--- 155
           C   +C       H  C+   DP  C Y ++Y+    SS G + +D      +NG+    
Sbjct: 169 CGHKLCDV-----HSVCKGSKDP--CPYAVQYSSANTSSSGYVFEDKLHLT-SNGKHAEQ 220

Query: 156 --LNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
             +   + LGCG  Q    + GA     DG+LGLG G  S+ S L    LI+N    C  
Sbjct: 221 NSVQASIILGCGRKQTGEYLRGAGP---DGVLGLGPGNISVPSLLAKAGLIQNSFSICFE 277

Query: 210 GGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
               G + FGD  + +     +  +   +  Y   GV     G           + DSGS
Sbjct: 278 ENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIV-GVESFCVGSLCLKETRFQALIDSGS 336

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSL 294
           S+T+L    YQ +     K+++A S+
Sbjct: 337 SFTFLPNEVYQKVVIEFDKQVNATSI 362


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 70/255 (27%), Positives = 113/255 (44%), Gaps = 32/255 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +T+ +G P + + + +DTGSDL W+QC  PC  C + P P + PS         C 
Sbjct: 37  GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSRSFRKAACT 95

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
           D +C ++ A     C     C Y+  Y D  ++ G L  +  + N   G +  P  A GC
Sbjct: 96  DNLC-NVSALPLKACAANV-CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGC 153

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFLFFGDD 221
           G   +   ++    G++GLG+G  S+ SQL       N   +C   L+      L FG  
Sbjct: 154 GTQNL--GTFAGAAGLVGLGQGPLSLNSQLS--HTFANKFSYCLVSLNSLSASPLTFG-S 208

Query: 222 LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------------DS 266
           +  ++ + +TS+  ++ +  YY   +  +  GG+   L   P VF             DS
Sbjct: 209 IAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA--PSVFAIDQSTGRGGTIIDS 266

Query: 267 GSSYTYLNRVTYQTL 281
           G++ T L    Y  +
Sbjct: 267 GTTITMLTLPAYSAV 281


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 86/325 (26%), Positives = 128/325 (39%), Gaps = 36/325 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y   + +G PA  Y + +DTGS LTWLQC    V C     P++ P        V C 
Sbjct: 129 GNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCS 188

Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L A       C     C Y+  Y D   S+G L KD  +F    G    P    
Sbjct: 189 SSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSF----GSGSFPGFYY 244

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
           GCG +      +    G++GL K K S++ QL     +     +CL  S    G+L  G 
Sbjct: 245 GCGQDNE--GLFGRSAGLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGS 300

Query: 221 DLYDSSRVVWTSMSS---DYTKYYSP----GVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
             Y+  +  +T M+S   D + Y+       VA        +  ++LP + DSG+  T L
Sbjct: 301 --YNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRL 358

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y  L+  +   +++ + +       L  C++G      V  V        ++F  G
Sbjct: 359 PPNVYTALSRAVAAAMASAAPRAP-TYSILDTCFRGSAAGLRVPRVD-------MAFAGG 410

Query: 334 KTRTLFELTPEAYLIISNKGNVCLG 358
            T     L+P   LI  +    CL 
Sbjct: 411 AT---LALSPGNVLIDVDDSTTCLA 432


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 148/343 (43%), Gaps = 44/343 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPSND----LVPC 103
           + V + +G PA+P  L  DTGSDL+W+QC  PC     C     PL+ PS       V C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            +P CA   A G    ED   C Y + Y DG S+ GVL +D  A   +      P    G
Sbjct: 208 GEPQCA---AAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFP---FG 261

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG-D 220
           CG   +    +  +DG+LGLG+G+ S+ SQ  +      V  +CL  S    G+L  G  
Sbjct: 262 CGTRNL--GDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLTIGAT 317

Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
              D+    +T+M     +  +Y   +  +  GG    L   P VF       DSG+  T
Sbjct: 318 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYI--LPVPPAVFTRGGTLLDSGTVLT 375

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
           YL    Y+ L    +  L+ +    AP ++ L  C+     F    +V      ++  F 
Sbjct: 376 YLPAQAYELLRD--RFRLTMERYTPAPPNDVLDACYD----FAGESEV--IVPAVSFRFG 427

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           DG    +FEL     +I  ++   CL      + G   L++IG
Sbjct: 428 DGA---VFELDFFGVMIFLDENVGCLA-FAAMDAGGLPLSIIG 466


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 160/376 (42%), Gaps = 75/376 (19%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G P  P  + LDTGSD+ WLQC APC RC +    ++ P    
Sbjct: 137 VSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASH 195

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C  P+C  L + G   C+     C Y++ Y DG  + G    +   F   +G R
Sbjct: 196 SYGAVDCAAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--ASGAR 250

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------- 208
           + PR+ALGCG++      +    G+LGLG+G  S  SQ+ S++  R+   +CL       
Sbjct: 251 V-PRVALGCGHDNE--GLFVAAAGLLGLGRGSLSFPSQI-SRRFGRS-FSYCLVDRTSSS 305

Query: 209 --SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF----GGETTGLKNLP- 261
             +      + FG      S  V  S ++ +T        E F+     G + G   +P 
Sbjct: 306 ASATSRSSTVTFG------SGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPG 359

Query: 262 ----------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
                           V+ DSG+S T L R  Y  L    +   +A  L+ +P   +L  
Sbjct: 360 VAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRA--AAAGLRLSPGGFSL-- 415

Query: 306 CWKGRRPFKNVHDVK--KCFR--TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
                  F   +D+   K  +  T+++ F  G       L PE YLI + ++G  C    
Sbjct: 416 -------FDTCYDLSGLKVVKVPTVSMHFAGGAEAA---LPPENYLIPVDSRGTFCFA-F 464

Query: 361 NGAEVGLQDLNVIGGI 376
            G + G   +++IG I
Sbjct: 465 AGTDGG---VSIIGNI 477


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 121/282 (42%), Gaps = 30/282 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCE 104
           T  Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C 
Sbjct: 79  TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             +C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    G
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFG 193

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL- 222
           C  +      +  +DG+LG+G G  S++ Q   +    +   +CL        FF     
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTG 250

Query: 223 YDSSRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGS 268
           Y S   V T     YTK  +     ELFF         GE  GL         VVFDSGS
Sbjct: 251 YFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 310

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
             +Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 349


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/157 (35%), Positives = 79/157 (50%), Gaps = 12/157 (7%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---AP--HPLYRPSNDLVP 102
           T  Y + + +G P RP  L LDTGSDL W QC APC+ C E   AP   P    ++  +P
Sbjct: 87  TNEYLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAASSTHAALP 145

Query: 103 CEDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNP 158
           C+ P+C +L   + G  +  D + C Y   Y D   ++G L  D+F F  +   G     
Sbjct: 146 CDAPLCRALPFTSCGGRSWGDRS-CVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAAR 204

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           R+  GCG+    G       GI G G+G+ S+ SQL+
Sbjct: 205 RVTFGCGHIN-KGIFQANETGIAGFGRGRWSLPSQLN 240


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 69/153 (45%), Gaps = 10/153 (6%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
           Y + + IG+P  P+    DTGSDLTW QC  PC  C     P+Y PS       +PC   
Sbjct: 71  YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
            C  + +    NC   + C Y   Y DG  S G+L  +      ++       +A GCG 
Sbjct: 130 TCLPIWS---RNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGT 186

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           +   G       G +GLG+G  S+++QL   K 
Sbjct: 187 DN--GGDSLNSTGTVGLGRGTLSLLAQLGVGKF 217


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 144/339 (42%), Gaps = 63/339 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + +  G P +   + +DTGSDL W QC  PC  C  A   ++ P    + D V C 
Sbjct: 78  GEYLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSSTYDTVSCA 136

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C+SL      +C     C Y+  Y DG S+ G L  +      T G    P +A GC
Sbjct: 137 SNFCSSLP---FQSCT--TSCKYDYMYGDGSSTSGALSTET----VTVGTGTIPNVAFGC 187

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFGDD 221
           G+  +   S+    GI+GLG+G  S++SQ  S  +      +CL   G      +  GD 
Sbjct: 188 GHTNL--GSFAGAAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIGDS 243

Query: 222 LYDSSRVVWTSM---SSDYTKYYS-------PGVAELF----FGGETTGLKNLPVVFDSG 267
              +  V +T++   +++ T YY+        G A  +    F  + +G      + DSG
Sbjct: 244 A-AAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGF--ILDSG 300

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++ TYL    +  L + +K E+        PE +             +++ +  CF T  
Sbjct: 301 TTLTYLETGAFNALVAALKAEV------PFPEAD------------GSLYGLDYCFSTAG 342

Query: 328 LSFTDGKTRTL------FELTPE-AYLIISNKGNVCLGI 359
           ++     T T       +EL PE  ++ +   G++CL +
Sbjct: 343 VANPTYPTMTFHFKGADYELPPENVFVALDTGGSICLAM 381


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 64/180 (35%), Positives = 84/180 (46%), Gaps = 15/180 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V +  G PAR Y + +DTGS L+WLQC    V C     PL+ PS       + C
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 174

Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
               C+SL     +N  CE  +  C Y   Y D   S+G L +D         Q L P  
Sbjct: 175 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 231

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
             GCG  Q     +    GILGLG+ K S++ Q+ S+        +CL + GGGGFL  G
Sbjct: 232 VYGCG--QDSDGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 287


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 112/282 (39%), Gaps = 37/282 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G P R  ++ LDTGSD+ WLQC +PC +C     P++ P        +PC
Sbjct: 107 SGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQC-SPCRKCYSQSDPIFNPYKSKSFAGIPC 165

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+C  L + G         C Y++ Y DG  + G    +   F    G ++  ++ALG
Sbjct: 166 SSPLCRRLDSSGCSTRRH--TCLYQVSYGDGSFTTGDFATETLTF---RGNKI-AKVALG 219

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR--NVVGHCL----SGGGGGFLF 217
           CG++         L        G         SQ  IR  +   +CL    +      + 
Sbjct: 220 CGHHN------EGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMV 273

Query: 218 FGDDLYDS-SRVVWTSMSSDYTKYYSPGVAELFFGG-ETTGLK----------NLPVVFD 265
           FGD      +R      +     +Y  G+  +  GG    G+           N  V+ D
Sbjct: 274 FGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIID 333

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           SG+S T L R  Y  L    +  + A+ LK  PE      C+
Sbjct: 334 SGTSVTRLTRPAYTALRDAFR--VGARHLKRGPEFSLFDTCY 373


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 93/362 (25%), Positives = 153/362 (42%), Gaps = 62/362 (17%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           V G    +G Y   + +G PA+  +L LDTGSD+ W+QC+ PC  C +   P++ P++  
Sbjct: 152 VSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSS 210

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C  P C+ L      +     +C Y++ Y DG  ++G L  D   F   N  ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
           N  +ALGCG++      +    G+LGLG G  SI +Q+ +         +CL    SG  
Sbjct: 265 ND-VALGCGHDN--EGLFTGAAGLLGLGGGALSITNQMKATSF-----SYCLVDRDSGKS 316

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
               F    L           +     +Y  G++    GG+   + +            V
Sbjct: 317 SSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGV 376

Query: 263 VFDSGSSYTYLNRVTYQT-------LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
           + D G++ T L    Y +       LT+ +KK  S+ SL +         C+     F +
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDT--------CYD----FSS 424

Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIG 374
           +  VK    T+A  FT GK+    +L  + YLI + + G  C      +      L++IG
Sbjct: 425 LSSVK--VPTVAFHFTGGKS---LDLPAKNYLIPVDDNGTFCFAFAPTSS----SLSIIG 475

Query: 375 GI 376
            +
Sbjct: 476 NV 477


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 51/358 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
           VG  + F V G   P   G Y   + +G P R + + +DTGSD+ W+ C +    P    
Sbjct: 64  VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 123

Query: 87  VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
           ++     + P    S  LV C D  C S +      C     C Y  +Y DG  + G  +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGFYI 182

Query: 143 KDAFAFNYTNGQRL----NPRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLH 195
            D  +F+      L    +     GC  N   G    P   +DGI GLG+G  S++SQL 
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCS-NLQTGDLQRPRRAVDGIFGLGQGSLSVISQLA 241

Query: 196 SQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
            Q L   V  HCL G   GGG +  G         V+T +      +Y+  +  +   G+
Sbjct: 242 VQGLAPRVFSHCLKGDKSGGGIMVLGQ--IKRPDTVYTPLVPS-QPHYNVNLQSIAVNGQ 298

Query: 254 TTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
              +   P VF          D+G++  YL    Y             +++  A      
Sbjct: 299 ILPID--PSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI---------QAIANAVSQYGR 347

Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL-IISNKGNV--CLG 358
           P+ ++  + F+        F  ++LSF  G +     L P AYL I S+ G+   C+G
Sbjct: 348 PITYESYQCFEITAGDVDVFPEVSLSFAGGASMV---LRPHAYLQIFSSSGSSIWCIG 402


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           V G    +G Y   + IG+P  P ++ LDTGSD++W+QC APC  C E   P++ P++  
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPIFEPTSSA 199

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + CE   C SL      N      C YE+ Y DG  ++G  V +      T+    
Sbjct: 200 SFTSLSCETEQCKSLDVSECRN----GTCLYEVSYGDGSYTVGDFVTETVTLGSTSLG-- 253

Query: 157 NPRLALGCGYN 167
              +A+GCG+N
Sbjct: 254 --NIAIGCGHN 262


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 114/268 (42%), Gaps = 28/268 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCE 104
           G Y +++ +G P        DTGSDL W QC  PC RC +   PL+ P +        C+
Sbjct: 93  GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCK-PCERCYKQVDPLFDPKSSKTYRDFSCD 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C+ L       C     C Y+  Y D   ++G +  D    + T G  ++ P+  +G
Sbjct: 152 ARQCSLLD---QSTCSGNI-CQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIG 207

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
           CG+    G       GI+GLG G  S++SQ+ S   +     +CL       G    L F
Sbjct: 208 CGHEN-DGTFSDKGSGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNF 264

Query: 219 GDDLYDSSRVVWT-------SMSSDY---TKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
           G +   S   V +       +MSS Y    +  S G   + FG  + G     ++ DSG+
Sbjct: 265 GSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGT 324

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           + T +    +  L++ +  ++  +  ++
Sbjct: 325 TLTIVPDDFFSNLSTAVGNQVEGRRAED 352


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 112/265 (42%), Gaps = 21/265 (7%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-VEAPHPLYRPSNDLVPCEDPICA 109
           +   ++ G P +  FL +DTGS LTW QC  PC  C  +  +P YRP+  +    D +C 
Sbjct: 58  FMAEIHFGSPQKKQFLHMDTGSSLTWTQC-FPCSDCYAQKIYPKYRPAASIT-YRDAMCE 115

Query: 110 SLHAPGH-HNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNG--QRLNPRLALGC 164
             H   + H   DP    C Y+  Y D  +  G L ++    +  +G  +R++  +  GC
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVH-GVYFGC 174

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
             N +   SY    GILGLG GK SI+ +  S+      +G          L  GD    
Sbjct: 175 --NTLSDGSYFTGTGILGLGVGKYSIIGEFGSK--FSFCLGEISEPKASHNLILGDGANV 230

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
                  +++  +T +    +  +  G E T    + V  D+GS+ ++L+   Y      
Sbjct: 231 QGHPTVINITEGHTIF---QLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFVDA 287

Query: 285 MKKELSAKSLKEAPEDETLPLCWKG 309
               + ++ L   P      LC+K 
Sbjct: 288 FDDLIGSRPLSYEPT-----LCYKA 307


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 151/354 (42%), Gaps = 51/354 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y + + +G P R  +L +DTGSD+ WLQC APCV C      ++ P    +   + C
Sbjct: 34  SGEYFIRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSSTYSTLGC 92

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQRLNPRLA 161
               C +L   G        +C Y+++Y DG  S G    DA + N T+  GQ +  ++ 
Sbjct: 93  NSRQCLNLDVGGCVG----NKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIP 148

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFL 216
           LGCG++      +    G+LGLGKG  S  +Q++S+   R    +CL+G          L
Sbjct: 149 LGCGHDNE--GYFVGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTGRDTDSTERSSL 204

Query: 217 FFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG----------ETTGLKNLPVVF 264
            FGD     + V +T  +S+   + +Y   +  +  GG          +   L N  V+ 
Sbjct: 205 IFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVII 264

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-F 323
           DSG+S T L    Y +L    +   S   L    E      C+       N+ D+     
Sbjct: 265 DSGTSVTRLQNAAYASLREAFRAGTS--DLVLTTEFSLFDTCY-------NLSDLSSVDV 315

Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            T+ L F  G      +L    YL+ + N    CL     A  G    ++IG I
Sbjct: 316 PTVTLHFQGGAD---LKLPASNYLVPVDNSSTFCL-----AFAGTTGPSIIGNI 361


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 74/275 (26%), Positives = 118/275 (42%), Gaps = 38/275 (13%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           T+ +    S++SSS +FN   + L       V+ T  Y + + IG P       LDTGS+
Sbjct: 31  TIDLIHRRSNASSSRVFN---TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSE 87

Query: 74  LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
             W QC  PCV C     P++ PS      E      +    H +      C YEL Y  
Sbjct: 88  HIWTQC-LPCVHCYNQTAPIFDPSKSSTFKE------IRCDTHDH-----SCPYELVYGG 135

Query: 134 GGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
              + G LV +    + T+GQ  + P   +GCG N    + + P   G++GL +G  S++
Sbjct: 136 KSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN---SGFKPGFAGVVGLDRGPKSLI 192

Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
           +Q+  +     ++ +C +G G   + FG + +     VV T++   + K   PG   L  
Sbjct: 193 TQMGGEY--PGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTV---FVKTAKPGFYYLNL 247

Query: 251 GGETTGLKNLP------------VVFDSGSSYTYL 273
              + G   +             +V DSGS+ TY 
Sbjct: 248 DAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 282


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 132/327 (40%), Gaps = 39/327 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y   + +G P+  Y + +DTGS LTWLQC    V C     PL+ P    +   V C 
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCS 191

Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L A       C     C Y+  Y D   S+G L  D  +F  T+     P    
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS----YPSFYY 247

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
           GCG +      +    G++GL + K S++ QL     +     +CL +    G+L  G  
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP- 302

Query: 222 LYDS----SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTY 272
            Y++    S     S S D + Y+   ++ +  GG    +      +LP + DSG+  T 
Sbjct: 303 -YNTGHYYSYTPMASSSLDASLYFI-TLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITR 360

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    +  L+  + + ++    + AP    L  C++G+     V        T+ ++F  
Sbjct: 361 LPTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQLRV-------PTVVMAFAG 411

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
           G +    +LT    LI  +    CL  
Sbjct: 412 GAS---MKLTTRNVLIDVDDSTTCLAF 435


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 55/154 (35%), Positives = 71/154 (46%), Gaps = 11/154 (7%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDP 106
           Y + + IG P  P+    DTGSDLTW QC  PC  C     P+Y      S   VPC   
Sbjct: 93  YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151

Query: 107 ICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
            C  + +    NC    + C Y   Y DG  S GVL  +   F    G  +   +A GCG
Sbjct: 152 TCLPIWS--SRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVG-GIAFGCG 208

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
            +   G SY+   G +GLG+G  S+V+QL   K 
Sbjct: 209 VDN-GGLSYNS-TGTVGLGRGSLSLVAQLGVGKF 240


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 67/132 (50%), Gaps = 12/132 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G P+ P  + LDTGSD+ WLQC APC RC +   P++ P    
Sbjct: 130 VSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQC-APCRRCYDQSGPVFDPRRSS 188

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C  P+C  L + G   C+     C Y++ Y DG  + G    +   F    G R
Sbjct: 189 SYGAVDCAAPLCRRLDSGG---CDLRRRACLYQVAYGDGSVTAGDFATETLTF--AGGAR 243

Query: 156 LNPRLALGCGYN 167
           +  R+ALGCG++
Sbjct: 244 VA-RVALGCGHD 254


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 101/343 (29%), Positives = 147/343 (42%), Gaps = 44/343 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPSND----LVPC 103
           + V + +G PA+P  L  DTGSDL+W+QC  PC     C     PL+ PS       V C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            +P CA   A G    ED   C Y + Y DG S+ GVL +D  A   +      P    G
Sbjct: 203 GEPQCA---AAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFP---FG 256

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG-D 220
           CG   +    +  +DG+LGLG+G+ S+ SQ  +      V  +CL  S    G+L  G  
Sbjct: 257 CGTRNL--GDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLTIGAT 312

Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
              D+    +T+M     +  +Y   +  +  GG    L   P VF       DSG+  T
Sbjct: 313 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYV--LPVPPAVFTRGGTLLDSGTVLT 370

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
           YL    Y  L    +  L+ +    AP ++ L  C+     F    +V      ++  F 
Sbjct: 371 YLPAQAYALLRD--RFRLTMERYTPAPPNDVLDACYD----FAGESEV--VVPAVSFRFG 422

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
           DG    +FEL     +I  ++   CL      + G   L++IG
Sbjct: 423 DGA---VFELDFFGVMIFLDENVGCLA-FAAMDTGGLPLSIIG 461


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/275 (26%), Positives = 118/275 (42%), Gaps = 38/275 (13%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           T+ +    S++SSS +FN   + L       V+ T  Y + + IG P       LDTGS+
Sbjct: 25  TIDLIHRRSNASSSRVFN---TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSE 81

Query: 74  LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
             W QC  PCV C     P++ PS      +      +    H +      C YEL Y  
Sbjct: 82  HIWTQC-LPCVHCYNQTAPIFDPS------KSSTFKEIRCDTHDH-----SCPYELVYGG 129

Query: 134 GGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
              + G LV +    + T+GQ  + P   +GCG N    + + P   G++GL +G  S++
Sbjct: 130 KSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN---SGFKPGFAGVVGLDRGPKSLI 186

Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
           +Q+  +     ++ +C +G G   + FG + +     VV T++   + K   PG   L  
Sbjct: 187 TQMGGEY--PGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTV---FVKTAKPGFYYLNL 241

Query: 251 GGETTGLKNLP------------VVFDSGSSYTYL 273
              + G   +             +V DSGS+ TY 
Sbjct: 242 DAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 276


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 149/351 (42%), Gaps = 59/351 (16%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + ++IG P + Y L LDTGSDL W+QC  PC  C E   P Y P        + C
Sbjct: 87  SGEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGC 145

Query: 104 EDPICASLHAPGHH-NCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTN--GQRLNPR 159
            DP C  + +P     C+   Q C Y   Y D  ++ G    + F  N T+  G+    R
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205

Query: 160 LA---LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
           +     GCG +N+     +H   G+LGLG+G  S  SQL  Q L  +   +CL       
Sbjct: 206 VENVMFGCGHWNR---GLFHGASGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDT 260

Query: 216 -----LFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
                L FG+  DL +   + +T++     +    +Y   +  +  GGE   + N+P   
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGE---VLNIPEST 317

Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
                      + DSG++ +Y     YQ +     K+   K +K  P  +  P+      
Sbjct: 318 WNMTSDGVGGTIVDSGTTLSYFTEPAYQII-----KDAFVKKVKGYPIVQDFPIL----D 368

Query: 312 PFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
           P  NV  V+K       + F DG    ++    E Y I +  +  VCL IL
Sbjct: 369 PCYNVSGVEKIDLPDFGILFADG---AVWNFPVENYFIRLDPEEVVCLAIL 416


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 138/346 (39%), Gaps = 48/346 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           T  Y +T+  G P +   +  DTGS++ W+QC    V C     PL+ P+       + C
Sbjct: 13  TANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISC 72

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
               C  L + G   C   + C Y + Y DG S++G L  + F      G   N     G
Sbjct: 73  TSAACTGLSSRG---CSG-STCVYGVTYGDGSSTVGFLATETFTL--AAGNVFN-NFIFG 125

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
           CG N      +    G++GLG+   S+ SQL +   + N+  +CL  +    G+L  G+ 
Sbjct: 126 CGQNN--QGLFTGAAGLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGNP 181

Query: 222 LYDSSRVVWTSMSSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
           L         + S   T Y+      S G   L     +T  +++  + DSG+  T L  
Sbjct: 182 LRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLAL--SSTVFQSVGTIIDSGTVITRLPP 239

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
             Y  L +  +  ++  +   A     L  C+   R           F T+ L +T    
Sbjct: 240 TAYGALRTAFRAAMTQYT--RAAAASILDTCYDFSR------TTTVTFPTIKLHYTG--- 288

Query: 336 RTLFELTPEA---YLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
             L    P A   Y+I S++  VCL     A  G  D   IG IG+
Sbjct: 289 --LDVTIPGAGVFYVISSSQ--VCL-----AFAGNSDSTQIGIIGN 325


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 137/327 (41%), Gaps = 38/327 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
           Y VT+ IG   R   + +DTGSDLTW+QC  PC  C     PL+ PS       + C   
Sbjct: 67  YIVTVEIG--GRNMTVIVDTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSS 123

Query: 107 ICASL-HAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            C SL +A G+      +   C+Y + Y DG  + G L  +      T+          G
Sbjct: 124 TCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVS----NFIFG 179

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
           CG N      +    G++GLGK   S+VSQ  +  +   V  +CL   +    G L  G 
Sbjct: 180 CGRNN--KGLFGGASGLMGLGKSDLSLVSQ--TSAIFEGVFSYCLPTTAADASGSLILGG 235

Query: 221 D---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
           +     +++ + +T M ++     +Y   +  +  GG   +    +   ++ DSG+  T 
Sbjct: 236 NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITR 295

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
           L    Y+ L +   K+ S      AP    L  C+       N +D +    T+ + F +
Sbjct: 296 LPPPVYRDLKAEFLKQFSG--FPSAPPFSILDTCFN-----LNGYD-EVDIPTIRMQF-E 346

Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
           G      ++T   Y + ++   VCL +
Sbjct: 347 GNAELTVDVTGIFYFVKTDASQVCLAL 373


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 128/329 (38%), Gaps = 42/329 (12%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
           V+    Y + + +G P      ++DTGSDL W QC  PC  C     P++ PS      E
Sbjct: 55  VFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKSSTFKE 113

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
                      H N      C YE+ YAD   S G+L  +      T+G+  +    ++G
Sbjct: 114 KRC--------HGN-----SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIG 160

Query: 164 CGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           CG N      PG +     GI+GL  G SS++SQ+     I  ++ +C S  G   + FG
Sbjct: 161 CGLNNSNLMTPGYAASS-SGIVGLNMGPSSLISQMDLP--IPGLISYCFSSQGTSKINFG 217

Query: 220 DDLY---DSSRVVWTSMSSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSY 270
            +     D +      +  D   YY      S G   +   G     ++  +  DSG++Y
Sbjct: 218 TNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTY 277

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           TYL       +   +   + A +    P  E L LC+          D  + F  + L F
Sbjct: 278 TYLPTSYCNLVREAVAASVVAANQVPDPSSENL-LCYN--------WDTMEIFPVITLHF 328

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGI 359
             G    L +     Y+     G  CL I
Sbjct: 329 AGGADLVLDKY--NMYVETITGGTFCLAI 355


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 148/357 (41%), Gaps = 71/357 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + ++IG P R + L LDTGSDL W+QC  PC  C     P Y P        + C
Sbjct: 189 SGEYFMDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGC 247

Query: 104 EDPICASLHAPGHHNCEDPAQ--------CDYELEYADGGSSLGVLVKDAFAFNYTN--G 153
            DP C  + +P      DP Q        C Y   Y D  ++ G    + F  N T+  G
Sbjct: 248 HDPRCHLVSSP------DPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAG 301

Query: 154 QRLNPRLA---LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
           +    R+     GCG +N+     +H   G+LGLG+G  S  SQL  Q L  +   +CL 
Sbjct: 302 KSEFKRVENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLV 356

Query: 210 GGG-----GGFLFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLK 258
                      L FG+  DL +   V +TS+     +    +Y   +  +  GGE   + 
Sbjct: 357 DRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIP 416

Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
                         + DSG++ +Y    +Y+ +     K+   K +K  P  +  P+   
Sbjct: 417 EETWHLSPEGAGGTIVDSGTTLSYFAEPSYEII-----KDAFVKKVKGYPVIKDFPIL-- 469

Query: 309 GRRPFKNVHDVKKC----FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
              P  NV  V+K     FR L   F DG    ++    E Y I +  +  VCL IL
Sbjct: 470 --DPCYNVSGVEKMELPEFRIL---FEDG---AVWNFPVENYFIKLEPEEIVCLAIL 518


>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
          Length = 290

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 86/202 (42%), Gaps = 21/202 (10%)

Query: 39  FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G   P+  G Y   + +G P R  ++ +DTGSD+ W+ C + C  C +        
Sbjct: 63  FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
               P    ++ L+ C D  C S       +C     QC Y  +Y DG  + G  V D  
Sbjct: 122 NYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLM 181

Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
             A  F  T     +  +  GC   Q      S   +DGI G G+   S++SQL SQ + 
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241

Query: 201 RNVVGHCLSG--GGGGFLFFGD 220
             V  HCL G   GGG L  G+
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGE 263


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 43/342 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + M IG P     +  DTGSDLTW+QC  PC  C     PL+ PS       + C 
Sbjct: 92  GEYFMKMSIGTPLVEVIVIADTGSDLTWVQC-LPCDPCYRQKSPLFDPSRSSSYRHMLCG 150

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ--RLNPRLAL 162
              C +L         D   C+Y   Y D   + G L  + F    T+ +   L+P +  
Sbjct: 151 SRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSP-IVF 209

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
           GCG     G ++  L   +    G + S+VSQL S  +I+    +CL      S      
Sbjct: 210 GCGTGN--GGTFDELGSGIVGLGGGALSLVSQLSS--IIKGKFSYCLVPLSEQSNVTSKI 265

Query: 216 LFFGDDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE----TTGLKN-----LPVVFD 265
            F  D +    +VV T + S     YY   +  +  G +    T GL N       V+ D
Sbjct: 266 KFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIID 325

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SG++ T+L+   +  L  ++++ + A+ + +        +C      F++  D+      
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAERVSDP--RGLFSVC------FRSAGDID--LPV 375

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
           +A+ F D   +    L P    + +++  +C  +++  ++G+
Sbjct: 376 IAVHFNDADVK----LQPLNTFVKADEDLLCFTMISSNQIGI 413


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 51/358 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
           VG  + F V G   P   G Y   + +G P R + + +DTGSD+ W+ C +    P    
Sbjct: 64  VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 123

Query: 87  VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
           ++     + P    S  LV C D  C S +      C     C Y  +Y DG  + G  +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGYYI 182

Query: 143 KDAFAFNYTNGQRL----NPRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLH 195
            D  +F+      L    +     GC  N   G    P   +DGI GLG+G  S++SQL 
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCS-NLQSGDLQRPRRAVDGIFGLGQGSLSVISQLA 241

Query: 196 SQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
            Q L   V  HCL G   GGG +  G         V+T +      +Y+  +  +   G+
Sbjct: 242 VQGLAPRVFSHCLKGDKSGGGIMVLGQ--IKRPDTVYTPLVPS-QPHYNVNLQSIAVNGQ 298

Query: 254 TTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
              +   P VF          D+G++  YL    Y             +++  A      
Sbjct: 299 ILPID--PSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI---------QAVANAVSQYGR 347

Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL-IISNKGNV--CLG 358
           P+ ++  + F+        F  ++LSF  G +     L P AYL I S+ G+   C+G
Sbjct: 348 PITYESYQCFEITAGDVDVFPQVSLSFAGGASMV---LGPRAYLQIFSSSGSSIWCIG 402


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  TW+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 268


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 74/156 (47%), Gaps = 12/156 (7%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
           Y + + IG P  P+    DTGSDLTW QC  PC  C     P+Y PS       VPC   
Sbjct: 66  YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124

Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNP-RLALG 163
            C  L      NC +P+  C Y   Y+DG  S+G+L  +     +   GQ ++   +A G
Sbjct: 125 TC--LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFG 182

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           CG +   G       G +GLG+G  S+++QL   K 
Sbjct: 183 CGTDN--GGDSLNSTGTVGLGRGTLSLLAQLGVGKF 216


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 93/352 (26%), Positives = 142/352 (40%), Gaps = 53/352 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
           Y VT+ +G   R   + +DTGSDL+W+QC  PC RC     P++ PS       V C  P
Sbjct: 135 YIVTVELG--GRKMTVIVDTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSPSYRTVLCSSP 191

Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            C SL  A G+      +P  C+Y + Y DG  + G L  +    +  N   +N     G
Sbjct: 192 TCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTE--HLDLGNSTAVN-NFIFG 248

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
           CG N      +    G++GLG+   S++SQ  +  +   V  +CL        G L  G 
Sbjct: 249 CGRNN--QGLFGGASGLVGLGRSSLSLISQ--TSAMFGGVFSYCLPITETEASGSLVMGG 304

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP--------VVFDSGSS 269
           +    S V   +    YT+         +F    G T G   +         ++ DSG+ 
Sbjct: 305 N----SSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTV 360

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLA 327
            T L    YQ L     K+ S      AP    L  C+   G +  + + ++K  F    
Sbjct: 361 ITRLPPSIYQALKDEFVKQFSG--FPSAPAFMILDTCFNLSGYQEVE-IPNIKMHF---- 413

Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
               +G      ++T   Y + ++   VCL I       L   N +G IG++
Sbjct: 414 ----EGNAELNVDVTGVFYFVKTDASQVCLAI-----ASLSYENEVGIIGNY 456


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 101/244 (41%), Gaps = 25/244 (10%)

Query: 68  LDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSNDLV----PCEDPICASL--HAPGHHNC 119
           +DT SD+ W+QC APC +  C      LY P+  ++    PC  P C SL  +A G    
Sbjct: 178 VDTASDVPWVQC-APCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGA 236

Query: 120 EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLD 178
            +   C Y + Y DG  + G  V D    N      ++ +   GC +  + PG+  +   
Sbjct: 237 GNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS-KFQFGCSHALLRPGSFNNKTA 295

Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSD 236
           G + LG+G  S+ SQ        NV  +CL  +G   GFL  G   + +SR   T M   
Sbjct: 296 GFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPM--- 352

Query: 237 YTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTSIMKK 287
                +P +  +   G     + LPV           DS +  T L    Y  L +  + 
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412

Query: 288 ELSA 291
           ++ A
Sbjct: 413 QMRA 416


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + + +G P R  ++ +D+GSD+ W+QC  PC +C     P++ P++      VPC
Sbjct: 139 SGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQ-PCTQCYHQTDPVFDPADSASFMGVPC 197

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  +   G H       C YE+ Y DG  + G L  +   F    G+ +   +A+G
Sbjct: 198 SSSVCERIENAGCH----AGGCRYEVMYGDGSYTKGTLALETLTF----GRTVVRNVAIG 249

Query: 164 CGYN 167
           CG+ 
Sbjct: 250 CGHR 253


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/346 (25%), Positives = 144/346 (41%), Gaps = 39/346 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
           Y +T+ +G P R      DTGSDL W++C           AP   + PS       V C+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR----- 159
              C +L   G   C+D + C Y   Y DG ++ GVL  + F F+   G   +PR     
Sbjct: 161 TDACEAL---GRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFD-DGGSGRSPRQVRVG 216

Query: 160 -LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGF 215
            +  GC       A   P DG++GLG G  S+V+QL     +     +CL   S      
Sbjct: 217 GVKFGC---STATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVNASSA 273

Query: 216 LFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG-LKNLPVVFDSGSSYTY 272
           L FG   D+ +        ++ D   YY+  +  +  G +T     +  ++ DSG++ T+
Sbjct: 274 LNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVDSGTTLTF 333

Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSF 330
           L+      +   + + ++   ++    D  L LC+   GR       +  +    L L F
Sbjct: 334 LDPSLLGPIVDELSRRITLPPVQS--PDGLLQLCYNVAGRE-----VEAGESIPDLTLEF 386

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             G       L PE   +   +G +CL I+   E   Q ++++G +
Sbjct: 387 GGGAA---VALKPENAFVAVQEGTLCLAIVATTE--QQPVSILGNL 427


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 268


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 122/289 (42%), Gaps = 54/289 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G YN+ + +G P   + +  DTGSDL W QC APC +C + P P ++P++      +PC 
Sbjct: 84  GGYNMNISVGTPLLTFSVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCT 142

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C  L  P      +   C Y  +Y  G ++ G L  +        G    P +A GC
Sbjct: 143 SSFCQFL--PNSIRTCNATGCVYNYKYGSGYTA-GYLATETLKV----GDASFPSVAFGC 195

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD- 220
                 G S     GI GLG+G  S++ QL   +       +CL   S  G   + FG  
Sbjct: 196 STENGVGNS---TSGIAGLGRGALSLIPQLGVGRF-----SYCLRSGSAAGASPILFGSL 247

Query: 221 -DLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------------- 262
            +L D +     + +  + +  YY   +      G T G  +LPV               
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYYYVNLT-----GITVGETDLPVTTSTFGFTQNGLGGG 302

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET--LPLCWK 308
            + DSG++ TYL +  Y+    ++K+   +++      + T  L LC+K
Sbjct: 303 TIVDSGTTLTYLAKDGYE----MVKQAFLSQTADVTTVNGTRGLDLCFK 347


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 12/132 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G PA P  + LDTGSD+ WLQC APC RC +    ++ P    
Sbjct: 132 VSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYDQSGQVFDPRRSR 190

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C  P+C  L + G   C+     C Y++ Y DG  + G    +   F    G R
Sbjct: 191 SYGAVGCSAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--AGGAR 245

Query: 156 LNPRLALGCGYN 167
           +  R+ALGCG++
Sbjct: 246 VA-RIALGCGHD 256


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 106/264 (40%), Gaps = 38/264 (14%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPH------PLYRPSND----LVPCEDPICASLHAPGHH 117
           +DT SD+ W+QC APC     APH       LY PS        PC  P C +L  P  +
Sbjct: 160 IDTASDVPWVQC-APC----PAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNL-GPYAN 213

Query: 118 NCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASY 174
            C  PA  QC Y ++Y DG +S G  + D    N             GC +  + PG+  
Sbjct: 214 GCT-PAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFS 272

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTS 232
           +   GI+ LG+G  S+ +Q  ++    +V  +CL       GF   G     +SR   T 
Sbjct: 273 NKTSGIMALGRGAQSLPTQ--TKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTP 330

Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTS 283
           M        +P +  +         K LPV         V DS +  T L    Y  L +
Sbjct: 331 M---LRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRA 387

Query: 284 IMKKELSAKSLKEAPEDETLPLCW 307
               E+  ++ + A   E L  C+
Sbjct: 388 AFVAEM--RAYRAAAPKEHLDTCY 409


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/181 (32%), Positives = 86/181 (47%), Gaps = 20/181 (11%)

Query: 24  SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
           +SS  LFN + +     VH   Y      + + IG P    +  +DTGSDL WLQC  PC
Sbjct: 37  NSSQVLFNRITAQTPVSVHHYDYL-----MELSIGTPPVKTYAQVDTGSDLIWLQC-IPC 90

Query: 84  VRCVEAPHPLYRP------SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGS 136
             C +  +P++ P      SN     E   C+ L++    +C  D   C+Y   Y D   
Sbjct: 91  TNCYKQLNPMFDPQSSSTYSNIAYGSES--CSKLYS---TSCSPDQNNCNYTYSYEDDSI 145

Query: 137 SLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           + GVL ++      T G+ +  + +  GCG+N   G       GI+GLG+G  S+VSQ+ 
Sbjct: 146 TEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN-NGVFNDKEMGIIGLGRGPLSLVSQIG 204

Query: 196 S 196
           S
Sbjct: 205 S 205


>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 525

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 85/357 (23%), Positives = 147/357 (41%), Gaps = 67/357 (18%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---APHP------LYRP----SND 99
            T+ +G P   + + LDTGSDL W+ CD  C RC     +P+       +Y P    ++ 
Sbjct: 114 TTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSSTSK 171

Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADG-GSSLGVLVKDAFAFN--YTNGQR 155
            VPC + +CA         C +    C Y + Y     S+ G+L++D       + + + 
Sbjct: 172 TVPCNNNLCAQ-----RDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEP 226

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  QV   S+  +   +G+ GLG  + S+ S L  + L+ N    C S  G
Sbjct: 227 IQAYITFGCG--QVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDG 284

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVF 264
            G + FGD           S+  + T +        Y+  V  +   G T    ++  +F
Sbjct: 285 VGRINFGDK---------GSLEQEETPFNLNQLHPNYNITVTSIRV-GTTLIDADITALF 334

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
           DSG+S++Y     Y  L++    +         P           R PF+  +++     
Sbjct: 335 DSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNP-----------RIPFEYCYNMSP--- 380

Query: 325 TLALSFTDGKTRTLFELTP----EAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
               S T G + T+    P    +  ++IS +  +  CL ++  AE+ +   N + G
Sbjct: 381 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKSAELNIIGQNFMTG 437


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 144/360 (40%), Gaps = 48/360 (13%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G SLL    G    T  Y  ++ +G PA    ++LDTGSD +W+QC  PC  C E   P+
Sbjct: 123 GVSLLAN-WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCK-PCADCYEQRDPV 180

Query: 94  YRPSN----DLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGGSSLGVLVKDAF 146
           + P+       VPC    C  L +              C YE+ Y D   ++G L +D  
Sbjct: 181 FDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTL 240

Query: 147 AFNYTNGQRLN---PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
             + +         P    GCG++     ++  +DG+LGLG GK+S+ SQ+ ++      
Sbjct: 241 TLSPSPSPSPADTVPGFVFGCGHSN--AGTFGEVDGLLGLGLGKASLPSQVAAR--YGAA 296

Query: 204 VGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKN 259
             +CL  S    G+L FG     ++   +T M +  D T YY      L   G     + 
Sbjct: 297 FSYCLPSSPSAAGYLSFGGAAARAN-AQFTEMVTGQDPTSYY------LNLTGIVVAGRA 349

Query: 260 LPV-----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
           + V           + DSG++++ L    Y  L S  +  +     K AP       C+ 
Sbjct: 350 IKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYD 409

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
               F     V+     + L F DG T     L P   L   N     CL  +   ++G+
Sbjct: 410 ----FTGHETVR--IPAVELVFADGAT---VHLHPSGVLYTWNDVAQTCLAFVPNHDLGI 460


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 115/272 (42%), Gaps = 38/272 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------LYRPSND---- 99
           Y +++ +G PA    + +DTGSD++W+QC+ PC     AP P       L+ P+      
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCE-PC----PAPSPCHAHAGALFDPAASSTYA 189

Query: 100 LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C    CA L   G  N C+  ++C Y ++Y DG ++ G    D    + ++  R   
Sbjct: 190 AFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVR--- 246

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GC + ++        DG++GLG    S+VSQ  ++        +CL       GFL
Sbjct: 247 GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAAR--YGKSFSYCLPATPASSGFL 304

Query: 217 FF----GDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
                       +SR   T M  S     YY   + ++  GG+  GL   P VF      
Sbjct: 305 TLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS--PSVFAAGSLV 362

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           DSG+  T L    Y  L+S  +  ++  +  E
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARAE 394


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC  PC +C     P++ P++      V C
Sbjct: 198 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCTQCYHQSDPVFDPADSASFTGVSC 256

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  L   G H      +C YE+ Y DG  + G L  +   F    G+ +   +A+G
Sbjct: 257 SSSVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIG 308

Query: 164 CGYN 167
           CG+ 
Sbjct: 309 CGHR 312


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 12/124 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G PA+ Y++ LDTGSD+ W+QC  PC  C +   P++ P    S   + C
Sbjct: 156 SGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQ-PCSDCYQQSDPIFTPAASSSYSPLTC 214

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           +   C SL      N     QC Y++ Y DG  + G  V +  +F    G      +ALG
Sbjct: 215 DSQQCNSLQMSSCRN----GQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNSIALG 267

Query: 164 CGYN 167
           CG++
Sbjct: 268 CGHD 271


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 12/124 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR +++ LDTGSD+ WLQC  PC  C +   P++ P+       V C
Sbjct: 17  SGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTC 75

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           +   C+SL      +C    QC Y++ Y DG  + G    ++ +F  +   +    +ALG
Sbjct: 76  QSQQCSSLE---MSSCRS-GQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK---NVALG 128

Query: 164 CGYN 167
           CG++
Sbjct: 129 CGHD 132


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 118/292 (40%), Gaps = 62/292 (21%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPC-------ED 105
           V   IGQP  P +  +DTGS LTW+QC+ PC+ C +   PLY PS+             D
Sbjct: 112 VNFSIGQPPVPQYAVMDTGSSLTWIQCE-PCINCHQQKGPLYNPSSSSTYVSCSDFDRTD 170

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY-TNGQRLNPRLALGC 164
               + H          + C+Y   YAD  ++ G   ++   F    +G  +   +  GC
Sbjct: 171 TTFTATHG---------SDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGC 221

Query: 165 GYN--QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF----F 218
           G+N  Q+PG + +   G+ GLG   SSI+S+L                 G GF +     
Sbjct: 222 GHNNTQLPGPTGYA-SGVFGLGDSGSSIISKL-----------------GFGFSYCIGNI 263

Query: 219 GDDLYDSSRVVW---TSMSSDYTKYYSPGVAELFFGGETTGLKNL---PVVF-------- 264
           GD LY   R+       +    T     G+  +   G + G + L   P+VF        
Sbjct: 264 GDPLYGFHRLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGI 323

Query: 265 ------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
                 DSG++ +Y+ R  Y  +   +   LS    +       L LC+ G+
Sbjct: 324 SSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGK 375


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 122/289 (42%), Gaps = 54/289 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G YN+ + +G P   + +  DTGSDL W QC APC +C + P P ++P++      +PC 
Sbjct: 84  GGYNMNISVGTPLLTFPVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCT 142

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
              C  L  P      +   C Y  +Y  G ++ G L  +        G    P +A GC
Sbjct: 143 SSFCQFL--PNSIRTCNATGCVYNYKYGSGYTA-GYLATETLKV----GDASFPSVAFGC 195

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD- 220
                 G S     GI GLG+G  S++ QL   +       +CL   S  G   + FG  
Sbjct: 196 STENGVGNS---TSGIAGLGRGALSLIPQLGVGRF-----SYCLRSGSAAGASPILFGSL 247

Query: 221 -DLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------------- 262
            +L D +     + +  + +  YY      +   G T G  +LPV               
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYY-----YVNLTGITVGETDLPVTTSTFGFTQNGLGGG 302

Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET--LPLCWK 308
            + DSG++ TYL +  Y+    ++K+   +++      + T  L LC+K
Sbjct: 303 TIVDSGTTLTYLAKDGYE----MVKQAFLSQTANVTTVNGTRGLDLCFK 347


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/290 (24%), Positives = 110/290 (37%), Gaps = 27/290 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--------SNDL 100
           G Y   M +G PA+ Y + +DTGS LTWLQC    V C     P++ P         +  
Sbjct: 119 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCS 178

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
            P  D +  +   P    C     C Y+  Y D   S+G L KD  +F  T+     P  
Sbjct: 179 APQCDALTTATLNPS--TCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNF 232

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
             GCG +      +    G++GL + K S++ QL     +     +CL        +   
Sbjct: 233 YYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSGYLSI 288

Query: 221 DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
             Y+  +  +T M+         + K     VA        +   +LP + DSG+  T L
Sbjct: 289 GSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRL 348

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
               Y  L+  +   +  K    A     L  C++G+     V  V   F
Sbjct: 349 PTDVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQASRLRVPQVSMAF 396


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 12/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
           V G    +G Y   + +G PA+  ++ LDTGSD+ W+QC  PC  C +   P++ P++  
Sbjct: 154 VSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQC-LPCSECYQQSDPIFDPTSSS 212

Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C DP CASL      +     +C Y++ Y DG  ++G    D   F  +   ++
Sbjct: 213 TFKSLTCSDPKCASLDVSACRS----NKCLYQVSYGDGSFTVGNYATDTVTFGESG--KV 266

Query: 157 NPRLALGCGYN 167
           N  +ALGCG++
Sbjct: 267 N-DVALGCGHD 276


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 64/131 (48%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           V G    +G Y   + IG+P  P ++ LDTGSD++W+QC APC  C E   P + P++  
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPXFEPTSSA 199

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + CE   C SL      N      C YE+ Y DG  ++G  V +      T+    
Sbjct: 200 SFTSLSCETEQCKSLDVSECRN----GTCLYEVSYGDGSYTVGDFVTETVTLGSTSLG-- 253

Query: 157 NPRLALGCGYN 167
              +A+GCG+N
Sbjct: 254 --NIAIGCGHN 262


>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
          Length = 518

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 138/356 (38%), Gaps = 49/356 (13%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP---- 96
           Y NV++  G PA  + + LDTGSDL WL C+    C+  ++        P  LY P    
Sbjct: 92  YANVSL--GTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 149

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           ++  + C D  C      G   C  P   C Y++  +    + G L++D      T  + 
Sbjct: 150 TSSSIRCSDKRCF-----GSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHL-VTEDED 203

Query: 156 LNP---RLALGCGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
           L P    + LGCG NQ         ++G+LGL   + S+ S L    +  N    C    
Sbjct: 204 LKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRI 263

Query: 211 -GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
               G + FGD  Y D       S+ +  +  Y   V  +  GG    +  L  +FD+GS
Sbjct: 264 ISVVGRISFGDKGYTDQEETPLVSLET--STAYGVNVTGVSVGGVPVDVP-LFALFDTGS 320

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNV-----HDVKK 321
           S+T L    Y   T      +     K  P D   P   C+  R    N      H   K
Sbjct: 321 SFTLLLESAYGVFTKAFDDLMED---KRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSK 377

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
           C+      F     R   +   +  +  SN+G    CLGIL    + +   N++ G
Sbjct: 378 CYNPCRDDF-----RWRIQNDSQESVSYSNEGTKMYCLGILKSINLNIIGQNLMSG 428


>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 530

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 138/356 (38%), Gaps = 49/356 (13%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP---- 96
           Y NV++  G PA  + + LDTGSDL WL C+    C+  ++        P  LY P    
Sbjct: 104 YANVSL--GTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 161

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           ++  + C D  C      G   C  P   C Y++  +    + G L++D      T  + 
Sbjct: 162 TSSSIRCSDKRCF-----GSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHL-VTEDED 215

Query: 156 LNP---RLALGCGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
           L P    + LGCG NQ         ++G+LGL   + S+ S L    +  N    C    
Sbjct: 216 LKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRI 275

Query: 211 -GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
               G + FGD  Y D       S+ +  +  Y   V  +  GG    +  L  +FD+GS
Sbjct: 276 ISVVGRISFGDKGYTDQEETPLVSLET--STAYGVNVTGVSVGGVPVDVP-LFALFDTGS 332

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNV-----HDVKK 321
           S+T L    Y   T      +     K  P D   P   C+  R    N      H   K
Sbjct: 333 SFTLLLESAYGVFTKAFDDLMED---KRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSK 389

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
           C+      F     R   +   +  +  SN+G    CLGIL    + +   N++ G
Sbjct: 390 CYNPCRDDF-----RWRIQNDSQESVSYSNEGTKMYCLGILKSINLNIIGQNLMSG 440


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/271 (25%), Positives = 114/271 (42%), Gaps = 33/271 (12%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-------------PHPLYRPS-NDLVP 102
           +G P   + + LDTGSDL WL CD  C+ CV                + L + S ++ V 
Sbjct: 111 VGTPPLWFLVALDTGSDLFWLPCD--CISCVHGGLRTRTGKILKFNTYDLDKSSTSNEVS 168

Query: 103 CEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
           C +    S        C    + C Y+++Y ++  SS G +V+D       + Q    + 
Sbjct: 169 CNN----STFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKDADT 224

Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
           R+A GCG  Q    + GA+    +G+ GLG    S+ S L  + LI N    C      G
Sbjct: 225 RIAFGCGQVQTGVFLNGAA---PNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSAG 281

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
            + FGD      R    ++   +   Y+  + ++        L+    +FDSG+S+TY+N
Sbjct: 282 RITFGDTGSPDQRKTPFNVRKLHPT-YNITITKIIVEDSVADLE-FHAIFDSGTSFTYIN 339

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
              Y  +  +   ++ AK       D  +P 
Sbjct: 340 DPAYTRIGEMYNSKVKAKRHSSQSPDSNIPF 370


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/132 (35%), Positives = 67/132 (50%), Gaps = 12/132 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G P  P  + LDTGSD+ WLQC APC RC +    ++ P    
Sbjct: 137 VSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASH 195

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C  P+C  L + G   C+     C Y++ Y DG  + G    +   F   +G R
Sbjct: 196 SYGAVDCAAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--ASGAR 250

Query: 156 LNPRLALGCGYN 167
           + PR+ALGCG++
Sbjct: 251 V-PRVALGCGHD 261


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 12/124 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR +++ LDTGSD+ WLQC  PC  C +   P++ P+       V C
Sbjct: 158 SGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTC 216

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           +   C+SL      +C    QC Y++ Y DG  + G    ++ +F  +   +    +ALG
Sbjct: 217 QSQQCSSLEM---SSCRS-GQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK---NVALG 269

Query: 164 CGYN 167
           CG++
Sbjct: 270 CGHD 273


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 149/380 (39%), Gaps = 56/380 (14%)

Query: 34  GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
           G  L+  V      +G Y   + +G PA    L LDT SDLTWLQC  PC RC     P+
Sbjct: 124 GRGLVAPVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPV 182

Query: 94  YRPSNDLVPCE----DPICASLHAPGHHNCEDPAQCDYELEYADG------GSSLGVLVK 143
           + P +     E     P C +L   G  + +    C Y + Y DG       +S+G LV+
Sbjct: 183 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKR-GTCIYTVLYGDGDGHGSTSTSVGDLVE 241

Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH-------- 195
           +   F    G      L++GCG++   G    P  GILGL +G+ SI  Q+         
Sbjct: 242 ETLTF---AGGVRQAYLSIGCGHDN-KGLFGAPAAGILGLSRGQISIPHQIAFLGYNASF 297

Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS---RVVWTSMSSDYTKYYSPGVAELFFGG 252
           S  L+  + G    G     L FG    D+S       T ++ +   +Y   +  +  GG
Sbjct: 298 SYCLVDFISG---PGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGG 354

Query: 253 -ETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSA-KSLKEAPE 299
               G+               V+ DSG++ T L R  Y       +   +    +     
Sbjct: 355 VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGP 414

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALS--FTDGKTRTLFELTPEAYLI-ISNKGNVC 356
                 C+           ++ C +  A+S  F  G       L P+ YLI + ++G VC
Sbjct: 415 SGLFDTCYT----VGGRAGLRHCVKVPAVSMHFAGG---VELSLQPKNYLITVDSRGTVC 467

Query: 357 LGILNGAEVGLQDLNVIGGI 376
                 A  G + ++VIG I
Sbjct: 468 FAF---AGTGDRSVSVIGNI 484


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 63/124 (50%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   M IG P R Y+L+LDTGSD+TW+QC APC  C     P+Y PSN      V C
Sbjct: 42  SGEYFARMGIGSPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYC 100

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C +L    +  C+    C Y + Y D  +S G L  ++F     N       +A G
Sbjct: 101 GSALCQALD---YSACQG-MGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFG 155

Query: 164 CGYN 167
           CG++
Sbjct: 156 CGHS 159


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q   +    +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q   +    +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 142/355 (40%), Gaps = 50/355 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPC 103
           V++ IG P +P  L LDTGS L+W+QC    ++    P P  + +           L+PC
Sbjct: 68  VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLPC 127

Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
             PIC     P      +C+    C Y   YADG  + G LV++ F F+ +      P +
Sbjct: 128 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS---TPPV 183

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
            LGC              GILG+ +G+ S +SQ    K    V     S   G  LF+  
Sbjct: 184 ILGCAQASTEN------RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTG--LFYLG 235

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP------------- 261
           D  +SS+  + +M +      SP +  L +      +K      N+P             
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQ 295

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            + DSGS  TYL    Y+ +   + + + A   K     +   +C+          +V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDA----GVTAEVGR 351

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               ++  F +G    +F    E  L    KG  C+GI     +G+   N+IG +
Sbjct: 352 RIGGISFEFDNGV--EIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGS-NIIGTV 403


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 64/124 (51%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   M IG P R Y+L+LDTGSD+TW+QC APC  C     P+Y PSN      V C
Sbjct: 9   SGEYFARMGIGNPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYC 67

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C +L    +  C+    C Y + Y D  +S G L  ++F     +   +   +A G
Sbjct: 68  GSALCQALD---YSACQG-MGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMR-NIAFG 122

Query: 164 CGYN 167
           CG++
Sbjct: 123 CGHS 126


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q   +    +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 166/376 (44%), Gaps = 50/376 (13%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGY-YNVTMYIGQPARPYFLDLDTGSDLT 75
           M++ ++SSS SS+           V   ++P G  Y + + +G P + +    DTGSDL 
Sbjct: 26  MAARANSSSWSSMAGTT------DVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLV 79

Query: 76  WLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYA 132
           W+Q + PC  C       P    +   + C   +C  L  PG  +CE   + C Y  EY 
Sbjct: 80  WVQSE-PCTGCSGGTIFDPRQSSTFREMDCSSQLCTEL--PG--SCEPGSSACSYSYEYG 134

Query: 133 DGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
             G + G   +D  +   T+ G +  P  A+GCG   +  + +  +DG++GLG+G  S+ 
Sbjct: 135 S-GETEGEFARDTISLGTTSGGSQKFPSFAVGCG---MVNSGFDGVDGLVGLGQGPVSLT 190

Query: 192 SQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL------YDSSRVVWTSMSSDYTKYY 241
           SQL +   I +   +CL    S      L FG           S+++  T  S  Y  YY
Sbjct: 191 SQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKI--TPPSDTYPTYY 246

Query: 242 SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
              V  +   G+T G     ++ DSG++ TY+    Y  + S M+  ++   +  +    
Sbjct: 247 LLTVNGIAVAGQTMGSPGTTII-DSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS--SM 303

Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGIL 360
            L LC+  R   +N       F  L +        T+   +   +L++ + G+ VCL + 
Sbjct: 304 GLDLCYD-RSSNRNYK-----FPALTIRLAGA---TMTPPSSNYFLVVDDSGDTVCLAM- 353

Query: 361 NGAEVGLQDLNVIGGI 376
            G+  GL  +++IG +
Sbjct: 354 -GSAGGLP-VSIIGNV 367


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/376 (23%), Positives = 146/376 (38%), Gaps = 87/376 (23%)

Query: 44  NVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDA--PCVRC---------VEAP 90
           +++P  Y  Y+V++  G P +      DTGS L W  C A   C RC         +   
Sbjct: 123 SLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKF 182

Query: 91  HPLYRPSNDLVPCEDPICASLHAPGH----HNCEDPA-QCD-----YELEYADGGSSLGV 140
            P    S  +V C +P CA +  P       NC   + +C      Y L+Y  G ++ G+
Sbjct: 183 VPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA-GI 241

Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
           L+ +          +  P   +GC    V     H   GI G G+G  S+ SQ+  ++  
Sbjct: 242 LLSETLDLE----NKRVPDFLVGCSVMSV-----HQPAGIAGFGRGPESLPSQMRLKRF- 291

Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------------------- 239
                HCL   G     F D    S  V+ +   SD +K                     
Sbjct: 292 ----SHCLVSRG-----FDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFR 342

Query: 240 -YYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
            YY   +  +  GG+               N   + DSGS++T+L++  ++ +   ++K+
Sbjct: 343 EYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQ 402

Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC--FRTLALSFTDGKTRTLFELTPEAY 346
           L      +  E ++      G RP  N+   ++   F  + L F  G       L  E Y
Sbjct: 403 LVKYPRAKDVEAQS------GLRPCFNIPKEEESAEFPDVVLKFKGGGK---LSLAAENY 453

Query: 347 L-IISNKGNVCLGILN 361
           L +++++G VCL ++ 
Sbjct: 454 LAMVTDEGVVCLTMMT 469


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 123/312 (39%), Gaps = 58/312 (18%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----- 79
           S++S+F     S L     N+   G Y V++  G PA PY L LDT +DLTW+ C     
Sbjct: 106 SATSMFELPMRSAL-----NIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160

Query: 80  ---------------DAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE 120
                          D    +     +  YRP+       + C    CA L    ++ C+
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKN-WYRPAKSSSWRRIRCSQKECALLP---YNTCQ 216

Query: 121 DPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHP 176
            P++   C Y  +  DG  ++G+  K+      ++G+    P L LGC   +  G S   
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEA-GGSVDA 275

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRV 228
            DG+L LG G+ S    +H+ K        CL     S     +L FG +   +   +  
Sbjct: 276 HDGVLSLGNGEMSFA--VHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME 333

Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VVFDSGSSYTYLNRVTY 278
                + D    Y P V  +F GGE   +              V+ D+ +S T L    Y
Sbjct: 334 TDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393

Query: 279 QTLTSIMKKELS 290
             +TS + + LS
Sbjct: 394 AAVTSALDRHLS 405


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 126/298 (42%), Gaps = 38/298 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + + IG P     +  DTGSDL W+QC  PC  C +   P++ P        V CE
Sbjct: 92  GEYFMRISIGTPPIEVLVIADTGSDLIWVQCQ-PCQECYKQKSPIFNPKQSSTYRRVLCE 150

Query: 105 DPICASLHAPGHHNCEDPA---QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
              C +L++     C        C Y   Y D   ++G L  + F    TN       LA
Sbjct: 151 TRYCNALNS-DMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSI--QELA 207

Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
            GCG N   G       GI+GLG G  S++SQL ++  I N   +CL      S    G 
Sbjct: 208 FGCG-NSNGGNFDEVGSGIVGLGGGSLSLISQLGTK--IDNKFSYCLVPILEKSNFSLGK 264

Query: 216 LFFGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFGGETTGLKNLP---------VV 263
           + FGD+ + S    + S   +S +   +Y   +  +  G E    +N           ++
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR----RPFKNVH 317
            DSG++ T+L+   Y  L  +++K +  + + +   +    +C++ +     P   VH
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDP--NGIFSICFRDKIGIELPIITVH 380


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 39/281 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
           +G Y   + +G PAR  ++  DTGSD++WLQC +PC +C     P++ P  S+   P  C
Sbjct: 78  SGDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLAC 136

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC  L   G   C    +C Y++ Y DG  ++G    +  +F    G+     +A+G
Sbjct: 137 ASSICGKLKIKG---CSRKNECMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 189

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
           CG N      +H   G+LGLG+G  S  SQ  +     +V  +CL    S      +F  
Sbjct: 190 CGRNN--QGLFHGAAGLLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVFGP 245

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------------VVFDS 266
             + + +R      +     YY  G+A +   G      N+P             V+ DS
Sbjct: 246 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPV---NIPPDAFAMGSRGTGGVIVDS 302

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           G   T ++R+T    T++     S  +   AP       C+
Sbjct: 303 G---TAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCY 340


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 67/129 (51%), Gaps = 13/129 (10%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G    +G Y   + +GQP++P+++ LDTGSD+ WLQC  PC  C +   P++ P    S 
Sbjct: 149 GTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCK-PCSDCYQQSDPIFDPTASSSY 207

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
           + + C+   C  L      N     +C Y++ Y DG  ++G  V +  +F    G     
Sbjct: 208 NPLTCDAQQCQDLEMSACRN----GKCLYQVSYGDGSFTVGEYVTETVSF----GAGSVN 259

Query: 159 RLALGCGYN 167
           R+A+GCG++
Sbjct: 260 RVAIGCGHD 268


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   L++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 93/208 (44%), Gaps = 26/208 (12%)

Query: 28  SLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
           +L +H  ++ LF   GN      + V +  G P + + L LDTGS +TW QC  PCVRC+
Sbjct: 145 NLKDHTPNNKLFDEDGN------FLVDVAFGTPPQKFTLILDTGSSITWTQCK-PCVRCL 197

Query: 88  EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFA 147
           +A    + PS           ASL               Y + Y D  +S+G    D   
Sbjct: 198 KASRRHFDPS-----------ASLTYSLGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMT 246

Query: 148 FNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
             +++   + P+   GCG N   G      DG+LGLG+G+ S VSQ  S+   + V  +C
Sbjct: 247 LEHSD---VFPKFQFGCGRNN-EGDFGSGADGMLGLGQGQLSTVSQTASK--FKKVFSYC 300

Query: 208 L--SGGGGGFLFFGDDLYDSSRVVWTSM 233
           L      G  LF       SS + +TS+
Sbjct: 301 LPEEDSIGSLLFGEKATSQSSSLKFTSL 328


>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 510

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 81/336 (24%), Positives = 131/336 (38%), Gaps = 34/336 (10%)

Query: 57  IGQPARPYFLDLDTGSDLTWL--QCDA--PCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
           +G P   + + LDTGSDL WL  QCD   P           Y PS    +  VPC    C
Sbjct: 108 VGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGASGSASFYIPSMSSTSQAVPCNSDFC 167

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
                    +C   + C Y++ Y     SS G LV+D    +  +   Q L  ++  GCG
Sbjct: 168 DH-----RKDCSTTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQILKAQIMFGCG 222

Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
             QV   S+      +G+ GLG    S+ S L  + L  +    C    G G + FGD  
Sbjct: 223 --QVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRISFGDQG 280

Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
                     ++  +   Y+  +  +  G E   L+    +FD+G+++TYL    Y  +T
Sbjct: 281 SSDQEETPLDINQKHPT-YAITITGITVGTEPMDLE-FSTIFDTGTTFTYLADPAYTYIT 338

Query: 283 SIMKKELSAKSLKEAPEDETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
                ++ A        D  +P   C+        +      FRT+      G    + +
Sbjct: 339 QSFHTQVRA---NRHAADTRIPFEYCYDLSSSEARIQTPGVSFRTVG-----GSLFPVID 390

Query: 341 LTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
           L  +   I  ++   CL I+   ++ +   N + G+
Sbjct: 391 LG-QVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 425


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 123/312 (39%), Gaps = 58/312 (18%)

Query: 25  SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----- 79
           S++S+F     S L     N+   G Y V++  G PA PY L LDT +DLTW+ C     
Sbjct: 106 SATSMFELPMRSAL-----NIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160

Query: 80  ---------------DAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE 120
                          D    +     +  YRP+       + C    CA L    ++ C+
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKN-WYRPAKSSSWRRIRCSQKECALLP---YNTCQ 216

Query: 121 DPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHP 176
            P++   C Y  +  DG  ++G+  K+      ++G+    P L LGC   +  G S   
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEA-GGSVDA 275

Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRV 228
            DG+L LG G+ S    +H+ K        CL     S     +L FG +   +   +  
Sbjct: 276 HDGVLSLGNGEMSFA--VHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME 333

Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSYTYLNRVTY 278
                + D    Y P V  +F GGE   +              V+ D+ +S T L    Y
Sbjct: 334 TDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393

Query: 279 QTLTSIMKKELS 290
             +TS + + LS
Sbjct: 394 AAVTSALDRHLS 405


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 150/371 (40%), Gaps = 71/371 (19%)

Query: 36  SLLFQVHGNVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-----CVRCV- 87
           +L  +V    YP  Y  Y+V   +G P +   L LDTGS L W  C  P     C  C  
Sbjct: 57  TLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTF 116

Query: 88  ----EAPHPLY-RPSNDLV---PCEDPICASLHAPGHHNCEDPAQCDYE-LEYADGGSSL 138
                   P+Y R  +  V   PC  P C  +      NC    +C Y  LEY   GS+ 
Sbjct: 117 SGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFG-SDLNCSTTKRCPYYGLEYGL-GSTT 174

Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGY--NQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           G LV D    +  N  R+ P    GC    N+ P       +GI G G+G +SI +QL  
Sbjct: 175 GQLVSDVLGLSKLN--RI-PDFLFGCSLVSNRQP-------EGIAGFGRGLASIPAQLGL 224

Query: 197 QKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSR--VVWTSMS-----SDYTKYYSPGVA 246
            K    +V H        G   L  G    D++   V +   +     S Y++YY   ++
Sbjct: 225 TKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLS 284

Query: 247 ELFFGGETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
           ++  GG     K++P+               + DSGS++T++ R+ +  +   ++K ++ 
Sbjct: 285 KILVGG-----KDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTK 339

Query: 292 -KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
            K  KE  +   L  C+      ++  DV K    L  SF  G      +L    Y  + 
Sbjct: 340 YKRAKEIEDSSGLGPCYNITG--QSEVDVPK----LTFSFKGGAN---MDLPLTDYFSLV 390

Query: 351 NKGNVCLGILN 361
             G VC+ +L 
Sbjct: 391 TDGVVCMTVLT 401


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 148/346 (42%), Gaps = 43/346 (12%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ IG PA    + +DTGSDL+W+QC  PC    C     PLY P+       VPC+
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNSSSCYPQKDPLYDPTASSTYAPVPCD 185

Query: 105 DPICASLHAPGH-HNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
              C  L    + H C + +    C Y +EY +  +++GV   +       + Q      
Sbjct: 186 SKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---SPQVSVKDF 242

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFF 218
             GCG  Q    ++   DG+LGLG    S+VSQ  + +       +CL  G    GFL  
Sbjct: 243 GFGCGLVQ--QGTFDLFDGLLGLGGAPESLVSQ--TAETYGGAFSYCLPPGNSTTGFLAL 298

Query: 219 G--DDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSY 270
           G   +  D++  ++T + S  +   +Y   +  +  GG+   +        ++ DSG+  
Sbjct: 299 GAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSGTII 358

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
           T L    Y  L +  +  +SA  L     D+ L  C+     F  + +V     T+AL+F
Sbjct: 359 TGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN----FTGIANVT--VPTVALTF 412

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
             G T  L    P   LI       CL    GA  G  D+ +IG +
Sbjct: 413 DGGATIDLD--VPSGVLI-----QDCLAFAGGASDG--DVGIIGNV 449


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 71/151 (47%), Gaps = 12/151 (7%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
           Y + + IG P  P+    DTGSDLTW QC  PC  C     P+Y PS       VPC   
Sbjct: 77  YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135

Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLN-PRLALG 163
            C  L      NC  P+  C Y   Y+DG  S G+L  +     +   GQ ++   +A G
Sbjct: 136 TC--LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFG 193

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
           CG +   G       G +GLG+G  S+++QL
Sbjct: 194 CGTDN--GGDSLNSTGTVGLGRGTLSLLAQL 222


>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Brachypodium distachyon]
          Length = 436

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 87/195 (44%), Gaps = 31/195 (15%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSN-- 98
           G Y +T+ +G P+R Y+L   TGSD+ W+    PC  C + P P        LY P N  
Sbjct: 74  GLYCITVKLGNPSRHYYLAFHTGSDVMWV----PCSSCTDCPTPDDIGFSLDLYDPKNSS 129

Query: 99  --DLVPCEDPICASLHAPGHHNCEDP----AQCDYELEYADGG-SSLGVLVKDAFAFNYT 151
               + C D  CA     GH  C        QC Y   YADG  ++ G  V D   F+  
Sbjct: 130 TSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIF 189

Query: 152 NGQR----LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            G       +  +  GC  ++   + +   DG++G GK   S++SQL+SQ  + +    C
Sbjct: 190 MGNESFASSSASVIFGCSKSR---SGHLQADGVIGFGKDAPSLISQLNSQG-VSHAFSRC 245

Query: 208 L--SGGGGGFLFFGD 220
           L  S  GGG L   +
Sbjct: 246 LDDSDDGGGVLILDE 260


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGAMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 90/223 (40%), Gaps = 27/223 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + IG P   +   +DT SDL W QC  PC  C     P++ P    +   +PC 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145

Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L  H  GH   +D   C Y   Y+   ++ G L  D        G+     +A 
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
           GC  +   GA      G++GLG+G  S+VSQL  ++       +CL        G L  G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253

Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
              D   +++  +   M  D  Y  YY   +  L  G  T  L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSL 296


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 39/281 (13%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
           +G Y   + +G PAR  ++  DTGSD++WLQC +PC +C     P++ P  S+   P  C
Sbjct: 11  SGDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLAC 69

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              IC  L   G   C    +C Y++ Y DG  ++G    +  +F    G+     +A+G
Sbjct: 70  ASSICGKLKIKG---CSRKNKCMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 122

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
           CG N      +H   G+LGLG+G  S  SQ  +     +V  +CL    S      +F  
Sbjct: 123 CGRNN--QGLFHGAAGLLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVFGP 178

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------------VVFDS 266
             + + +R      +     YY  G+A +   G      N+P             V+ DS
Sbjct: 179 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPV---NIPPDAFAMGSRGTGGVIVDS 235

Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           G   T ++R+T    T++     S  +   AP       C+
Sbjct: 236 G---TAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCY 273


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGAMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
           G Y   M +G PA+ Y + +DTGS LTWLQC    V C     P++ P            
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184

Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
              C D   A+L+     +C     C Y+  Y D   S+G L KD  +F  T+     P 
Sbjct: 185 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 237

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
              GCG +      +    G++GL + K S++ QL     +     +CL +       + 
Sbjct: 238 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 293

Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
               Y+  +  +T M+S        + K     VA       ++   +LP + DSG+  T
Sbjct: 294 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            L    Y  L+  +   +  K    A     L  C++G+     V +V   F
Sbjct: 354 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 403


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
           G Y   M +G PA+ Y + +DTGS LTWLQC    V C     P++ P            
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186

Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
              C D   A+L+     +C     C Y+  Y D   S+G L KD  +F  T+     P 
Sbjct: 187 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 239

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
              GCG +      +    G++GL + K S++ QL     +     +CL +       + 
Sbjct: 240 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 295

Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
               Y+  +  +T M+S        + K     VA       ++   +LP + DSG+  T
Sbjct: 296 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            L    Y  L+  +   +  K    A     L  C++G+     V +V   F
Sbjct: 356 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 405


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 120/294 (40%), Gaps = 35/294 (11%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
           LF   GS  LF   GN +  G+ + T   IG P   + + LD GSDL W+ CD  C++C 
Sbjct: 84  LFPSEGSDALFL--GNEF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA 137

Query: 88  EAPHPLY----RPSNDLVP----------CEDPICASLHAPGHHNCEDPAQCDYELEY-A 132
                 Y    R  N+  P          C D +C         + +DP  C Y   Y +
Sbjct: 138 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCE--LGSDCKSSKDP--CPYLASYYS 193

Query: 133 DGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILGLGKGK 187
           +  SS G+L++D         + +   +   + +GCG  Q    S     DG++GLG G 
Sbjct: 194 ENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGD 253

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVA 246
            S+ S L    L+RN    C      G + FGD  L       +  +   +  Y    V 
Sbjct: 254 LSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIE-VE 312

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAP 298
               G  +        + DSG+S+T+L    Y+ +     K+++A   S K +P
Sbjct: 313 GYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 366


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 134/363 (36%), Gaps = 78/363 (21%)

Query: 1   MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
           M S  NG +        S      S++S+F     S L     N+   G Y V++ IG P
Sbjct: 80  MGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSAL-----NIAHVGMYLVSVRIGTP 134

Query: 61  ARPYFLDLDTGSDLTWLQCDAPCVR-------------------CVEAPHPLYRPSND-- 99
           A PY L LDT +DLTW+ C     +                     EA    YRP+    
Sbjct: 135 ALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEASKNWYRPAKSSS 194

Query: 100 --LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
              + C    CA L    ++ C+ P++   C Y  +  DG  ++G+  K+      ++G+
Sbjct: 195 WRRIRCSQKECAVLP---YNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVTVSDGR 251

Query: 155 RLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----- 208
               P L LGC   +  G S    DG+L LG G  S    +H+ K        CL     
Sbjct: 252 MAKLPGLILGCSVLEA-GGSVDAHDGVLSLGNGDMSFA--VHAAKRFGQRFSFCLLSANS 308

Query: 209 SGGGGGFLFFG-------------DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
           S     +L FG             D LY+           D    Y   V  +  GGE  
Sbjct: 309 SRDASSYLTFGPNPAVMGPGTMETDILYN----------VDVKPAYGAQVTGVLVGGERL 358

Query: 256 GLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
            + +            V+ D+ +S T L    Y  +T+ + + LS   L    E E    
Sbjct: 359 DIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLS--HLPRVYELEGFEY 416

Query: 306 CWK 308
           C+K
Sbjct: 417 CYK 419


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 116/289 (40%), Gaps = 40/289 (13%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
           G Y++ + +G P   +   +DTGSDLTW QC APC   C   P PLY P+       +PC
Sbjct: 94  GAYHMILSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPC 152

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR---- 159
             P+C +L  P      +   C Y+  YA G ++ G L  D  A    +G          
Sbjct: 153 ASPLCQAL--PSAFRACNATGCVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFAG 209

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGF 215
           +A GC  +   G       GI+GLG+   S++SQ+   +       +CL      G    
Sbjct: 210 VAFGC--STANGGDMDGASGIVGLGRSALSLLSQIGVGRF-----SYCLRSDADAGASPI 262

Query: 216 LF------FGDDLYDSSRVVWTSMSSDYTKYY-------SPGVAELFFGGETTGLKNL-- 260
           LF       GD +  ++ +     +     YY       + G  +L     T G      
Sbjct: 263 LFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGA 322

Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
             V+ DSG+++TYL    Y  L      + +    + +       LC++
Sbjct: 323 GGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE 371


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 139/350 (39%), Gaps = 53/350 (15%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
            T+ +G P   + + LDTGSDL W+ CD  C +C       Y    +L            
Sbjct: 103 TTVELGTPGMKFMVALDTGSDLFWVPCD--CSKCAPTQGVAYASDFELSIYDPKQSSTSK 160

Query: 101 -VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQ 154
            V C + +CA      H N C    + C Y + Y    +S  G+LV+D        +N +
Sbjct: 161 KVTCNNNLCA------HRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQE 214

Query: 155 RLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
            +   +  GCG  QV   S+      +G+ GLG  + S+ S L  + L  +    C    
Sbjct: 215 SIKAYVTFGCG--QVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHD 272

Query: 212 GGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
           G G + FGD    D     + S  S  +  Y+  V ++  G     + +   +FDSG+S+
Sbjct: 273 GVGRISFGDKGSPDQEETPFNSNPSHPS--YNISVTQVRVGTTLVDV-DFTALFDSGTSF 329

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV-----KKCFRT 325
           TYL    Y  ++     +   K     P           R PF+  +D+          +
Sbjct: 330 TYLINPIYAMVSENFHAQAQDKRRPPDP-----------RIPFEYCYDMSPGANSSLIPS 378

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
           ++L+       T+F+  P   +   N+   CL I+   E+ +   N + G
Sbjct: 379 MSLTMKGRGHFTVFD--PIIVITTQNELVYCLAIVKSTELNIIGQNFMTG 426


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y  ++ +G PA+   +++DTGS ++W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 120/294 (40%), Gaps = 35/294 (11%)

Query: 29  LFNHVGSSLLFQVHGNVYPTGYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
           LF   GS  LF   GN +  G+ + T   IG P   + + LD GSDL W+ CD  C++C 
Sbjct: 74  LFPSEGSDALFL--GNEF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA 127

Query: 88  EAPHPLY----RPSNDLVP----------CEDPICASLHAPGHHNCEDPAQCDYELEY-A 132
                 Y    R  N+  P          C D +C         + +DP  C Y   Y +
Sbjct: 128 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCE--LGSDCKSSKDP--CPYLASYYS 183

Query: 133 DGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILGLGKGK 187
           +  SS G+L++D         + +   +   + +GCG  Q    S     DG++GLG G 
Sbjct: 184 ENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGD 243

Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVA 246
            S+ S L    L+RN    C      G + FGD  L       +  +   +  Y    V 
Sbjct: 244 LSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIE-VE 302

Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAP 298
               G  +        + DSG+S+T+L    Y+ +     K+++A   S K +P
Sbjct: 303 GYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 356


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 123/308 (39%), Gaps = 42/308 (13%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           T ++ S  S+++++ L      S      G  + +G Y   + +G P     + +DTGSD
Sbjct: 61  TAQLESLHSATAAADLLRSPVMS------GVPFDSGEYFAVIGVGDPPTHALVVIDTGSD 114

Query: 74  LTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHAPGHHNCE-DPAQCDY 127
           L WLQC  PC RC     PLY P N      +PC  P C   L  PG   C+     C Y
Sbjct: 115 LIWLQC-LPCRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLRYPG---CDARTGGCVY 170

Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
            + Y DG +S G L  D       +  R++  + LGCG++           G+LG G+G+
Sbjct: 171 MVVYGDGSASSGDLATDTLVL--PDDTRVH-NVTLGCGHDNE--GLLASAAGLLGAGRGQ 225

Query: 188 SSIVSQLHSQKLIRNVVGHCL------SGGGGGFLFFGD--DLYDSSRVVWTSMSSDYTK 239
            S  +QL       +V  +CL      +     +L FG   +L  ++     +     + 
Sbjct: 226 LSFPTQL--APAYGHVFSYCLGDRMSRARNSSSYLVFGRTPELPSTAFTPLRTNPRRPSL 283

Query: 240 YYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKE 288
           YY   V     G    G  N             VV DSG++ +   R  Y  +       
Sbjct: 284 YYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSH 343

Query: 289 LSAKSLKE 296
            +A  ++ 
Sbjct: 344 AAAAGMRR 351


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 147/357 (41%), Gaps = 71/357 (19%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y + +++G P + + L LDTGSDL W+QC  PC  C E   P Y P    S   + C
Sbjct: 178 SGEYFIDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDPGQSSSYRNIGC 236

Query: 104 EDPICASLHAPGHHNCEDPAQ--------CDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
            D  C  + +P      DP Q        C Y   Y D  ++ G    + F  N T    
Sbjct: 237 HDSRCHLVSSP------DPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSG 290

Query: 155 ----RLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
               R    +  GCG +N+     +H   G+LGLG+G  S  SQL  Q L  +   +CL 
Sbjct: 291 KPELRRVENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLV 345

Query: 209 ----SGGGGGFLFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLK 258
                      L FG+  DL     + +T++     +    +Y   +  +  GGE     
Sbjct: 346 DRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVV--- 402

Query: 259 NLP-------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           N+P              + DSG++ +Y     YQ    ++K+   AK +K  P  +  P+
Sbjct: 403 NIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQ----VIKEAFMAK-VKGYPVVKDFPV 457

Query: 306 CWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
                 P  NV  V++       + F+DG    ++    E Y I I  +  VCL IL
Sbjct: 458 L----EPCYNVTGVEQPDLPDFGIVFSDG---AVWNFPVENYFIEIEPREVVCLAIL 507


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/160 (35%), Positives = 73/160 (45%), Gaps = 16/160 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSNDLVP 102
           T  Y V + +G P RP  L LDTGSDL W QC APC+ C     +    P    ++  V 
Sbjct: 91  TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAASSTHAAVR 149

Query: 103 CEDPICASL--HAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQR 155
           C+ P+C +L   + G          C Y   Y D   ++G L  D F F    N   G  
Sbjct: 150 CDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209

Query: 156 LNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQL 194
              RL  GCG +N+  G       GI G G+G+ S+ SQL
Sbjct: 210 SERRLTFGCGHFNK--GIFQANETGIAGFGRGRWSLPSQL 247


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
           G Y   M +G PA+ Y + +DTGS LTWLQC    V C     P++ P            
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184

Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
              C D   A+L+     +C     C Y+  Y D   S+G L KD  +F  T+     P 
Sbjct: 185 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 237

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
              GCG +      +    G++GL + K S++ QL     +     +CL +       + 
Sbjct: 238 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 293

Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
               Y+  +  +T M+S        + K     VA       ++   +LP + DSG+  T
Sbjct: 294 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            L    Y  L+  +   +  K    A     L  C++G+     V +V   F
Sbjct: 354 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 403


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 90/223 (40%), Gaps = 27/223 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + IG P   +   +DT SDL W QC  PC  C     P++ P    +   +PC 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145

Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L  H  GH   +D   C Y   Y+   ++ G L  D        G+     +A 
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
           GC  +   GA      G++GLG+G  S+VSQL  ++       +CL        G L  G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253

Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
              D   +++  +   M  D  Y  YY   +  L  G  T  L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSL 296


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 134/367 (36%), Gaps = 82/367 (22%)

Query: 1   MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
           M S  NG +        S      S++S+F     S L     N+   G Y V++ IG P
Sbjct: 79  MGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSAL-----NIAHVGMYLVSVRIGTP 133

Query: 61  ARPYFLDLDTGSDLTWLQC-----------------------DAPCVRCVEAPHPLYRPS 97
           A PY L LDT +DLTW+ C                       +       EA    YRP+
Sbjct: 134 ALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAKKEASKNWYRPA 193

Query: 98  ND----LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNY 150
                  + C    CA L    ++ C+ P++   C Y  +  DG  ++G+  K+      
Sbjct: 194 KSSSWRRIRCSQKECAVLP---YNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVTV 250

Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
           ++G+    P L LGC   +  G S    DG+L LG G  S    +H+ K        CL 
Sbjct: 251 SDGRMAKLPGLILGCSVLEA-GGSVDAHDGVLSLGNGDMSFA--VHAAKRFGQRFSFCLL 307

Query: 209 ----SGGGGGFLFFG-------------DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
               S     +L FG             D LY+           D    Y   V  +  G
Sbjct: 308 SANSSRDASSYLTFGPNPAVMGPGTMETDILYN----------VDVKPAYGAKVTGVLVG 357

Query: 252 GETTGLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
           GE   + +            V+ D+ +S T L    Y  +T+ + + LS   L    E E
Sbjct: 358 GERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLS--HLPRVYELE 415

Query: 302 TLPLCWK 308
               C+K
Sbjct: 416 GFEYCYK 422


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 65/130 (50%), Gaps = 13/130 (10%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           V G    +G Y V + +G P R  ++ +D+GSD+ W+QC  PC  C +   P++ P+   
Sbjct: 127 VSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ-PCSECYQQSDPVFDPAGSA 185

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C+  +C  L   G   C D  +C YE+ Y DG  + G L  +   F    G+ L
Sbjct: 186 TYAGISCDSSVCDRLDNAG---CND-GRCRYEVSYGDGSYTRGTLALETLTF----GRVL 237

Query: 157 NPRLALGCGY 166
              +A+GCG+
Sbjct: 238 IRNIAIGCGH 247


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 114/292 (39%), Gaps = 30/292 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
           G Y   M +G PA+ Y + +DTGS LTWLQC    V C     P++ P            
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186

Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
              C D   A+L      +C     C Y+  Y D   S+G L KD  +F  T+     P 
Sbjct: 187 AQQCSDLTTATLSP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 239

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
              GCG +      +    G++GL + K S++ QL     +     +CL +       + 
Sbjct: 240 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 295

Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
               Y+  +  +T M+S        + K     VA       ++   +LP + DSG+  T
Sbjct: 296 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
            L    Y  L+  +   +  K    A     L  C++G+     V +V   F
Sbjct: 356 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 405


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 66/136 (48%), Gaps = 11/136 (8%)

Query: 93  LYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
           LY P    +++ VPC D  C   ++     C+    C Y + Y DG ++ G  V D+  F
Sbjct: 48  LYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTF 107

Query: 149 NYTNG----QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
           +  +G    +  N  +  GCG  Q   +   S   LDGI+G G+  SS++SQL +   ++
Sbjct: 108 DEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVK 167

Query: 202 NVVGHCLSGGGGGFLF 217
            +  HCL    GG +F
Sbjct: 168 RIFSHCLDSHHGGGIF 183


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 110/268 (41%), Gaps = 32/268 (11%)

Query: 39  FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
           F V G   P+  G Y   + +G P R +++ +DTGSD+ W+ C + C  C +        
Sbjct: 63  FPVKGTFDPSQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
               P    ++ L+ C D  C S       +C     QC Y  +Y DG  + G  V D  
Sbjct: 122 NYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLM 181

Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
             A  F  T     +  +  GC   Q      S   +DGI G G+   S++SQL  Q + 
Sbjct: 182 HFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIA 241

Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
             V  HCL G   GGG L  G+ +     +V++ +      +Y+  +  +   G+   + 
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGEIV--EPNIVYSPLVQS-QPHYNLNLQSISVNGQIVPIA 298

Query: 258 -------KNLPVVFDSGSSYTYLNRVTY 278
                   N   + DSG++  YL    Y
Sbjct: 299 PAVFATSNNRGTIVDSGTTLAYLAEEAY 326


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 129/323 (39%), Gaps = 35/323 (10%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPCEDPICASLHAPGHHN--CE- 120
           LDTGS L+WLQC    V C     PLY PS       + C    C+ L A   ++  CE 
Sbjct: 3   LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGI 180
           D   C Y   Y D   S+G L +D      T+ Q L P+   GCG  Q     +    GI
Sbjct: 63  DSNACLYTASYGDTSFSIGYLSQDLLTL--TSSQTL-PQFTYGCG--QDNQGLFGRAAGI 117

Query: 181 LGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY 240
           +GL + K S+++QL ++    +   +CL     G    G     S        +   T  
Sbjct: 118 IGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDS 175

Query: 241 YSPGVAELFFGGETT---------GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
            +P +  L     T           +  +P + DSG+  T L    Y  L     K +S 
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235

Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
           K  K AP    L  C+KG    K++  V +    + + F  G   T   L   + LI ++
Sbjct: 236 KYAK-APAYSILDTCFKGS--LKSISAVPE----IKMIFQGGADLT---LRAPSILIEAD 285

Query: 352 KGNVCLGILNGAEVGLQDLNVIG 374
           KG  CL     +  G   + +IG
Sbjct: 286 KGITCLAFAGSS--GTNQIAIIG 306


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 91/355 (25%), Positives = 141/355 (39%), Gaps = 50/355 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPC 103
           V++ IG P +P  L LDTGS L+W+QC    V+    P P  + +           L+PC
Sbjct: 68  VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPC 127

Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
             PIC     P      +C+    C Y   YADG  + G LV++ F F+ +      P +
Sbjct: 128 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS---TPPV 183

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
            LGC              GILG+  G+ S +SQ    K    V     S   G  LF+  
Sbjct: 184 ILGCAQASTEN------RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTG--LFYLG 235

Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP------------- 261
           D  +SS+  + +M +      SP +  L +      +K      N+P             
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQ 295

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            + DSGS  TYL    Y+ +   + + + A   K     +   +C+          +V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDA----GVTAEVGR 351

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               ++  F +G    +F    E  L    KG  C+GI     +G+   N+IG +
Sbjct: 352 RIGGISFEFDNG--VEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGS-NIIGTV 403


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 73/155 (47%), Gaps = 10/155 (6%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y ++  +G P    +  +DTGSD+ WLQC  PC +C      ++ PS      ++P  
Sbjct: 84  GEYLISYSVGIPPFQLYGIIDTGSDMIWLQCK-PCEKCYNQTTRIFDPSKSNTYKILPFS 142

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C S+      + ++   C+Y + Y DG  S G L  +      TNG  +   R  +G
Sbjct: 143 STTCQSVEDTSCSS-DNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIG 201

Query: 164 CGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQ 197
           CG N     S+     GI+GLG G  S+++QL  +
Sbjct: 202 CGRNNT--VSFEGKSSGIVGLGNGPVSLINQLRRR 234


>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
          Length = 566

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 84/200 (42%), Gaps = 28/200 (14%)

Query: 33  VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
           VG  + F V G   P   G Y   + +G P R + + +DTGSD+ W+ C +    P    
Sbjct: 112 VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 171

Query: 87  VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
           ++     + P    S  LV C D  C S +      C     C Y  +Y DG  + G  +
Sbjct: 172 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGYYI 230

Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
            D    N  +G    PR A+               DGI GLG+G  S++SQL  Q L   
Sbjct: 231 SDFMCSNLQSGDLQRPRRAV---------------DGIFGLGQGSLSVISQLAVQGLAPR 275

Query: 203 VVGHCLSG--GGGGFLFFGD 220
           V  HCL G   GGG +  G 
Sbjct: 276 VFSHCLKGDKSGGGIMVLGQ 295


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 88/358 (24%), Positives = 143/358 (39%), Gaps = 62/358 (17%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           +++ IG P +   + LDTGS L+W+QC     +    P   + P    S   +PC  P+C
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131

Query: 109 ASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
                P      +C+    C Y   YADG  + G LVK+   F+ T    + P L LGC 
Sbjct: 132 KP-RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITPPLILGCA 187

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-------GFLFF 218
                        GILG+ +G+ S VSQ    K       +C+            G  + 
Sbjct: 188 TESSDDR------GILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYL 236

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG----GETTGLKNLPV------------ 262
           GD+  +S    + S+ +       P +  L +     G   GLK L +            
Sbjct: 237 GDNP-NSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 295

Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSGS +T+L    Y  + + +   +  +  K      T  +C+ G     NV  +
Sbjct: 296 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG-----NVAMI 350

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
            +    L   FT G    +  L P+  ++++  G + C+GI   + +G    N+IG +
Sbjct: 351 PRLIGDLVFVFTRG----VEILVPKERVLVNVGGGIHCVGIGRSSMLGAAS-NIIGNV 403


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 93/346 (26%), Positives = 134/346 (38%), Gaps = 61/346 (17%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHA----PGH-- 116
           +DTGSDLTW+QC  PC  C     PL+ PS       VPC    C ASL A    PG   
Sbjct: 181 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239

Query: 117 -----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
                       +C Y L Y DG  S GVL  D  A     G  ++     GCG +    
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL---GGASVDG-FVFGCGLSNR-- 293

Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YD 224
             +    G++GLG+ + S+VSQ   +     V  +CL    SG   G L  G D     +
Sbjct: 294 GLFGGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSYRN 351

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYL 273
           ++ V +T M +D      P     +F   T                  V+ DSG+  T L
Sbjct: 352 ATPVSYTRMIAD------PAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 405

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y+ + +   ++  A+    AP    L  C+          +VK    TL L   +G
Sbjct: 406 APSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRL---EG 458

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
                 +     ++   +   VCL +   A +  +D   I  IG++
Sbjct: 459 GADMTVDAAGMLFMARKDGSQVCLAM---ASLSFEDQTPI--IGNY 499


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 76/164 (46%), Gaps = 11/164 (6%)

Query: 44  NVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSND-- 99
            + PT G Y +T+ IG P   Y    DTGSDL W QC APC  +C + P PLY PS+   
Sbjct: 78  QISPTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQC-APCSSQCFQQPTPLYNPSSSTT 136

Query: 100 --LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQR 155
             ++PC   +     A           C Y + Y  G +S+     + F F  +    Q 
Sbjct: 137 FAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSV-YQGSETFTFGSSTPANQT 195

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             P +A GC  N   G +     G++GLG+G  S+VSQL   K 
Sbjct: 196 GVPGIAFGCS-NASGGFNTSSASGLVGLGRGSLSLVSQLGVPKF 238


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 93/346 (26%), Positives = 134/346 (38%), Gaps = 61/346 (17%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHA----PGH-- 116
           +DTGSDLTW+QC  PC  C     PL+ PS       VPC    C ASL A    PG   
Sbjct: 180 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238

Query: 117 -----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
                       +C Y L Y DG  S GVL  D  A     G  ++     GCG +    
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL---GGASVDG-FVFGCGLSNR-- 292

Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YD 224
             +    G++GLG+ + S+VSQ   +     V  +CL    SG   G L  G D     +
Sbjct: 293 GLFGGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSYRN 350

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYL 273
           ++ V +T M +D      P     +F   T                  V+ DSG+  T L
Sbjct: 351 ATPVSYTRMIAD------PAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 404

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
               Y+ + +   ++  A+    AP    L  C+          +VK    TL L   +G
Sbjct: 405 APSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRL---EG 457

Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
                 +     ++   +   VCL +   A +  +D   I  IG++
Sbjct: 458 GADMTVDAAGMLFMARKDGSQVCLAM---ASLSFEDQTPI--IGNY 498


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 74/287 (25%), Positives = 116/287 (40%), Gaps = 37/287 (12%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL- 100
           G++  +  Y V + +G P R   L  DTGSDLTW QC+ PC   C +    ++ PS    
Sbjct: 128 GSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSS 186

Query: 101 ---VPCEDPICASLHAPG-HHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
              + C   +C  L + G    C      C Y ++Y D  +S+G L ++      T+   
Sbjct: 187 YINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD--- 243

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGG 213
           +      GCG  Q     +    G++GLG+   S V Q  S  +   +  +CL  +    
Sbjct: 244 IVDDFLFGCG--QDNEGLFSGSAGLIGLGRHPISFVQQTSS--IYNKIFSYCLPSTSSSL 299

Query: 214 GFLFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPVV-------- 263
           G L FG     ++ + +T +S  S    +Y   +  +  GG       LP V        
Sbjct: 300 GHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGG-----TKLPAVSSSTFSAG 354

Query: 264 ---FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
               DSG+  T L    Y  L S  ++ +    +  A ED     C+
Sbjct: 355 GSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPV--ANEDGLFDTCY 399


>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
          Length = 191

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 14/162 (8%)

Query: 6   NGENLCFPTVRMSSSSS--SSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPA 61
           NG+NL F   R  ++ S              SS+ F + GN  PT  G Y   + +G P 
Sbjct: 21  NGDNLVFQVERRKTTLSGIKHHDHHRRGRFLSSVDFNLGGNGLPTRTGLYFTKLGLGSPK 80

Query: 62  RPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLH 112
           + Y++ +DTGSD+ W+ C   C RC           LY P    +++L+ C+   C+S +
Sbjct: 81  KDYYVQVDTGSDILWVNC-VECSRCPTKSQIGMDLTLYDPKGSHTSELISCDHEFCSSTY 139

Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
                 C     C Y + Y DG ++ G  V+D   F+  NG 
Sbjct: 140 DGPIPGCRAETPCPYSITYGDGSATTGYYVRDYLTFDRINGN 181


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 66/131 (50%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SN 98
           V G    +G Y + + IG+P    ++ LDTGSD++W+QC APC  C +   P++ P  SN
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPISSN 197

Query: 99  DLVP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              P  C++P C SL      N      C YE+ Y DG  ++G    +      T G   
Sbjct: 198 SYSPIRCDEPQCKSLDLSECRN----GTCLYEVSYGDGSYTVGEFATETV----TLGSAA 249

Query: 157 NPRLALGCGYN 167
              +A+GCG+N
Sbjct: 250 VENVAIGCGHN 260


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 94/353 (26%), Positives = 142/353 (40%), Gaps = 66/353 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y + +++G P R + L +DTGSDLTWLQC  PC  C +   P++ PS      ++PC 
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 227

Query: 105 DPICASLHAPGHHNCED------PAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLN 157
              C  +    H  C D      P  C Y   Y D   + G L  ++ + + ++    L 
Sbjct: 228 AAACDLV---VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 284

Query: 158 PR-LALGCGYNQVPGASYHPLDGILGL------GKGKSSIVSQLHSQKLIRNV----VGH 206
            R + +GCG++             LG        + +SS + Q  S  L+       V  
Sbjct: 285 IRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSS 344

Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
            +S G G  L    D    +  V T+ S + T YY      L   G     + LP+    
Sbjct: 345 AISFGAGFALSRHFDQMRFTPFVRTNNSVE-TFYY------LGIQGIKIDQELLPIPAER 397

Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--G 309
                      + DSG++ TYLNR  Y+ + S     L+  S   A   + L +C+   G
Sbjct: 398 FAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAF---LARISYPRADPFDILGICYNATG 454

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN--KGNVCLGIL 360
           R            F TL++ F +G      +L  E Y I  +  +   CL IL
Sbjct: 455 RTAVP--------FPTLSIVFQNGAE---LDLPQENYFIQPDPQEAKHCLAIL 496


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 119/304 (39%), Gaps = 59/304 (19%)

Query: 51  YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHP----LYRPSNDLVPCE 104
           Y + + IG P RP    L LDTGSDL W QC   C  C   P P    L   +   VPC 
Sbjct: 100 YLIHLSIGTP-RPQRVALTLDTGSDLVWTQC--ACHVCFAQPFPTFDALASQTTLAVPCS 156

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY---TNGQRLN---- 157
           DPIC S   P      +   C Y  +YAD   + G +V+D F F      NG + +    
Sbjct: 157 DPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVA 216

Query: 158 -PRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
            P +  GCG YN+  G       GI G  +G  S+ SQL   +       HC +      
Sbjct: 217 VPNVRFGCGQYNK--GIFKSNESGIAGFSRGPMSLPSQLKVARF-----SHCFTAIADAR 269

Query: 216 ---LFFG-----DDL--YDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPV-- 262
              +F G     D+L  + +  V  T  + S+ + YY      L   G T G   LP+  
Sbjct: 270 TSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYY------LTLKGITVGKTRLPLNA 323

Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                          + DSG+    L    Y++L +     +      E+  D    LC+
Sbjct: 324 LAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCF 383

Query: 308 KGRR 311
           +  R
Sbjct: 384 EAAR 387


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 139/331 (41%), Gaps = 47/331 (14%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC  PC RC +   P++ P++      V C
Sbjct: 140 SGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCK-PCSRCYQQSDPVFDPADSSSFAGVSC 198

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  L   G   C +  +C YE+ Y DG  + G L  +      T GQ +   +A+G
Sbjct: 199 GSDVCDRLENTG---C-NAGRCRYEVSYGDGSYTKGTLALETL----TVGQVMIRDVAIG 250

Query: 164 CGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
           CG+ NQ     +    G+LGLG G  S + QL  Q        +CL     G  G L FG
Sbjct: 251 CGHTNQ---GMFIGAAGLLGLGGGSMSFIGQLGGQT--GGAFSYCLVSRGTGSTGALEFG 305

Query: 220 DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG-------ETTGLKNL---PVVFDSG 267
                     W S+  +     +Y  G+A +  GG       ET  L       VV D+G
Sbjct: 306 RGALPVG-ATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTG 364

Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
           ++ T      Y         + S  +L  AP       C+     F++V        T++
Sbjct: 365 TAVTRFPTAAYVAFRDSFTAQTS--NLPRAPGVSIFDTCYD-LNGFESVR-----VPTVS 416

Query: 328 LSFTDGKTRTLFELTPEAYLI-ISNKGNVCL 357
             F+DG   T   L    +LI +   G  CL
Sbjct: 417 FYFSDGPVLT---LPARNFLIPVDGGGTFCL 444


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q   +    +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 63/120 (52%), Gaps = 15/120 (12%)

Query: 55  MYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRP----SNDLVPCEDPI 107
           M +GQP +P F  LDTGSD+TWLQC  PC     C E   P++ P    S + V C+   
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWLQC-LPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQ 59

Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
           C  L   G +       C Y++EY DG  ++G L  +   F ++N     P +++GCG++
Sbjct: 60  CQLLDEAGCNV----NSCIYKVEYGDGSFTIGELATETLTFVHSNSI---PNISIGCGHD 112


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 120/280 (42%), Gaps = 32/280 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKE-APEDETLPLCWKGR 310
           Y+         S++++ +    LK  A E+E+   C+  R
Sbjct: 233 YIP----DRALSVLRQRIRELLLKRGAAEEESERNCYDMR 268


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 81/163 (49%), Gaps = 14/163 (8%)

Query: 44  NVYPTG---YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-PLYRPS-- 97
           N++P+     + V   +GQP  P    +DTGS L W+QC APC  C +    P++ PS  
Sbjct: 92  NLHPSASEPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQC-APCKSCSQQIIGPMFDPSIS 150

Query: 98  --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQ 154
              D + C++ IC   +AP    C+  +QC Y   Y +G  S+GV+  +   F  ++ G+
Sbjct: 151 STYDSLSCKNIICR--YAPSGE-CDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGR 207

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
                +  GC +            G+ GLG G +S+V+Q+ S+
Sbjct: 208 NAVNNVLFGCSHRN-GNYKDRRFTGVFGLGSGITSVVNQMGSK 249


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 13/124 (10%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC+ PC +C     P++ P++      V C
Sbjct: 133 SGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSFSGVSC 191

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C+ +     H      +C YE+ Y DG  + G L  +   F    G+ L   +A+G
Sbjct: 192 ASTVCSHVDNAACHE----GRCRYEVSYGDGSYTKGTLALETITF----GRTLIRNVAIG 243

Query: 164 CGYN 167
           CG++
Sbjct: 244 CGHH 247


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 69/156 (44%), Gaps = 14/156 (8%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + IG P   +   +DT SDL W QC  PC  C     P++ P    +   +PC 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145

Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L  H  GH   +D   C Y   Y+   ++ G L  D        G+     +A 
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQK 198
           GC  +   GA      G++GLG+G  S+VSQL  ++
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRR 234


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 118/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y  ++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 119/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 118/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y  ++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 128/320 (40%), Gaps = 67/320 (20%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR---CVEAPHPLYR 95
            + H NV  T    V++ +G P +   + LDTGS+L+WL C     R      +  P   
Sbjct: 77  LRFHHNVSLT----VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRAS 132

Query: 96  PSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
            +   VPC    C S   P    C+   ++C   L YADG SS G L  D FA    +G 
Sbjct: 133 STFAAVPCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVG--SGP 190

Query: 155 RLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
            L  R A GC    ++  P        G+LG+ +G  S VSQ  +++       +C+S  
Sbjct: 191 PL--RAAFGCMSSAFDSSPDGVAS--AGLLGMNRGALSFVSQASTRRF-----SYCISDR 241

Query: 211 GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG---------GETTGLKNL 260
              G L  G  DL        T +  +YT  Y P +   +F          G   G K+L
Sbjct: 242 DDAGVLLLGHSDLP-------TFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHL 294

Query: 261 PV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
           P+               + DSG+ +T+L    Y  L +   ++  A+ L  A +D +   
Sbjct: 295 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQ--ARPLLPALDDPSF-- 350

Query: 306 CWKGRRPFKNVHDVKKCFRT 325
                  F+   D   CFR 
Sbjct: 351 ------AFQEAFDT--CFRV 362


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 121/281 (43%), Gaps = 32/281 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G PA+   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
           +      +  +DG+LG+G G+ S++ Q        +   +CL          S   G F 
Sbjct: 116 DSFGANEFGNVDGLLGMGAGQMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172

Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
             G      + V +T M +    T+ +   +  +   GE  GL         VVFDSGS 
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            +Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 143/379 (37%), Gaps = 69/379 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD----------APCVRCVEAPHPLYRPSN 98
           G Y V   +G PA+P+ L  DTGSDLTW++C           +       +P   +RP  
Sbjct: 93  GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEK 152

Query: 99  DL----VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAF----- 148
                 +PC    C+         C  P + C Y+  Y DG ++ G +  ++        
Sbjct: 153 SKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSS 212

Query: 149 -----NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLI 200
                N     +L   L LGC      G S+   DG+L LG    S  S   S+   +  
Sbjct: 213 SSSSKNKVKKAKLQ-GLVLGC-TGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFS 270

Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDS----------SRVVWTSMSSDYTKYYSPGVAELFF 250
             +V H        +L FG +   S          +R     + S    +Y   +  +  
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISV 330

Query: 251 GGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
            GE   L  +P           V+ DSG+S T L +  Y+ + + + K+L+       P 
Sbjct: 331 DGE---LLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLA-----RFPR 382

Query: 300 DETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
               P   C+    P +   D       LA+ F  G  R   E   ++Y+I +  G  C+
Sbjct: 383 VAMDPFEYCYNWTSPSRK--DEGDDLPKLAVHFA-GSAR--LEPPSKSYVIDAAPGVKCI 437

Query: 358 GILNGAEVGLQDLNVIGGI 376
           G+  G   G   ++VIG I
Sbjct: 438 GVQEGPWPG---ISVIGNI 453


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 13/133 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
           V G    +G Y   + +G P R  ++ LDTGSD+ W+QC+ PC  C     P++ PS   
Sbjct: 147 VSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSA 205

Query: 99  --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               V C+  +C+ L A   H+      C YE  Y DG  S G    +   F  T+    
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS----GGCLYEASYGDGSYSTGSFATETLTFGTTS---- 257

Query: 157 NPRLALGCGYNQV 169
              +A+GCG+  V
Sbjct: 258 VANVAIGCGHKNV 270


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 82/277 (29%), Positives = 119/277 (42%), Gaps = 38/277 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
           Y VT+ IG P R + +  DTGSDLTW+QC  PC    C     PL+ PS       VPC 
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSSTYVDVPCS 180

Query: 105 DPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR---L 160
            P C   H  G       A  C+Y ++Y D   + G L ++ F  +  +   L P    +
Sbjct: 181 APEC---HIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPS--PLAPAATGV 235

Query: 161 ALGCG--YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN---VVGHCL--SGGGG 213
             GC   Y  V   +   + G+LGLG+G SSI+SQ  +++ I +   V  +CL   G   
Sbjct: 236 VFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQ--TRRSINSGGGVFSYCLPPRGSST 293

Query: 214 GFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------V 262
           G+L  G          S + +T + +  ++  S  V  L          ++P        
Sbjct: 294 GYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGA 353

Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
           V DSG+  T++    Y  L    +  L   S K  PE
Sbjct: 354 VIDSGTVVTHMPAAAYYPLRDEFR--LHMGSYKMLPE 388


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 84/191 (43%), Gaps = 24/191 (12%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
            + H NV  T    +++ +G P +   + +DTGS+L+WL C+      +  P+P + P  
Sbjct: 58  LRFHHNVSLT----ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATI--PYPFFNPNI 111

Query: 97  --SNDLVPCEDPICASLHA--PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
             S   + C  P C +     P   +C+    C   L YAD  SS G L  D F F    
Sbjct: 112 SSSYTPISCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGF---- 167

Query: 153 GQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
           G   NP +  GC  +     S    +  G++G+  G  S+VSQL   K       +C+SG
Sbjct: 168 GSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKF-----SYCISG 222

Query: 211 GG-GGFLFFGD 220
               G L  G+
Sbjct: 223 SDFSGILLLGE 233


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 134/341 (39%), Gaps = 43/341 (12%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-------EAPHPLYRPS----NDLVPCED 105
           +G P   + + LDTGSDL WL C   C  C         AP   Y PS    +  VPC  
Sbjct: 104 VGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTSQAVPCNS 161

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNY--TNGQRLNPRLAL 162
             C          C   + C Y++ Y     SS G LV+D    +   T+ Q L  ++  
Sbjct: 162 DFCGL-----RKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQFLKAQIMF 216

Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           GCG  +V   S+      +G+ GLG    S+ S L  + L  N    C    G G + FG
Sbjct: 217 GCG--EVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRDGIGRISFG 274

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
           D            ++  +   Y+  +  +  G     L+ +  +FD+G+S+TYL    Y 
Sbjct: 275 DQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLE-VSTIFDTGTSFTYLADPAYT 332

Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC---FRTLALSFTDGKTR 336
            +T     ++ A   + A +          R PF+  +D+       +T ++S       
Sbjct: 333 YITDGFHSQVQAN--RHAADS---------RIPFEYCYDLSSSEARIQTPSISLRTVGGS 381

Query: 337 TLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
               + P   + I     V CL I+   ++ +   N + G+
Sbjct: 382 LFPAIDPGQVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 422


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 134/341 (39%), Gaps = 43/341 (12%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-------EAPHPLYRPS----NDLVPCED 105
           +G P   + + LDTGSDL WL C   C  C         AP   Y PS    +  VPC  
Sbjct: 104 VGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTSQAVPCNS 161

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNY--TNGQRLNPRLAL 162
             C          C   + C Y++ Y     SS G LV+D    +   T+ Q L  ++  
Sbjct: 162 DFCGL-----RKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQFLKAQIMF 216

Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
           GCG  +V   S+      +G+ GLG    S+ S L  + L  N    C    G G + FG
Sbjct: 217 GCG--EVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRDGIGRISFG 274

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
           D            ++  +   Y+  +  +  G     L+ +  +FD+G+S+TYL    Y 
Sbjct: 275 DQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLE-VSTIFDTGTSFTYLADPAYT 332

Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC---FRTLALSFTDGKTR 336
            +T     ++ A   + A +          R PF+  +D+       +T ++S       
Sbjct: 333 YITDGFHSQVQAN--RHAADS---------RIPFEYCYDLSSSEARIQTPSISLRTVGGS 381

Query: 337 TLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
               + P   + I     V CL I+   ++ +   N + G+
Sbjct: 382 LFPAIDPGQVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 422


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 139/350 (39%), Gaps = 52/350 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQC--DAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
           VT+ IG P +P  + LDTGS L+W+QC    P     +   P    S  ++PC  P+C  
Sbjct: 90  VTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTASFD---PSLSSSFYVLPCTHPLCKP 146

Query: 111 LHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
              P       C+    C Y   YADG  + G LV++  AF+ +   +  P L LGC   
Sbjct: 147 -RVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPS---QTTPPLILGC--- 199

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFGDDLYD 224
               +      GILG+  G+ S   Q    K    V     +       G  + G++  +
Sbjct: 200 ---SSESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNN-PN 255

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP-------------VVFD 265
           S+R  + SM +       P +  L +     G++      N+P              + D
Sbjct: 256 SARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVD 315

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SGS +T+L  V Y  +   + + L  +  K         +C+ G     N  ++ +    
Sbjct: 316 SGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDG-----NAMEIGRLLGD 370

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIG 374
           +A  F  G    +  + P+  ++    G V C+GI     +G    N+IG
Sbjct: 371 VAFEFEKG----VEIVVPKERVLADVGGGVHCVGIGRSERLGAAS-NIIG 415


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 113/291 (38%), Gaps = 37/291 (12%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPH--PLYRPSNDL----VPCEDPICASLHAPGHHNCED 121
           +D+GSD+ W+QC  PC   V  P   PL+ P+       VPC    CA L  P    C  
Sbjct: 85  IDSGSDVPWVQCQ-PCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARL-GPYRRGCLA 142

Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGIL 181
            +QC + + YA+G ++ G    D       +  R       GC +        + + G L
Sbjct: 143 NSQCQFGITYANGATATGTYSSDDLTLGPYDVVR---GFLFGCAHADQGSTFSYDVAGTL 199

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRV---VWTSMSSD 236
            LG G  S V Q  SQ     V  +C+  S    GF+ FG     ++ V   V T + S 
Sbjct: 200 ALGGGSQSFVQQTASQ--YSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSS 257

Query: 237 YTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTSIMKK 287
            T   SP    +         + LPV         V DS +  + +    YQ L +  + 
Sbjct: 258 ST--MSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQALRAAFRS 315

Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
            ++    + AP    L  C+     F  V  +     ++AL F  G T  L
Sbjct: 316 AMTM--YRPAPPVSILDTCYD----FSGVRSIT--LPSIALVFDGGATVNL 358


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 13/133 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
           V G    +G Y   + +G P R  ++ LDTGSD+ W+QC+ PC  C     P++ PS   
Sbjct: 147 VSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSA 205

Query: 99  --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               V C+  +C+ L A   H+      C YE  Y DG  S G    +   F  T+    
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS----GGCLYEASYGDGSYSTGSFATETLTFGTTS---- 257

Query: 157 NPRLALGCGYNQV 169
              +A+GCG+  V
Sbjct: 258 VANVAIGCGHKNV 270


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 118/280 (42%), Gaps = 27/280 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + + IG P    +   DTGSDL W QC  PC+ C +  +P++ PS       V CE
Sbjct: 89  GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLAL 162
              C  L      +C  P + CD+   Y DG  + GV+  +    N  +GQ  +   +  
Sbjct: 148 SQQCRLLDT---VSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVF 204

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS-----QKLIRNVVGHCLSGGGGGFLF 217
           GCG+N     + + + G+ G G    S+ SQ+ S     +K  + +V           + 
Sbjct: 205 GCGHNNSGTFNENEM-GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263

Query: 218 FGDDLYDS-SRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSGS 268
           FG +   S S VV T + + D   YY       S G     F   +       V  D+G+
Sbjct: 264 FGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 323

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
             T L R  Y  L   +K+ +  + +++   D    LC++
Sbjct: 324 PPTLLPRDFYNRLVQGVKEAIPMEPVQDP--DLQPQLCYR 361


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 20/181 (11%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + +G P R  ++ +D+GSD+ W+QC  PC +C     PL+ P++      V C
Sbjct: 40  SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMGVSC 98

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  +   G ++     +C YE+ Y DG  + G L  +   F    G+ +   +A+G
Sbjct: 99  SSAVCDRVENAGCNS----GRCRYEVSYGDGSYTKGTLALETLTF----GRTVVRNVAIG 150

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
           CG++      +    G+LGLG G  S + QL  Q    N   +CL   G    GFL FG 
Sbjct: 151 CGHSNR--GMFVGAAGLLGLGGGSMSFMGQLSGQT--GNAFSYCLVSRGTNTNGFLEFGS 206

Query: 221 D 221
           +
Sbjct: 207 E 207


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 140/370 (37%), Gaps = 54/370 (14%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--------VRCVEAPHPLYRPSNDL 100
           G Y V   +G PA+P+ L  DTGSDLTW++C  P               P   +RP +  
Sbjct: 95  GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSR 154

Query: 101 ----VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
               + C    C          C  P + C Y+  Y DG ++ G +  ++     +  + 
Sbjct: 155 TWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREE 214

Query: 156 LNPR---LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLS 209
              +   L LGC  +   G S+   DG+L LG    S  S   S+   +    +V H   
Sbjct: 215 RKAKLKGLVLGCSSSYT-GPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSP 273

Query: 210 GGGGGFLFFGDDLYDSS-------------RVVWTSMSSD--YTKYYSPGVAELFFGGET 254
                +L FG +   SS             R   T +  D     +Y   +  +   GE 
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333

Query: 255 TGLKNLP--------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
             +            V+ DSG+S T L +  Y+ + + + K L+   L     D     C
Sbjct: 334 LKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAG--LPRVTMDP-FEYC 390

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
           +    P     DV      +A+ F  G  R   E   ++Y+I +  G  C+G+  G   G
Sbjct: 391 YNWTSPSGKDADV--AVPKMAVHFA-GAAR--LEPPGKSYVIDAAPGVKCIGLQEGPWPG 445

Query: 367 LQDLNVIGGI 376
              ++VIG I
Sbjct: 446 ---ISVIGNI 452


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 63/223 (28%), Positives = 89/223 (39%), Gaps = 27/223 (12%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y V + IG P   +   +DT SDL W QC  PC  C     P++ P    +   +PC 
Sbjct: 87  GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145

Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C  L  H  GH   +D   C Y   Y+   ++ G L  D        G+     +A 
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
           GC  +   GA      G++GLG+G  S+VSQL  ++       +CL        G L  G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253

Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
              D   +++  +   M  D  Y  YY   +  L  G     L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSL 296


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 114/272 (41%), Gaps = 38/272 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------LYRPSND---- 99
           Y +++ +G PA    + +DTGSD++W+QC+ PC     AP P       L+ P+      
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCE-PC----PAPSPCHAHAGALFDPAASSTYA 162

Query: 100 LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
              C    CA L   G  N C+  ++C Y ++Y DG ++ G    D    + ++  R   
Sbjct: 163 AFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVR--- 219

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
               GC + ++        DG++GLG    S VSQ  ++        +CL       GFL
Sbjct: 220 GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAAR--YGKSFFYCLPATPASSGFL 277

Query: 217 FF----GDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
                       +SR   T M  S     YY   + ++  GG+  GL   P VF      
Sbjct: 278 TLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS--PSVFAAGSLV 335

Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
           DSG+  T L    Y  L+S  +  ++  +  E
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARAE 367


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 69/162 (42%), Gaps = 16/162 (9%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G    TG Y VT   G PA+   L +DTGSDLTW+QC  PC  C      ++ P    S 
Sbjct: 129 GTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCADCYSQVDAIFEPKQSSSY 187

Query: 99  DLVPCEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
             +PC    C  L      +   P     C YE+ Y DG SS G   ++       + Q 
Sbjct: 188 KTLPCLSATCTELIT--SESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDSFQ- 244

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
                A GCG+       +    G+LGLG+   S  SQ  S+
Sbjct: 245 ---NFAFGCGHTNT--GLFKGSSGLLGLGQNSLSFPSQSKSK 281


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 110/264 (41%), Gaps = 33/264 (12%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
           + G    +G Y   + IG+P+ P ++ LDTGSD+ W+QC APC  C     P++ P++  
Sbjct: 134 ISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC-APCADCYHQADPIFEPASST 192

Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
               + C+   C SL      N      C YE+ Y DG  ++G  V +      T G   
Sbjct: 193 SYSPLSCDTKQCQSLDVSECRN----NTCLYEVSYGDGSYTVGDFVTETI----TLGSAS 244

Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGG 213
              +A+GCG+N      +    G+LGLG GK S  SQ+++         +CL        
Sbjct: 245 VDNVAIGCGHNN--EGLFIGAAGLLGLGGGKLSFPSQINASSF-----SYCLVDRDSDSA 297

Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----------NLPVV 263
             L F   L   +       + +   +Y  G+  L  GGE   +           N  ++
Sbjct: 298 STLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGII 357

Query: 264 FDSGSSYTYLNRVTYQTLTSIMKK 287
            DSG++ T L    Y  L     K
Sbjct: 358 IDSGTAVTRLQTAAYNALRDAFVK 381


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 126/312 (40%), Gaps = 41/312 (13%)

Query: 68  LDTGSDLTWLQC----DAPCVRCVEAPH-PLYRPSNDLVPCEDPICASLHAPGHHNCEDP 122
           LD+ SD+ W+QC      PC   V++ + P   PS+    C  P C +L  P  + C + 
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPYANGCAN- 220

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILG 182
            QC Y + Y DG S+ G  + D    +  N          GC + +  G+      GI+ 
Sbjct: 221 NQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVS---GFKFGCSHAEQ-GSFDARAAGIMA 276

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYT 238
           LG G  S++SQ  S+    N   +C+  +    GF   G     SSR V T M       
Sbjct: 277 LGGGPESLLSQTASR--YGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 334

Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGS------SYTYLNRVTYQTLTSIMKKELSAK 292
            +Y   +  +  GG+  G+   P VF +GS      + T L    YQ L S  +  ++  
Sbjct: 335 TFYGVLLRTITVGGQRLGVA--PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTM- 391

Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
             + AP    L  C+     F  V +++     ++L F       +  L P   L     
Sbjct: 392 -YRSAPPKGYLDTCYD----FTGVVNIR--LPKISLVF---DRNAVLPLDPSGILF---- 437

Query: 353 GNVCLGILNGAE 364
            N CL   + A+
Sbjct: 438 -NDCLAFTSNAD 448


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 137/329 (41%), Gaps = 38/329 (11%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
           V + IG P     L +DT SDL WLQC  PC+ C     P++ PS       +    S +
Sbjct: 87  VNISIGSPPVTQLLHMDTASDLLWLQC-RPCINCYAQSLPIFDPSRSYTHRNESCRTSQY 145

Query: 113 A-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL---ALGCGYNQ 168
           + P          C+Y + Y DG  S G+L K+   FN    +  +  L     GCG++ 
Sbjct: 146 SMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDN 205

Query: 169 VPGASYHPL--DGILGLGKGKSSIVSQLHSQ------KLIRNVVGH-CLSGGGGGFLFFG 219
                  PL   GILGLG G+ S+V +  ++       L      H  L  G  G    G
Sbjct: 206 YG----EPLVGTGILGLGYGEFSLVHRFGTKFSYCFGSLDDPSYPHNVLVLGDDGANILG 261

Query: 220 D----DLYDS-SRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
           D    ++Y+    V   ++S D      P    +F     TGL     + D+G+S T L 
Sbjct: 262 DTTPLEIYNGFYYVTIEAISVD--GIILPIDPWVFNRNHQTGLGG--TIIDTGNSLTSLV 317

Query: 275 RVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKGRRPFKNVHDVKKCFRTLALSFTD 332
              Y+ L + ++     + +  +  +D+   + C+ G    +++  V+  F  +   F+D
Sbjct: 318 EEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLE-RDL--VESGFPIVTFHFSD 374

Query: 333 GKTRTL------FELTPEAYLIISNKGNV 355
           G   +L       +L+P  + +    GN+
Sbjct: 375 GAELSLDVKSVFMKLSPNVFCLAVTPGNM 403


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 142/358 (39%), Gaps = 62/358 (17%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
           +++ IG P +   + LDTGS L+W+QC     +    P   + P    S   +PC  P+C
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131

Query: 109 ASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
                P      +C+    C Y   YADG  + G LVK+   F+ T    + P L LGC 
Sbjct: 132 KP-RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITPPLILGCA 187

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-------GFLFF 218
                        GILG+ +G+ S VSQ    K       +C+            G  + 
Sbjct: 188 TESSDDR------GILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYL 236

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG----GETTGLKNLPV------------ 262
           GD+  +S    + S+ +       P +  L +     G   GLK L +            
Sbjct: 237 GDNP-NSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 295

Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSGS +T+L    Y  + + +   +  +  K      T  +C+ G     NV  +
Sbjct: 296 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG-----NVAMI 350

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
            +    L   FT G    +    P+  ++++  G + C+GI   + +G    N+IG +
Sbjct: 351 PRLIGDLVFVFTRG----VEIFVPKERVLVNVGGGIHCVGIGRSSMLGAAS-NIIGNV 403


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
           Y + + +G P      ++DTGSDL W QC  PC  C     P++ PSN     E      
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNSSTFKE------ 113

Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQV 169
                   C   + C Y++ YAD   S G L  +    + T+G+  + P   +GCG+N  
Sbjct: 114 ------KRCNGNS-CHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS- 165

Query: 170 PGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSR 227
             + + P   G++GL  G SS+++Q+  +     ++ +C +  G   + FG + +     
Sbjct: 166 --SWFKPTFSGMVGLSWGPSSLITQMGGEY--PGLMSYCFASQGTSKINFGTNAIVAGDG 221

Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSSYTYLNR 275
           VV T+M     K   PG+  L     + G  ++             ++ DSG++ TY   
Sbjct: 222 VVSTTMFLTTAK---PGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF-P 277

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
           V+Y  L                P    + LC+          D    F  + + F+ G  
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDM-LCYYT--------DTIDIFPVITMHFSGGAD 328

Query: 336 RTLFELTPEAYLIISNKGNVCLGIL 360
             L +     Y+    +G  CL I+
Sbjct: 329 LVLDKY--NMYIETITRGTFCLAII 351


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 73/310 (23%), Positives = 125/310 (40%), Gaps = 56/310 (18%)

Query: 41  VHGNVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAP------ 90
           V   +YP  Y  Y  ++ +G P +P  + LDTGS L+W+ C +   C  C  +P      
Sbjct: 79  VRTALYPHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAM 138

Query: 91  ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQ------CDYELEYADGGSSLGVL 141
              HP    S+ LV C +P C  +H+     C           C   L     GS+ G+L
Sbjct: 139 AVFHPKNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLL 198

Query: 142 VKDAFAFNYTNGQRLNP---RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQK 198
           + D    + ++           A+GC    V    + P  G+ G G+G  S+ SQL   K
Sbjct: 199 ISDTLRLSPSSSSSAPAPFRNFAIGCSIVSV----HQPPSGLAGFGRGAPSVPSQLKVPK 254

Query: 199 LIRNVVGHCL-------SGGGGGFLFFGDDLYDSSRVVWT----------SMSSDYTKYY 241
                  +CL       +    G L  GD +  + +   T          +    Y+ YY
Sbjct: 255 F-----SYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYY 309

Query: 242 SPGVAELFFGGETTGLKN---LP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
              +  +  GG+   L +   +P      + DSG+++TYL+   ++ + + M+  +  + 
Sbjct: 310 YLALTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRY 369

Query: 294 LKEAPEDETL 303
            +  P ++ L
Sbjct: 370 NRSRPVEDAL 379


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 82/188 (43%), Gaps = 17/188 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           TG Y V   +G PA+P+ L  DTGSDLTW++C        +AP  ++R +       + C
Sbjct: 109 TGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIAC 168

Query: 104 EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT-------NGQR 155
               C S       NC  PA  C Y+  Y DG ++ GV+  D+     +        G+R
Sbjct: 169 SSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRR 228

Query: 156 LNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGG 211
              + + LGC      G S+   DG+L LG    S  S+  ++   +    +V H     
Sbjct: 229 AKLQGVVLGC-TASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 287

Query: 212 GGGFLFFG 219
              +L FG
Sbjct: 288 ATSYLTFG 295


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 60/198 (30%), Positives = 90/198 (45%), Gaps = 22/198 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
           Y VTM +G  ++   + +DT SDLTW+QC+ PC+ C     P+++P    S   V C   
Sbjct: 65  YIVTMGLG--SKNMTVIIDTRSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 107 ICASLH----APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
            C SL       G     +P+ C+Y + Y DG  + G L  +A +F    G         
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSF----GGVSVSDFVF 177

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
           GCG N      +  + G++GLG+   S+VSQ ++      V  +CL     G  G L  G
Sbjct: 178 GCGRNNK--GLFGGVSGLMGLGRSYLSLVSQTNA--TFGGVFSYCLPTTEAGSSGSLVMG 233

Query: 220 DDLYDSSRVVWTSMSSDY 237
           ++    S+    S  S Y
Sbjct: 234 NEFSQISQKKKNSYGSRY 251


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y   + +G+PAR  ++ LDTGSD+TWLQC  PC  C     P+Y PS       V C
Sbjct: 160 SGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQ-PCADCYAQSDPVYDPSVSTSYATVGC 218

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           + P C  L A    N      C YE+ Y DG  ++G    +      +        +A+G
Sbjct: 219 DSPRCRDLDAAACRNST--GSCLYEVAYGDGSYTVGDFATETLTLGDSAPVS---NVAIG 273

Query: 164 CGYN 167
           CG++
Sbjct: 274 CGHD 277


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 85/200 (42%), Gaps = 30/200 (15%)

Query: 58  GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV---------PCEDPIC 108
           G PA    + +DTGSDLTW+QC  PC  C     PL+ P+              C D + 
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 161

Query: 109 ASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           A+   PG          +C Y L Y DG  S GVL  D  A     G  L      GCG 
Sbjct: 162 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLG-GFVFGCGL 217

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF--GD 220
           +      +    G++GLG+ + S+VSQ  S+     V  +CL    SG   G L    GD
Sbjct: 218 SNR--GLFGGTAGLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGGGD 273

Query: 221 DLYDSSR----VVWTSMSSD 236
           D   S R    V +T M +D
Sbjct: 274 DAASSYRNTTPVAYTRMIAD 293


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 9/124 (7%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y + + +G PA   ++ LDTGSD+ WLQC +PC  C     P++ P+       VPC
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQC-SPCKVCYNQSDPVFNPAKSKTFATVPC 191

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  L             C Y++ Y DG  ++G    +   F   +G R++  +ALG
Sbjct: 192 GSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF---HGARVD-HVALG 247

Query: 164 CGYN 167
           CG++
Sbjct: 248 CGHD 251


>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 242

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 106/240 (44%), Gaps = 19/240 (7%)

Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           SS GVL +D  +F   +  +   R   GC  ++         DGI+GLG+G+ SI+ QL 
Sbjct: 3   SSSGVLGEDIVSFGRESELKAQ-RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLV 61

Query: 196 SQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
            + +I +    C  G   GGG  +  G  +   S +V++      + YY+  + E+   G
Sbjct: 62  EKGVINDSFSLCYGGMDIGGGAMVLGG--VPTPSDMVFSRSDPLRSPYYNIELKEIHVAG 119

Query: 253 ETTGLKNL------PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
           +   + +         V DSG++Y YL    +      +  ++ +      P+     +C
Sbjct: 120 KALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDIC 179

Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
           + G R  +NV  + + F  + + F +G+      LTPE YL   +K  G  CLG+    +
Sbjct: 180 FAGAR--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSKVDGAYCLGVFQNGK 234


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 74/161 (45%), Gaps = 15/161 (9%)

Query: 15  VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
           +R   ++ ++  +SL +  G        G  + +G Y   + +G P+    L +DTGSDL
Sbjct: 50  LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109

Query: 75  TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
            WLQC +PC RC      ++ P        VPC  P C +L  PG   C+        C 
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165

Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
           Y + Y DG SS G L  D  AF   N   +N  + LGCG +
Sbjct: 166 YMVAYGDGSSSTGDLATDKLAF--ANDTYVN-NVTLGCGRD 203


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
           Y + + +G P      ++DTGSDL W QC  PC  C     P++ PSN     E      
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNSSTFKE------ 113

Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQV 169
                   C   + C Y++ YAD   S G L  +    + T+G+  + P   +GCG+N  
Sbjct: 114 ------KRCNGNS-CHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS- 165

Query: 170 PGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSR 227
             + + P   G++GL  G SS+++Q+  +     ++ +C +  G   + FG + +     
Sbjct: 166 --SWFKPTFSGMVGLSWGPSSLITQMGGEY--PGLMSYCFASQGTSKINFGTNAIVAGDG 221

Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSSYTYLNR 275
           VV T+M     K   PG+  L     + G  ++             ++ DSG++ TY   
Sbjct: 222 VVSTTMFLTTAK---PGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF-P 277

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
           V+Y  L                P    + LC+          D    F  + + F+ G  
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDM-LCYYT--------DTIDIFPVITMHFSGGAD 328

Query: 336 RTLFELTPEAYLIISNKGNVCLGIL 360
             L +     Y+    +G  CL I+
Sbjct: 329 LVLDKY--NMYIETITRGTFCLAII 351


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/353 (26%), Positives = 141/353 (39%), Gaps = 66/353 (18%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y + +++G P R + L +DTGSDLTWLQC  PC  C +   P++ PS      ++PC 
Sbjct: 85  GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 143

Query: 105 DPICASLHAPGHHNCED------PAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLN 157
              C  +    H  C D      P  C Y   Y D   + G L  ++ + + ++    L 
Sbjct: 144 AAACDLV---VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 200

Query: 158 PR-LALGCGYNQVPGASYHPLDGILGL------GKGKSSIVSQLHSQKLIRNV----VGH 206
            R + +GCG++             LG        + +SS + Q  S  L+       V  
Sbjct: 201 IRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSS 260

Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
            +S G G  L    D    +  V T+ S + T YY      L   G     + LP+    
Sbjct: 261 AISFGAGFALSRHFDQMKFTPFVRTNNSVE-TFYY------LGIQGIKIDQELLPIPAER 313

Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--G 309
                      + DSG++ TYLNR  Y+ + S     L+  S   A   + L +C+   G
Sbjct: 314 FAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAF---LARISYPRADPFDILGICYNATG 370

Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN--KGNVCLGIL 360
           R            F  L++ F +G      +L  E Y I  +  +   CL IL
Sbjct: 371 RAAVP--------FPALSIVFQNGAE---LDLPQENYFIQPDPQEAKHCLAIL 412


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 9/129 (6%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DL 100
           V   G Y +   +G P       +DTGSD+ WLQC+ PC  C +   P++ PS       
Sbjct: 85  VASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE-PCEDCYKQTTPIFDPSKSKTYKT 143

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PR 159
           +PC    C SL    +  C     C+Y ++Y DG  S G L  +      T+G  ++ P+
Sbjct: 144 LPCSSNTCESLR---NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPK 200

Query: 160 LALGCGYNQ 168
             +GCG+N 
Sbjct: 201 TVIGCGHNN 209


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 118/280 (42%), Gaps = 27/280 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y + + IG P    +   DTGSDL W QC  PC+ C +  +P++ PS       V CE
Sbjct: 89  GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147

Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
              C  L      +C  P + CD+   Y DG  + GV+  +    N  +GQ  +   +  
Sbjct: 148 SQQCRLLDT---VSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVF 204

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS-----QKLIRNVVGHCLSGGGGGFLF 217
           GCG+N     + + + G+ G G    S+ SQ+ S     +K  + +V           + 
Sbjct: 205 GCGHNNSGTFNENEM-GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263

Query: 218 FGDDLYDS-SRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSGS 268
           FG +   S S VV T + + D   YY       S G     F   +       V  D+G+
Sbjct: 264 FGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 323

Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
             T L R  Y  L   +K+ +  + +++   D    LC++
Sbjct: 324 PPTLLPRDFYNRLVQGVKEAIPMEPVQDP--DLQPQLCYR 361


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 127/312 (40%), Gaps = 42/312 (13%)

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
           FL +DTGSD+TW+QCD PC +C +    L++P+       +PC   +C  L +   H+C 
Sbjct: 2   FLLIDTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQS-FSHSCL 59

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDG 179
           + + C+Y + Y D  ++ G    +       +   ++ P  A GCG+     A+    +G
Sbjct: 60  N-SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH-----ANKGLFNG 113

Query: 180 ILGL-GKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YDSSRVVWT 231
             GL G GKSSI     +      V  +CL    S    G L FG+     YD       
Sbjct: 114 AAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLV 173

Query: 232 SMSSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKK 287
             SS  ++Y+      +   G   G + LP    V+ DSG+  +   +  Y+ L     +
Sbjct: 174 DSSSGPSQYF------VSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQ 227

Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
            L    L+ A        C++       V D+      + L F D        L+P   L
Sbjct: 228 ILPG--LQTAVSVAPFDTCFR----VSTVDDIN--IPLITLHFRDDAE---LRLSPVHIL 276

Query: 348 IISNKGNVCLGI 359
              + G +C   
Sbjct: 277 YPVDDGVMCFAF 288


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 107/255 (41%), Gaps = 38/255 (14%)

Query: 131 YADGGSSLGVLVKDAFAFNYTNGQR----LNPRLALGCGYNQVP--GASYHPLDGILGLG 184
           Y DG S+ G LVKD    +   G R     N  +  GCG  Q    G S   +DGI+G G
Sbjct: 2   YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61

Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
           +  SS +SQL SQ  ++    HCL    GG +F   ++  S +V  T M S  + +YS  
Sbjct: 62  QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVV-SPKVKTTPMLSK-SAHYSVN 119

Query: 245 VAELFFGGETTGLK--------NLPVVFDSGSSYTYLNRVTYQ-TLTSIMKK--ELSAKS 293
           +  +  G     L         +  V+ DSG++  YL    Y   L  I+    EL+  +
Sbjct: 120 LNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHT 179

Query: 294 LKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG 353
           ++E+               F   H   K  R   ++F   K+ +L  + P  YL    + 
Sbjct: 180 VQES---------------FTCFHYTDKLDRFPTVTFQFDKSVSL-AVYPREYLFQVRED 223

Query: 354 NVCLGILNGAEVGLQ 368
             C G  NG   GLQ
Sbjct: 224 TWCFGWQNG---GLQ 235


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 152/343 (44%), Gaps = 47/343 (13%)

Query: 51  YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEA-PHP--LYRPSND----LV 101
           Y V++ IG P RP  + L  DTGSDLTW+ C+  C  C +  PHP  ++R ++      +
Sbjct: 119 YFVSIRIGTP-RPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTI 177

Query: 102 PCEDPICASLHAPGHHN---CEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
           PC    C  +    + +   C +P A C ++  Y +G  ++GV   +       + +++ 
Sbjct: 178 PCSSDDC-KIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIR 236

Query: 158 P-RLALGC--GYNQVPGASYHPLDGILGLGKGKSSI---VSQLHSQKLIRNVVGHCLSGG 211
              + +GC   +N+  G      DG++GLG  K S+   ++++   K    +V H  S  
Sbjct: 237 LFDVLIGCTESFNETNGFP----DGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSN 292

Query: 212 GGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSP-GVAELFFGGE----TTGLKNLP---- 261
              FL FGD       ++  T +   Y   + P  V+ +  GG     ++ + N+     
Sbjct: 293 HKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGG 352

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED--ETLPLCWKGRRPFKNVHDV 319
           ++ DSG+S T L    Y  +   + K +  K  K  P +  E    C      F++    
Sbjct: 353 MIVDSGTSLTMLAGEAYDKVVDAL-KPIFDKHKKVVPIELPELNNFC------FEDKGFD 405

Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
           +     L + F DG    +F+   ++Y+I   +G  CLGI+  
Sbjct: 406 RAAVPRLLIHFADG---AIFKPPVKSYIIDVAEGIKCLGIIKA 445


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 121/281 (43%), Gaps = 32/281 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   L++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P  + GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
           +      +  +DG+LG+G G  S++ Q        +   +CL          S   G F 
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172

Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
             G      + V +T M +    T+ +   +  +   GE  GL         VVFDSGS 
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            +Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 144/354 (40%), Gaps = 74/354 (20%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LVP 102
           G Y +T+ IG P   Y    DTGSDL W QC APC   +C   P PLY P++     ++P
Sbjct: 90  GEYLMTLSIGTPPLSYPAIADTGSDLIWTQC-APCSGDQCFAQPAPLYNPASSTTFGVLP 148

Query: 103 CEDPI--CASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-Q 154
           C   +  CA + A     PG         C Y   Y  G ++ GV   + F F      Q
Sbjct: 149 CNSSLSMCAGVLAGKAPPPG-------CACMYNQTYGTGWTA-GVQGSETFTFGSAAADQ 200

Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
              P +A GC  +    + ++   G++GLG+G  S+VSQL + +       +CL+     
Sbjct: 201 ARVPGIAFGC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP---- 249

Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV 262
              F D    S+ ++  S + + T   S P VA            L   G + G K L +
Sbjct: 250 ---FQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSI 306

Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
                          + DSG++ T L    YQ + + ++  ++  ++ +  +   L LC+
Sbjct: 307 SPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAI-DGSDSTGLDLCY 365

Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
               P            ++ L F DG    L    P    +IS  G  CL + N
Sbjct: 366 ALPTP----TSAPPAMPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 410


>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
          Length = 378

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 131/299 (43%), Gaps = 36/299 (12%)

Query: 93  LYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF 146
           +YRP+       +PC   +C S+  PG   C +P Q C Y ++Y ++  +S G+L++D  
Sbjct: 8   IYRPAESTTSRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62

Query: 147 AFNYTNGQ-RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
             NY      +N  + +GCG  Q    + G +    DG+LGLG    S+ S L    L++
Sbjct: 63  HLNYREDHVPVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQ 119

Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP 261
           N    C      G +FFGD    S +           + Y+  V +   G +     +  
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
            + DSG+S+T L    Y+  T    K+++A  +    ED T   C+    P + + DV  
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP- 234

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
              T+ L+F   K  +L  + P   L  ++K       CL +L   E +G+   N + G
Sbjct: 235 ---TITLTFAADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 286


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 118/279 (42%), Gaps = 30/279 (10%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y  ++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
           +      +  +DG+LG+G G  S++ Q        +   +CL        FF     Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172

Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
              V T     YTK  +     ELFF         GE  GL         VVFDSGS  +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
           Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268


>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
          Length = 335

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 105/246 (42%), Gaps = 28/246 (11%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR------------PSNDLVPCE 104
           +G P   + + LDTGSDL W+ CD  C+ C     P YR             ++  VPC 
Sbjct: 94  LGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSSTSRKVPCS 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR---LNPRL 160
             +C    A    +   P    Y ++Y +D  SS GVLV+D        G++   +   +
Sbjct: 152 SNLCDEQSACRSASSSCP----YSIQYLSDNTSSTGVLVEDVLYLVTEYGRQPKIVTAPI 207

Query: 161 ALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKL-IRNVVGHCLSGGGGGFLF 217
             GCG  Q      +  P +G+LGLG    S+ S L SQ +   N    C +  G G + 
Sbjct: 208 TFGCGRTQTGSFLGTAAP-NGLLGLGMDTISVPSLLASQGVAAANSFSMCFAQDGHGRIN 266

Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
           FGD      +    +M      YY+  +     G ++   K    + DSG+S+T L+   
Sbjct: 267 FGDTGSSDQQETPLNMYKQ-NPYYNISITGATVGSKSIHTK-FNAIVDSGTSFTALSDPM 324

Query: 278 YQTLTS 283
           Y  +TS
Sbjct: 325 YTQITS 330


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 136/346 (39%), Gaps = 52/346 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPIC 108
           Y V   IG PA+P  + LDT +D  W+ C   CV C  +    P    S+  + CE P C
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCSSSVLFDPSKSSSSRTLQCEAPQC 146

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
                P   +C     C + + Y  GGS++   L +D           + P    GC  N
Sbjct: 147 KQAPNP---SCTVSKSCGFNMTY--GGSTIEAYLTQDTLTL----ASDVIPNYTFGC-IN 196

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLY 223
           +  G S  P  G++GLG+G  S++SQ  SQ L ++   +CL    S    G L  G    
Sbjct: 197 KASGTSL-PAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPK-N 252

Query: 224 DSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGSSYT 271
              R+  T +  +       Y       V        T+ L   P      +FDSG+ YT
Sbjct: 253 QPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYT 312

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  + +  ++ +   +       +T   C+ G   F +V      F    ++ T
Sbjct: 313 RLVEPAYVAVRNEFRRRVKNANATSLGGFDT---CYSGSVVFPSVT-----FMFAGMNVT 364

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD-LNVIGGI 376
                    L P+  LI S+ GN+    +  A V +   LNVI  +
Sbjct: 365 ---------LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASM 401


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 100/264 (37%), Gaps = 28/264 (10%)

Query: 68  LDTGSDLTWLQCDA-PCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDP 122
           LDT SD+TW+QC   P   C      LY P    S+ +  C  P C  L  P  + C + 
Sbjct: 173 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYANGCTNN 231

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY-HPLDGIL 181
            QC Y + Y DG S+ G  + D          R       GC +      S+     GI+
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPATAVR---SFQFGCSHGVQGSFSFGSSAAGIM 288

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG-GGFLFFGDDLYDSSRVVWTSMSSDYT-- 238
            LG G  S+VSQ  +      V  HC       GF   G     + R V T M  +    
Sbjct: 289 ALGGGPESLVSQ--TAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIP 346

Query: 239 -KYYSPGVAELFFGGETTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSA 291
             +Y   +  +   G+   +   P VF      DS ++ T L    YQ L    +  ++ 
Sbjct: 347 PTFYMVRLEAIAVAGQRIAVP--PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAM 404

Query: 292 KSLKEAPEDETLPLCW--KGRRPF 313
              + AP    L  C+   G R F
Sbjct: 405 --YQPAPPKGPLDTCYDMAGVRSF 426


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 121/281 (43%), Gaps = 32/281 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
           Y +++ +G P++   +++DTGS  +W+ C+  C  C   P    +  +     V C   +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58

Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           C    +  H  + E+   C + + Y DG +S G+L +D   F  ++ Q++ P    GC  
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNM 115

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
           +      +  +DG+LG+G G+ S++ Q        +   +CL          S   G F 
Sbjct: 116 DSFGANEFGNVDGLLGMGAGQMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172

Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
             G      + V +T M +    T+ +   +  +   GE  GL         VVFDSGS 
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            +Y+       L+  +++ L  +    A E+E+   C+  R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 66/143 (46%), Gaps = 15/143 (10%)

Query: 34  GSSLLFQVHGNVYP-----TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
           G+SL   + G V       +G Y   + IG PAR  ++ LDTGSD+TW+QC  PC  C +
Sbjct: 147 GASLAAAIQGPVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQ-PCADCYQ 205

Query: 89  APHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKD 144
              P++ PS       V C+ P C  L      N      C YE+ Y DG  ++G    +
Sbjct: 206 QSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNAT--GACLYEVAYGDGSYTVGDFATE 263

Query: 145 AFAFNYTNGQRLNPRLALGCGYN 167
                 +        +A+GCG++
Sbjct: 264 TLTLGDSTPVT---NVAIGCGHD 283


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 136/346 (39%), Gaps = 52/346 (15%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPIC 108
           Y V   IG PA+P  + LDT +D  W+ C   CV C  +    P    S+  + CE P C
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCSSSVLFDPSKSSSSRTLQCEAPQC 146

Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
                P   +C     C + + Y  GGS++   L +D           + P    GC  N
Sbjct: 147 KQAPNP---SCTVSKSCGFNMTY--GGSTIEAYLTQDTLTL----ASDVIPNYTFGC-IN 196

Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLY 223
           +  G S  P  G++GLG+G  S++SQ  SQ L ++   +CL    S    G L  G    
Sbjct: 197 KASGTSL-PAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPK-N 252

Query: 224 DSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYT 271
              R+  T +  +       Y       V        T+ L   P      +FDSG+ YT
Sbjct: 253 QPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYT 312

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
            L    Y  + +  ++ +   +       +T   C+ G   F +V      F    ++ T
Sbjct: 313 RLVEPAYVAVRNEFRRRVKNANATSLGGFDT---CYSGSVVFPSVT-----FMFAGMNVT 364

Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD-LNVIGGI 376
                    L P+  LI S+ GN+    +  A V +   LNVI  +
Sbjct: 365 ---------LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASM 401


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 80/165 (48%), Gaps = 16/165 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
           Y VT  +G P     +++DTGSDL+W+QC  PC     C     PL+ P+       VPC
Sbjct: 48  YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+CA L       C   AQC Y + Y DG ++ GV   D    + ++  +       G
Sbjct: 107 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 162

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
           CG+ Q     ++ +DG+LGLG+ + S+V Q  +      V  +CL
Sbjct: 163 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCL 203


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SN 98
           V G    +G Y + + IG+P    ++ LDTGSD++W+QC APC  C +   P++ P  SN
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPVSSN 197

Query: 99  DLVP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
              P  C+ P C SL      N      C YE+ Y DG  ++G    +      T G   
Sbjct: 198 SYSPIRCDAPQCKSLDLSECRN----GTCLYEVSYGDGSYTVGEFATETV----TLGTAA 249

Query: 157 NPRLALGCGYN 167
              +A+GCG+N
Sbjct: 250 VENVAIGCGHN 260


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 100/264 (37%), Gaps = 28/264 (10%)

Query: 68  LDTGSDLTWLQCDA-PCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDP 122
           LDT SD+TW+QC   P   C      LY P    S+ +  C  P C  L  P  + C + 
Sbjct: 148 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYANGCTNN 206

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY-HPLDGIL 181
            QC Y + Y DG S+ G  + D          R       GC +      S+     GI+
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPATAVR---SFQFGCSHGVQGSFSFGSSAAGIM 263

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG-GGFLFFGDDLYDSSRVVWTSMSSDYT-- 238
            LG G  S+VSQ  +      V  HC       GF   G     + R V T M  +    
Sbjct: 264 ALGGGPESLVSQ--TAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIP 321

Query: 239 -KYYSPGVAELFFGGETTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSA 291
             +Y   +  +   G+   +   P VF      DS ++ T L    YQ L    +  ++ 
Sbjct: 322 PTFYMVRLEAIAVAGQRIAVP--PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAM 379

Query: 292 KSLKEAPEDETLPLCW--KGRRPF 313
              + AP    L  C+   G R F
Sbjct: 380 --YQPAPPKGPLDTCYDMAGVRSF 401


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 138/328 (42%), Gaps = 64/328 (19%)

Query: 68  LDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCE-DP 122
           LDTGSD+ W+QC APC RC E   P++ P    S   V C   +C  L + G   C+   
Sbjct: 3   LDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGG---CDLRR 58

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGIL 181
             C Y++ Y DG  + G  V +   F    G R+  R+ALGCG+ N+    +   L G+ 
Sbjct: 59  GACMYQVAYGDGSVTAGDFVTETLTF--AGGARV-ARVALGCGHDNEGLFVAAAGLLGLG 115

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-------FLFFGDDLYDSSRVVWTSMS 234
             G    + +S+ + +     +V    SG G          + FG     +S   +T M 
Sbjct: 116 RGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMV 175

Query: 235 SD---YTKYY------------SPGVAELFFGGETTGLKNLP------VVFDSGSSYTYL 273
            +    T YY             PGVAE       + L+  P      V+ DSG+S T L
Sbjct: 176 RNPRMETFYYVQLVGISVGGARVPGVAE-------SDLRLDPSTGRGGVIVDSGTSVTRL 228

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCWK--GRRPFKNVHDVKKCFRTLALSF 330
            R +Y  L    +   +A  L+ +P   +L   C+   GRR  K          T+++ F
Sbjct: 229 ARASYSALRDAFRAA-AAGGLRLSPGGFSLFDTCYDLGGRRVVK--------VPTVSMHF 279

Query: 331 TDGKTRTLFELTPEAYLI-ISNKGNVCL 357
             G       L PE YLI + ++G  C 
Sbjct: 280 AGGAEAA---LPPENYLIPVDSRGTFCF 304


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/348 (26%), Positives = 138/348 (39%), Gaps = 56/348 (16%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCED 105
           ++ +T+ IG P +P  L LDTGSDL W QC     R      PLY P+        PC+ 
Sbjct: 88  HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTR-QHREKPLYDPAKSSSFAAAPCDG 146

Query: 106 PICASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            +C +    G  N ++ +  +C Y   Y    ++ G L  + F F     +R++  L  G
Sbjct: 147 RLCET----GSFNTKNCSRNKCIYTYNYGS-ATTKGELASETFTFG--EHRRVSVSLDFG 199

Query: 164 CGY---NQVPGASYHPLDGILGLGKGKSSIVSQLHSQK--------LIRNVVGHCLSGGG 212
           CG      +PGAS     GILG+   + S+VSQL   +        L RN   H   G  
Sbjct: 200 CGKLTSGSLPGAS-----GILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAM 254

Query: 213 GGF-LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--NLPV------- 262
                +       ++ +V     S+Y  YY P +      G + G K  N+PV       
Sbjct: 255 ADLSKYRTTGPIQTTSLVTNPDGSNY-YYYVPLI------GISVGTKRLNVPVSSFAIGR 307

Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
                   DSG +   L  V  + L   M + +    +          LC++  R     
Sbjct: 308 DGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGA 367

Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
            +       L   F DG    L  L  ++Y++  + G +CL I +GA 
Sbjct: 368 VETAVQVPPLVYHF-DGGAAML--LRRDSYMVEVSAGRMCLVISSGAR 412


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 133/309 (43%), Gaps = 32/309 (10%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
           S S S++L  H+ S     +   + P +G + ++++IG P        DTGSDLTW QC 
Sbjct: 60  SFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQC- 118

Query: 81  APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
            PC  C     P++ P    S   V C    C SL +  +H   D   C Y   Y D   
Sbjct: 119 LPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLES--YHCGPDLQSCSYGYSYGDRSF 176

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD-GILGLGKGKSSIVSQLH 195
           + G L  D      T G    P+  +GCG+    G ++  +  GI+GLG G  S+VSQ+ 
Sbjct: 177 TYGDLASD----QITIGSFKLPKTVIGCGHQN--GGTFGGVTSGIIGLGGGSLSLVSQMR 230

Query: 196 SQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSR-VVWTSM--SSDYTKYY------ 241
           +   ++    +CL     +    G + FG     S R VV T +   S  T Y+      
Sbjct: 231 TIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAI 290

Query: 242 SPGVAELFFGGETTGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
           S G          + + N   ++ DSG++ T L R  Y  + S + + + AK + +    
Sbjct: 291 SVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDD--PS 348

Query: 301 ETLPLCWKG 309
             L LC+  
Sbjct: 349 GILELCYSA 357


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 71/160 (44%), Gaps = 15/160 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDP 106
           Y + + IG P  P+    DTGSDLTW QC  PC  C     P+Y      S   VPC   
Sbjct: 95  YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153

Query: 107 ICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-----R 159
            C  +      NC     + C Y   Y DG  S GVL  +   F  ++     P      
Sbjct: 154 TCLPIWR-SSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGG 212

Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           +A GCG +   G SY+   G +GLG+G  S+V+QL   K 
Sbjct: 213 VAFGCGVDN-GGLSYNS-TGTVGLGRGSLSLVAQLGVGKF 250


>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
 gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
          Length = 541

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/261 (28%), Positives = 110/261 (42%), Gaps = 41/261 (15%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVP------- 102
           YY V + +G P   + + LDTGSDL W+ CD  C +C    +   +P+  L P       
Sbjct: 111 YYAV-VEVGTPNATFLVALDTGSDLFWVPCD--CKQCASIANVTGQPATALRPYSPRESS 167

Query: 103 ------CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTN--- 152
                 C++ +C     P   +      C YE++Y    +S  GVLV+D           
Sbjct: 168 TSKQVTCDNALC---DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGA 224

Query: 153 ----GQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNV 203
               G+ L   +  GCG  Q    + GA++   DG++GLG+   S+ S L S  L+  + 
Sbjct: 225 AAEAGEALQAPVVFGCGQVQTGTFLDGAAF---DGLMGLGRENVSVPSVLASSGLVASDS 281

Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-KNLPV 262
              C    G G + FGD    SS    T  +   T Y    V+      ET  +      
Sbjct: 282 FSMCFGDDGVGRINFGDS--GSSGQGETPFTGRRTLY---NVSFTAVNVETKSVAAEFAA 336

Query: 263 VFDSGSSYTYLNRVTYQTLTS 283
           V DSG+S+TYL    Y  L +
Sbjct: 337 VIDSGTSFTYLADPEYTELAT 357


>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 530

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 94/353 (26%), Positives = 153/353 (43%), Gaps = 55/353 (15%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRPS--- 97
           Y NV+  +G PA  + + LDTGS+L WL C+  + C+R ++        P  LY P+   
Sbjct: 104 YANVS--VGTPATWFLVALDTGSNLFWLPCNCGSTCIRDLKDIGLSQSRPLNLYSPNTSS 161

Query: 98  -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
            +  + C D  C              + C Y+++Y    + + G L +D      T    
Sbjct: 162 TSSSIRCNDDRCFGSSQCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDVD 216

Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
           L P    + LGCG NQ     S   ++G+LGLG    S+ S L   K+  N    C    
Sbjct: 217 LKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILAKAKITANSFSMCFGNI 276

Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
               G + FGD  Y + ++    + ++ +  Y+  V E+  GG+  G++ L  +FD+G+S
Sbjct: 277 IDVIGRISFGDKGY-TDQMETPLLPTEPSPTYAVNVTEVSVGGDVVGVQ-LLALFDTGTS 334

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
           +T+L    Y  +T      ++ K     PE            PF+  +D+        F 
Sbjct: 335 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------IPFEFCYDLSPNSTTILFP 383

Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAEVGLQDLNVIG 374
            +A++F  G    +F   P    I+ N+ N    CLGIL   +     +N+IG
Sbjct: 384 RVAMTFEGGS--LMFLRNP--LFIVWNEDNTAMYCLGILKSVDF---KINIIG 429


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/347 (23%), Positives = 129/347 (37%), Gaps = 73/347 (21%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-------------HPLYRPS 97
           Y +T+ IG P +   + +DTGSDLTW+ C      C++                PL+  S
Sbjct: 11  YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70

Query: 98  NDLVPCEDPICASLHAPGHHNCEDP---AQC---------------DYELEYADGGSSLG 139
           +    C    CA +H+    N  DP   A C                +   Y +GG   G
Sbjct: 71  SFRASCASSFCAEIHS--SDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSG 128

Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
           +L +D          R  PR + GC       ++YH   GI G G+G  S+ SQL     
Sbjct: 129 ILTRDILKAR----TRDVPRFSFGCV-----TSTYHEPIGIAGFGRGLLSLPSQL---GF 176

Query: 200 IRNVVGHCL-------SGGGGGFLFFGD-----DLYDSSRVVWTSMSSDYTKYYSPGVAE 247
           +     HC        +      L  G      +L DS +      +  Y   Y  G+  
Sbjct: 177 LEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLES 236

Query: 248 LFFGGETTGLK------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
           +  G   T  +            N  ++ DSG++YT+L    Y  L +I++  ++     
Sbjct: 237 ITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRAT 296

Query: 296 EAPEDETLPLCWKGRRPFKNV----HDVKKCFRTLALSFTDGKTRTL 338
           E        LC+K   P  N+    +DV   F ++  +F +  T  L
Sbjct: 297 ETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLL 343


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 61/131 (46%), Gaps = 12/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G P R   + LDTGSD+TW+QC+ PC  C +   P+Y P    
Sbjct: 135 VSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE-PCSDCYQQSDPIYNPALSS 193

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           S  LV C+  +C  L   G   C     C Y++ Y DG  + G    +         Q  
Sbjct: 194 SYKLVGCQANLCQQLDVSG---CSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQ-- 248

Query: 157 NPRLALGCGYN 167
              +A+GCG++
Sbjct: 249 --NVAIGCGHD 257


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 58/125 (46%), Gaps = 8/125 (6%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y + + +G P        DTGSDL W QC  PC  C E   P++ P+      ++ CE
Sbjct: 93  GEYLMNISLGTPPVSMHGIADTGSDLLWRQC-KPCDSCYEQIEPIFDPAKSKTYQILSCE 151

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C++L   G   C D   C Y   Y DG  + G L  D      T G+ ++ P++  G
Sbjct: 152 GKSCSNLG--GQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFG 209

Query: 164 CGYNQ 168
           CG+N 
Sbjct: 210 CGHNN 214


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 119/302 (39%), Gaps = 69/302 (22%)

Query: 37  LLFQVHGNVYPTG-----------YYNVTM----YIGQPARPYFLDLDTGSDLTWLQCDA 81
           LLF++     P G           ++NV++     +G P +   + LDTGS+L+WL C  
Sbjct: 37  LLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAP 96

Query: 82  PCVRCVEAPHPL-YRPSNDL----VPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGG 135
                      L +RP   L    VPC+   C S   P    C+  + QC   L YADG 
Sbjct: 97  GGGGGGGGRSALSFRPRASLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGS 156

Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVS 192
           SS G L  + F    T GQ    R A GC    ++  P        G+LG+ +G  S VS
Sbjct: 157 SSDGALATEVF----TVGQGPPLRAAFGCMATAFDTSPDGVAT--AGLLGMNRGALSFVS 210

Query: 193 QLHSQKLIRNVVGHCLSG-GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
           Q  +++       +C+S     G L  G  DL          +  +YT  Y P +   +F
Sbjct: 211 QASTRRF-----SYCISDRDDAGVLLLGHSDL--------PFLPLNYTPLYQPAMPLPYF 257

Query: 251 G---------GETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMK 286
                     G   G K LP+               + DSG+ +T+L    Y  L +   
Sbjct: 258 DRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFS 317

Query: 287 KE 288
           ++
Sbjct: 318 RQ 319


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 114/268 (42%), Gaps = 31/268 (11%)

Query: 68  LDTGSDLTWLQC----DAPCVRCVEAPH-PLYRPSNDLVPCEDPICASLHAPGHHNCEDP 122
           LD+ SD+ W+QC      PC   V++ + P   P++    C  P C +L  P  + C + 
Sbjct: 33  LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPYANGCAN- 90

Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILG 182
            QC Y + Y DG S+ G  + D    +  N          GC + +  G+      GI+ 
Sbjct: 91  NQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVS---GFKFGCSHAE-QGSFDARAAGIMA 146

Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYT 238
           LG G  S++SQ  S+    N   +C+  +    GF   G     SSR V T M       
Sbjct: 147 LGGGPESLLSQTASR--YGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 204

Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGS---SYTYLNRV---TYQTLTSIMKKELSAK 292
            +Y   +  +  GG+  G+   P VF +GS   S T + R+    YQ L +  +  ++  
Sbjct: 205 TFYGVLLRTITVGGQRLGVA--PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTM- 261

Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVK 320
             + AP    L  C+     F  V +++
Sbjct: 262 -YRSAPPKGYLDTCYD----FTGVVNIR 284


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/162 (34%), Positives = 74/162 (45%), Gaps = 19/162 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VPCED 105
           Y VT+ IG P     L  DTGSDLTW QC+ PC+  C     P + PS+      V C  
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192

Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
           P+C +  +    NC       Y + Y DG  ++G L K+ F    TN   L+  +  GCG
Sbjct: 193 PMCGNPESCSASNCL------YGIGYGDGSVTVGFLAKEKFTL--TNSDVLDD-IYFGCG 243

Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            N      +    GILGLG GK S    L +     N+  +C
Sbjct: 244 ENN--KGVFIGSAGILGLGPGKFSF--PLQTTTTYNNIFSYC 281


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 67/139 (48%), Gaps = 7/139 (5%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHA-PG 115
           IG P  P  L +DTGSDLTW+ C  PC +C     P + PS           ++ HA P 
Sbjct: 84  IGNPPVPQLLLIDTGSDLTWIHC-LPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQ 141

Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASY 174
               E    C Y L Y D  ++ G+L ++   F  ++   ++ + +  GCG +    + +
Sbjct: 142 IFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN---SGF 198

Query: 175 HPLDGILGLGKGKSSIVSQ 193
               G+LGLG G  SIV++
Sbjct: 199 TKYSGVLGLGPGTFSIVTR 217


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 130/329 (39%), Gaps = 46/329 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC-- 108
           Y + + +  P        DTGS L WL+C  P      A H     S   +PC+   C  
Sbjct: 76  YLMALDVSTPPVRMLALADTGSSLVWLKCKLP------AAHTPASSSYARLPCDAFACKA 129

Query: 109 ----ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
               AS  A G  N      C Y   +ADG  + G +  DAF F+         RL  GC
Sbjct: 130 LGDAASCRATGSGN----NICVYRYAFADGSCTAGPVTVDAFTFST--------RLDFGC 177

Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
              +  G S  P DG++GL  G  S+VSQL ++    +   +CL     S      L FG
Sbjct: 178 A-TRTEGLSV-PDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSLNFG 235

Query: 220 DDLYDSSR--VVWTSMSSDYTK-YYSPGVAELFFGGETTGLK--NLPVVFDSGSSYTYLN 274
                SS      T + +   K +Y+  +  +   G+   L+     ++ DSG+  TYL 
Sbjct: 236 SHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTTKLIVDSGTMLTYLP 295

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
           +     L + +    +A  L      ETL  +C+  RR  +   DV K    + L    G
Sbjct: 296 KAVLDPLVAALT---AAIKLPRVKSPETLYAVCYDVRR--RAPEDVGKSIPDVTLVLGGG 350

Query: 334 KTRTLFELTPEAYLIISNKG-NVCLGILN 361
                  L      ++ NKG  VCL ++ 
Sbjct: 351 GE---VRLPWGNTFVVENKGTTVCLALVE 376


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 14/150 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
           Y VT  +G P     +++DTGSDL+W+QC  PC     C     PL+ P+       VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+CA L       C   AQC Y + Y DG ++ GV   D    + ++  +       G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
           CG+ Q     ++ +DG+LGLG+ + S+V Q
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ 282


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 62/123 (50%), Gaps = 13/123 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G + + + IG+P+  Y   LDTGSDLTW QC  PC  C + P P+Y PS       V C+
Sbjct: 19  GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-IPCSDCYKQPTPIYDPSLSSTYGTVSCK 77

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L A    +    A C+Y   Y D  S+ G+L  + F  +     +  P +A GC
Sbjct: 78  SSLCLALPASACIS----ATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAFGC 129

Query: 165 GYN 167
           G +
Sbjct: 130 GQD 132


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 62/123 (50%), Gaps = 13/123 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G + + + IG+P+  Y   LDTGSDLTW QC  PC  C + P P+Y PS       V C+
Sbjct: 19  GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-MPCSDCYKQPTPIYDPSLSSTYGTVSCK 77

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
             +C +L A    +    A C+Y   Y D  S+ G+L  + F  +     +  P +A GC
Sbjct: 78  SSLCLALPASACIS----ATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAFGC 129

Query: 165 GYN 167
           G +
Sbjct: 130 GQD 132


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 72/241 (29%), Positives = 103/241 (42%), Gaps = 25/241 (10%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y++T  IG P +      DTGSDL W +C A C RCV    P Y P    S   +PC 
Sbjct: 80  GAYDMTFSIGTPPQELSALADTGSDLIWAKCGA-CTRCVPQGSPSYYPNKSSSFSKLPCS 138

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGS----SLGVLVKDAFAFNYTNGQRLNPRL 160
             +C+ L  P        A+CDY+  Y         + G L  + F    T G    P +
Sbjct: 139 GSLCSDL--PSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETF----TLGSDAVPGI 192

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF--LFF 218
             GC    +    Y    G++GLG+G  S+VSQL+          +CL+        L F
Sbjct: 193 GFGC--TTMSEGGYGSGSGLVGLGRGPLSLVSQLN-----VGAFSYCLTSDAAKTSPLLF 245

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT-GLKNLPVVFDSGSSYTYLNRVT 277
           G      + V  T +    T YY+  +  +  G  TT G  +  ++FDSG++  +L    
Sbjct: 246 GSGALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPA 305

Query: 278 Y 278
           Y
Sbjct: 306 Y 306


>gi|156099262|ref|XP_001615633.1| aspartic protease PM5 [Plasmodium vivax Sal-1]
 gi|148804507|gb|EDL45906.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/412 (22%), Positives = 157/412 (38%), Gaps = 84/412 (20%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
           LC  +V+  S S+   S  L         ++++G++    YY + + IG P +   L LD
Sbjct: 27  LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80

Query: 70  TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
           TGS      C A C  C   +E P  L    ++ ++ CE+  C     P   NC    +C
Sbjct: 81  TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133

Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
           +Y   Y +G    G    D  +    N +R+  R  +GC  ++     Y    G+LG+  
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193

Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
              +G  + V+ L      ++ V   C+S  GG  +  G D                   
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRGGSKSVSGQGSG 253

Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
                           L ++ +VVW +++  Y  Y      ++F     +  K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKVVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SGS++T++    Y  L                       LC +      N +DV K  + 
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDVNKRLKM 353

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
              SF +   +  F+   ++   I  K N+C+ I++G +      GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 54/356 (15%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
           +G Y + + +G PA   ++ LDTGSD+ WLQC +PC  C      ++ P        VPC
Sbjct: 135 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQSDVIFDPKKSKTFATVPC 193

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
              +C  L             C Y++ Y DG  + G    +   F   +G R++  + LG
Sbjct: 194 GSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF---HGARVD-HVPLG 249

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGGF 215
           CG++      +    G+LGLG+G  S  SQ  S+        +CL        S      
Sbjct: 250 CGHDN--EGLFVGAAGLLGLGRGGLSFPSQTKSR--YNGKFSYCLVDRTSSGSSSKPPST 305

Query: 216 LFFGDDLYDSSRVVWTSMSSDY--TKYY------------SPGVAELFFGGETTGLKNLP 261
           + FG+D    + V    +++    T YY             PGV+E  F  + TG  N  
Sbjct: 306 IVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATG--NGG 363

Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
           V+ DSG+S T L +  Y  L    +  L A  LK AP       C+        +  VK 
Sbjct: 364 VIIDSGTSVTRLTQSAYVALRDAFR--LGATKLKRAPSYSLFDTCFD----LSGMTTVK- 416

Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
              T+   F  G+      L    YLI ++ +G  C          +  L++IG I
Sbjct: 417 -VPTVVFHFGGGEV----SLPASNYLIPVNTEGRFCFAFAG----TMGSLSIIGNI 463


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 141/350 (40%), Gaps = 49/350 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQC--DAPCVRCVEAPH-PLYRPSNDLVPCEDPICA 109
           V + IG P +   + LDTGS L+W+QC   AP      A   P    +   +PC  P+C 
Sbjct: 99  VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158

Query: 110 SLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
               P      +C+    C Y   YADG  + G LV++ F F+ +      P L LGC  
Sbjct: 159 P-RIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS---LFTPPLILGCAT 214

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYD 224
                 S  P  GILG+ +G+ S  SQ    K    V       G    G  + G +  +
Sbjct: 215 E-----STDP-RGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNP-N 267

Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NL-PVVF------------D 265
           S+   +  M +       P +  L +     G++      N+ P VF            D
Sbjct: 268 SNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLD 327

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SGS +TYL    Y  + + + + +  +  K         +C+ G     N  ++ +    
Sbjct: 328 SGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDG-----NAIEIGRLIGD 382

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIG 374
           +   F  G    +  + P+  ++ + +G V C+GI N  ++G    N+IG
Sbjct: 383 MVFEFEKG----VQIVVPKERVLATVEGGVHCIGIANSDKLGAAS-NIIG 427


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 142/378 (37%), Gaps = 88/378 (23%)

Query: 45  VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
           +YP  Y  Y  T  +G P +P  + LDTGS LTW+          C +P    V   HP 
Sbjct: 59  LYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPK 118

Query: 94  YRPSNDLVPCEDPICASLH--------------APGHHNCEDPAQ--C-DYELEYADGGS 136
              S+ LV C +P C  +H              +PG  NC   A   C  Y + Y   GS
Sbjct: 119 NSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GS 177

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           + G+L+ D          R  P   LGC    V    + P  G+ G G+G  S+ +QL  
Sbjct: 178 TAGLLIADTL----RAPGRAVPGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGL 229

Query: 197 QKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE--------- 247
            K       +CL         F D+   S  +V           Y P V           
Sbjct: 230 PKF-----SYCLLS-----RRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYG 279

Query: 248 ----LFFGGETTGLK--NLP-------------VVFDSGSSYTYLNRVTYQTLTSIMKKE 288
               L   G T G K   LP              + DSG+++TYL+   +Q +   +   
Sbjct: 280 VYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAA 339

Query: 289 LSA--KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
           +    K  K+A ++  L  C+   +  +++         L+  F  G    + +L  E Y
Sbjct: 340 VGGRYKRSKDAEDELGLHPCFALPQGARSM-----ALPELSFHFEGG---AVMQLPVENY 391

Query: 347 LIISNKGNV---CLGILN 361
            +++ +G V   CL ++ 
Sbjct: 392 FVVAGRGAVEAICLAVVT 409


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 125/281 (44%), Gaps = 28/281 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
           G Y + + +G P    +  +DTGSDL W QC  PC  C     P++ P    +   +PCE
Sbjct: 80  GDYLMKLTLGSPPVDIYGLVDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKTYSPIPCE 138

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
              C+       ++C     C Y   YAD   + GVL ++A  F+ T+G  +    +  G
Sbjct: 139 SEQCSFFG----YSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFG 194

Query: 164 CGYNQVPGASYHPLDGILGLGKGKS-SIVSQL----HSQKLIRNVVGHCLSGGGGGFLFF 218
           CG++     +++  D  +    G   S+VSQ+     S++  + +V         G + F
Sbjct: 195 CGHSN--SGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252

Query: 219 GDDLYDSSR-VVWTSMSSD--YTKY------YSPGVAELFFGGETTGLKNLPVVFDSGSS 269
           G++   S   VV T ++S+   T Y       S G   + F    T L    ++ DSG+ 
Sbjct: 253 GEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSET-LSKGNIMIDSGTP 311

Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
            TY+ +  Y+ L   +K + S   +++ P D    LC++  
Sbjct: 312 ATYIPQEFYERLVEELKVQSSLLPIEDDP-DLGTQLCYRSE 351


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 139/389 (35%), Gaps = 65/389 (16%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPT-----GYYNVTMYIGQPARPY 64
           LCF +V   S S  ++   L   V ++          P      G Y     IG P +P 
Sbjct: 11  LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPV 70

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
              +D   +L W QC  PC  C E   PL+ P+       +PC   +C S+     +   
Sbjct: 71  SAVVDLTGELVWTQCT-PCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTS 129

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGASY 174
           D   C YE      G + G    D FA            L  GC          + G S 
Sbjct: 130 D--VCIYEAP-TKAGDTGGKAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGPS- 180

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD--------SS 226
               GI+GLG+   S+V+Q++          +CL+G   G LF G             + 
Sbjct: 181 ----GIVGLGRTPWSLVTQMNVTAF-----SYCLAGKSSGALFLGATAKQLAGGKNSSTP 231

Query: 227 RVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
            V+ TS  S       YY   +A +  GG   +        V+ D+ S  +YL    Y+ 
Sbjct: 232 FVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKA 291

Query: 281 LTSIMKKELSAKSLKEAPE--DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           L   +   +  + +   P+  D   P    G  P             L  +F  G   T 
Sbjct: 292 LKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAP------------ELVFTFDGGAALT- 338

Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
             + P  YL+ S  G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365


>gi|168002493|ref|XP_001753948.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694924|gb|EDQ81270.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 602

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 88/421 (20%), Positives = 147/421 (34%), Gaps = 104/421 (24%)

Query: 50  YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPC--EDP 106
           +  V + +G+  + Y++ +DTGS ++W+ C        E PH L++P  D  V C  ++ 
Sbjct: 155 FVKVPIGLGKERQEYYMHIDTGSGISWVNCKGRGPITTEGPHGLFKPKADSYVNCKKQEE 214

Query: 107 ICASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
            C        H C+     +C ++ +Y DG    G +V     F+ ++G      +A GC
Sbjct: 215 FCKGFQDGEEHRCDKKHHFRCIFDTQYGDGLIIEGYIVMIDLIFDLSDGSESQADVAFGC 274

Query: 165 GYN----QVPGASYH------------------------------PLDGILGLGKGKSSI 190
                  QV   + H                                DG++GLG    S 
Sbjct: 275 ASTCPKFQVVKNTPHLSVKIASSFSIMCADKVNDEETKKLGQNTALTDGLIGLGPHPGSW 334

Query: 191 VSQLHSQKLIRN-VVGHCLSGGGG---------------GFLFFGDDL-YDSSRVVWTSM 233
           + QL+    I   V+  C     G               GFL FG+     +   +WT+ 
Sbjct: 335 LHQLNMLGYISEYVIAICFEPDLGKSRHAAIGPELPEPAGFLSFGNPYSAQAESTIWTAN 394

Query: 234 SSDYTKYYSPGVAE----------LFFGGETTGLKNLPVV-------------------- 263
                +Y +P   E            + G    ++   +V                    
Sbjct: 395 IPSPEEYANPHPHEANSTNLQYYDAMYTGRLVSIRYRDIVIQLRGNEKKRKRDHPEGVQM 454

Query: 264 -FDSGSSYTYLNRVTYQTLTSIMKKELS------AKSLKEAPEDETLPLCWK----GRRP 312
            FD+GS  TYL R T+    +I+ +E         +   E  +DE    CW+    G  P
Sbjct: 455 GFDTGSDLTYLTRKTFDAFVTILDEEAKHLGYEITRDADEFVKDEQRK-CWRKKSGGEEP 513

Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG---NVCLGILNGAEVGLQD 369
             +V D        A +F +  T++   + P+ Y+     G     C  +L   E    +
Sbjct: 514 --SVEDFGDMILEFA-TFAEDDTKSELVINPKYYITSEGSGRQHRTCFNMLKETEFDFGN 570

Query: 370 L 370
           L
Sbjct: 571 L 571


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 139/389 (35%), Gaps = 65/389 (16%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPT-----GYYNVTMYIGQPARPY 64
           LCF +V   S S  ++   L   V ++          P      G Y     IG P +P 
Sbjct: 11  LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPV 70

Query: 65  FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
              +D   +L W QC  PC  C E   PL+ P+       +PC   +C S+     +   
Sbjct: 71  SAVVDLTGELVWTQC-TPCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTS 129

Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGASY 174
           D   C YE      G + G    D FA            L  GC          + G S 
Sbjct: 130 D--VCIYEAP-TKAGDTGGKAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGPS- 180

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD--------SS 226
               GI+GLG+   S+V+Q++          +CL+G   G LF G             + 
Sbjct: 181 ----GIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTP 231

Query: 227 RVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
            V+ TS  S       YY   +A +  GG   +        V+ D+ S  +YL    Y+ 
Sbjct: 232 FVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKA 291

Query: 281 LTSIMKKELSAKSLKEAPE--DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           L   +   +  + +   P+  D   P    G  P             L  +F  G   T 
Sbjct: 292 LKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAP------------ELVFTFDGGAALT- 338

Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
             + P  YL+ S  G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR  ++ LDTGSD+TW+QC  PC  C +   P++ PS       V C
Sbjct: 164 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVAC 222

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           ++P C  L A    N      C YE+ Y DG  ++G    +      +        +A+G
Sbjct: 223 DNPRCHDLDAAACRNST--GACLYEVAYGDGSYTVGDFATETLTLGDSAPVS---SVAIG 277

Query: 164 CGYN 167
           CG++
Sbjct: 278 CGHD 281


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y   + +G PAR  ++ LDTGSD+TW+QC  PC  C +   P++ PS       V C
Sbjct: 160 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVAC 218

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           ++P C  L A    N      C YE+ Y DG  ++G    +      +        +A+G
Sbjct: 219 DNPRCHDLDAAACRNST--GACLYEVAYGDGSYTVGDFATETLTLGDSAPVS---SVAIG 273

Query: 164 CGYN 167
           CG++
Sbjct: 274 CGHD 277


>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/275 (26%), Positives = 120/275 (43%), Gaps = 48/275 (17%)

Query: 68  LDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDLVPCEDPICASLHAP 114
           LDTGSDL W+ CD  C +C          E    +Y P    +N  V C + +CA     
Sbjct: 4   LDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQ---- 57

Query: 115 GHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQRLNPRLALGCGYNQVP 170
             + C    + C Y + Y    +S  G+L++D         N +R+   +  GCG  QV 
Sbjct: 58  -RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCG--QVQ 114

Query: 171 GASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
             S+  +   +G+ GLG  K S+ S L  + L+ +    C    G G + FGD       
Sbjct: 115 SGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQE 174

Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYTYLNRVTYQTLTSIMK 286
               +++  +   Y+  V  +  G  TT + +    +FD+G+S+TYL    Y T++    
Sbjct: 175 ETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFTYLVDPMYTTVSE--- 228

Query: 287 KELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
              SA+  + +P+          R PF+  +D+++
Sbjct: 229 ---SAQDKRHSPD---------SRIPFEYCYDMRE 251


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 66/222 (29%), Positives = 95/222 (42%), Gaps = 23/222 (10%)

Query: 5    HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
            H  ++LCF     S     ++   L +   + L F  H NV  T    V++ +G P +  
Sbjct: 960  HLFKSLCFSATPTSMVLPLNTQMGLISQPSNKLSF--HHNVTLT----VSLTVGSPPQQV 1013

Query: 65   FLDLDTGSDLTWLQC-DAPCVRCVEAPHPLYRPSNDLVPCEDPIC--ASLHAPGHHNCED 121
             + LDTGS+L+WL C  +P +  V   +PL   S   +PC  PIC   +   P    C+ 
Sbjct: 1014 TMVLDTGSELSWLHCKKSPNLTSVF--NPLSSSSYSPIPCSSPICRTRTRDLPNPVTCDP 1071

Query: 122  PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYH--PLDG 179
               C   + YAD  S  G L  D    N+  G    P    GC  +     S       G
Sbjct: 1072 KKLCHAIVSYADASSLEGNLASD----NFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTG 1127

Query: 180  ILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGD 220
            ++G+ +G  S V+QL   K       +C+SG    G L FGD
Sbjct: 1128 LMGMNRGSLSFVTQLGLPKF-----SYCISGRDSSGVLLFGD 1164


>gi|46488413|gb|AAS99528.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488415|gb|AAS99529.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488417|gb|AAS99530.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488419|gb|AAS99531.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488421|gb|AAS99532.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488423|gb|AAS99533.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488425|gb|AAS99534.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488427|gb|AAS99535.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488429|gb|AAS99536.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488431|gb|AAS99537.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488433|gb|AAS99538.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488435|gb|AAS99539.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488437|gb|AAS99540.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488439|gb|AAS99541.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488441|gb|AAS99542.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488443|gb|AAS99543.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488445|gb|AAS99544.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488447|gb|AAS99545.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488449|gb|AAS99546.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488455|gb|AAS99549.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 92/412 (22%), Positives = 157/412 (38%), Gaps = 84/412 (20%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
           LC  +V+  S S+   S  L         ++++G++    YY + + IG P +   L LD
Sbjct: 27  LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80

Query: 70  TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
           TGS      C A C  C   +E P  L    ++ ++ CE+  C     P   NC    +C
Sbjct: 81  TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133

Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
           +Y   Y +G    G    D  +    N +R+  R  +GC  ++     Y    G+LG+  
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193

Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
              +G  + V+ L      ++ V   C+S  GG  +  G D                   
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRGGSKSVSGQGSG 253

Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
                           L ++ ++VW +++  Y  Y      ++F     +  K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKIVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SGS++T++    Y  L                       LC +      N +DV K  + 
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDVNKRLKM 353

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
              SF +   +  F+   ++   I  K N+C+ I++G +      GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 13/131 (9%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y   + +G P R  ++ LDTGSD+ W+QC+ PC +C     P++ P    
Sbjct: 187 VSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCE-PCSKCYSQVDPIFNPSLSA 245

Query: 97  SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           S   + C   +C+ L A   +NC     C Y++ Y DG  ++G    +   F  T+ +  
Sbjct: 246 SFSTLGCNSAVCSYLDA---YNCHG-GGCLYKVSYGDGSYTIGSFATEMLTFGTTSVR-- 299

Query: 157 NPRLALGCGYN 167
              +A+GCG++
Sbjct: 300 --NVAIGCGHD 308


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 37/244 (15%)

Query: 45  VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
           V+    Y + + +G P       +DTGS++TW QC  PCV C +   P++ PS      E
Sbjct: 374 VFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-LPCVHCYKQNAPIFDPSKSSTFKE 432

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
                         C D + C YE++Y D   + G L  D    + T+G+  +     +G
Sbjct: 433 ------------KRCHDHS-CPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIG 479

Query: 164 CGYNQVPGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD- 221
           CG N    + + P  +G +GL  G  S+++Q+  +     ++ +C +G G   + FG + 
Sbjct: 480 CGRNN---SWFRPSFEGFVGLNWGPLSLITQMGGE--YPGLMSYCFAGNGTSKINFGTNA 534

Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSS 269
           +     VV T+M   +     PG   L     + G   +             +V DSG++
Sbjct: 535 IVGGGGVVSTTM---FVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTT 591

Query: 270 YTYL 273
            TY 
Sbjct: 592 LTYF 595



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 141/352 (40%), Gaps = 52/352 (14%)

Query: 14  TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
           T+ +    S++SSS + N    S        V+ T  Y + + IG P       LDTGS+
Sbjct: 31  TIDLIHRRSNASSSRVSNTQAGS---PYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSE 87

Query: 74  LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
           L W QC  PC+ C +   P++ PS      E       + P H        C Y+L Y D
Sbjct: 88  LIWTQC-LPCLHCYDQKAPIFDPSKSSTFKE----TRCNTPDH-------SCPYKLVYDD 135

Query: 134 GGSSLGVLVKDAFAFNYTNG-QRLNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
              + G L  +    + T+G   + P   +GC  N   G+ + P   GI+GL +G  S++
Sbjct: 136 KSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNN-SGSGFRPSSSGIVGLSRGSLSLI 194

Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY---TKYYSPGVAEL 248
           SQ+                 GG +   GD +  ++    T+    Y       S G   +
Sbjct: 195 SQM-----------------GGAYP--GDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRI 235

Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
              G      N  +V DSG+  TY        +   +++ ++A  + +   ++   LC+ 
Sbjct: 236 ETVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDM--LCY- 292

Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
               + N  ++   F  + + F+ G    L +     Y+ ++  G  CL I+
Sbjct: 293 ----YSNTIEI---FPVITVHFSGGADLVLDKY--NMYMELNRGGVFCLAII 335


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 149/371 (40%), Gaps = 52/371 (14%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           SSS+S L N+   ++  ++ G     G Y++   IG P +      DTGSDL W +CDA 
Sbjct: 75  SSSASQLSNNDTDTVPLRMDGG---GGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG 131

Query: 83  CVRCVEAP---HPLYRPSNDLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGS-- 136
                      HP    +   +PC D +CA+L +     C    A+CDY+  Y  G    
Sbjct: 132 GGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPD 191

Query: 137 -SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
            + G L  + F    T G    P +  GC         Y    G++GLG+G  S+VSQL 
Sbjct: 192 FTQGFLGSETF----TLGGDAVPGVGFGC--TTALEGDYGEGAGLVGLGRGPLSLVSQLD 245

Query: 196 SQKLIRNVVGHCLSGGGGGF--LFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
           +   +     +CL+        L FG    +  +   V ++     T +Y+  +  +  G
Sbjct: 246 AGTFM-----YCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIG 300

Query: 252 GETTG--LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
             TT        VVFDSG++ TYL    Y    +    + ++ +  E            G
Sbjct: 301 SATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVE------------G 348

Query: 310 RRPFKNVH---DVKKCFRTLALSFTDGKTRTLFELTPEA-YLIISNKGNVCLGILNGAEV 365
           R  F+  +   D  +    + L F  G    L    P A Y++  + G VC  +      
Sbjct: 349 RYGFEACYEKPDSARLIPAMVLHFDGGADMAL----PVANYVVEVDDGVVCWVVQRSPS- 403

Query: 366 GLQDLNVIGGI 376
               L++IG I
Sbjct: 404 ----LSIIGNI 410


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 97/349 (27%), Positives = 142/349 (40%), Gaps = 54/349 (15%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V G    +G Y  ++ +G P  P  L LDTGSD+ WLQC APC +C      ++ P    
Sbjct: 132 VSGLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQC-APCRQCYAQSGRVFDPRRSR 190

Query: 97  SNDLVPCEDPIC-ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           S   V C  P C       G         C Y++ Y DG  + G L  +   F    G R
Sbjct: 191 SYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF--ARGAR 248

Query: 156 LNPRLALGCGYNQVPGASYHPLDGIL---GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           + PR+A+GCG++          +G+        G       L +Q   R   G   S   
Sbjct: 249 V-PRVAVGCGHDN---------EGLFVAAAGLLGLGRGRLSLPTQTARR--YGRRFS--- 293

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYT 271
             + F G DL    R +  ++          GV E     + +TG     V+ DSG+S T
Sbjct: 294 --YCFQGSDL--DHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGG--VILDSGTSVT 347

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCW--KGRRPFKNVHDVKKCFRTLAL 328
            L R  Y  +    +   +A  L+ AP   +L   C+  +GRR  K          T+++
Sbjct: 348 RLARPVYVAVREAFRA--AAGGLRLAPGGFSLFDTCYDLRGRRVVK--------VPTVSV 397

Query: 329 SFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
               G       L PE YLI +  +G  CL  L G + G   ++++G I
Sbjct: 398 HLAGGAE---VALPPENYLIPVDTRGTFCLA-LAGTDGG---VSIVGNI 439


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 137/331 (41%), Gaps = 47/331 (14%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
           Y VT+ +G   R   + +DTGSDL+W+QC  PC RC     P++ PS       V C   
Sbjct: 66  YIVTVELG--GRKMTVIVDTGSDLSWVQCQ-PCNRCYNQQDPVFNPSKSPSYRTVLCNSL 122

Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            C SL  A G+      +P  C+Y + Y DG  + G +  +        G         G
Sbjct: 123 TCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL----GNTTVNNFIFG 178

Query: 164 CGY-NQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLF 217
           CG  NQ    GAS     G++GLG+   S++SQ+    +   V  +CL        G L 
Sbjct: 179 CGRKNQGLFGGAS-----GLVGLGRTDLSLISQIS--PMFGGVFSYCLPTTEAEASGSLV 231

Query: 218 FGDD---LYDSSRVVWTSMSSD-YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSY 270
            G +     +++ + +T M  +    +Y   +  +  GG   +        ++ DSG+  
Sbjct: 232 MGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVI 291

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLAL 328
           + L    YQ L +   K+ S      AP    L  C+   G +  K + D+K  F     
Sbjct: 292 SRLPPSIYQALKAEFVKQFSG--YPSAPSFMILDSCFNLSGYQEVK-IPDIKMYF----- 343

Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
              +G      ++T   Y + ++   VCL I
Sbjct: 344 ---EGSAELNVDVTGVFYSVKTDASQVCLAI 371


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 89/391 (22%), Positives = 153/391 (39%), Gaps = 76/391 (19%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           +SSS + +       S+ +F+   + +  G Y+  +  G P +   L  DTGS L W  C
Sbjct: 50  ASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPC 109

Query: 80  DAPCVRCVEAPHPLYRP------------SNDLVPCEDPICASLHA-----------PGH 116
            +  + C E   P   P            S+ LV C++P C+ +             P  
Sbjct: 110 TSRYL-CSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKT 168

Query: 117 HNCED--PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
            NC    PA   Y ++Y   GS+ G+L+ +   F      +  P   +GC +      S 
Sbjct: 169 ENCTQTCPA---YVVQYGS-GSTAGLLLSETLDF----PDKXIPNFVVGCSF-----LSI 215

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------GGFLFFGDDLYDSSRV 228
           H   GI G G+G  S+ SQ+  +K       +CL+          G L        SS +
Sbjct: 216 HQPSGIAGFGRGSESLPSQMGLKKF-----AYCLASRKFDDSPHSGQLILDSTGVKSSGL 270

Query: 229 VWTSMSSD-------YTKYYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYT 271
            +T    +       Y +YY   + ++  G +   +           N   + DSGS++T
Sbjct: 271 TYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFT 330

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSF 330
           ++++   + +    +K+L+  +   A + ETL     G RP  ++   K   F  L   F
Sbjct: 331 FMDKPVLEVVAREFEKQLA--NWTRATDVETL----TGLRPCFDISKEKSVKFPELIFQF 384

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
             G    L       + ++S+ G  CL ++ 
Sbjct: 385 KGGAKWAL--PLNNYFALVSSSGVACLTVVT 413


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 14/150 (9%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
           Y VT  +G P     +++DTGSDL+W+QC  PC     C     PL+ P+       VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             P+CA L       C   AQC Y + Y DG ++ GV   D    + ++  +       G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
           CG+ Q     ++ +DG+LGLG+ + S+V Q
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ 282


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 116/277 (41%), Gaps = 26/277 (9%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV- 101
           G+   T  Y +T+ IG PA    + +DTGSD++W++C++           L+ PS     
Sbjct: 121 GSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------TDGLTLFDPSKSTTY 174

Query: 102 ---PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
               C    CA L   G   C + + C Y ++Y DG ++ G    D  A + ++      
Sbjct: 175 APFSCSSAACAQLGNNG-DGCSN-SGCQYRVQYGDGSNTTGTYSSDTLALSASD---TVT 229

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GC +++        +DG++GLG    S+VSQ  +         +CL  +    GFL
Sbjct: 230 DFHFGCSHHE-EDFDGEKIDGLMGLGGDAQSLVSQ--TAATYGKSFSYCLPPTNRTSGFL 286

Query: 217 FFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLPV----VFDSGSSY 270
            FG     S   V T M         Y   + ++  GG   G++   +    V DSG+  
Sbjct: 287 TFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSVMDSGTVI 346

Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
           T+L R  Y  L+S  +  ++    + A     L  C+
Sbjct: 347 TWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCY 383


>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
          Length = 551

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 139/342 (40%), Gaps = 62/342 (18%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---------PLYRP-------SNDL 100
           +G P   + + LDTGSDL W+ CD  C +C    +         P  R        ++  
Sbjct: 111 VGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTN------- 152
           V C   +C   +A         + C Y + YA    SS G LV+D               
Sbjct: 169 VTCASNLCDQPNACATAT----SSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224

Query: 153 GQRLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCL 208
           G  +   +  GCG  QV   S+      DG++GLG  K S+ S L S  +++ N    C 
Sbjct: 225 GAAVRTPVVFGCG--QVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCF 282

Query: 209 SGGGGGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
           S  G G + FGD    D  ++  +V ++ S     YY+  +  +     + G KNLP+ F
Sbjct: 283 SKDGLGRINFGDTGSADQSETPFIVKSTHS-----YYNISITSM-----SVGDKNLPLGF 332

Query: 265 ----DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
               DSG+S+TYLN   Y   T+    ++S +    +    + P       PF+  + + 
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPF------PFEYCYSLS 386

Query: 321 KCFRTLALSFTDGKTR--TLFELTPEAYLIISNKGNVCLGIL 360
               T+ L      T    +F +T   Y I +   N  + I+
Sbjct: 387 PDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRII 428


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 117/298 (39%), Gaps = 57/298 (19%)

Query: 44  NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----------------------DA 81
           N    G Y V++  G PA PY L LDT +DLTW+ C                      D 
Sbjct: 133 NTAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDD 192

Query: 82  PCVRCV---EAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEY 131
             V  +   EA    YRP+       + C +  CA  H P ++ C+ P++   C Y  + 
Sbjct: 193 DVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCA--HLP-YNTCQSPSKLESCSYYQKT 249

Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
            DG  ++G+   +      ++G+    P L LGC   +  GAS    DG+L LG G  S 
Sbjct: 250 QDGTVTIGIYGNEKATVTVSDGRMAKLPGLVLGCSVLEA-GASVDAHDGVLSLGNGHMSF 308

Query: 191 VSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRVVWTSMSSDYTKYYS 242
              +H+          CL     S     +L FG +   +   +       + D    Y 
Sbjct: 309 A--IHAVLRFGGRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYG 366

Query: 243 PGVAELFFGGETTGL--------KNL--PVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
           P V  +  GGE   +        K L   V+ D+ +S T L    Y+ L + + + L+
Sbjct: 367 PRVTAVLVGGERLDIPDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLA 424


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 15/126 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPS----NDLV 101
           G Y   + +GQP + YF   DTGSD++WLQC  PC     C +   P++ P        +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
            C+   C   H      C D   C YE+EY DG  ++G L  + F+F ++N     P L 
Sbjct: 241 SCDSEQC---HLLDEAAC-DANSCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293

Query: 162 LGCGYN 167
           +GCG++
Sbjct: 294 IGCGHD 299


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 118/302 (39%), Gaps = 69/302 (22%)

Query: 37  LLFQVHGNVYPTG-----------YYNVTM----YIGQPARPYFLDLDTGSDLTWLQCDA 81
           LLF++     P G           ++NV++     +G P +   + LDTGS+L+WL C  
Sbjct: 36  LLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAP 95

Query: 82  PCVRCVEAPHPL-YRPSNDL----VPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGG 135
                      L +RP   L    VPC    C S   P    C+  + QC   L YADG 
Sbjct: 96  GGGGGGGGRSALSFRPRASLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGS 155

Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVS 192
           SS G L  + F    T GQ    R A GC    ++  P        G+LG+ +G  S VS
Sbjct: 156 SSDGALATEVF----TVGQGPPLRAAFGCMATAFDTSPDGVAT--AGLLGMNRGALSFVS 209

Query: 193 QLHSQKLIRNVVGHCLSG-GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
           Q  +++       +C+S     G L  G  DL          +  +YT  Y P +   +F
Sbjct: 210 QASTRRF-----SYCISDRDDAGVLLLGHSDL--------PFLPLNYTPLYQPAMPLPYF 256

Query: 251 G---------GETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMK 286
                     G   G K LP+               + DSG+ +T+L    Y  L +   
Sbjct: 257 DRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFS 316

Query: 287 KE 288
           ++
Sbjct: 317 RQ 318


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 89/391 (22%), Positives = 153/391 (39%), Gaps = 76/391 (19%)

Query: 20  SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
           +SSS + +       S+ +F+   + +  G Y+  +  G P +   L  DTGS L W  C
Sbjct: 50  ASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPC 109

Query: 80  DAPCVRCVEAPHPLYRP------------SNDLVPCEDPICASLHA-----------PGH 116
            +  + C E   P   P            S+ LV C++P C+ +             P  
Sbjct: 110 TSRYL-CSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKT 168

Query: 117 HNCED--PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
            NC    PA   Y ++Y   GS+ G+L+ +   F      +  P   +GC +      S 
Sbjct: 169 ENCTQTCPA---YVVQYGS-GSTAGLLLSETLDF----PDKKIPNFVVGCSF-----LSI 215

Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------GGFLFFGDDLYDSSRV 228
           H   GI G G+G  S+ SQ+  +K       +CL+          G L        SS +
Sbjct: 216 HQPSGIAGFGRGSESLPSQMGLKKF-----AYCLASRKFDDSPHSGQLILDSTGVKSSGL 270

Query: 229 VWTSMSSD-------YTKYYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYT 271
            +T    +       Y +YY   + ++  G +   +           N   + DSGS++T
Sbjct: 271 TYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFT 330

Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSF 330
           ++++   + +    +K+L+  +   A + ETL     G RP  ++   K   F  L   F
Sbjct: 331 FMDKPVLEVVAREFEKQLA--NWTRATDVETL----TGLRPCFDISKEKSVKFPELIFQF 384

Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
             G    L       + ++S+ G  CL ++ 
Sbjct: 385 KGGAKWAL--PLNNYFALVSSSGVACLTVVT 413


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 15/126 (11%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPS----NDLV 101
           G Y   + +GQP + YF   DTGSD++WLQC  PC     C +   P++ P        +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240

Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
            C+   C   H      C D   C YE+EY DG  ++G L  + F+F ++N     P L 
Sbjct: 241 SCDSEQC---HLLDEAAC-DANSCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293

Query: 162 LGCGYN 167
           +GCG++
Sbjct: 294 IGCGHD 299


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 7/124 (5%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           G Y ++  +G P       +DTGSD+ WLQC  PC  C     P++ PS       +PC 
Sbjct: 92  GEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQ-PCEDCYNQTTPIFDPSQSKTYKTLPCS 150

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
             IC S+ +    +  +  +C+Y + Y D   S G L  +      T+G  +  P+  +G
Sbjct: 151 SNICQSVQSAASCSSNND-ECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIG 209

Query: 164 CGYN 167
           CG+N
Sbjct: 210 CGHN 213


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 106/265 (40%), Gaps = 34/265 (12%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL- 100
           G +  +  Y V + +G P R   L  DTGS LTW QC+ PC   C +   P++ PS    
Sbjct: 132 GRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCE-PCAGSCYKQQDPIFDPSKSSS 190

Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
              + C   +C    + G  +  D A C Y+++Y D   S G L ++      T+   + 
Sbjct: 191 YTNIKCTSSLCTQFRSAGCSSSTD-ASCIYDVKYGDNSISRGFLSQERLTITATD---IV 246

Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGF 215
                GCG  Q     +    G++GL +   S V Q  S  +   +  +CL  +    G 
Sbjct: 247 HDFLFGCG--QDNEGLFRGTAGLMGLSRHPISFVQQTSS--IYNKIFSYCLPSTPSSLGH 302

Query: 216 LFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPVV---------- 263
           L FG     ++ + +T  S  S    +Y   +  +  GG       LP V          
Sbjct: 303 LTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGG-----TKLPAVSSSTFSAGGS 357

Query: 264 -FDSGSSYTYLNRVTYQTLTSIMKK 287
             DSG+  T L    Y  L S  ++
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQ 382


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 81/322 (25%), Positives = 126/322 (39%), Gaps = 32/322 (9%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y   M +G PA  Y + +DTGS LTWLQC    V C     P++ P +      V C 
Sbjct: 120 GNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCS 179

Query: 105 DPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              C+ L +   +   C     C Y+  Y D   S+G L KD  +F  T+     P    
Sbjct: 180 AQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYY 235

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF 218
           GCG +      +    G++GL + K S++ QL     +     +CL    S G      +
Sbjct: 236 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSY 291

Query: 219 GDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
               Y  + +V +S+     + K     VA       ++   +LP + DSG+  T L   
Sbjct: 292 NPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTS 351

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR 336
            Y  L+  +   +  K    A     L  C+KG+    +   V        +SF  G   
Sbjct: 352 VYSALSKAVAAAM--KGTSRASAYSILDTCFKGQASRVSAPAVT-------MSFAGGAA- 401

Query: 337 TLFELTPEAYLIISNKGNVCLG 358
              +L+ +  L+  +    CL 
Sbjct: 402 --LKLSAQNLLVDVDDSTTCLA 421


>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
           Japonica Group]
 gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
          Length = 551

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 139/342 (40%), Gaps = 62/342 (18%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---------PLYRP-------SNDL 100
           +G P   + + LDTGSDL W+ CD  C +C    +         P  R        ++  
Sbjct: 111 VGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168

Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTN------- 152
           V C   +C   +A         + C Y + YA    SS G LV+D               
Sbjct: 169 VTCASNLCDQPNACATAT----SSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224

Query: 153 GQRLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCL 208
           G  +   +  GCG  QV   S+      DG++GLG  K S+ S L S  +++ N    C 
Sbjct: 225 GAAVRTPVVFGCG--QVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCF 282

Query: 209 SGGGGGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
           S  G G + FGD    D  ++  +V ++ S     YY+  +  +     + G KNLP+ F
Sbjct: 283 SKDGLGRINFGDTGSADQSETPFIVKSTHS-----YYNISITSM-----SVGDKNLPLGF 332

Query: 265 ----DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
               DSG+S+TYLN   Y   T+    ++S +    +    + P       PF+  + + 
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPF------PFEYCYSLS 386

Query: 321 KCFRTLALSFTDGKTR--TLFELTPEAYLIISNKGNVCLGIL 360
               T+ L      T    +F +T   Y I +   N  + I+
Sbjct: 387 PDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRII 428


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 43/344 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y + + +G P + + L LDTGSDL WLQC  PC  C       Y P        + C
Sbjct: 157 SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNITC 215

Query: 104 EDPICASLHAPGHH-NCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-- 159
            DP C+ + +P     CE D   C Y   Y D  ++ G    + F  N T  +  +    
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275

Query: 160 ---LALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
              +  GCG +N+   +    L G+       SS +  L+       +V    +      
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSK 335

Query: 216 LFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
           L FG+  DL + + + +TS      +    +Y   +  +  GG+   +            
Sbjct: 336 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395

Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
              + DSG++ +Y     Y+    I+K + + K  +  P     P+      P  NV  +
Sbjct: 396 GGTIIDSGTTLSYFAEPAYE----IIKNKFAEKMKENYPIFRDFPVL----DPCFNVSGI 447

Query: 320 KKC---FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
           ++       L ++F DG   T++    E   I  ++  VCL IL
Sbjct: 448 EENNIHLPELGIAFVDG---TVWNFPAENSFIWLSEDLVCLAIL 488


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 60/135 (44%), Gaps = 10/135 (7%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
           G Y + + +G P  P     DTGSDL W QC  PC  C E   PL+ P        + C+
Sbjct: 92  GAYLMNISLGTPPVPMLGIADTGSDLIWRQC-LPCPNCYEQVEPLFDPKESETYKTLDCD 150

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
           +  C  L   G  +C+D   C Y   Y D   + G L  D      T G   + P +A G
Sbjct: 151 NEFCQDLGQQG--SCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFG 208

Query: 164 CGYNQVPGASYHPLD 178
           CG++   G +++  D
Sbjct: 209 CGHDN--GGTFNEKD 221


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 113/273 (41%), Gaps = 48/273 (17%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
           Y ++  IG P    +  +DTG+D  W QC  PC  C+    P++ PS       +PC  P
Sbjct: 90  YVMSYSIGTPPFQLYSLIDTGNDNIWFQCK-PCKPCLNQTSPMFHPSKSSTYKTIPCTSP 148

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
           IC   +A GH+                    LGV   D    N  NG  ++ + + +GCG
Sbjct: 149 ICK--NADGHY--------------------LGV---DTLTLNSNNGTPISFKNIVIGCG 183

Query: 166 Y-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
           + NQ P   Y  + G +GL +G  S +SQL+S   I     +CL            L FG
Sbjct: 184 HRNQGPLEGY--VSGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKLHFG 239

Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNR 275
           D    S     ++   +   Y+   +     G     L+N       + DSG++ T L +
Sbjct: 240 DKSTVSGLGTVSTPIKEENGYFV-SLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPK 298

Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
             Y  L S++   +  K +K+    +   LC++
Sbjct: 299 DVYSRLESVVLDMVKLKRVKDP--SQQFNLCYQ 329


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 143/389 (36%), Gaps = 65/389 (16%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHV-------GSSLLFQVHGNVYPTGYYNVTMYIGQPAR 62
           LCF +V   S S  ++   L   V       G ++   ++  +   G Y     IG P +
Sbjct: 11  LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIY--LSSQGLYVANFTIGTPPQ 68

Query: 63  PYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHN 118
           P    +D   +L W QC  PC  C E   PL+ P+       +PC   +C S+     + 
Sbjct: 69  PVSAVVDLTGELVWTQC-TPCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNC 127

Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGA 172
             D   C YE      G + G+   D FA            L  GC          + G 
Sbjct: 128 TSD--VCIYEAP-TKAGDTGGMAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGP 179

Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD-------- 224
           S     GI+GLG+   S+V+Q++          +CL+G   G LF G             
Sbjct: 180 S-----GIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGALFLGATAKQLAGGKNSS 229

Query: 225 SSRVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTY 278
           +  V+ TS  S       YY   +A +  GG   +        V+ D+ S  +YL    Y
Sbjct: 230 TPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAY 289

Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
           + L   +   +  + +   P+     LC+           V      L  +F  G   T 
Sbjct: 290 KALKKALTAAVGVQPVASPPKPYD--LCFS--------KAVAGDAPELVFTFDGGAALT- 338

Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
             + P  YL+ S  G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 70/159 (44%), Gaps = 14/159 (8%)

Query: 51  YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
           Y + + IG P RP    L LDTGSDL W QC   C  C + P P++R S       VPC 
Sbjct: 94  YLIHLGIGTP-RPQRVVLHLDTGSDLVWTQC--ACTVCFDQPVPVFRASVSHTFSRVPCS 150

Query: 105 DPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRL 160
           DP+C  +++ P          C Y   Y D   + G + +D F F   +  +     P +
Sbjct: 151 DPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNI 210

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
             GCG     G       GI G G G  S+ SQL  ++ 
Sbjct: 211 RFGCGMMNY-GLFTPNQSGIAGFGTGPLSLPSQLKVRRF 248


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G PA    + LDTGSD+ WLQC APC  C      ++ P    S   V C
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 183

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             PIC  L + G     +   C Y++ Y DG  + G    +   F    G R+  R+A+G
Sbjct: 184 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 238

Query: 164 CGYN 167
           CG++
Sbjct: 239 CGHD 242


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G PA    + LDTGSD+ WLQC APC  C      ++ P    S   V C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 177

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             PIC  L + G     +   C Y++ Y DG  + G    +   F    G R+  R+A+G
Sbjct: 178 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 232

Query: 164 CGYN 167
           CG++
Sbjct: 233 CGHD 236


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
           +G Y   + +G PA    + LDTGSD+ WLQC APC  C      ++ P    S   V C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 177

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
             PIC  L + G     +   C Y++ Y DG  + G    +   F    G R+  R+A+G
Sbjct: 178 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 232

Query: 164 CGYN 167
           CG++
Sbjct: 233 CGHD 236


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 142/343 (41%), Gaps = 48/343 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
           Y +T+ +G P R      DTGSDL W++C           AP   + PS       V C+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR----- 159
              C +L   G   C+D + C Y   Y DG ++ GVL  + F F+     R +PR     
Sbjct: 161 TDACEAL---GRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGR-SPRQVRIG 216

Query: 160 -LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
            +  GC       A   P DG++GLG G  S+V+QL     +     +CL          
Sbjct: 217 GVKFGC---STATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL---------V 264

Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT--GLKNLPVVFDSGSSYTYLNRV 276
              +  SS + + ++ +D T+   PG A     G  T     +  ++ DSG++ T+L+  
Sbjct: 265 PHSVNASSALNFGAL-ADVTE---PGAASTPLVGNKTVASAASSRIIVDSGTTLTFLDPS 320

Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGK 334
               +   + + ++   ++    D  L LC+   GR       +  +    L L F  G 
Sbjct: 321 LLGPIVDELSRRITLPPVQS--PDGLLQLCYNVAGRE-----VEAGESIPDLTLEFGGGA 373

Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
                 L PE   +   +G +CL I+   E   Q ++++G + 
Sbjct: 374 A---VALKPENAFVAVQEGTLCLAIVATTE--QQPVSILGNLA 411


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 141/378 (37%), Gaps = 88/378 (23%)

Query: 45  VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
           +YP  Y  Y  T  +G P +P  + LDTGS LTW+          C +P    V   HP 
Sbjct: 91  LYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPK 150

Query: 94  YRPSNDLVPCEDPICASLH--------------APGHHNCEDPAQ--C-DYELEYADGGS 136
              S+ LV C +P C  +H              +PG  NC   A   C  Y + Y   GS
Sbjct: 151 NSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GS 209

Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
           + G+L+ D          R  P   LGC    V    + P  G+ G G+G  S+ +QL  
Sbjct: 210 TAGLLIADTL----RAPGRAVPGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGL 261

Query: 197 QKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE--------- 247
            K       +CL         F D+   S  +V           Y P V           
Sbjct: 262 PKF-----SYCLLS-----RRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYG 311

Query: 248 ----LFFGGETTGLK--NLPV-------------VFDSGSSYTYLNRVTYQTLTSIMKKE 288
               L   G T G K   LP              + DSG+++TYL+   +Q +   +   
Sbjct: 312 VYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAA 371

Query: 289 LSA--KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
           +    K  K+A +   L  C+   +  +++         L+  F  G    + +L  E Y
Sbjct: 372 VGGRYKRSKDAEDGLGLHPCFALPQGARSM-----ALPELSFHFEGG---AVMQLPVENY 423

Query: 347 LIISNKGNV---CLGILN 361
            +++ +G V   CL ++ 
Sbjct: 424 FVVAGRGAVEAICLAVVT 441


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 145/376 (38%), Gaps = 63/376 (16%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S +S++  +   S+ L    G    +G Y VT+ IG P     L  DTGSDLTW QC+ P
Sbjct: 104 SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-P 162

Query: 83  CV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
           C+  C     P + PS+      V C  P+C    +    NC       Y + Y D   +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCV------YSIGYGDKSFT 216

Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ---- 193
            G L K+ F    TN   L   +  GCG N           G+     G   +       
Sbjct: 217 QGFLAKEKFTL--TNSDVLE-DVYFGCGENN---------QGLFDGVAGLLGLGPGKLSL 264

Query: 194 -LHSQKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
              +     N+  +CL   +    G L FG     S  V +T +SS +   ++ G+  + 
Sbjct: 265 PAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPISS-FPSAFNYGIDII- 321

Query: 250 FGGETTGLKNLPV----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
             G + G K L +          + DSG+ +T L    Y  L S+ K+++S  S K    
Sbjct: 322 --GISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMS--SYKSTSG 377

Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
                 C+     F  +  V   + T+A SF  G   T+ EL      +      VCL  
Sbjct: 378 YGLFDTCYD----FTGLDTVT--YPTIAFSFAGG---TVVELDGSGISLPIKISQVCL-- 426

Query: 360 LNGAEVGLQDLNVIGG 375
              A  G  DL  I G
Sbjct: 427 ---AFAGNDDLPAIFG 439


>gi|449533387|ref|XP_004173657.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 254

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 58/197 (29%), Positives = 85/197 (43%), Gaps = 32/197 (16%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---------DLVPC 103
           V++ IG P +P  L LDTGS L+W+QC    V+    P P  + +           L+PC
Sbjct: 69  VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPC 128

Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
             PIC     P      +C+    C Y   YADG  + G LV++ F F   +     P +
Sbjct: 129 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTF---SNSLSTPPV 184

Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG----GFL 216
            LGC              GILG+  G+ S +SQ    K       +C+    G    G  
Sbjct: 185 ILGCAQGSTENR------GILGMNHGRLSFISQAKISKF-----SYCVPSRTGPNPTGLF 233

Query: 217 FFGDDLYDSSRVVWTSM 233
           + GD+  +SS+  + +M
Sbjct: 234 YLGDNP-NSSKFKYVTM 249


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 73/276 (26%), Positives = 120/276 (43%), Gaps = 38/276 (13%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
           V + IGQP+ P  + +DTGSD+ W+ C+ PC  C      L+ PS  +     P+C +  
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCN-PCTNCDNHLGLLFDPS--MSSTFSPLCKT-- 157

Query: 113 APGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVP 170
             G   C+ DP    + + Y D  S+ G   +D   F  T+ G      + +GCG+N   
Sbjct: 158 PCGFKGCKCDPIP--FTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGHNI-- 213

Query: 171 GASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF-------GDDL 222
           G +  P  +GILGL  G +S+ +Q+  +        +C+      +  +       G DL
Sbjct: 214 GFNSDPGYNGILGLNNGPNSLATQIGRK------FSYCIGNLADPYYNYNQLRLGEGADL 267

Query: 223 --YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---VVFDSGSSYTYL---- 273
             Y +   V+        +  S G   L    ET  +K      V+ DSG++ TYL    
Sbjct: 268 EGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSA 327

Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
           +++ Y  + +++K        + AP      LC+ G
Sbjct: 328 HKLLYNEVRNLLKWSFRQVIFENAP----WKLCYYG 359


>gi|46488451|gb|AAS99547.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488453|gb|AAS99548.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/412 (22%), Positives = 156/412 (37%), Gaps = 84/412 (20%)

Query: 10  LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
           LC  +V+  S S+   S  L         ++++G++    YY + + IG P +   L LD
Sbjct: 27  LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80

Query: 70  TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
           TGS      C A C  C   +E P  L    ++ ++ CE+  C     P   NC    +C
Sbjct: 81  TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133

Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
           +Y   Y +G    G    D  +    N +R+  R  +GC  ++     Y    G+LG+  
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193

Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
              +G  + V+ L      ++ V   C+S  GG  +  G D                   
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRRGSKSVSGQGSG 253

Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
                           L ++ ++VW +++  Y  Y      ++F     +  K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKIVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313

Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
           SGS++T++    Y  L                       LC +      N +D  K  + 
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDANKRLKM 353

Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
              SF +   +  F+   ++   I  K N+C+ I++G +      GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 70/151 (46%), Gaps = 16/151 (10%)

Query: 22  SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
           +SS S    N  GS +   V G    +G Y V + +G P R  ++ +D+GSD+ W+QC  
Sbjct: 106 ASSDSRYEVNDFGSDV---VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ- 161

Query: 82  PCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
           PC  C +   P++ P+       V C   +C  +   G H+      C YE+ Y DG  +
Sbjct: 162 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS----GGCRYEVMYGDGSYT 217

Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
            G L  +   F  T    +   +A+GCG+  
Sbjct: 218 KGTLALETLTFAKT----VVRNVAMGCGHRN 244


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 92/359 (25%), Positives = 135/359 (37%), Gaps = 60/359 (16%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---PLYRPSND----LVPC 103
           +++T+ I QP +   L +DTGSDL W QC         A H   P+Y P        +PC
Sbjct: 16  HSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPC 72

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
            D +C         NC    +C YE  Y    +++GVL  + F F       L  RL  G
Sbjct: 73  SDRLCQEGQF-SFKNCTSKNRCVYEDVYGS-AAAVGVLASETFTFGARRAVSL--RLGFG 128

Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG- 219
           CG   +   S     GILGL     S+++QL  Q+       +CL+         L FG 
Sbjct: 129 CG--ALSAGSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFGA 181

Query: 220 -DDL--YDSSRVVWT----SMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------- 262
             DL  + ++R + T    S   +   YY P V      G + G K L V          
Sbjct: 182 MADLSRHKTTRPIQTTAIVSNPVETVYYYVPLV------GISLGHKRLAVPAASLAMRPD 235

Query: 263 -----VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
                + DSGS+  YL    ++ +   +   +         ED  L      R     + 
Sbjct: 236 GGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAME 295

Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
            V+     L L F  G    L     + Y      G +CL +  G       +++IG +
Sbjct: 296 AVQ--VPPLVLHFDGGAAMVLPR---DNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNV 347


>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 553

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 145/378 (38%), Gaps = 79/378 (20%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-------------VEAPHPLYRP--- 96
            T+ +G P   + + LDTGSDL W+ CD  C RC              +    +Y P   
Sbjct: 103 TTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNPNGS 160

Query: 97  -SNDLVPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY-- 150
            ++  V C + +C       H N C    + C Y + Y    +S  G+LV+D        
Sbjct: 161 STSKKVTCNNSLCT------HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPD 214

Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
            N   +   +  GCG  QV   S+  +   +G+ GLG  K S+ S L  +    +    C
Sbjct: 215 DNHDLVEANVIFGCG--QVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMC 272

Query: 208 LSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKN 259
               G G + FGD           S+  D T +        Y+  + ++  G     ++ 
Sbjct: 273 FGRDGIGRISFGDK---------GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE- 322

Query: 260 LPVVFDSGSSYTYLNRVTYQTLT-SIMKK----------------ELSAKSLKEAPEDET 302
              +FDSG+S+TYL   TY  L+ S+  K                E+         ED  
Sbjct: 323 FTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRR 382

Query: 303 LPLCWKGRRPFKNVHDVKKCFRTL---ALSFTDGKTRTLFELTPEAYLIISNKGNV--CL 357
            P     R PF   +D+     T    ++S T G         P   +IIS +  +  CL
Sbjct: 383 RPP--DSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDP--IIIISTQSELVYCL 438

Query: 358 GILNGAEVGLQDLNVIGG 375
            ++  AE+ +   N + G
Sbjct: 439 AVVKSAELNIIGQNFMTG 456


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 69/150 (46%), Gaps = 16/150 (10%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           SS S    N  GS +   V G    +G Y V + +G P R  ++ +D+GSD+ W+QC  P
Sbjct: 106 SSDSRYEVNDFGSDI---VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ-P 161

Query: 83  CVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSL 138
           C  C +   P++ P+       V C   +C  +   G H+      C YE+ Y DG  + 
Sbjct: 162 CKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS----GGCRYEVMYGDGSYTK 217

Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
           G L  +   F  T    +   +A+GCG+  
Sbjct: 218 GTLALETLTFAKT----VVRNVAMGCGHRN 243


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 68/124 (54%), Gaps = 12/124 (9%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
           +G Y + + IG+P++ +++ +DTGSD+ WLQC  PC  C +   P++ P++      + C
Sbjct: 157 SGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQC-KPCDDCYQQVDPIFDPASSSSFSRLGC 215

Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
           + P C +L      N      C Y++ Y DG  ++G    +  +F   N   ++ ++A+G
Sbjct: 216 QTPQCRNLDVFACRN----DSCLYQVSYGDGSYTVGDFATETVSFG--NSGSVD-KVAIG 268

Query: 164 CGYN 167
           CG++
Sbjct: 269 CGHD 272


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 122/280 (43%), Gaps = 46/280 (16%)

Query: 17  MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTW 76
           ++S  +  +S +L NH  ++ LF   GN      + V +  G P + + L LDTGS +TW
Sbjct: 99  INSKCNQYTSGNLKNHAHNNNLFDEDGN------FLVDVAFGTPPQKFKLILDTGSSITW 152

Query: 77  LQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGG 135
            QC A CV C++  H  +          D + +S ++ G  +C        Y + Y D  
Sbjct: 153 TQCKA-CVHCLKDSHRHF----------DSLASSTYSFG--SCIPSTVGNTYNMTYGDKS 199

Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
           +S+G    D      ++   +  +   GCG N   G      DG+LGLG+G+ S VSQ  
Sbjct: 200 TSVGNYGCDTMTLEPSD---VFQKFQFGCGRNN-EGDFGSGADGMLGLGQGQLSTVSQTA 255

Query: 196 SQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSS-------DYTKYYSPGVA 246
           S+   + V  +CL      G  LF       SS + +TS+ +       + + YY   + 
Sbjct: 256 SK--FKKVFSYCLPEENSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLL 313

Query: 247 ELFFGGETTGLKNLP--------VVFDSGSSYTYLNRVTY 278
           ++  G +     N+P         + DSG+  T L +  Y
Sbjct: 314 DISVGNKRL---NIPSSVFASPGTIIDSGTVITRLPQRAY 350


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 140/358 (39%), Gaps = 69/358 (19%)

Query: 45  VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
           +YP  Y  Y  T  +G P +P  + LDTGS LTW+          C +P    V   HP 
Sbjct: 95  LYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPK 154

Query: 94  YRPSNDLVPCEDPICASLHAPGH-HNCEDP----AQCD--------YELEYADGGSSLGV 140
              S+ LV C +P C  +H+  H   C  P    A C         Y + Y   GS+ G+
Sbjct: 155 NSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGS-GSTAGL 213

Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
           L+ D          R      LGC    V    + P  G+ G G+G  S+ +QL   K  
Sbjct: 214 LIADTL----RAPGRAVSGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGLSKF- 264

Query: 201 RNVVGHCL--------SGGGGGFLFFGDDLYDSSRVVWTSMSSD---YTKYYSPGVAELF 249
                +CL        +   G  +  GD+       +  S + D   Y  YY   ++ + 
Sbjct: 265 ----SYCLLSRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVT 320

Query: 250 FGGETTGLK----------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
            GG+   L           +   + DSG+++TYL+   +Q +   +   +  +  +    
Sbjct: 321 VGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDV 380

Query: 300 DETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV 355
           +E L L  C+   +  K++         L+L F  G    + +L  E Y +++ +  V
Sbjct: 381 EEGLGLHPCFALPQGAKSM-----ALPELSLHFKGG---AVMQLPLENYFVVAGRAPV 430


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 11/154 (7%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
           G Y +++ +G P        DTGSDL W QC  PC +C +   PL+ P +      + C+
Sbjct: 91  GEYLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKTYRDLSCD 149

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
              C +L      +C     C Y   Y D   + G L  D      TNG  +  P+  +G
Sbjct: 150 TRQCQNLGE--SSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIG 207

Query: 164 CGYNQVPGASYHPLD-GILGLGKGKSSIVSQLHS 196
           CG       ++   D GI+GLG G  S++SQ+ S
Sbjct: 208 CGRRN--NGTFDKKDSGIIGLGGGPMSLISQMGS 239


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 128/322 (39%), Gaps = 36/322 (11%)

Query: 68  LDTGSDLTWLQC-DAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDP 122
           +DT SD+ W+QC   P  +C     PLY P+       +PC  P C  L +   + C   
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232

Query: 123 A-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGIL 181
             +C Y + Y DG ++ G  V D    + T   +       GC +  V G+  +   GIL
Sbjct: 233 TDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVK---DFRFGCSH-AVRGSFSNQNAGIL 288

Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGDDLYDSSRVVWTSM--SSDYT 238
            LG G+ S++ Q  +     N   +C+      GFL  G  +  S +  +T +  +    
Sbjct: 289 ALGGGRGSLLEQ--TADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAP 346

Query: 239 KYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
            +Y   +  +   G+   +         V DSG+  T L    Y  L +  +  ++A   
Sbjct: 347 TFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGP 406

Query: 295 KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN 354
             AP    L  C+     F    DVK     ++L F  G T    +L P + ++     +
Sbjct: 407 LAAPV-RNLDTCYD----FTRFPDVK--VPKVSLVFAGGAT---LDLEPASIIL-----D 451

Query: 355 VCLGILNGAEVGLQDLNVIGGI 376
            CL     A  G + +  IG +
Sbjct: 452 GCLAF--AATPGEESVGFIGNV 471


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 9/124 (7%)

Query: 49  GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL--VPCE 104
           G Y + + +G P    +  +DTGSDL W QC  PC  C     P++ P  SN    +PC+
Sbjct: 48  GDYLMKLTLGTPPVDVYGLVDTGSDLVWAQC-TPCQGCYRQKSPMFEPLRSNTYTPIPCD 106

Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
              C SL     H+C     C Y   YAD   + GVL ++   F+ T+G+ +    +  G
Sbjct: 107 SEECNSLFG---HSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFG 163

Query: 164 CGYN 167
           CG++
Sbjct: 164 CGHS 167


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 118/292 (40%), Gaps = 47/292 (16%)

Query: 23  SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
           S +S++  +   S+ L    G    +G Y VT+ IG P     L  DTGSDLTW QC+ P
Sbjct: 104 SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-P 162

Query: 83  CV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
           C+  C     P + PS+      V C  P+C    +    NC       Y + Y D   +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCV------YSIVYGDKSFT 216

Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ---- 193
            G L K+ F    TN   L   +  GCG N           G+     G   +       
Sbjct: 217 QGFLAKEKFTL--TNSDVLE-DVYFGCGENN---------QGLFDGVAGLLGLGPGKLSL 264

Query: 194 -LHSQKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
              +     N+  +CL   +    G L FG     S  V +T +SS +   ++ G+  + 
Sbjct: 265 PAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPISS-FPSAFNYGIDII- 321

Query: 250 FGGETTGLKNLPV----------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
             G + G K L +          + DSG+ +T L    Y  L S+ K+++S+
Sbjct: 322 --GISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 32/273 (11%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
           + V +  G PA+   + LDTGSDL+W+QC      C     P + P+       VPC  P
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196

Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
           +CA+  A G  N      C Y ++Y DG S+ GVL +D   FN ++          GCG 
Sbjct: 197 VCAA--AGGMCNG---TTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT---GFTFGCGE 248

Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYD 224
             +    +  +DG+LGLG+GK S+ SQ  +      V  +CL       G+L  G     
Sbjct: 249 KNI--GDFGEVDGLLGLGRGKLSLPSQ--AAPSFGGVFSYCLPSYNTTPGYLNIGATKPT 304

Query: 225 SS-RVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYLN 274
           S+  V +T+M     Y  +Y   +  +  GG    L   P VF       DSG+  TYL 
Sbjct: 305 STVPVQYTAMIKKPQYPSFYFIELVSINIGGYI--LPVPPSVFTKTGTLLDSGTILTYLP 362

Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
              Y +L    K  +     K AP  E L  C+
Sbjct: 363 PPAYTSLRDRFKFTMQGN--KPAPPYEPLDTCY 393


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 66/143 (46%), Gaps = 13/143 (9%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
           V++ IG P +   + LDTGS L+W+QC  P      A  PL   S  ++PC   +C    
Sbjct: 80  VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKP-R 138

Query: 113 APGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV 169
            P +    +C+    C Y   YADG  + G LV++ F F   +  +  P L LGC  +  
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCATDS- 194

Query: 170 PGASYHPLDGILGLGKGKSSIVS 192
                    GILG+  G+ S  S
Sbjct: 195 -----SDTQGILGMNLGRLSFSS 212


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 77/172 (44%), Gaps = 18/172 (10%)

Query: 57  IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLH 112
           +G P+   +   DTGS+L WLQC  PC  C     P++ P+     + V  + PIC ++ 
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQC-LPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121

Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPG 171
                  E    C Y+  Y DG ++ G L  D FAF       +    L  GC ++    
Sbjct: 122 RISCR--EGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKAR 179

Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
              H   G++GL +  +S+VSQL  +K       +C+      G G  ++FG
Sbjct: 180 LKGHQA-GVVGLNRHPNSLVSQLKVKKF-----SYCMVIPDDHGSGSRMYFG 225


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/335 (24%), Positives = 137/335 (40%), Gaps = 50/335 (14%)

Query: 53  VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
           V + IG P     L +DT SDL W+QC  PC+ C     P++ PS       +    S +
Sbjct: 87  VNISIGSPPITQLLHMDTASDLLWIQC-LPCINCYAQSLPIFDPSRSYTHRNETCRTSQY 145

Query: 113 A-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL---ALGCGYNQ 168
           + P      +   C+Y + Y D   S G+L ++   FN    +  +  L     GCG++ 
Sbjct: 146 SMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDN 205

Query: 169 VPGASYHPL--DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDD 221
                  PL   GILGLG G+ S+V +   +        +C             L  GDD
Sbjct: 206 YG----EPLVGTGILGLGYGEFSLVHRFGKK------FSYCFGSLDDPSYPHNVLVLGDD 255

Query: 222 ----LYDSS---------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
               L D++          V   ++S D      P    +F     TGL     + D+G+
Sbjct: 256 GANILGDTTPLEIHNGFYYVTIEAISVD--GIILPIDPRVFNRNHQTGLGG--TIIDTGN 311

Query: 269 SYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKGRRPFKNVHDVKKCFRTL 326
           S T L    Y+ L + ++     + +  +  +D+ + + C+ G   F+    V+  F  +
Sbjct: 312 SLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGN--FER-DLVESGFPIV 368

Query: 327 ALSFTDGK-----TRTLF-ELTPEAYLIISNKGNV 355
              F++G       ++LF +L+P  + +    GN+
Sbjct: 369 TFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNL 403


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 106/262 (40%), Gaps = 29/262 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
           G    T  Y +T+ IG PA    + +DTGSD++W+QC  PC +C      L+ P    + 
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSSSSTY 172

Query: 99  DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
               C    CA L      N    +QC Y + Y D  S+ G    D      T G     
Sbjct: 173 SPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTL----TLGSSAMT 228

Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
               GC  ++  G +    DG++GLG G  S+ SQ  +         +CL  + G  GFL
Sbjct: 229 DFQFGCSQSESGGFNDQ-TDGLMGLGGGAQSLASQ--TAGTFGTAFSYCLPPTSGSSGFL 285

Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPV-------VFDSG 267
             G     SS  V T M  S+    YY   +  +  G +     NLP        + DSG
Sbjct: 286 TLGT---GSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQL---NLPTSVFSAGSLMDSG 339

Query: 268 SSYTYLNRVTYQTLTSIMKKEL 289
           +  T L    Y  L+S  K  +
Sbjct: 340 TIITRLPPTAYSALSSAFKAGM 361


>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 547

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 79/185 (42%), Gaps = 13/185 (7%)

Query: 41  VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
           V+GNV   GYY   + IG P +     LDTGS L    C   C RC  +   +++P    
Sbjct: 71  VYGNVPELGYYYTYLTIGTPGQTVSGILDTGSTLPAFPCSG-CTRCGPSKTGMFKPELSS 129

Query: 97  SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
           ++    C D  C      G ++C  +  QC Y + Y +G S+ G L +D  A     G  
Sbjct: 130 TSSTFGCSDARCFC----GANSCSCNNEQCGYSIRYLEGSSTSGFLAEDMLAVG-DGGPA 184

Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
            N     GC  ++         DG+ G+G+  +S+  QL  Q +I +    C      G 
Sbjct: 185 AN--FVFGCAQSESGLLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREGV 242

Query: 216 LFFGD 220
           L  G+
Sbjct: 243 LLLGN 247


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/257 (28%), Positives = 107/257 (41%), Gaps = 31/257 (12%)

Query: 48  TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
           +G Y V + IG P     L  DTGSD+ W+QC +PC  C     PL+ P+N      VPC
Sbjct: 120 SGEYLVRVGIGSPPLEQHLVADTGSDVIWVQC-SPCSDCYAQGDPLFDPANSASFSPVPC 178

Query: 104 EDPIC-ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
              +C A+             +C+Y++ Y D   + GVL  +       +G      +A+
Sbjct: 179 NSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAM 235

Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS------GGGGGFL 216
           GCG+       +    G+LGLG G  S+V QL           +CL+      G G G L
Sbjct: 236 GCGHENR--GLFAEAAGLLGLGWGPMSLVGQLGGAAG--GAFSYCLAGYYSGEGSGSGSL 291

Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----------NLPVVF 264
             G +    +  VW  +  + D   +Y  GV  L   GE   L+             VV 
Sbjct: 292 VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVM 351

Query: 265 DSGSSYTYLNRVTYQTL 281
           D+G++ T L    Y  L
Sbjct: 352 DTGTAVTRLPAEAYAAL 368


>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
          Length = 585

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 110/254 (43%), Gaps = 36/254 (14%)

Query: 54  TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------- 100
           T+ +G P + + + LDTGSDL W+ CD  C RC       Y    +L             
Sbjct: 106 TVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSSTSRK 163

Query: 101 VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR-- 155
           V C + +CA      H N C    + C Y + Y    +S  G+LV+D       + ++  
Sbjct: 164 VTCNNSLCA------HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEF 217

Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
           +   +  GCG  QV   S+  +   +G+ GLG  K S+ S L  +    +    C    G
Sbjct: 218 VEAYVTFGCG--QVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDG 275

Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
            G + FGD           ++++ +   Y+  V ++  G     L +   +FDSG+S+TY
Sbjct: 276 IGRISFGDKGGPDQEETPFNLNALHPT-YNITVTQVRVGTTLIDL-DFTALFDSGTSFTY 333

Query: 273 LNRVTYQTLTSIMK 286
           L    Y   T+++K
Sbjct: 334 LVDPIY---TNVLK 344


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 105/264 (39%), Gaps = 30/264 (11%)

Query: 43  GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL 100
           G+   T  Y +T+ IG PA    + +DTGSD++W+ C A   R        + P  S+  
Sbjct: 117 GSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHA---RAGAGSSLFFDPGKSSTY 173

Query: 101 VP--CEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
            P  C    C  L   G  N C   + C Y + Y DG ++ G    D  A N T      
Sbjct: 174 TPFSCSSAACTRLE--GRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEKVE-- 229

Query: 158 PRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GG 213
                GC     PG        DG++GLG G  S+VSQ  +     +   +CL       
Sbjct: 230 -NFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQ--TAATYGSAFSYCLPATTRSS 286

Query: 214 GFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------D 265
           GFL  G     +S  V T M  S     +Y   +  +  GG+   +   P VF      D
Sbjct: 287 GFLTLGAST-GTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAIS--PTVFAAGSIMD 343

Query: 266 SGSSYTYLNRVTYQTLTSIMKKEL 289
           SG+  T L    Y  L++  +  +
Sbjct: 344 SGTIITRLPPRAYSALSAAFRAGM 367


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 83/193 (43%), Gaps = 26/193 (13%)

Query: 51  YNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP----S 97
           Y   + +G P   + + LDTGSDL WL C+    C+R +E        P  LY P    +
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161

Query: 98  NDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
           +  + C D  C      G   C  P+  C Y++ Y++   + G L++D      T  + L
Sbjct: 162 SSSIRCSDKRCF-----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL-ATEDENL 215

Query: 157 NP---RLALGCGYNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-- 210
            P    + LGCG  Q       + ++G+LGLG    S+ S L    +  N    C     
Sbjct: 216 TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275

Query: 211 GGGGFLFFGDDLY 223
           G  G + FGD  Y
Sbjct: 276 GNVGRISFGDRGY 288


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 124/317 (39%), Gaps = 32/317 (10%)

Query: 55  MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICAS 110
           M +G PA  Y + +DTGS LTWLQC    V C     P++ P +      V C    C+ 
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 111 LHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
           L +   +   C     C Y+  Y D   S+G L KD  +F  T+     P    GCG + 
Sbjct: 61  LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYYGCGQDN 116

Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYD 224
                +    G++GL + K S++ QL     +     +CL    S G      +    Y 
Sbjct: 117 --EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSYNPGQYS 172

Query: 225 SSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
            + +V +S+     + K     VA       ++   +LP + DSG+  T L    Y  L+
Sbjct: 173 YTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALS 232

Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELT 342
             +   +  K    A     L  C+KG+    +   V        +SF  G      +L+
Sbjct: 233 KAVAAAM--KGTSRASAYSILDTCFKGQASRVSAPAVT-------MSFAGGAA---LKLS 280

Query: 343 PEAYLIISNKGNVCLGI 359
            +  L+  +    CL  
Sbjct: 281 AQNLLVDVDDSTTCLAF 297


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 125/321 (38%), Gaps = 71/321 (22%)

Query: 39  FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
            + H NV       V++ +G P +   + LDTGS+L+WL C     R   A    +RP  
Sbjct: 53  LRFHHNVS----LTVSLAVGTPPQNVTMVLDTGSELSWLLCAT--GRAAAAAADSFRPRA 106

Query: 99  D----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
                 VPC    C+S   P   +C+  + +C   L YADG +S G L  D FA     G
Sbjct: 107 SATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV----G 162

Query: 154 QRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
                R A GC    Y+  P A      G+LG+ +G  S V+Q  +++       +C+S 
Sbjct: 163 DAPPLRSAFGCMSAAYDSSPDAVAT--AGLLGMNRGALSFVTQASTRRF-----SYCISD 215

Query: 211 -GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG---------GETTGLKN 259
               G L  G  DL          +  +YT  Y P     +F          G   G K 
Sbjct: 216 RDDAGVLLLGHSDL--------PFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKP 267

Query: 260 LPV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
           LP+               + DSG+ +T+L    Y  + +   K+   K L  A ED +  
Sbjct: 268 LPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQ--TKPLLPALEDPSF- 324

Query: 305 LCWKGRRPFKNVHDVKKCFRT 325
                   F+   D   CFR 
Sbjct: 325 -------AFQEAFDT--CFRV 336


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.137    0.426 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,704,231,719
Number of Sequences: 23463169
Number of extensions: 311035630
Number of successful extensions: 782344
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 577
Number of HSP's successfully gapped in prelim test: 1262
Number of HSP's that attempted gapping in prelim test: 778031
Number of HSP's gapped (non-prelim): 2095
length of query: 380
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 236
effective length of database: 8,980,499,031
effective search space: 2119397771316
effective search space used: 2119397771316
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)