BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 015211
         (411 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/401 (76%), Positives = 349/401 (87%), Gaps = 2/401 (0%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNAT-SWPAKKSFEYYQVLL 66
           +++ V   L     AE V FS++LIHRFS+EVKAL VS+  + + SWP KKS +YYQ+L+
Sbjct: 18  LFILVMASLLIDKSAE-VTFSSRLIHRFSDEVKALRVSRKDSLSYSWPEKKSMDYYQILV 76

Query: 67  SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDL 126
           +SD Q+QKMK GPQ+Q LFPSQGSKTMSLG+DFGWLHYTWIDIGTP+VSFLVALDAGSDL
Sbjct: 77  NSDFQRQKMKLGPQYQFLFPSQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALDAGSDL 136

Query: 127 LWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 186
           LW+PCDC++CAPLSASYY+SLDRDLNEYSPS SSTSKHLSCSH+LC+LG +C +PKQPCP
Sbjct: 137 LWVPCDCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCP 196

Query: 187 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 246
           Y+MDYYTENTSSSGLLVEDILHL S GDNAL  SV+A V+IGCGMKQSGGYLDGVAPDGL
Sbjct: 197 YSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGL 256

Query: 247 IGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYIT 306
           +GLGL EISVPS LAKAGLIRNSFSMCFD+DDSGRIFFGDQGP TQQST FL  +G Y T
Sbjct: 257 MGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTT 316

Query: 307 YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
           Y++GVE  C+GSSCLKQTSF+A+VD+G+SFTFLP  VYE I  EFDRQVN TI+SF GYP
Sbjct: 317 YVVGVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYP 376

Query: 367 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           WK CYKSSS  L K+PSVKL+FP NNSFV++NPVF+IYG Q
Sbjct: 377 WKYCYKSSSNHLTKVPSVKLIFPLNNSFVIHNPVFMIYGIQ 417


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 288/388 (74%), Positives = 341/388 (87%), Gaps = 2/388 (0%)

Query: 22  AETVMFSTKLIHRFSEEVKALGVSK--NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGP 79
            E   FS++LIHRFS+E K + VS+  + N T WP KKS EYYQ+L+SSD+++QK+K GP
Sbjct: 15  VELATFSSRLIHRFSKEYKEVSVSRGGDVNGTWWPEKKSKEYYQILVSSDLKRQKLKLGP 74

Query: 80  QFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPL 139
            +Q+LFPSQGSKTMSLGNDFGWLHYTWIDIGTP+VSF+VALD+GSDL W+PCDCV+CAPL
Sbjct: 75  HYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALDSGSDLFWVPCDCVQCAPL 134

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 199
           SAS+Y+SLDRDL+EYSPS SSTSK LSCSHRLCD+G +C+NPKQ CPY+++YYTE+TSSS
Sbjct: 135 SASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSS 194

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           GLLVEDI+HL SGGD+ L  SV+A VIIGCGMKQSGGYLDGVAPDGL+GLGL EISVPS 
Sbjct: 195 GLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSF 254

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           LAKAGLI+NSFSMCF++DDSGRIFFGDQGPATQQS  FL  NG Y TYI+GVE CC+G+S
Sbjct: 255 LAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVEVCCVGTS 314

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
           CLKQ+SF A+VDSG+SFTFLP +V+E IA EFD QVN + +SFEGY WK CYK+SSQ LP
Sbjct: 315 CLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLP 374

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           K+PS++L+FPQNNSF+V NPVF+IYG Q
Sbjct: 375 KIPSLRLIFPQNNSFMVQNPVFMIYGIQ 402


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score =  589 bits (1519), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 279/400 (69%), Positives = 337/400 (84%), Gaps = 2/400 (0%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLS 67
           + ++V  LL ES  A   MFS +LIHRFS+EVKA   +++  + SWP  ++ EYY++L+ 
Sbjct: 7   VAMSVVVLLIESCMA--AMFSARLIHRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVR 64

Query: 68  SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLL 127
           SD ++QK+  G ++Q LFPS+GSKTMS GND+GWLHYTWIDIGTPN+SFLVALDAGSDLL
Sbjct: 65  SDWERQKVMLGSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLL 124

Query: 128 WIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPY 187
           WIPCDC++CAPLSASYY SLDRDLN+YSPS SSTSKHLSCSH+LC+   +C +PKQ CPY
Sbjct: 125 WIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPY 184

Query: 188 TMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 247
           T++YY+ENTSSSGLL+EDILHL SG D+A  +SV+A VIIGCGM+Q+GGYLDGVAPDGL+
Sbjct: 185 TINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLM 244

Query: 248 GLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITY 307
           GLGLGEISVPS L+KAGL++NSFS+CF+ DDSGRIFFGDQG ATQQ+T FL S+GKY TY
Sbjct: 245 GLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETY 304

Query: 308 IIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
           I+GVE CCIGSSC+KQTSF+A+VDSG+SFTFLP E Y  +  EFD+QVN T  SFEGYPW
Sbjct: 305 IVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPW 364

Query: 368 KCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           + CYKSSS+ L K PSV L F  NNSFVV+NPVFV++G Q
Sbjct: 365 EYCYKSSSKELLKNPSVILKFALNNSFVVHNPVFVVHGYQ 404


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score =  585 bits (1509), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 273/384 (71%), Positives = 328/384 (85%)

Query: 24  TVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM 83
             MFS +LIHRFS+EVKA   +++  + SWP  ++ EYY++L+ SD ++QK+  G ++Q 
Sbjct: 2   AAMFSARLIHRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVRSDWERQKVMLGSKYQF 61

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           LFPS+GSKTMS GND+GWLHYTWIDIGTPN+SFLVALDAGSDLLWIPCDC++CAPLSASY
Sbjct: 62  LFPSEGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASY 121

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
           Y SLDRDLN+YSPS SSTSKHLSCSH+LC+   +C +PKQ CPYT++YY+ENTSSSGLL+
Sbjct: 122 YGSLDRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLI 181

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           EDILHL SG D+A  +SV+A VIIGCGM+Q+GGYLDGVAPDGL+GLGLGEISVPS L+KA
Sbjct: 182 EDILHLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKA 241

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL++NSFS+CF+ DDSGRIFFGDQG ATQQ+T FL S+GKY TYI+GVE CCIGSSC+KQ
Sbjct: 242 GLVKNSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQ 301

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
           TSF+A+VDSG+SFTFLP E Y  +  EFD+QVN T  SFEGYPW+ CYKSSS+ L K PS
Sbjct: 302 TSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPS 361

Query: 384 VKLMFPQNNSFVVNNPVFVIYGTQ 407
           V L F  NNSFVV+NPVFV++G Q
Sbjct: 362 VILKFALNNSFVVHNPVFVVHGYQ 385


>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 532

 Score =  563 bits (1450), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 266/388 (68%), Positives = 329/388 (84%), Gaps = 4/388 (1%)

Query: 24  TVMFSTKLIHRFSEEVKALGVSKNRNAT---SWPAKKSFEYYQVLLSSDVQKQKMKTGPQ 80
           ++ F+++++HRFSEE+KAL  S + N +   SWP K S EYYQ L+S D ++QKMK G +
Sbjct: 21  SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKLGSR 80

Query: 81  FQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           FQ+LFPS+GSKT++LGNDFGWLHYTWIDIGTP+VSFLVALDAGSDLLW+PC+C++CAPLS
Sbjct: 81  FQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLS 140

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 200
           ASYY SLD+DLNEY PS+SSTSKH+SCSH LCD G SCQ+PKQ CPY +DY TENTSSSG
Sbjct: 141 ASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSG 200

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
           LL++D+LHL SG +N+   ++QA VI+GCGMKQSGGYL GVAPDGL GLGLGEISV S L
Sbjct: 201 LLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 260

Query: 261 AKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
           AK  L++NSFS+CF++D SGRIFFGD+GPA+QQ+TSF+  +GKY TYI+GVE CCI +SC
Sbjct: 261 AKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSC 320

Query: 321 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQRLP 379
           LKQTSFKA++DSG+SFT+LP+E YE I  EFD+++N T   SF+GYPWK CYK S+  +P
Sbjct: 321 LKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMP 380

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           K+PSV L+FP NNSFVV++PVF IYG Q
Sbjct: 381 KVPSVTLLFPLNNSFVVHDPVFPIYGDQ 408


>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like, partial [Cucumis sativus]
          Length = 408

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 265/388 (68%), Positives = 328/388 (84%), Gaps = 4/388 (1%)

Query: 24  TVMFSTKLIHRFSEEVKALGVSKNRNAT---SWPAKKSFEYYQVLLSSDVQKQKMKTGPQ 80
           ++ F+++++HRFSEE+KAL  S + N +   SWP K S EYYQ L+S D ++QKMK G +
Sbjct: 21  SITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGDFRRQKMKLGSR 80

Query: 81  FQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           FQ+LFPS+GS T++LGNDFGWLHYTWIDIGTP+VSFLVALDAGSDLLW+PC+C++CAPLS
Sbjct: 81  FQLLFPSEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLS 140

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 200
           ASYY SLD+DLNEY PS+SSTSKH+SCSH LCD G SCQ+PKQ CPY +DY TENTSSSG
Sbjct: 141 ASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSG 200

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
           LL++D+LHL SG +N+   ++QA VI+GCGMKQSGGYL GVAPDGL GLGLGEISV S L
Sbjct: 201 LLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 260

Query: 261 AKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
           AK  L++NSFS+CF++D SGRIFFGD+GPA+QQ+TSF+  +GKY TYI+GVE CCI +SC
Sbjct: 261 AKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSC 320

Query: 321 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQRLP 379
           LKQTSFKA++DSG+SFT+LP+E YE I  EFD+++N T   SF+GYPWK CYK S+  +P
Sbjct: 321 LKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMP 380

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           K+PSV L+FP NNSFVV++PVF IYG Q
Sbjct: 381 KVPSVTLLFPLNNSFVVHDPVFPIYGDQ 408


>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 521

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 266/386 (68%), Positives = 318/386 (82%), Gaps = 10/386 (2%)

Query: 25  VMFSTKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQ 82
           + FS +L+HRF++E+K +     R  T  WP ++S  YYQ+LL+ D+ ++K+K G  ++Q
Sbjct: 22  ITFSARLVHRFADEMKPV-----RPPTGYWPDQRSMRYYQMLLTGDILRRKIKVGGTRYQ 76

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
           +LFPS GSKTMSLGNDFGWLHYTWIDIGTP+ SFLVALDAGSDLLWIPCDCV+CAPLS+S
Sbjct: 77  LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSS 136

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
           YY++LDRDLNEYSPS S +SKHLSCSHRLCD G++C++ +Q CPY + Y +ENTSSSGLL
Sbjct: 137 YYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLL 196

Query: 203 VEDILHLISGGDNALKNS-VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           VEDILHL SGG   L NS VQA V++GCGMKQSGGYLDGVAPDGL+GLG GE SVPS LA
Sbjct: 197 VEDILHLQSGG--TLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLA 254

Query: 262 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
           K+GLI  SFS+CF++DDSGR+FFGDQGP +QQSTSFL  +G Y TYIIGVE+CCIG+SCL
Sbjct: 255 KSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCL 314

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           K TSFKA VDSG+SFTFLP  VY  I  EFD+QVN + +SFEG PW+ CY  SSQ LPK+
Sbjct: 315 KMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPSSQDLPKV 374

Query: 382 PSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           PS  LMF +NNSFVV +PVFV YG +
Sbjct: 375 PSFTLMFQRNNSFVVYDPVFVFYGNE 400


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 264/386 (68%), Positives = 318/386 (82%), Gaps = 10/386 (2%)

Query: 25  VMFSTKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQ 82
           + FS +L+HRF++E+K +     R  T  WP + S  YY++LL+ D+ ++K+K G  ++Q
Sbjct: 21  ITFSARLVHRFADEMKPV-----RPPTGYWPDRWSMGYYRMLLTGDILRRKIKVGGARYQ 75

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
           +LFPS GSKTMSLGNDFGWLHYTWIDIGTP+ SFLVALDAGSDLLWIPCDCV+CAPLS+S
Sbjct: 76  LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSS 135

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
           YY++LDRDLNEYSPS S +SKHLSCSH+LCD G++C++ +Q CPY + Y +ENTSSSGLL
Sbjct: 136 YYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLL 195

Query: 203 VEDILHLISGGDNALKNS-VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           VEDILHL SGG  +L NS VQA V++GCGMKQSGGYLDGVAPDGL+GLG GE SVPS LA
Sbjct: 196 VEDILHLQSGG--SLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLA 253

Query: 262 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
           K+GLI +SFS+CF++DDSGRIFFGDQGP  QQSTSFL  +G Y TYIIGVE+CC+G+SCL
Sbjct: 254 KSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCL 313

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           K TSFK  VDSG+SFTFLP  VY  IA EFD+QVN + +SFEG PW+ CY  SSQ LPK+
Sbjct: 314 KMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPSSQELPKV 373

Query: 382 PSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           PS+ L F QNNSFVV +PVFV YG +
Sbjct: 374 PSLTLTFQQNNSFVVYDPVFVFYGNE 399


>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 529

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 253/383 (66%), Positives = 305/383 (79%), Gaps = 6/383 (1%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTG-PQFQMLF 85
           FS KL HRFSEE+K + V        WP +++  Y++ LL +D  + K+  G  + ++LF
Sbjct: 27  FSVKLFHRFSEEMKPVQVQTG----DWPDRRTLHYHEKLLRNDFLRHKINLGGARHKLLF 82

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
           PSQGSKTMS GNDFGWLHYTWIDIGTP+ SFLVALDAGSDLLW+PCDC+ CAPLSAS+Y+
Sbjct: 83  PSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCAPLSASFYS 142

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQP-CPYTMDYYTENTSSSGLLVE 204
           +LDRDLNEYSPS S +SKHLSCSHRLCD+G++C+  KQ  CPYT++Y ++NTSSSGLLVE
Sbjct: 143 NLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVE 202

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           DI HL SG  +   +SVQA V++GCGMKQSGGYLDG APDGLIGLG GE SVPS LAK+G
Sbjct: 203 DIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSG 262

Query: 265 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
           LIR+SFS+CF++DDSGR+FFGDQG   QQST FL  +G + TYI+GVETCCIG+SC K T
Sbjct: 263 LIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVT 322

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           SF A  DSG+SFTFLP   Y  IA EFD+QVN T ++F+G PW+ CY  SSQ+LPK+P++
Sbjct: 323 SFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVPSSQQLPKIPTL 382

Query: 385 KLMFPQNNSFVVNNPVFVIYGTQ 407
            LMF QNNSFVV NPVFV Y  Q
Sbjct: 383 TLMFQQNNSFVVYNPVFVSYNEQ 405


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 254/398 (63%), Positives = 318/398 (79%), Gaps = 4/398 (1%)

Query: 5   SLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRN--ATSWPAKKSFEYY 62
           SL   L  + L+ +++ A  V FS+KLIHRFS+E KA  VS+N N  A SWP K+SF+YY
Sbjct: 5   SLIPLLMAYLLVVDAAIA--VTFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYY 62

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
           ++LLSSD+++QK+K G ++Q+LFPS+GS  + LGN+FGWLHYTWIDIGTPNVSFLVALDA
Sbjct: 63  RLLLSSDLKRQKLKLGAEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDA 122

Query: 123 GSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 182
           GSDLLW+PCDC++CAPLSASYY+ L RDLNEYSPS SSTSK LSC+ +LC+LG+ C++ K
Sbjct: 123 GSDLLWVPCDCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSK 182

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
            PCPY   YY+ENTSSSGLL+ED LHL    ++A ++SV ASVIIGCG KQSG + DG A
Sbjct: 183 DPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAA 242

Query: 243 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 302
           PDGL+GLG G++SVPSLLAKAGL+RN+FS+CFD + SG I FGDQG  TQ+STSF+   G
Sbjct: 243 PDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEG 302

Query: 303 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 362
           K++TY+I VE   +GSS LK   F+A+VDSG+SFTFLP E+YE I  EFD+QVN T +SF
Sbjct: 303 KFVTYLIEVEGYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSF 362

Query: 363 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPV 400
           +G PWK CY SSSQ L  +P+V L+F  N SF+V+NPV
Sbjct: 363 KGSPWKYCYNSSSQELLNIPTVTLVFAMNQSFIVHNPV 400


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 250/382 (65%), Positives = 309/382 (80%), Gaps = 2/382 (0%)

Query: 21  GAETVMFSTKLIHRFSEEVKALGVSKNRN--ATSWPAKKSFEYYQVLLSSDVQKQKMKTG 78
            A  V FS+KLIHRFS+E KA  VS+N N  A SWP K+SF+YY++LLSSD+++QK+K G
Sbjct: 9   AAIAVTFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLG 68

Query: 79  PQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAP 138
            ++Q+LFPS+GS  + LGN+FGWLHYTWIDIGTPNVSFLVALDAGSDLLW+PCDC++CAP
Sbjct: 69  AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 128

Query: 139 LSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSS 198
           LSASYY+ L RDLNEYSPS SSTSK LSC+ +LC+LG+ C++ K PCPY   YY+ENTSS
Sbjct: 129 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 188

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
           SGLL+ED LHL    ++A ++SV ASVIIGCG KQSG + DG APDGL+GLG G++SVPS
Sbjct: 189 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 248

Query: 259 LLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
           LLAKAGL+RN+FS+CFD + SG I FGDQG  TQ+STSF+   GK++TY+I VE   +GS
Sbjct: 249 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 308

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
           S LK   F+A+VDSG+SFTFLP E+YE I  EFD+QVN T +SF+G PWK CY SSSQ L
Sbjct: 309 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQEL 368

Query: 379 PKLPSVKLMFPQNNSFVVNNPV 400
             +P+V L+F  N SF+V+NPV
Sbjct: 369 LNIPTVTLVFAMNQSFIVHNPV 390


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 237/384 (61%), Positives = 307/384 (79%), Gaps = 5/384 (1%)

Query: 25  VMFSTKLIHRFSEEVKALGVSKNRNATS--WPAKKSFEYYQVLLSSDVQKQKMKTGPQ-F 81
           + FS+KLIHRFS+E K++ +S+  NA+   WP + SFEY+Q+LL +D+++Q+MK G Q  
Sbjct: 26  LTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMKLGSQKN 85

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           Q+LFPSQGS+ +  GN+  WLHYTWIDIGTPNVSFLVALDAGSDLLW+PCDC++CAPLSA
Sbjct: 86  QLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSA 145

Query: 142 SYYN-SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT-ENTSSS 199
           SYYN SLDRDL+EYSPS SSTS+HLSC H+LC+ G++C+NPK PCPY  +Y   ENT+S+
Sbjct: 146 SYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFENTTSA 205

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G LVED LHL S GD+  +  +QASV++GCG KQ G + DG APDG++GLG G+ISVPSL
Sbjct: 206 GFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSL 265

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           LAKAGLI+N FS+CFD++DSGRI FGD+G A+QQST FL   G Y+ Y +GVE+ C+G+S
Sbjct: 266 LAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNS 325

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
           CLK++ FKA+VDSGSSFT+LP EVY  + +EFD+QVN    SF+   W  CY +SSQ L 
Sbjct: 326 CLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELH 385

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVI 403
            +P+++L FP+N +FVV+NP + I
Sbjct: 386 DIPAIQLKFPRNQNFVVHNPTYSI 409


>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
 gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
 gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
 gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
          Length = 528

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 242/405 (59%), Positives = 313/405 (77%), Gaps = 10/405 (2%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLS 67
           +   V +L TE + A   +FS++LIHRFS+E +A  +    ++ S P K+S EYY++L  
Sbjct: 8   LLFCVLFLATEETLAS--LFSSRLIHRFSDEGRA-SIKTPSSSDSLPNKQSLEYYRLLAE 64

Query: 68  SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLL 127
           SD ++Q+M  G + Q L PS+GSKT+S GNDFGWLHYTWIDIGTP+VSFLVALD GS+LL
Sbjct: 65  SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 124

Query: 128 WIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 186
           WIPC+CV+CAPL+++YY+SL  +DLNEY+PS+SSTSK   CSH+LCD  + C++PK+ CP
Sbjct: 125 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 184

Query: 187 YTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGGYLDGVAP 243
           YT++Y + NTSSSGLLVEDILHL    +N L N   SV+A V+IGCG KQSG YLDGVAP
Sbjct: 185 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 244

Query: 244 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA-SNG 302
           DGL+GLG  EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQST FL   N 
Sbjct: 245 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 304

Query: 303 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 362
           KY  YI+GVE CCIG+SCLKQTSF   +DSG SFT+LP+E+Y  +A E DR +N T  +F
Sbjct: 305 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNF 364

Query: 363 EGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           EG  W+ CY+SS++  PK+P++KL F  NN+FV++ P+FV   +Q
Sbjct: 365 EGVSWEYCYESSAE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQ 407


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 244/411 (59%), Positives = 313/411 (76%), Gaps = 10/411 (2%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
           M   S+ I   V +L TE + A   +FS+++IHRFS+E +A  +    ++ S P K+S E
Sbjct: 1   MASRSVFILFCVLFLATEETLAS--VFSSRMIHRFSDEGRA-SIRTPSSSESLPEKQSLE 57

Query: 61  YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVAL 120
           YY++L  SD ++Q+M  G +FQ L PS+GSKT+S GNDFGWLHYTWIDIGTP+VSFLVAL
Sbjct: 58  YYRLLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVAL 117

Query: 121 DAGSDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 179
           D GSDLLWIPC+CV+CAPL+++YY+SL  +DLNEY+PS+SSTSK   CSH+LCD  + C+
Sbjct: 118 DTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCE 177

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGG 236
           +PK+ CPYT++Y + NTSSSGLLVEDILHL    +N L N   SV+A V+IGCG KQSG 
Sbjct: 178 SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGD 237

Query: 237 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 296
           YLDGVAPDGL+GLG  EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQST 
Sbjct: 238 YLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTP 297

Query: 297 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           FL        YI+GVE CCIG+SCLKQTSF   +DSG SFT+LP+E+Y  +A E DR +N
Sbjct: 298 FLQLENNS-GYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHIN 356

Query: 357 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
            T  SFEG  W+ CY+SS +  PK+P++KL F  NN+FV++ P+FV   +Q
Sbjct: 357 ATSKSFEGVSWEYCYESSVE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQ 405


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score =  497 bits (1279), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 235/408 (57%), Positives = 316/408 (77%), Gaps = 11/408 (2%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
           M   S  I L +  L++E S A   +FS++LIHRFS+E    G +  ++  S+P K+SFE
Sbjct: 1   MASRSAFILLFILSLVSEKSLAS--LFSSRLIHRFSDE----GRASIKSPGSFPEKRSFE 54

Query: 61  YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVAL 120
           YY++L S D ++QKM  G +FQ L PS+GSKT+S GN FGWLHYTWIDIGTP+VSFLVAL
Sbjct: 55  YYRLLTSIDSRRQKMNLGAKFQSLVPSEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVAL 114

Query: 121 DAGSDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 179
           D+GSDLLWIPC+CV+CAPLS++YY+SL  +DLNE+ PSAS+TSK   CSH+LC+   +C+
Sbjct: 115 DSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACE 174

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           +PK+ CPYT+ Y +ENTSSSGLLVED+LHL    + +  +SV+A V++GCG KQSG +L 
Sbjct: 175 SPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANAS--SSVKARVVVGCGEKQSGEFLK 232

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 299
           G+APDG++GLG GEISVPS LAKAGL+RNSFSMCFD++DSGRI+FGD GP+TQQST FL 
Sbjct: 233 GIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLP 292

Query: 300 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
              +++ Y +GVE CC+G+SCLKQ+SF  ++DSG SFTFLP+E+Y  +A E D  +N T+
Sbjct: 293 YKNEFVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATV 352

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
              EG PW+ CY++S +  PK+P++KL F  NN+FV++ P+FV+  ++
Sbjct: 353 KKIEGGPWEYCYETSFE--PKVPAIKLKFSSNNTFVIHKPLFVLQRSE 398


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 241/404 (59%), Positives = 301/404 (74%), Gaps = 6/404 (1%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATS--WPAKKSFEYYQVL 65
           +++  F  L+  S   T  FS+KLIHRFSEE K+L +S N N +S  WP K SF+Y Q+L
Sbjct: 7   LFVICFCFLSNHSIGLT--FSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLL 64

Query: 66  LSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSD 125
           L +D+++QKMK G Q Q+LFPS GS T   GND  WLHYTWIDIGTPNVSFLVALDAGSD
Sbjct: 65  LDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSD 124

Query: 126 LLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPC 185
           L W+PCDC++CAPLSAS Y  LDRDL+EY PS S+TS+HLSC+H+LC+LG+ C+N K PC
Sbjct: 125 LSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPC 184

Query: 186 PYTMDYYTENTSSSGLLVEDILHLISGGD--NALKNSVQASVIIGCGMKQSGGYLDGVAP 243
           PY  DY   NTSSSG LVEDILHL S  D  N+ +  VQASVI+GCG KQ+GGYLDG AP
Sbjct: 185 PYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAP 244

Query: 244 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGK 303
           DG++GLG G ISVPSLLAKAGLIR SFS+CFD + SG I FGDQG  +Q+ST  L + G 
Sbjct: 245 DGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGN 304

Query: 304 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
           Y  Y+I VE+ C+G+SCLKQ+ FKA+VDSG+SFT+LP +VY  I  EFD+QVN    S +
Sbjct: 305 YDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQ 364

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
           G PW  CY +SS++L  +P+++L F  N S +++N  + +   Q
Sbjct: 365 GGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQ 408


>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 880

 Score =  484 bits (1245), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 223/391 (57%), Positives = 292/391 (74%), Gaps = 5/391 (1%)

Query: 21  GAETVMFSTKLIHRFSEEVKALGVSKNRNAT----SWPAKKSFEYYQVLLSSDVQKQKMK 76
           GA  V FS++LIHRFSEE KA   S+  + +    +WP + S EY+++LL SDV +Q+M+
Sbjct: 19  GAVGVTFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSDVTRQRMR 78

Query: 77  TGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRC 136
            G Q++ML+P +G +T   GN   WLHYTWIDIGTPNVSFLVALDAGSD+LW+PCDC+ C
Sbjct: 79  LGSQYEMLYPFEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIEC 138

Query: 137 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 196
           A LSA  YN LDRDLN+Y PS S+TS+HL C H+LCD+ + C+  K PCPY + Y + NT
Sbjct: 139 ASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANT 198

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
           SSSG + ED LHL S G +A +NSVQAS+I+GCG KQ+G YL G  PDG++GLG G ISV
Sbjct: 199 SSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISV 258

Query: 257 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           PSLLAKAGLI+NSFS+CF++++SGRI FGDQG  TQ ST FL  +GK+  YI+GVE+ C+
Sbjct: 259 PSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCV 318

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
           GS CLK+T F+A++DSGSSFTFLP EVY+ +  EFD+QVN T    +   W+ CY +SSQ
Sbjct: 319 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQN-SWEYCYNASSQ 377

Query: 377 RLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
            L  +P + L F +N ++++ NP+F+   +Q
Sbjct: 378 ELISIPPLNLAFSRNQTYLIQNPIFIDPASQ 408


>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 529

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 239/411 (58%), Positives = 305/411 (74%), Gaps = 10/411 (2%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
           M   S  I   V +L TE  G    +FS++LIHRFS+E +A  +    ++ S P K+S  
Sbjct: 1   MASRSAFILFCVLFLATE--GTLASVFSSRLIHRFSDEGRA-SIKTPSSSESLPEKQSLA 57

Query: 61  YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVAL 120
           YY++L  SD ++Q+M  G +FQ L PS+GSKT+S GNDFGWLHYTWIDIGTP+VSFLVAL
Sbjct: 58  YYRLLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVAL 117

Query: 121 DAGSDLLWIPCDCVRCAPLSASYYNSL-DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 179
           D GSDLLWIPC+CV+CAPL+++YY+SL  +DLNEY+PS+SS+SK   CSH+LC   + C 
Sbjct: 118 DTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCD 177

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SVQASVIIGCGMKQSGG 236
           +PK+ C YT+ Y + NTSSSGLLVEDILHL    +N L N   SV+A V++GCG KQSG 
Sbjct: 178 SPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGD 237

Query: 237 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 296
           YLDGVAPDGL+GLG  EISVPS L+KAGL+RNSFS+CFD++DSGRI+FGD GP+ QQS  
Sbjct: 238 YLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAP 297

Query: 297 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           FL        YI+GVE CCIG+SCLKQTSF   +DSG SFT+LP+E+Y  +A E DR +N
Sbjct: 298 FLQLENNS-GYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHIN 356

Query: 357 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
            T  SFEG  W+ CY+SS +  PK+P++KL F  NN+FV++ P+FV   +Q
Sbjct: 357 ATSKSFEGVSWEYCYESSVE--PKVPAIKLKFSHNNTFVIHKPLFVFQQSQ 405


>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 525

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 220/385 (57%), Positives = 285/385 (74%), Gaps = 9/385 (2%)

Query: 21  GAETVMFSTKLIHRFSEEVKALGVSKNRNAT----SWPAKKSFEYYQVLLSSDVQKQKMK 76
           GA    FS++LIHRFSEE KA   S+   ++    +WP + S EY+++LL SDV +Q+M+
Sbjct: 19  GAVGATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSDVARQRMR 78

Query: 77  TGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRC 136
            G Q++ L+PS+G +T   GN   WLHYTWIDIGTPNVSFLVALDAGSD+LW+PCDC+ C
Sbjct: 79  LGSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIEC 138

Query: 137 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 196
           A LSA  YN LDRDLN+Y PS S+TS+HL C H+LCD+ + C+  K PCPY + Y + NT
Sbjct: 139 ASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASANT 198

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
           SSSG + ED LHL S G +A +NSVQAS+I+GCG KQ+G YL G  PDG++GLG G ISV
Sbjct: 199 SSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNISV 258

Query: 257 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           PSLLAKAGLI+NSFS+C D+++SGRI FGDQG  TQ ST FL      I Y++GVE+ C+
Sbjct: 259 PSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFL----PIIAYMVGVESFCV 314

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
           GS CLK+T F+A++DSGSSFTFLP EVY+ +  EFD+QVN +    +   W+ CY +SSQ
Sbjct: 315 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQS-SWEYCYNASSQ 373

Query: 377 RLPKLPSVKLMFPQNNSFVVNNPVF 401
            L  +P +KL F +N +F++ NP+F
Sbjct: 374 ELVNIPPLKLAFSRNQTFLIQNPIF 398


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 226/400 (56%), Positives = 300/400 (75%), Gaps = 8/400 (2%)

Query: 6   LTIYLAVFWLLTESSGAETVM---FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEY 61
           + + + ++ LL +    ETV+   FS+++IHRFS+E K  L  +   N  SWP + S EY
Sbjct: 1   MAVGVLLWLLLAKGFVLETVIAVTFSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEY 60

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           +++LL+SD+ +QKMK G Q Q  +PS+GSKT+S GNDF WLHYTWIDIGTPNVSFLVALD
Sbjct: 61  FRLLLNSDLTRQKMKLGSQDQSFYPSEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALD 120

Query: 122 AGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 181
            GSD+ W+PCDC+ CAPLSA++YN+LDRDLN+YSPS SS+S+HL C H+LC+  ++C+  
Sbjct: 121 TGSDMFWVPCDCIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGF 180

Query: 182 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 241
           K  CPY  +Y ++NTSSSG L+ED LHL S  +NA KNS+QASVI+GCG KQSG +L+G 
Sbjct: 181 KDRCPYIKEYTSDNTSSSGFLIEDKLHLAS--NNATKNSIQASVILGCGRKQSGYFLEGA 238

Query: 242 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ-QSTSFLAS 300
           AP+G++GLG G ISVP+LLAKAGLIRNS S+C ++  SGRI FGDQG ATQ +ST FL  
Sbjct: 239 APNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFGDQGHATQRRSTPFLLD 298

Query: 301 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-I 359
           +G+ + Y +GVE  C+GS C K+T FKA +D+G+SFT+LPK VYET+ AEF++QV+ T I
Sbjct: 299 DGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRI 358

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
           TS     + CCY +SS+     P +K  F +N SF++ NP
Sbjct: 359 TSQIQSDFNCCYNASSRESNNFPPMKFTFSKNQSFIIQNP 398


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 208/388 (53%), Positives = 285/388 (73%), Gaps = 6/388 (1%)

Query: 25  VMFSTKLIHRFSEEVKALGVSK---NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
           +  S  L+HRFS+E K+L  S+   N +A  WP   S +Y+Q+L+  D++++++  G ++
Sbjct: 22  LTLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLKYFQMLMDYDLKRRRLNIGSKY 81

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
            +LFPS+GS+ +  GN+F WLHYTWID+GTP+V FLVALD GSDLLW+PCDC++CAPLSA
Sbjct: 82  DVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSA 141

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 201
           +YY+ LDRDL+EY+P+ SSTSKHL C H+LC   T+C++   PC Y  DYY++NTS+SG 
Sbjct: 142 NYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGF 201

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           ++ED L L S   +   + +QASV+ GCG KQSG YLDG APDG++GLG G ISVP+LLA
Sbjct: 202 MIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLA 261

Query: 262 KAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
           + GL+RN+FS+CFD + SGRI FGD GPATQQ+T FL   G++  Y IGVE+ C+GSSCL
Sbjct: 262 QEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGSSCL 321

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ--VNDTITSFEGYPWKCCYKSSSQRLP 379
           +++ F+A+VDSGSSFT+LP EVY+ I  EFD+Q  VN T       PW  CY  S+    
Sbjct: 322 QRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYNISTLVSF 381

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
            +PS++L+FP N  F +++PV+V+   Q
Sbjct: 382 NIPSMQLVFPLNQIF-IHDPVYVLPANQ 408


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 205/375 (54%), Positives = 266/375 (70%), Gaps = 7/375 (1%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           FS++++HR S+E +   +        WP + S  YY+ LL SD+Q+QK +   + Q+L  
Sbjct: 27  FSSRMVHRLSDEAR---LEAGPRMGLWPQRGSGGYYRALLRSDLQRQKRRLAGKNQLLSL 83

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           S+G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS SY  +
Sbjct: 84  SKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLS-SYRGN 142

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
           LDRDL  Y P+ S+TS+HL CSH LC  G+ C NPKQPC Y +DY++ENT+SSGLL+ED 
Sbjct: 143 LDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSENTTSSGLLIEDS 202

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           LHL S   +A    V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL+
Sbjct: 203 LHLNSREGHA---PVNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLV 259

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
           RNSFSMCF +D SGRIFFGDQG ++QQST F+   GK  TY + V+  CIG  CL+ +SF
Sbjct: 260 RNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSF 319

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
           +A+VDSG+SFT LP +VY+    EFD+Q+N +   +E   WK CY +S   +P +P++ L
Sbjct: 320 QALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIIL 379

Query: 387 MFPQNNSFVVNNPVF 401
            F  N SF   NP+ 
Sbjct: 380 AFAANKSFQAVNPIL 394


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 202/374 (54%), Positives = 268/374 (71%), Gaps = 6/374 (1%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S +++HR S+E +    ++         + S +Y++ L+ SD+Q+QK + G ++Q+L  S
Sbjct: 29  SARMVHRLSDEARLAAGARGGRRWP--RRGSGDYFRALVRSDLQRQKRRVGGKYQLLSLS 86

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           QG      GND GWL+YTW+D+GTPN SFLVALD GSDL W+PCDC++CAPLS SY+ SL
Sbjct: 87  QGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLS-SYHGSL 145

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y PS S+TS+HL CSH LC   + C NPKQPCPY +DY++ENT+SSGLL+ED+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL S   +A    V ASVIIGCG KQSG YL+G+APDGL+GLG+ +ISVPS LA+AGL+R
Sbjct: 206 HLDSREGHA---PVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF KDDSGRIFFGDQG  TQQST F+  NGK  TY + V+  CIG  C +   F+
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VD+G+SFT LP + Y++I  EFD+Q+N +  S + Y ++ CY +    +P +P++ L 
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382

Query: 388 FPQNNSFVVNNPVF 401
           F +N SF   NP+ 
Sbjct: 383 FAENKSFQAVNPIL 396


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 202/374 (54%), Positives = 268/374 (71%), Gaps = 6/374 (1%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S +++HR S+E +    ++         + S +Y++ L+ SD+Q+QK + G ++Q+L  S
Sbjct: 29  SARMVHRLSDEARLAAGARGGRRWP--RRGSGDYFRALVRSDLQRQKRRVGGKYQLLSLS 86

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           QG      GND GWL+YTW+D+GTPN SFLVALD GSDL W+PCDC++CAPLS SY+ SL
Sbjct: 87  QGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLS-SYHGSL 145

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y PS S+TS+HL CSH LC   + C NPKQPCPY +DY++ENT+SSGLL+ED+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL S   +A    V ASVIIGCG KQSG YL+G+APDGL+GLG+ +ISVPS LA+AGL+R
Sbjct: 206 HLDSREGHA---PVNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF KDDSGRIFFGDQG  TQQST F+  NGK  TY + V+  CIG  C +   F+
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VD+G+SFT LP + Y++I  EFD+Q+N +  S + Y ++ CY +    +P +P++ L 
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382

Query: 388 FPQNNSFVVNNPVF 401
           F +N SF   NP+ 
Sbjct: 383 FAENKSFQAVNPIL 396


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 206/380 (54%), Positives = 268/380 (70%), Gaps = 12/380 (3%)

Query: 24  TVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM 83
           +   ST+++HR S+E +   ++   +   WP   S  YY+ L+ SD+Q+QK K     Q+
Sbjct: 71  SATLSTRMVHRLSDEAR---LAAGPHGARWPRHGSGGYYRALVRSDLQRQKRK----HQL 123

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           L  S+     S GNDFGWL+YTW+D+GTPN SF+VALD GSDL W+PCDC+ CAPL A Y
Sbjct: 124 LSVSEAGGIFSPGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPL-AGY 182

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
             +LDRDL  Y P+ S+TS+HL CSH LC  G+ C +PKQPCPY+ DY  ENT+SSGLL+
Sbjct: 183 RETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLI 242

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           EDILHL S   +A    V+ASV+IGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+A
Sbjct: 243 EDILHLDSRESHA---PVKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARA 299

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL+RNSFSMCF K+DSGRIFFGDQG + QQST F+   GKY TY + V+  C+G  C + 
Sbjct: 300 GLVRNSFSMCF-KEDSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEA 358

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
           TSF+A+VDSG+SFT LP  VY+ +A EFD+QV+    + E   ++ CY +S  ++P +P+
Sbjct: 359 TSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPT 418

Query: 384 VKLMFPQNNSFVVNNPVFVI 403
           V L F  N SF   NP  V+
Sbjct: 419 VTLTFAANKSFQAVNPTIVL 438


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 210/385 (54%), Positives = 272/385 (70%), Gaps = 9/385 (2%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGP-QFQMLFP 86
           ST++++R S+E +   ++       WP + S +YY+ L+ SD+Q+QK + G  + Q+L  
Sbjct: 135 STRMVYRLSDEAR---MAAGTRGARWPRRGSGDYYRSLVRSDLQRQKRRLGGGKHQLLSF 191

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           S+    +  GNDFGWL+YTW+D+GTPN SF+VALD GSDL WIPCDC+ CAPLS  Y+ S
Sbjct: 192 SKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCDCIECAPLSG-YHGS 250

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
           LDRDL  Y P+ S+TS+HL CSH LC LG+ C N KQPCPY   Y  ENT+SSGLLVEDI
Sbjct: 251 LDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDI 310

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           LHL S   +A    V+ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL+
Sbjct: 311 LHLDSRESHA---PVKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLV 367

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
           RNSFSMCF KD SGRIFFGDQG +TQQST F+   GK  TY + V+  C+G  C + TSF
Sbjct: 368 RNSFSMCFTKD-SGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSF 426

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
           +AIVDSG+SFT LP ++Y+ +A EFD+QVN +    E   +  CY +S   +P +P+V L
Sbjct: 427 QAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTL 486

Query: 387 MFPQNNSFVVNNPVFVIYGTQVGVS 411
            F  N SF   NP F+++  +  V+
Sbjct: 487 TFAGNKSFQPVNPTFLLHDEEGAVA 511


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 201/374 (53%), Positives = 259/374 (69%), Gaps = 11/374 (2%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S++++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S
Sbjct: 28  SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           +G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS  Y  +L
Sbjct: 81  KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL    D+     V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF +D SGRIFFGDQG  +QQST F+   GK  TY + V+  CIG  CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VDSG+SFT LP +VY+    EFD+Q+N T   +E   WK CY +S   +P +P++ L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 388 FPQNNSFVVNNPVF 401
           F  + S    NP+ 
Sbjct: 377 FAADKSLQAVNPIL 390


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 201/374 (53%), Positives = 259/374 (69%), Gaps = 11/374 (2%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S++++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S
Sbjct: 28  SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           +G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS  Y  +L
Sbjct: 81  KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL    D+     V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF +D SGRIFFGDQG  +QQST F+   GK  TY + V+  CIG  CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VDSG+SFT LP +VY+    EFD+Q+N T   +E   WK CY +S   +P +P++ L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 388 FPQNNSFVVNNPVF 401
           F  + S    NP+ 
Sbjct: 377 FAADKSLQAVNPIL 390


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/374 (53%), Positives = 258/374 (68%), Gaps = 11/374 (2%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S++++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S
Sbjct: 28  SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           +G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS  Y  +L
Sbjct: 81  KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL    D+     V ASVIIGCG KQSG YLDG+APDGL+ LG+ +ISVPS LA+AGL++
Sbjct: 200 HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQ 256

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF +D SGRIFFGDQG  +QQST F+   GK  TY + V+  CIG  CL+ TSFK
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VDSG+SFT LP +VY+    EFD+Q+N T   +E   WK CY +S   +P +P++ L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 388 FPQNNSFVVNNPVF 401
           F  + S    NP+ 
Sbjct: 377 FAADKSLQAVNPIL 390


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/371 (53%), Positives = 256/371 (69%), Gaps = 11/371 (2%)

Query: 31  LIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS 90
           ++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S+G 
Sbjct: 1   MVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLSKGG 53

Query: 91  KTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRD 150
            T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS  Y  +LDRD
Sbjct: 54  STFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNLDRD 112

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
           L  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED LHL 
Sbjct: 113 LRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLN 172

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
              D+     V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++NSF
Sbjct: 173 YREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSF 229

Query: 271 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 330
           SMCF +D SGRIFFGDQG  +QQST F+   GK  TY + V+  CIG  CL+ TSFKA+V
Sbjct: 230 SMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALV 289

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 390
           DSG+SFT LP +VY+    EFD+Q+N T   +E   WK CY +S   +P +P++ L F  
Sbjct: 290 DSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAA 349

Query: 391 NNSFVVNNPVF 401
           + S    NP+ 
Sbjct: 350 DKSLQAVNPIL 360


>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
          Length = 378

 Score =  303 bits (777), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 144/254 (56%), Positives = 180/254 (70%), Gaps = 3/254 (1%)

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 3   DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           HL    D+     V ASVIIGCG KQSG YLDG+APDGL+GLG+ +ISVPS LA+AGL++
Sbjct: 63  HLNYREDHV---PVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 119

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           NSFSMCF +D SGRIFFGDQG  +QQST F+   GK  TY + V+  CIG  CL+ TSFK
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           A+VDSG+SFT LP +VY+    EFD+Q+N T   +E   WK CY +S   +P +P++ L 
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 239

Query: 388 FPQNNSFVVNNPVF 401
           F  + S    NP+ 
Sbjct: 240 FAADKSLQAVNPIL 253


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 157/404 (38%), Positives = 225/404 (55%), Gaps = 10/404 (2%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVK--ALGVSKNRNATSWPAKKSFEY 61
            S ++++ +   +         +FS ++ HRFSE VK  + G      A +WPAK SFEY
Sbjct: 3   FSWSVFIVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEY 62

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           Y  L   D   +  +      +L  S G+ T  + +  G+LHYT + +GTP   FLVALD
Sbjct: 63  YAELAHRDRALRGRRLSDIDGLLTFSDGNSTFRISS-LGFLHYTTVSLGTPGKKFLVALD 121

Query: 122 AGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 181
            GSDL W+PCDC RCAP   + Y S D +L+ Y+P  SSTS+ ++C + LC     C   
Sbjct: 122 TGSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGT 180

Query: 182 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 241
              CPY + Y +  TS+SG+LVED+LHL +  ++  +  V+A V  GCG  Q+G +LD  
Sbjct: 181 FSNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIA 238

Query: 242 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASN 301
           AP+GL GLGL +ISVPS+L+K G   +SFSMCF  D  GRI FGD+G   Q+ T F   N
Sbjct: 239 APNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF-NLN 297

Query: 302 GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 361
             + TY I V    +G++ L    F A+ DSG+SFT+L   +Y  +   F  Q  D+   
Sbjct: 298 ALHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRP 356

Query: 362 FEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            +   P++ CY  S  +    +PS+ L     + F V +P+ +I
Sbjct: 357 PDSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIII 400


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 155/403 (38%), Positives = 233/403 (57%), Gaps = 13/403 (3%)

Query: 7   TIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLL 66
           T++L    +L         +F+ ++ HRFS+EVK    S  R A  +P K SFEY+  L+
Sbjct: 9   TLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFNALV 67

Query: 67  SSD--VQKQKMKTGPQFQM--LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
             D  ++ +++          L  S G+ T  + +  G+LHYT + +GTP + F+VALD 
Sbjct: 68  LRDWLIRGRRLSESESESESSLTFSDGNSTSRISS-LGFLHYTTVKLGTPGMRFMVALDT 126

Query: 123 GSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 182
           GSDL W+PCDC +CAP   + Y S + +L+ Y+P  S+T+K ++C++ LC     C    
Sbjct: 127 GSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTF 185

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
             CPY + Y +  TS+SG+L+ED++HL +   N  +  V+A V  GCG  QSG +LD  A
Sbjct: 186 STCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAA 243

Query: 243 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 302
           P+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF  D  GRI FGD+G + Q+ T F   N 
Sbjct: 244 PNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNP 302

Query: 303 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF 362
            +  Y I V    +G++ L    F A+ D+G+SFT+L   +Y T++  F  Q  D   S 
Sbjct: 303 SHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSP 361

Query: 363 EG-YPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVI 403
           +   P++ CY  S+     L PS+ L    N+ F +N+P+ VI
Sbjct: 362 DSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVI 404


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/379 (40%), Positives = 211/379 (55%), Gaps = 17/379 (4%)

Query: 33  HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQM---LFPSQG 89
           HR+S  V+ L      +  + P   + EYY  L   D++++ +           L  + G
Sbjct: 31  HRYSAAVRGLA----GHLRAPPPAGTAEYYAALAGHDLRRRSLAAAAGGGGAGNLAFADG 86

Query: 90  SKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR 149
           + T  L NDFG+LHY  + +GTPNV+FLVALD GSDL W+PCDC++CAPL++  Y  L  
Sbjct: 87  NDTYRL-NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGDLKF 145

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           D+  YSP  SSTS+ + CS  LCD    C      CPY++ Y +ENTSS G+LVED+L+L
Sbjct: 146 DM--YSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYL 203

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
            +  ++      QA +  GCG  QSG +L   AP+GL+GLG+   SVPSLLA  G+  NS
Sbjct: 204 TT--ESGQSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANS 261

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
           FSMCF +D  GRI FGD G + Q  T   +     Y  Y I +    +G      T F A
Sbjct: 262 FSMCFGEDGHGRINFGDTGSSDQLETPLNIYKQNPY--YNISITGAMVGGKSF-DTKFSA 318

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKLPSVKLM 387
           +VDSG+SFT L   +Y  I + F+ QV ++    +   P++ CY  S+Q     P++ L 
Sbjct: 319 VVDSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNISLT 378

Query: 388 FPQNNSFVVNNPVFVIYGT 406
               + F VN P+  I  T
Sbjct: 379 AKGGSIFPVNGPIITITDT 397


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 155/380 (40%), Positives = 219/380 (57%), Gaps = 10/380 (2%)

Query: 26  MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           +F+ K+ HRFS+ +K L  S +  + ++P+K SFEYY  L   D   +  K       L 
Sbjct: 27  IFTFKMHHRFSDMLKDL--SDSTTSRNFPSKGSFEYYAELAHRDQMLRGRKLYNVEAPLA 84

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
            S G+ T  + +  G+LHYT +++GTP + F+VALD GSDL W+PCDC +CAP     Y 
Sbjct: 85  FSDGNSTFRISS-LGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYA 143

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           S D +L+ Y P  SSTSK ++C++ LC     C      CPY + Y +  TS+SG+LVED
Sbjct: 144 S-DFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVED 202

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           +LHL S   N  + S++A V  GCG  QSG +L+  AP+GL GLG+ +ISVPS+L++ GL
Sbjct: 203 VLHLTSEDSN--QESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGL 260

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
             +SFSMCF  D  GRI FGD+G   Q+ T F  SN  + +Y I V    +G++ L    
Sbjct: 261 TADSFSMCFGHDGVGRISFGDKGSPDQEETPF-NSNPSHPSYNISVTQVRVGTT-LVDVD 318

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKL-PS 383
           F A+ DSG+SFT+L   +Y  ++  F  Q  D     +   P++ CY  S      L PS
Sbjct: 319 FTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPS 378

Query: 384 VKLMFPQNNSFVVNNPVFVI 403
           + L       F V +P+ VI
Sbjct: 379 MSLTMKGRGHFTVFDPIIVI 398


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 154/359 (42%), Positives = 212/359 (59%), Gaps = 22/359 (6%)

Query: 54  PAKKSFEYYQVLLSSDVQKQK------MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWI 107
           P   + EYY  L   D  +++         G +F     + G+ T  L NDFG+LHY  +
Sbjct: 48  PPHGTAEYYAALAGHDGLRRRSLGVGGGGGGAEFAF---ADGNDTYRL-NDFGFLHYAVV 103

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
            +GTPNV+FLVALD GSDL W+PCDC++CAPL +  Y SL  D+  YSP+ S+TS+ + C
Sbjct: 104 ALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLQSPNYGSLKFDV--YSPAQSTTSRKVPC 161

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S  LCDL  +C++    CPY++ Y ++NTSSSG+LVED+L+L S  D+A    V A ++ 
Sbjct: 162 SSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMF 219

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 287
           GCG  Q+G +L   AP+GL+GLG+   SVPSLLA  GL  NSFSMCF  D  GRI FGD 
Sbjct: 220 GCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDT 279

Query: 288 GPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 345
           G + Q+ T  +    N  Y   I G+    +GS  +  T F AIVDSG+SFT L   +Y 
Sbjct: 280 GSSDQKETPLNVYKQNPYYNITITGIT---VGSKSIS-TEFSAIVDSGTSFTALSDPMYT 335

Query: 346 TIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            I + FD Q+  +    +   P++ CY  S+  +   P+V L     + F VN+P+  I
Sbjct: 336 QITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITI 393


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 154/405 (38%), Positives = 227/405 (56%), Gaps = 16/405 (3%)

Query: 2   NRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKAL-GVSKNRNATSWPAKKSFE 60
           ++++    L   W+ +++      +F+ K+ HRFS+  K   G+++N     WP K SFE
Sbjct: 3   SKLTFFFLLITIWVFSKTCKGR--VFTFKMHHRFSDSFKNWSGLTRN-----WPEKGSFE 55

Query: 61  YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVAL 120
           YY  L   D   +  +       L  S G+ T  + +  G+LHYT +++GTP V F+VAL
Sbjct: 56  YYAALAHRDQMLRGRRLSDADASLAFSDGNSTFRISS-LGFLHYTTVELGTPGVKFMVAL 114

Query: 121 DAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQN 180
           D GSDL W+PCDC RCAP   + Y S D +L+ Y+P  SSTSK ++C++ +C     C  
Sbjct: 115 DTGSDLFWVPCDCSRCAPTHGASYAS-DFELSIYNPRESSTSKKVTCNNDMCAQRNRCLG 173

Query: 181 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 240
               CPY + Y +  TS+SG+LV+D+LHL +  ++  +  V+A V  GCG  QSG +LD 
Sbjct: 174 TFSSCPYIVSYVSAQTSTSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDI 231

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 300
            AP+GL GLG+ +ISVPS+L++ GLI +SFSMCF  D  GRI FGD+G   Q+ T F   
Sbjct: 232 AAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPFNV- 290

Query: 301 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
           N  + TY + V    +G + L    F A+ DSG+SFT++    Y  ++ +F     D   
Sbjct: 291 NPAHPTYNVTVTQARVG-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRR 349

Query: 361 SFE-GYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVI 403
             +   P++ CY  S      L PS+ L       F V +P+ VI
Sbjct: 350 PPDPRIPFEYCYDMSPDANASLVPSMSLTMKGGRHFTVYDPIIVI 394


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/382 (39%), Positives = 226/382 (59%), Gaps = 11/382 (2%)

Query: 26  MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           +F+ ++ HRFS+EVK    S  R    +P K SFEY+  L+  D  ++ +++        
Sbjct: 28  IFTFEMHHRFSDEVKQWSDSTGR-FVKFPPKGSFEYFNALVLRDWLIRGRRLSDSESESS 86

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           L  S G+ T  + +  G+LHYT + +GTP + F+VALD GSDL W+PCDC +CAP   + 
Sbjct: 87  LTFSDGNSTSRISS-LGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGAT 145

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
           Y S + +L+ Y+P  S+T+K ++C++ LC     C      CPY + Y +  TS+SG+L+
Sbjct: 146 YAS-EFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGILM 204

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED++HL +   N  +  V+A V  GCG  QSG +LD  AP+GL GLG+ +ISVPS+LA+ 
Sbjct: 205 EDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLARE 262

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL+ +SFSMCF  D  GRI FGD+G + Q+ T F   N  +  Y I V    +G++ L  
Sbjct: 263 GLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-LID 320

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKL- 381
             F A+ D+G+SFT+L   +Y T++  F  Q  D   S +   P++ CY  S+     L 
Sbjct: 321 DEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANASLI 380

Query: 382 PSVKLMFPQNNSFVVNNPVFVI 403
           PS+ L    N+ F +N+P+ VI
Sbjct: 381 PSLSLTMKGNSHFTINDPIIVI 402


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 153/357 (42%), Positives = 211/357 (59%), Gaps = 18/357 (5%)

Query: 54  PAKKSFEYYQVLLSSDVQKQK----MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDI 109
           P   + EYY  L   D  +++       G   +  F + G+ T  L NDFG+LHY  + +
Sbjct: 48  PPHGTAEYYAALAGHDGLRRRSLGVGGGGGGAEFAF-ADGNDTYRL-NDFGFLHYAVVAL 105

Query: 110 GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           GTPNV+FLVALD GSDL W+PCDC++CAP  +  Y SL  D+  YSP+ S+TS+ + CS 
Sbjct: 106 GTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSPAQSTTSRKVPCSS 163

Query: 170 RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 229
            LCDL  +C++    CPY++ Y ++NTSSSG+LVED+L+L S  D+A    V A ++ GC
Sbjct: 164 NLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSAQSKIVTAPIMFGC 221

Query: 230 GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP 289
           G  Q+G +L   AP+GL+GLG+   SVPSLLA  GL  NSFSMCF  D  GRI FGD G 
Sbjct: 222 GQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRINFGDTGS 281

Query: 290 ATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 347
           + Q+ T  +    N  Y   I G+    +GS  +  T F AIVDSG+SFT L   +Y  I
Sbjct: 282 SDQKETPLNVYKQNPYYNITITGI---TVGSKSI-STEFSAIVDSGTSFTALSDPMYTQI 337

Query: 348 AAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            + FD Q+  +    +   P++ CY  S+  +   P+V L     + F VN+P+  I
Sbjct: 338 TSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSIFPVNDPIITI 393


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score =  264 bits (674), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 198/320 (61%), Gaps = 13/320 (4%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           + G+ T  L NDFG+LHY  + +GTPNV+FLVALD GSDL W+PCDC++CAP  +  Y S
Sbjct: 20  ADGNDTYRL-NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
           L  D+  YSP+ S+TS+ + CS  LCDL  +C++    CPY++ Y ++NTSSSG+LVED+
Sbjct: 79  LKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDV 136

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L+L S  D+A    V A ++ GCG  Q+G +L   AP+GL+GLG+   SVPSLLA  GL 
Sbjct: 137 LYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLA 194

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQT 324
            NSFSMCF  D  GRI FGD G + Q+ T  +    N  Y   I G+    +GS  +  T
Sbjct: 195 ANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGI---TVGSKSI-ST 250

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLPS 383
            F AIVDSG+SFT L   +Y  I + FD Q+  +    +   P++ CY  S+  +   P+
Sbjct: 251 EFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PN 309

Query: 384 VKLMFPQNNSFVVNNPVFVI 403
           V L     + F VN+P+  I
Sbjct: 310 VSLTAKGGSIFPVNDPIITI 329


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 144/321 (44%), Positives = 197/321 (61%), Gaps = 15/321 (4%)

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
           P  G+  +   NDFG+LHY  + +GTPNV+FLVALD GSDL W+PCDC++CAP  +  Y 
Sbjct: 48  PPHGTADL---NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYG 104

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           SL  D+  YSP+ S+TS+ + CS  LCDL  +C++    CPY++ Y ++NTSSSG+LVED
Sbjct: 105 SLKFDV--YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVED 162

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           +L+L S  D+A    V A ++ GCG  Q+G +L   AP+GL+GLG+   SVPSLLA  GL
Sbjct: 163 VLYLTS--DSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGL 220

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQ 323
             NSFSMCF  D  GRI FGD G + Q+ T  +    N  Y   I G+    +GS  +  
Sbjct: 221 AANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGIT---VGSKSIS- 276

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLP 382
           T F AIVDSG+SFT L   +Y  I + FD Q+  +    +   P++ CY  S+  +   P
Sbjct: 277 TEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-P 335

Query: 383 SVKLMFPQNNSFVVNNPVFVI 403
           +V L     + F VN+P+  I
Sbjct: 336 NVSLTAKGGSIFPVNDPIITI 356


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 142/310 (45%), Positives = 193/310 (62%), Gaps = 12/310 (3%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 156
           NDFG+LHY  + +GTPNV+FLVALD GSDL W+PCDC++CAP  +  Y SL  D+  YSP
Sbjct: 70  NDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDV--YSP 127

Query: 157 SASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           + S+TS+ + CS  LCDL  +C++    CPY++ Y ++NTSSSG+LVED+L+L S  D+A
Sbjct: 128 AQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTS--DSA 185

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
               V A ++ GCG  Q+G +L   AP+GL+GLG+   SVPSLLA  GL  NSFSMCF  
Sbjct: 186 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 245

Query: 277 DDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGS 334
           D  GRI FGD G + Q+ T  +    N  Y   I G+    +GS  +  T F AIVDSG+
Sbjct: 246 DGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGIT---VGSKSI-STEFSAIVDSGT 301

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNS 393
           SFT L   +Y  I + FD Q+  +    +   P++ CY  S+  +   P+V L     + 
Sbjct: 302 SFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGSI 360

Query: 394 FVVNNPVFVI 403
           F VN+P+  I
Sbjct: 361 FPVNDPIITI 370


>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Brachypodium distachyon]
          Length = 509

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 158/388 (40%), Positives = 214/388 (55%), Gaps = 23/388 (5%)

Query: 31  LIHRFSEEVKALGVSKNRNATS--WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQ 88
           L HRFS  VK    S+ R A +  WP + S EYY  L + D  ++ +  G    +L  + 
Sbjct: 13  LHHRFSPVVKRWAESRGRPAAAAWWP-EGSPEYYSALSAHDRARRVLAGGKGESLLSFAD 71

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD 148
           G+ T       G LHY  + +GTPN +F+VALD GSDL W+PCDC RCAP++     +  
Sbjct: 72  GNSTT---RHAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPIA-----NTS 123

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
             L  YSP  SSTSK ++CSH LCD   +C N    CPYT+ Y + NTSSSG+LVED+L+
Sbjct: 124 ELLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLY 183

Query: 209 LI-------SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           +        SG    +  +V A V+ GCG +Q+G +LDG A +GL+GLG+  +SVPSLLA
Sbjct: 184 MTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDRVSVPSLLA 243

Query: 262 KAGLI-RNSFSMCFDKDDSGRIFFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSS 319
            AGL+  +SFSMCF  D +GRI FG+   A  Q  T F+ S  +  TY I V    +   
Sbjct: 244 AAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTR-PTYNISVTAVNVKGK 302

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQR 377
                 F A+VDSG+SFT+L    Y  +A  F+ QV +   +     P++ CY  S  Q 
Sbjct: 303 GAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCYALSRGQT 362

Query: 378 LPKLPSVKLMFPQNNSFVVNNPVFVIYG 405
              +P V L       F V  P  ++ G
Sbjct: 363 EVLMPEVSLTTRGGAVFPVTRPFVIVAG 390


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score =  261 bits (666), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 155/380 (40%), Positives = 213/380 (56%), Gaps = 8/380 (2%)

Query: 26  MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           +++  + HR SE V+    S      + P K + EYY  L   D   +  K       L 
Sbjct: 20  VYTFTMHHRHSEPVRKWSHSTASGIPAPPEKGTVEYYAELADRDRLLRGRKLSQIDDGLA 79

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
            S G+ T  + +  G+LHYT + IGTP V F+VALD GSDL W+PCDC RCA   +S + 
Sbjct: 80  FSDGNSTFRISS-LGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRCAATDSSAFA 138

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           S D DLN Y+P+ SSTSK ++C++ LC   + C      CPY + Y +  TS+SG+LVED
Sbjct: 139 S-DFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAETSTSGILVED 197

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           +LHL    ++   + V+A+VI GCG  QSG +LD  AP+GL GLG+ +ISVPS+L++ G 
Sbjct: 198 VLHLTQ--EDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGF 255

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
             +SFSMCF +D  GRI FGD+G   Q  T F   N  + TY I V    +G++ L    
Sbjct: 256 TADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRVGTT-LIDVE 313

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI-TSFEGYPWKCCYKSSSQRLPKL-PS 383
           F A+ DSG+SFT+L    Y  +   F  QV D    S    P++ CY  S      L PS
Sbjct: 314 FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPS 373

Query: 384 VKLMFPQNNSFVVNNPVFVI 403
           V L     + F V +P+ +I
Sbjct: 374 VSLTMGGGSHFAVYDPIIII 393


>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 537

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 154/386 (39%), Positives = 210/386 (54%), Gaps = 16/386 (4%)

Query: 31  LIHRFSEEVKALGVSKNRNATSW--PAKKSFEYYQVLLSSD---VQKQKMKTGPQFQMLF 85
           L HR S  V+    ++     +W   A+ + EYY  L   D   + ++ +  G    +L 
Sbjct: 33  LHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLARRGLAEGDGEGLLT 92

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
            + G+ T  L    G LHY  + +GTPN +FLVALD GSDL W+PCDC +CAP++ +   
Sbjct: 93  FASGNLTFRL---EGSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDL 149

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ---NPKQPCPYTMDYYTENTSSSGLL 202
               DL  YSP  SSTSK ++C H LC+   +C    N    CPYT+ Y + NTSSSG+L
Sbjct: 150 RGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVL 209

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           VED+LHL          +V A V++GCG  Q+G +LDG A DGL+GLG+ ++SVPS+L  
Sbjct: 210 VEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVPSVLHA 269

Query: 263 AGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
           AGL+  +SFSMCF  D  GRI FGD G   Q  T F   N  + TY I V    +    +
Sbjct: 270 AGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRN-THPTYNISVTAMSVSGKEV 328

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLP 379
               F AIVDSG+SFT+L    Y  +A  F+ +V +   +     P++ CY+    Q   
Sbjct: 329 A-AEFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQTEL 387

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYG 405
            +P V L       F V  P+ VIYG
Sbjct: 388 FVPEVSLTTRGGAVFPVTRPIVVIYG 413


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 154/410 (37%), Positives = 228/410 (55%), Gaps = 35/410 (8%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRN----ATSWPAKKSF 59
           + + ++  V W+L  +      M    L H+FS++  A+   ++RN    A  WP + + 
Sbjct: 10  VLVMVHCCVLWMLATTFANALRM---DLFHKFSKQ--AIEAMRSRNGMDYAQDWPTEGTI 64

Query: 60  EYYQVLLSSDVQK-----QKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNV 114
           E+  +L   DV +     +++            QG+ T  L    G LHY++IDIGTPNV
Sbjct: 65  EFQTMLRDHDVARHTRTARRILAASSMDQYVLIQGNATEQLFG--GGLHYSYIDIGTPNV 122

Query: 115 SFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL 174
            FLV LD GSDLLWIPC+C  CAPLSA   +     LN Y+PS SST+K + CS  LC++
Sbjct: 123 QFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVLCSDPLCEM 182

Query: 175 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI--SGGDNALKNSVQASVIIGCGMK 232
            ++C  P   CPY ++Y + NTS+SG L ED ++ +  SGG     N V+  V +GCG  
Sbjct: 183 SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGG-----NPVKLPVYLGCGKV 237

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQ 292
           Q+G  L G AP+GL+GLG  +ISVP+ LA  G + +SFS+C     SG + FGD+GPA Q
Sbjct: 238 QTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEGPAAQ 297

Query: 293 QSTSFLASNGKYI-TYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF 351
           ++T  +  +   + TYI+ +++  +G++ L   S  A+ D+G+SFT+L K VY      +
Sbjct: 298 RTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVYPQFVQAY 356

Query: 352 DRQV-----NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           D Q+     ND   S     W  CY++S+    ++P V L     NS  V
Sbjct: 357 DAQMSLPKWNDPRFS----KWDLCYQTSNTNF-QVPVVSLALSGGNSLDV 401


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 152/380 (40%), Positives = 213/380 (56%), Gaps = 8/380 (2%)

Query: 26  MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           +++  + HR SE V+    S      + P + + EYY  L   D   +  K       L 
Sbjct: 24  VYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYAELADRDRLLRGRKLSQIDAGLA 83

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
            S G+ T  + +  G+LHYT + IGTP V F+VALD GSDL W+PCDC RCA   ++ + 
Sbjct: 84  FSDGNSTFRISS-LGFLHYTTVQIGTPGVKFMVALDTGSDLFWVPCDCTRCAASDSTAFA 142

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           S D DLN Y+P+ SSTSK ++C++ LC   + C      CPY + Y +  TS+SG+LVED
Sbjct: 143 S-DFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAETSTSGILVED 201

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           +LHL    ++   + V+A+VI GCG  QSG +LD  AP+GL GLG+ +ISVPS+L++ G 
Sbjct: 202 VLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGF 259

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
             +SFSMCF +D  GRI FGD+G   Q  T F   N  + TY I V    +G++ +    
Sbjct: 260 TADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRVGTTVI-DVE 317

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI-TSFEGYPWKCCYKSSSQRLPKL-PS 383
           F A+ DSG+SFT+L    Y  +   F  QV D    S    P++ CY  S      L PS
Sbjct: 318 FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPS 377

Query: 384 VKLMFPQNNSFVVNNPVFVI 403
           V L     + F V +P+ +I
Sbjct: 378 VSLTMGGGSHFAVYDPIIII 397


>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
 gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
          Length = 523

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 157/385 (40%), Positives = 220/385 (57%), Gaps = 22/385 (5%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF----- 81
            S  + HR+S  V+             P   + EYY  L   D++++ +  GP       
Sbjct: 29  LSLDVHHRYSATVREWAGHHRA-----PPAGTAEYYAALARHDLRRRSLAAGPAAGGGGG 83

Query: 82  -QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
            ++ F + G+ T  L N+ G+LHY  + +GTPNV+FLVALD GSDL W+PCDC+ CAPL 
Sbjct: 84  GEVAF-ADGNDTYRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLV 141

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSG 200
           +  Y  L  D   YSP  SSTS+ + CS  LCDL ++C++    CPY+++Y ++NTSS+G
Sbjct: 142 SPNYRDLKFD--TYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTG 199

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
           +LVED+L+LI+  +      V A +  GCG  Q+G +L   AP+GL+GLG+  ISVPSLL
Sbjct: 200 VLVEDVLYLIT--EYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLL 257

Query: 261 AKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSS 319
           A  G+  NSFSMCF  D  GRI FGD G + QQ T   +     Y  Y I +    +GS 
Sbjct: 258 ASEGVAANSFSMCFGDDGRGRINFGDTGSSDQQETPLNIYKQNPY--YNISITGAMVGSK 315

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRL 378
               T+F AIVDSG+SFT L   +Y  I + F+ QV D  T  +   P++ CY  S +  
Sbjct: 316 SF-NTNFNAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISPKGS 374

Query: 379 PKLPSVKLMFPQNNSFVVNNPVFVI 403
              P++ LM    + F VN+P+  I
Sbjct: 375 VNPPNISLMAKGGSIFPVNDPIITI 399


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 161/432 (37%), Positives = 232/432 (53%), Gaps = 36/432 (8%)

Query: 2   NRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSF 59
            R  L + +AV  + +  + A+   F   L HRFS  V+    ++     A  WPA+ + 
Sbjct: 9   RRTGLLLAMAVVVVASLIAAADASSFGFDLHHRFSPVVRRWAEARGGPLAADQWPARGTP 68

Query: 60  EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDF---GWLHYTWIDIGTPNVSF 116
           EYY  L   D  ++ +  G    +L       T + GND    G L+Y  +++GTPN +F
Sbjct: 69  EYYSALSRHDRARRALAGGADDGLL-------TFAAGNDTYQSGTLYYAEVELGTPNATF 121

Query: 117 LVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR-DLNEYSPSASSTSKHLSCSHRLCDLG 175
           LVALD GSDL W+PCDC +CA + ++     D   L  YSP  SSTSK ++C + LC   
Sbjct: 122 LVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQR 181

Query: 176 TSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLI--SGGDNALKNSVQASVIIGCGMK 232
             C       CPY + Y + NTSSSG+LV+D+LHL     G  A   ++QA V+ GCG  
Sbjct: 182 NGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQV 241

Query: 233 QSGGYLD--GVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGP 289
           Q+G +LD  G A DGL+GLG+G++SVPS LA +GL+  +SFSMCF  D  GR+ FGD G 
Sbjct: 242 QTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGS 301

Query: 290 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAA 349
             Q  T F   +    TY +   +  +GS  +    F A++DSG+SFT+L    Y  +A 
Sbjct: 302 RGQAETPFTVRS-LNPTYNVSFTSIGVGSESVA-AEFAAVMDSGTSFTYLSDPEYTQLAT 359

Query: 350 EFDRQVNDTITSF-----EGYPWKCCYK-SSSQRLPKLPSVKL------MFPQNNSFVVN 397
           +F+ QV++   +F     + +P++ CY+ S +Q    +P V L      +FP    F+  
Sbjct: 360 KFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFI-- 417

Query: 398 NPVFVIYGTQVG 409
            PV    G  VG
Sbjct: 418 -PVGDTTGRAVG 428


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 149/381 (39%), Positives = 209/381 (54%), Gaps = 25/381 (6%)

Query: 33  HRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQK------MKTGPQFQMLF 85
           HR+S  V+   G+ +       P+  + EYY  L   D  +++                 
Sbjct: 38  HRYSATVRGWAGLRRG------PSPGTAEYYAALAGHDDLRRRSLSLAAAPAPGAGGPFA 91

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
              G+ T  L N FG+LHY  + +GTPNV+FLVALD GSDL W+PCDC++CAPLS+  Y 
Sbjct: 92  FVDGNDTYRL-NQFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLSSPDYG 150

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           +L  D+  YSP  SSTS+ + CS  +CDL T C      CPY ++Y ++NTSS G+LVED
Sbjct: 151 NLKFDV--YSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVED 208

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           +++L +  ++      QA +  GCG  Q+G +L   AP+GL+GLG+   SVPSLLA  G+
Sbjct: 209 VMYLAT--ESGHSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGV 266

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQ 323
             NSFSMCF +D  GRI FGD G A Q  T  +    N  Y   I+G     +       
Sbjct: 267 AANSFSMCFGEDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGA----MAGGKTFS 322

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKLP 382
           T F A+VDSG+SFT L   +Y  I + FD+QV +     +   P++ CY  SS+     P
Sbjct: 323 TKFSAVVDSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPP 382

Query: 383 SVKLMFPQNNSFVVNNPVFVI 403
           ++ L     + F V +P+  I
Sbjct: 383 NISLTAKGGSVFPVKDPIITI 403


>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
          Length = 585

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 201/345 (58%), Gaps = 8/345 (2%)

Query: 5   SLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVK--ALGVSKNRNATSWPAKKSFEYY 62
           S ++++ +   +         +FS ++ HRFSE VK  + G      A +WPAK SFEYY
Sbjct: 4   SWSVFIVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYY 63

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
             L   D   +  +      +L  S G+ T  + +  G+LHYT + +GTP   FLVALD 
Sbjct: 64  AELAHRDRALRGRRLSDIDGLLTFSDGNSTFRI-SSLGFLHYTTVSLGTPGKKFLVALDT 122

Query: 123 GSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 182
           GSDL W+PCDC RCAP   + Y S D +L+ Y+P  SSTS+ ++C++ LC     C    
Sbjct: 123 GSDLFWVPCDCSRCAPTEGTTYAS-DFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTF 181

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
             CPY + Y +  TS+SG+LVED+LHL +  ++  +  V+A V  GCG  Q+G +LD  A
Sbjct: 182 SNCPYMVSYVSAETSTSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAA 239

Query: 243 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 302
           P+GL GLGL +ISVPS+L+K G   +SFSMCF  D  GRI FGD+G   Q+ T F   N 
Sbjct: 240 PNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF-NLNA 298

Query: 303 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 347
            + TY I V    +G++ L    F A+ DSG+SFT+L   +Y  +
Sbjct: 299 LHPTYNITVTQVRVGTT-LIDLDFTALFDSGTSFTYLVDPIYTNV 342


>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
          Length = 313

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 118/190 (62%), Positives = 150/190 (78%), Gaps = 3/190 (1%)

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
           +SV+A V+IGCG KQSG YLDGVAPDGL+GLG  EISVPS L+KAGL+RNSFS+CFD++D
Sbjct: 5   SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64

Query: 279 SGRIFFGDQGPATQQSTSFLA-SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           SGRI+FGD GP+ QQST FL   N KY  YI+GVE CCIG+SCLKQTSF   +DSG SFT
Sbjct: 65  SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
           +LP+E+Y  +A E DR +N T  +FEG  W+ CY+SS++  PK+P++KL F  NN+FV++
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAE--PKVPAIKLKFSHNNTFVIH 182

Query: 398 NPVFVIYGTQ 407
            P+FV   +Q
Sbjct: 183 KPLFVFQQSQ 192


>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 553

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 155/409 (37%), Positives = 215/409 (52%), Gaps = 36/409 (8%)

Query: 26  MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           +F+  + HR+SE VK    S    +  WP K S EYY  L   D   +  +       L 
Sbjct: 25  IFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRFLRGRRLSQFDAGLA 84

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAP---LSAS 142
            S G+ T  + +  G+LHYT I++GTP V F+VALD GSDL W+PCDC RC+     + +
Sbjct: 85  FSDGNSTFRISS-LGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRCSATRSSAFA 143

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
              + D DL+ Y+P+ SSTSK ++C++ LC     C      CPY + Y +  TS+SG+L
Sbjct: 144 SALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGIL 203

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           VED+LHL    DN   + V+A+VI GCG  QSG +LD  AP+GL GLG+ +ISVPS+L++
Sbjct: 204 VEDVLHLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSR 261

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            G   +SFSMCF +D  GRI FGD+G   Q  T F   N  + TY I +    +G++ L 
Sbjct: 262 EGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNV-NPSHPTYNITINQVRVGTT-LI 319

Query: 323 QTSFKAIVDSGSSFTFLPKEVY--------------------------ETIAAEFDRQVN 356
              F A+ DSG+SFT+L    Y                          E    +F  QV 
Sbjct: 320 DVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVE 379

Query: 357 DTITSFEG-YPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVI 403
           D     +   P+  CY  S      L PS+ L     + FVV +P+ +I
Sbjct: 380 DRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII 428


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score =  247 bits (631), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 154/401 (38%), Positives = 218/401 (54%), Gaps = 28/401 (6%)

Query: 27  FSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML 84
           F   L HRFS  V+    ++     A  WPA+ + EYY  L   D  ++ +  G    +L
Sbjct: 36  FGFDLHHRFSPVVRRWAEARGGPLAADRWPARGTPEYYSALSRHDRARRALAGGADDGLL 95

Query: 85  FPSQGSKTMSLGNDF---GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
                  T + GND    G L+Y  +++GTPN +FLVALD GSDL W+PCDC +CA + +
Sbjct: 96  -------TFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPS 148

Query: 142 SYYNSLDR-DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSSS 199
           +     D   L  YSP  SSTS+ ++C + LC     C       CPY + Y + NTSSS
Sbjct: 149 ANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSSS 208

Query: 200 GLLVEDILHLI--SGGDNALKNSVQASVIIGCGMKQSGGYLD--GVAPDGLIGLGLGEIS 255
           G+LV+D+LHL     G  A   ++QA V+ GCG  Q+G +LD  G A DGL+GLG+G++S
Sbjct: 209 GVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVS 268

Query: 256 VPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           VPS LA +GL+  +SFSMCF  D  GR+ FGD G   Q  T F   +    TY +   + 
Sbjct: 269 VPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRS-LNPTYNVSFTSI 327

Query: 315 CIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF-----EGYPWKC 369
            IGS  +    F A++DSG+SFT+L    Y  +A +F+ QV++   +F     + +P++ 
Sbjct: 328 GIGSESVA-AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEY 386

Query: 370 CYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVG 409
           CY+ S +Q    +P V L       F V  P F+  G   G
Sbjct: 387 CYRLSPNQTEVAMPDVSLTAKGGALFPVTQP-FIPVGDTTG 426


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/377 (40%), Positives = 219/377 (58%), Gaps = 19/377 (5%)

Query: 33  HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQMLFPSQGS 90
           HRFS++V  +GV         P + S +YY+V+   D  ++ +++    Q  + F S G+
Sbjct: 39  HRFSDQV--VGVLP---GDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SDGN 92

Query: 91  KTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCA-PLSASYYNSLDR 149
           +T+ + +  G+LHY  + +GTP+  F+VALD GSDL W+PCDC  C   L A   +SLD 
Sbjct: 93  ETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLD- 150

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
            LN YSP+ASSTS  + C+  LC  G  C +P+  CPY + Y +  TSS+G+LVED+LHL
Sbjct: 151 -LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 209

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
           +S  ++    ++ A V  GCG  Q+G + DG AP+GL GLGL +ISVPS+LAK G+  NS
Sbjct: 210 VS--NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 329
           FSMCF  D +GRI FGD+G   Q+ T  L     + TY I V    +G +      F A+
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAV 325

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLP-KLPSVKL 386
            DSG+SFT+L    Y  I+  F+    D    T+    P++ CY  S  +   + P+V L
Sbjct: 326 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNL 385

Query: 387 MFPQNNSFVVNNPVFVI 403
                +S+ V +P+ VI
Sbjct: 386 TMKGGSSYPVYHPLVVI 402


>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
 gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
          Length = 455

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 142/363 (39%), Positives = 213/363 (58%), Gaps = 13/363 (3%)

Query: 7   TIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLL 66
           T++L    +L         +F+ ++ HRFS+EVK    S  R A  +P K SFEY+  L+
Sbjct: 9   TLFLIPILMLLSFGSCNGRIFTFEMHHRFSDEVKQWSDSTGRFA-KFPPKGSFEYFNALV 67

Query: 67  SSD--VQKQKMKTGPQFQM--LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
             D  ++ +++          L  S G+ T  + +  G+LHYT + +GTP + F+VALD 
Sbjct: 68  LRDWLIRGRRLSESESESESSLTFSDGNSTSRISS-LGFLHYTTVKLGTPGMRFMVALDT 126

Query: 123 GSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPK 182
           GSDL W+PCDC +CAP   + Y S + +L+ Y+P  S+T+K ++C++ LC     C    
Sbjct: 127 GSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTF 185

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
             CPY + Y +  TS+SG+L+ED++HL +   N  +  V+A V  GCG  QSG +LD  A
Sbjct: 186 STCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAA 243

Query: 243 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNG 302
           P+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF  D  GRI FGD+G + Q+ T F   N 
Sbjct: 244 PNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNP 302

Query: 303 KYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI--AAEFDRQVNDTIT 360
            +  Y I V    +G++ L    F A+ D+G+SFT+L   +Y T+  +A+  R   D+  
Sbjct: 303 SHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESAQDKRHSPDSRI 361

Query: 361 SFE 363
            FE
Sbjct: 362 PFE 364


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 152/377 (40%), Positives = 219/377 (58%), Gaps = 19/377 (5%)

Query: 33  HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQMLFPSQGS 90
           HRFS++V  +GV         P + S +YY+V+   D  ++ +++    Q  + F S G+
Sbjct: 39  HRFSDQV--VGVLP---GDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SDGN 92

Query: 91  KTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCA-PLSASYYNSLDR 149
           +T+ + +  G+LHY  + +GTP+  FLVALD GSDL W+PCDC  C   L A   +SLD 
Sbjct: 93  ETIRV-DALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLD- 150

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
            LN YSP+ASSTS  + C+  LC  G  C +P+  CPY + Y +  TSS+G+LVED+LHL
Sbjct: 151 -LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHL 209

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
           +S  ++    ++ A V +GCG  Q+G + DG AP+GL GLGL +ISVPS+LAK G+  NS
Sbjct: 210 VS--NDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 329
           FSMCF  D +GRI FGD+G   Q+ T  L     + TY I V    +  +      F A+
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNT-GDLEFDAV 325

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLP-KLPSVKL 386
            DSG+SFT+L    Y  I+  F+    D    T+    P++ CY  S  +   + P+V L
Sbjct: 326 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNL 385

Query: 387 MFPQNNSFVVNNPVFVI 403
                +S+ V +P+ VI
Sbjct: 386 TMKGGSSYPVYHPLVVI 402


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score =  243 bits (621), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 143/382 (37%), Positives = 213/382 (55%), Gaps = 15/382 (3%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDV--QKQKMKTGPQFQML 84
           F   L HR+S+ VK +      +    P K S  YY  +   D+    +K+ +      L
Sbjct: 41  FGFDLHHRYSDPVKGM-----LSVDDLPEKGSLHYYASMAHRDILIHGRKLVSDNTSTPL 95

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
               G++T    +  G+LHY  + IGTP++S+LVALD GSDL W+PCDC     +    +
Sbjct: 96  TFFSGNETYRF-SSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQF 154

Query: 145 NSLDR-DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
            S ++ D N Y P+ASSTS+ + C++ LC   + C + +  CPY + Y +  TSS+G+LV
Sbjct: 155 PSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLV 214

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+LHL +  D+A   ++ A +I GCG  Q+G +LDG AP+GL GLG+  ISVPS LA+ 
Sbjct: 215 EDLLHLTT--DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLARE 272

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           G   NSFSMCF +D  GRI FGD G + Q  T F      + TY + +    +G      
Sbjct: 273 GYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQ-LHPTYNVSITKINVGGRD-AD 330

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKL 381
             F AI DSG+SFT+L    Y  I+  F+    +   +S    P++ CY+ SS+Q   ++
Sbjct: 331 LEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEI 390

Query: 382 PSVKLMFPQNNSFVVNNPVFVI 403
           P+V L+    + F V +P+ ++
Sbjct: 391 PTVNLVMQGGSQFNVTDPIVIV 412


>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 525

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 154/391 (39%), Positives = 212/391 (54%), Gaps = 21/391 (5%)

Query: 26  MFSTKLIHRFSEEVKAL-GVS-KNRNATSWPAKKSFEYYQVLLSSDV----QKQKMKTGP 79
           +FS K+ HRFS+++K   GVS K     SWP K + EYY  L   D     Q+     GP
Sbjct: 27  IFSFKMHHRFSDQLKNWSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFRGQRLSEFDGP 86

Query: 80  QFQMLFPSQGS--KTMSLGNDFGWLH---YTWIDIGTPNVSFLVALDAGSDLLWIPCDCV 134
              + F    S  +  SLG     +    YT + +GTP   F+VALD GSDL W+PCDC 
Sbjct: 87  ---LAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFMVALDTGSDLFWVPCDCS 143

Query: 135 RCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTE 194
           RCAP   S Y S D +L+ YSP  SSTSK + C++ LC     C      CPY + Y + 
Sbjct: 144 RCAPTEGSPYAS-DFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSA 202

Query: 195 NTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEI 254
            TS++G+L+ED+LHL +  ++     +QA +  GCG  QSG +LD  AP+GL GLG+ +I
Sbjct: 203 ETSTTGILIEDLLHLKT--EHKHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQI 260

Query: 255 SVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           SVPS+L++ GL+ NSFSMCF  D  GRI FGD+G   Q+ T F   N  +  Y I V + 
Sbjct: 261 SVPSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSI 319

Query: 315 CIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKS 373
            +G++ L      A+ DSG+SF++    +Y  ++A F  Q  D         P++ CY  
Sbjct: 320 RVGTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNM 378

Query: 374 SSQRLPKL-PSVKLMFPQNNSFVVNNPVFVI 403
           S      L P + L       F V +P+ VI
Sbjct: 379 SPDANASLTPGISLTMKGGGPFPVYDPIIVI 409


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 154/406 (37%), Positives = 230/406 (56%), Gaps = 21/406 (5%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
           + L + L   W+L    G     F  +  HRFS++V  +GV         P + S +YY+
Sbjct: 12  MGLILMLVSSWVLDRCEGLGE--FGFEFHHRFSDQV--VGVLP---GDGLPNRDSSKYYR 64

Query: 64  VLLSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           V+   D  ++ +++ +  Q  + F + G++T+ + N  G+LHY  + +GTP+  FLVALD
Sbjct: 65  VMAHRDRLIRGRRLASEDQSLVTF-ADGNETIRV-NALGFLHYANVTVGTPSDWFLVALD 122

Query: 122 AGSDLLWIPCDC-VRCA-PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 179
            GSDL W+PCDC   C   L A   +SLD  LN YSP+ASSTS  + C+  LC     C 
Sbjct: 123 TGSDLFWLPCDCSTNCVRELKAPGGSSLD--LNIYSPNASSTSSKVPCNSTLCTRVDRCA 180

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           +P   CPY + Y +  TSS+G+LVED+LHL+S   N+    ++A + +GCG+ Q+G + D
Sbjct: 181 SPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS--KPIRARITLGCGLVQTGVFHD 238

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLA 299
           G AP+GL GLGL +ISVPS+LAK G+  NSFSMCF  D +GRI FGD+G   Q+ T  L 
Sbjct: 239 GAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETP-LN 297

Query: 300 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
               + TY + V    +G +      F A+ D+G+SFT+L    Y  I+  F+    D  
Sbjct: 298 IRQPHPTYNVTVTQISVGGNT-GDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKR 356

Query: 360 TSFEG-YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
              +   P++ CY  S +++  + P V L     +S+ V +P+ V+
Sbjct: 357 YQTDSELPFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVV 402


>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
 gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
          Length = 541

 Score =  240 bits (613), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 154/420 (36%), Positives = 220/420 (52%), Gaps = 39/420 (9%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR--NATSWPAKKSFEYYQ 63
           + + +     L  +  A +V F   L HRFS  V+    ++     A  WPA+ S EYY 
Sbjct: 15  VAVAIVAVSFLVAAGDASSVGF--DLHHRFSPVVRQWAEARGHPFAAQDWPARGSPEYYS 72

Query: 64  VLLSSD---VQKQKMKTGPQFQMLFPSQGSKTMSLGND----FGWLHYTWIDIGTPNVSF 116
            L   D   + ++ +  G        + G  T + GND     G L+Y  +++GTPN +F
Sbjct: 73  ALSRHDRAVLSRRALADG--------ADGLVTFAAGNDTLQYIGSLYYAVVEVGTPNATF 124

Query: 117 LVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 176
           LVALD GSDL W+PCDC +CA + A+        L  YSP  SSTSK ++C + LCD   
Sbjct: 125 LVALDTGSDLFWVPCDCKQCASI-ANVTGQPATALRPYSPRESSTSKQVTCDNALCDRPN 183

Query: 177 SCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLIS---GGDNALKNSVQASVIIGCGMK 232
            C       CPY + Y + NTS+SG+LV+D+LHL     G       ++QA V+ GCG  
Sbjct: 184 GCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGAAAEAGEALQAPVVFGCGQV 243

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPAT 291
           Q+G +LDG A DGL+GLG   +SVPS+LA +GL+  +SFSMCF  D  GRI FGD G + 
Sbjct: 244 QTGTFLDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFSMCFGDDGVGRINFGDSGSSG 303

Query: 292 QQSTSFLASNGKY-ITYI-IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAA 349
           Q  T F      Y +++  + VET  + +       F A++DSG+SFT+L    Y  +A 
Sbjct: 304 QGETPFTGRRTLYNVSFTAVNVETKSVAA------EFAAVIDSGTSFTYLADPEYTELAT 357

Query: 350 EFDRQVNDTITSF-----EGYPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            F+  V +  T+F     + +P++ CY    +Q    +P V L       F V  PV  +
Sbjct: 358 NFNSLVRERRTNFSSGSADPFPFEYCYALGPNQTEALIPDVSLTTKGGARFPVTQPVIGV 417


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 143/395 (36%), Positives = 213/395 (53%), Gaps = 23/395 (5%)

Query: 27  FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTG---PQ 80
           F   + HRFS+ VK  LG+       + P K S EYY  +   D   + +++  G    Q
Sbjct: 39  FGFDIHHRFSDPVKGILGID------NIPDKGSREYYVAMAHRDRVFRGRRLADGGDVDQ 92

Query: 81  FQMLF-PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPL 139
             + F P   +  +SL   FG+LH+  + +GTP  S+LVALD GSDL W+PC+C +C   
Sbjct: 93  KLLTFSPDNTTYQISL---FGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTKCVH- 148

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSS 198
                       N Y    SSTSK+++C+  LC+  T C +     CPY ++Y +ENTS+
Sbjct: 149 GIQLSTGQKIAFNIYDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTST 208

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
           +G LVED+LHLI+  D+  +++    +  GCG  Q+G +LDG AP+GL GLG+ ++SVPS
Sbjct: 209 TGFLVEDVLHLITDNDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPS 267

Query: 259 LLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
           +LAK GL  NSFSMCF  D  GRI FGD   +  Q  +       + TY I V    +G 
Sbjct: 268 ILAKQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGG 327

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF---EGYPWKCCYKSSS 375
           +      F AI D+G+SFT+L    Y+ I   FD ++     SF   +  P++ CY   +
Sbjct: 328 NS-ADLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRT 386

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVGV 410
            +  ++P++ L     +++ V +P+    G   GV
Sbjct: 387 NQTIEVPNINLTMKGGDNYFVMDPIITSGGGNNGV 421


>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
 gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
          Length = 499

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 204/383 (53%), Gaps = 18/383 (4%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           S +  HRFS  ++    ++ R     WPA  S  Y   L   D  +     G       P
Sbjct: 30  SLEFHHRFSAPLRRWAEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGGGSGTPP 89

Query: 87  ---SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
              ++G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C P + + 
Sbjct: 90  LTFAEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAA 148

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
             S       Y P  SSTSK + C+   CDL   C    Q CPY M Y +  TSSSG LV
Sbjct: 149 SGSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLV 203

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+L+L +  +NA    ++A +++GCG  Q+G +LD  AP+GL GLG+ E+SVPS+LA+ 
Sbjct: 204 EDVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 261

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL  NSFSMCF +D  GRI FGDQG + Q+ T  L  N ++ TY I +    IG+     
Sbjct: 262 GLTSNSFSMCFGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-TD 319

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCY--KSSSQRLPK 380
             F  I D+G+SFT+L    Y  I   F  QV     + +   P++ CY   SS  R P 
Sbjct: 320 LDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP- 378

Query: 381 LPSVKLMFPQNNSFVVNNPVFVI 403
           +P + L     + F V +P  VI
Sbjct: 379 IPDIILRTVSGSLFPVIDPGQVI 401


>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 498

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 145/381 (38%), Positives = 202/381 (53%), Gaps = 16/381 (4%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           S +  HRFS  ++    ++ R     WPA  S  Y   L   D  +     G       P
Sbjct: 31  SLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGSSSDAPP 90

Query: 87  ---SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
              ++G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C P + + 
Sbjct: 91  LTFAEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAA 149

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
             S       Y P  SSTSK + C+   CDL   C    Q CPY M Y +  TSSSG LV
Sbjct: 150 SGSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLV 204

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+L+L +  +NA    ++A +++GCG  Q+G +LD  AP+GL GLG+ E+SVPS+LA+ 
Sbjct: 205 EDVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL  NSFSMCF +D  GRI FGDQ  + Q+ T  L  N ++ TY I +    +G+     
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTD 320

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYKSSSQRLPKLP 382
             F  I D+G+SFT+L    Y  I   F  QV     + +   P++ CY  S  R P +P
Sbjct: 321 MDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSEARFP-IP 379

Query: 383 SVKLMFPQNNSFVVNNPVFVI 403
            + L     + F V +P  VI
Sbjct: 380 DIILRTVTGSMFPVIDPGQVI 400


>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 520

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 147/390 (37%), Positives = 205/390 (52%), Gaps = 16/390 (4%)

Query: 21  GAETVMFSTKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGP 79
           G  +   S +  HRFS  ++    ++ R     WPA  S  Y   L   D  +     G 
Sbjct: 24  GDASTAPSLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGG 83

Query: 80  QFQMLFP---SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRC 136
                 P   ++G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C
Sbjct: 84  SSSDAPPLTFAEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGC 142

Query: 137 APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENT 196
            P + +   S       Y P  SSTSK + C+   CDL   C    Q CPY M Y +  T
Sbjct: 143 TPPATAASGSFQATF--YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGT 199

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
           SSSG LVED+L+L +  +NA    ++A +++GCG  Q+G +LD  AP+GL GLG+ E+SV
Sbjct: 200 SSSGFLVEDVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSV 257

Query: 257 PSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           PS+LA+ GL  NSFSMCF +D  GRI FGDQ  + Q+ T  L  N ++ TY I +    +
Sbjct: 258 PSILAQKGLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITV 316

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCY--KS 373
           G+       F  I D+G+SFT+L    Y  I   F  QV     + +   P++ CY   S
Sbjct: 317 GNK-PTDMDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSS 375

Query: 374 SSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
           S  R P +P + L     + F V +P  VI
Sbjct: 376 SEARFP-IPDIILRTVTGSMFPVIDPGQVI 404


>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
           Japonica Group]
 gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
          Length = 551

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 155/393 (39%), Positives = 211/393 (53%), Gaps = 27/393 (6%)

Query: 31  LIHRFSEEVKALGVSKNRNATSWPAKK----SFEYYQVLLSSD---VQKQKMKTGPQFQM 83
           L HR+S  V+     +     SWPA      S EYY  L   D     ++ +  G     
Sbjct: 31  LHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGLVT 90

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS--A 141
              + G+ T+ L    G LHY  + +GTPN +FLVALD GSDL W+PCDC +CAPL    
Sbjct: 91  F--ADGNITLRLD---GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLT 145

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 201
           +       +L +YSPS SSTSK ++C+  LCD   +C      CPY + Y   NTSSSG 
Sbjct: 146 AVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGE 205

Query: 202 LVEDILHLI---SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
           LVED+L+L         A   +V+  V+ GCG  Q+G +LDG A DGL+GLG+ ++SVPS
Sbjct: 206 LVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPS 265

Query: 259 LLAKAGLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           +LA  G+++ NSFSMCF KD  GRI FGD G A Q  T F+  +  +  Y I + +  +G
Sbjct: 266 ILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-THSYYNISITSMSVG 324

Query: 318 SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCY 371
              L    F AI DSG+SFT+L    Y      F+ Q+++   +F G      +P++ CY
Sbjct: 325 DKNLP-LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCY 383

Query: 372 K-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
             S  Q   +LP V L       F V +PV+ I
Sbjct: 384 SLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPI 416


>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
          Length = 551

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 155/393 (39%), Positives = 211/393 (53%), Gaps = 27/393 (6%)

Query: 31  LIHRFSEEVKALGVSKNRNATSWPAKK----SFEYYQVLLSSD---VQKQKMKTGPQFQM 83
           L HR+S  V+     +     SWPA      S EYY  L   D     ++ +  G     
Sbjct: 31  LHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDGLVT 90

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS--A 141
              + G+ T+ L    G LHY  + +GTPN +FLVALD GSDL W+PCDC +CAPL    
Sbjct: 91  F--ADGNITLRLD---GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLT 145

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 201
           +       +L +YSPS SSTSK ++C+  LCD   +C      CPY + Y   NTSSSG 
Sbjct: 146 AVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGE 205

Query: 202 LVEDILHLI---SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
           LVED+L+L         A   +V+  V+ GCG  Q+G +LDG A DGL+GLG+ ++SVPS
Sbjct: 206 LVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPS 265

Query: 259 LLAKAGLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           +LA  G+++ NSFSMCF KD  GRI FGD G A Q  T F+  +  +  Y I + +  +G
Sbjct: 266 ILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-THSYYNISITSMSVG 324

Query: 318 SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCY 371
              L    F AI DSG+SFT+L    Y      F+ Q+++   +F G      +P++ CY
Sbjct: 325 DKNLP-LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCY 383

Query: 372 K-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
             S  Q   +LP V L       F V +PV+ I
Sbjct: 384 SLSPDQTTVELPIVSLTTNGGAVFPVTSPVYPI 416


>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
 gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 500

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 146/383 (38%), Positives = 203/383 (53%), Gaps = 18/383 (4%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           S +  HRFS  ++    ++ R     WPA  S  Y   L   D  +     G       P
Sbjct: 31  SLEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRHRAVSAAGGSSSDAPP 90

Query: 87  ---SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
              ++G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C P + + 
Sbjct: 91  LTFAEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAA 149

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
             S       Y P  SSTSK + C+   CDL   C    Q CPY M Y +  TSSSG LV
Sbjct: 150 SGSA----TFYIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLV 204

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+L+L +  +NA    ++A +++GCG  Q+G +LD  AP+GL GLG+ E+SVPS+LA+ 
Sbjct: 205 EDVLYLST--ENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL  NSFSMCF +D  GRI FGDQ  + Q+ T  L  N ++ TY I +    +G+     
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTD 320

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCY--KSSSQRLPK 380
             F  I D+G+SFT+L    Y  I   F  QV     + +   P++ CY   SS  R P 
Sbjct: 321 MDFITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP- 379

Query: 381 LPSVKLMFPQNNSFVVNNPVFVI 403
           +P + L     + F V +P  VI
Sbjct: 380 IPDIILRTVTGSMFPVIDPGQVI 402


>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
          Length = 829

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 144/390 (36%), Positives = 204/390 (52%), Gaps = 20/390 (5%)

Query: 27  FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           F   + HRFS+ VK  LGV         P K +  YY V+   D   + +++        
Sbjct: 30  FGFDIHHRFSDPVKEILGVHD------LPDKGTRLYYVVMAHRDRIFRGRRLAAAVHHSP 83

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           L     ++T  +G  FG+LH+  + +GTP +SFLVALD GSDL W+PC+C +C  +    
Sbjct: 84  LTFVPANETYQIG-AFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKC--VRGVE 140

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
            N      N Y    SSTS+ + C+  LC+L   C +    CPY ++Y +  TS++G LV
Sbjct: 141 SNGEKIAFNIYDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFLV 200

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+LHLI+  D          +  GCG  Q+G +LDG AP+GL GLG+G  SVPS+LAK 
Sbjct: 201 EDVLHLITDDDET--KDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKE 258

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GL  NSFSMCF  D  GRI FGD     Q  T F      + TY I V    +G +    
Sbjct: 259 GLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGGNA-AD 316

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV---NDTITSFEGYPWKCCYKSSSQRLPK 380
             F AI DSG+SFT L    Y+ I   F+  +     + +S +  P++ CY  SS +  +
Sbjct: 317 LEFHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVE 376

Query: 381 LPSVKLMFPQNNSFVVNNPVFVIYGTQVGV 410
           LP + L     ++++V +P+  I G  V +
Sbjct: 377 LP-INLTMKGGDNYLVTDPIVTISGEGVNL 405


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  234 bits (596), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 138/381 (36%), Positives = 206/381 (54%), Gaps = 16/381 (4%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQML 84
           F+  + H +S  V+ +         S+P + + +YY  ++ +D  V  +++      + L
Sbjct: 35  FTFNIHHLYSPAVRQI-----LPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPL 89

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
               G++T+ + +  G+L+Y  + +GTP V +LVALD GSDL W+PCDCV C  ++    
Sbjct: 90  TFLSGNETLRI-SPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNT 146

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
                + N YSP+ SSTSK + CS  LC     C +P   CPY + Y ++NTSS+G LVE
Sbjct: 147 TQGPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVE 206

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           DILHL +  ++     V A + +GCG  QSG +L   AP+GL GLG+  +SVPS+LA AG
Sbjct: 207 DILHLTT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAG 264

Query: 265 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
           LI NSFS+CF     GRI FGD+G   Q  T F     ++ TY + +    +G   +   
Sbjct: 265 LISNSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGR-RHPTYNVSITQIGVGGH-ISDL 322

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKLP 382
               I DSG+SFT+L    Y   A +F   V +   T     P++ CY+ S +Q     P
Sbjct: 323 DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYP 382

Query: 383 SVKLMFPQNNSFVVNNPVFVI 403
            + L       FV+N+P+ +I
Sbjct: 383 LMNLTMKGGGHFVINHPIVLI 403


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 138/381 (36%), Positives = 206/381 (54%), Gaps = 16/381 (4%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQML 84
           F+  + H +S  V+ +         S+P + + +YY  ++ +D  V  +++      + L
Sbjct: 58  FTFNIHHLYSPAVRQI-----LPFHSFPDEGTLDYYAAMVRTDHFVHSRRLGQVQDHRPL 112

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
               G++T+ + +  G+L+Y  + +GTP V +LVALD GSDL W+PCDCV C  ++    
Sbjct: 113 TFLSGNETLRI-SPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNC--ITGLNT 169

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
                + N YSP+ SSTSK + CS  LC     C +P   CPY + Y ++NTSS+G LVE
Sbjct: 170 TQGPVNFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVE 229

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           DILHL +  ++     V A + +GCG  QSG +L   AP+GL GLG+  +SVPS+LA AG
Sbjct: 230 DILHLTT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAG 287

Query: 265 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
           LI NSFS+CF     GRI FGD+G   Q  T F     ++ TY + +    +G   +   
Sbjct: 288 LISNSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGR-RHPTYNVSITQIGVGGH-ISDL 345

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYK-SSSQRLPKLP 382
               I DSG+SFT+L    Y   A +F   V +   T     P++ CY+ S +Q     P
Sbjct: 346 DVAVIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYP 405

Query: 383 SVKLMFPQNNSFVVNNPVFVI 403
            + L       FV+N+P+ +I
Sbjct: 406 LMNLTMKGGGHFVINHPIVLI 426


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 144/374 (38%), Positives = 206/374 (55%), Gaps = 24/374 (6%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
           + + L+VF L       +   F   + HRFS+ +K +  S+       P K +  YY  +
Sbjct: 11  MLLVLSVFILAGSLRSGDAASFKFDIHHRFSDSIKGIFHSEG-----LPEKHTPGYYATM 65

Query: 66  LSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAG 123
           +  D  V+ +++        L  + G+ T  +  D G+L+Y  + +GTP++ FLVALD G
Sbjct: 66  VHRDRLVRGRRLAASDVDTQLTFAYGNDTAFI-PDLGFLYYANVSVGTPSLDFLVALDTG 124

Query: 124 SDLLWIPCDCVRCAPLSASYYNSLDRD---LNEYSPSASSTSKHLSCSHRLCDLGTSCQN 180
           SDL W+PC+C  C     +Y N+ +     LN YSP+ S+TS  + C+  LC+  TS QN
Sbjct: 125 SDLFWLPCECSSCF----TYLNTSNGGKFMLNHYSPNDSTTSSTVPCTSSLCNRCTSNQN 180

Query: 181 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 240
               CPY M Y + NTSS G LVED+LHL +  D++L   V+A +  GCG  Q+G +   
Sbjct: 181 V---CPYEMRYLSANTSSIGYLVEDVLHLAT--DDSLLKPVEAKITFGCGTVQTGIFATT 235

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 300
            AP+GLIGLG+ +ISVPS LA  GL  NSFSMCF  D  GRI FGD GPA Q+ T F  +
Sbjct: 236 AAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDTGPADQKQTPF-NT 294

Query: 301 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
             +Y +Y +      +G        F AI DSG+SFT+L +  Y TI  + D  +     
Sbjct: 295 MLEYQSYNVTFNVINVGGEP-NDVPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRY 353

Query: 361 SFEG--YPWKCCYK 372
           S  G  +P++ CY+
Sbjct: 354 SLFGPNFPFEYCYE 367


>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 417

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 128/304 (42%), Positives = 175/304 (57%), Gaps = 7/304 (2%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           LHYT + +GTP   F+VALD GSDL W+PCDC RCAP   S Y S D +L+ YSP  SST
Sbjct: 3   LHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYAS-DFELSVYSPKKSST 61

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           SK + C++ LC     C      CPY + Y +  TS++G+L+ED+LHL +  +N     +
Sbjct: 62  SKTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKT--ENKHSEPI 119

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
           QA +  GCG  QSG +LD  AP+GL GLG+ +ISVPS+L++ GL+ NSFSMCF  D  GR
Sbjct: 120 QAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGR 179

Query: 282 IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 341
           I FGD+G   Q+ T F   N  +  Y I V +  +G++ L      A+ DSG+SF++   
Sbjct: 180 INFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRVGTT-LIDADITALFDSGTSFSYFTD 237

Query: 342 EVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNP 399
            +Y  ++A F  Q  D         P++ CY  S      L P + L       F V +P
Sbjct: 238 PIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVYDP 297

Query: 400 VFVI 403
           + VI
Sbjct: 298 IIVI 301


>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
          Length = 335

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 137/324 (42%), Positives = 189/324 (58%), Gaps = 18/324 (5%)

Query: 33  HRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKT 92
           HR+S  V+     +       P   + EYY  L   D++++ +  G +      + G+ T
Sbjct: 28  HRYSATVREWAGHRA------PPAGTAEYYAALAGHDLRRRSLAGGGEVAF---ADGNDT 78

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
             L N+ G+LHY  + +GTPNV+FLVALD GSDL W+PCDC+ CAPL +  Y  L  D  
Sbjct: 79  YRL-NELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSPNYRDLKFD-- 135

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
            YSP  SSTS+ + CS  LCD  ++C++    CPY++ Y ++NTSS+G+LVED+L+L++ 
Sbjct: 136 TYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVTE 195

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL-IRNSFS 271
                K  V A +  GCG  Q+G +L   AP+GL+GLG+  ISVPSLLA  G+   NSFS
Sbjct: 196 YGRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFS 254

Query: 272 MCFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 330
           MCF +D  GRI FGD G + QQ T   +     Y  Y I +    +GS  +  T F AIV
Sbjct: 255 MCFAQDGHGRINFGDTGSSDQQETPLNMYKQNPY--YNISITGATVGSKSI-HTKFNAIV 311

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQ 354
           DSG+SFT L   +Y  I +    Q
Sbjct: 312 DSGTSFTALSDPMYTQITSSVSVQ 335


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score =  230 bits (587), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 143/397 (36%), Positives = 209/397 (52%), Gaps = 22/397 (5%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
           + + L+VF+L           F   + HRFS+ +K +  S+       P K +  YY  +
Sbjct: 11  MLLVLSVFFLAGGLRSGHAASFKFTIHHRFSDSIKEIFGSEG-----LPEKHTPGYYAAM 65

Query: 66  LSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAG 123
           +  D  +  + + T      L  S G++T  L +  G L+Y  + IGTP + FLVALD G
Sbjct: 66  VHRDRLLHGRNLATTNGDTPLMFSYGNETYEL-SGLGNLYYANVSIGTPGLYFLVALDTG 124

Query: 124 SDLLWIPCDCVRCAPLSASYYNSLDRD---LNEYSPSASSTSKHLSCSHRLCDLGTSCQN 180
           SDL W+PC+C +C     +Y    D     LN YS +ASSTS  + CS  LC+L   C +
Sbjct: 125 SDLFWLPCECTKCP----TYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCELANQCSS 180

Query: 181 PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 240
            K  CPY   Y +EN+SS+G LV+DILH+ +  D++    V   V +GCG  Q+G + + 
Sbjct: 181 NKSSCPYQTHYLSENSSSAGYLVQDILHMAT--DDSQLKPVDVKVTLGCGKVQTGKFSNV 238

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 300
            AP+GLIGLG+G++SVPS LA  GL  +SFSMCF     GRI FGD GP  Q+ T F  +
Sbjct: 239 TAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFGDIGPVGQRETPFNPA 298

Query: 301 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTI 359
           +  Y   I+ +    I ++        AI+DSG+SFT+L    Y  I    D  +  + I
Sbjct: 299 SLSYNVTILQI----IVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITENMDAAMELERI 354

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
            S   +P++ CY+ S   + + P++         F V
Sbjct: 355 KSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGRKFDV 391


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 141/373 (37%), Positives = 205/373 (54%), Gaps = 28/373 (7%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQ-----------KM 75
           S +  HRFS  V+    ++       WP   S +Y   L   D ++              
Sbjct: 34  SLEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGD 93

Query: 76  KTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR 135
           K  P    L  S+G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  
Sbjct: 94  KPPP----LTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDG 148

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTEN 195
           C P +++   S       Y PS SSTS+ + C+ + C+L   C    Q CPY M Y + +
Sbjct: 149 CTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSAD 203

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
           TSSSG LVED+L+L +  ++A+   ++A ++ GCG  Q+G +LD  AP+GL GLG+  IS
Sbjct: 204 TSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMIS 261

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
           +PS+LA+ GL  NSF+MCF +D  GRI FGDQG + Q+ T  L  N ++ TY I +    
Sbjct: 262 IPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEIT 320

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-S 373
           +G+S L    F  I D+G+SFT+L    Y  I   F  QV+    + +   P++ CY  S
Sbjct: 321 VGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLS 379

Query: 374 SSQRLPKLPSVKL 386
           SS+   + PS+ L
Sbjct: 380 SSEDRIQTPSISL 392


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 141/373 (37%), Positives = 205/373 (54%), Gaps = 28/373 (7%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQ-----------KM 75
           S +  HRFS  V+    ++       WP   S +Y   L   D ++              
Sbjct: 34  SLEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGD 93

Query: 76  KTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR 135
           K  P    L  S+G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  
Sbjct: 94  KPPP----LTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDG 148

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTEN 195
           C P +++   S       Y PS SSTS+ + C+ + C+L   C    Q CPY M Y + +
Sbjct: 149 CTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSAD 203

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
           TSSSG LVED+L+L +  ++A+   ++A ++ GCG  Q+G +LD  AP+GL GLG+  IS
Sbjct: 204 TSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMIS 261

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
           +PS+LA+ GL  NSF+MCF +D  GRI FGDQG + Q+ T  L  N ++ TY I +    
Sbjct: 262 IPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEIT 320

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-S 373
           +G+S L    F  I D+G+SFT+L    Y  I   F  QV+    + +   P++ CY  S
Sbjct: 321 VGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLS 379

Query: 374 SSQRLPKLPSVKL 386
           SS+   + PS+ L
Sbjct: 380 SSEDRIQTPSISL 392


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 141/373 (37%), Positives = 205/373 (54%), Gaps = 28/373 (7%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQ-----------KM 75
           S +  HRFS  V+    ++       WP   S +Y   L   D ++              
Sbjct: 34  SLEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGD 93

Query: 76  KTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR 135
           K  P    L  S+G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  
Sbjct: 94  KPPP----LTFSEGNATLKVSN-LGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDG 148

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTEN 195
           C P +++   S       Y PS SSTS+ + C+ + C+L   C    Q CPY M Y + +
Sbjct: 149 CTPPASAASGSASF----YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSAD 203

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
           TSSSG LVED+L+L +  ++A+   ++A ++ GCG  Q+G +LD  AP+GL GLG+  IS
Sbjct: 204 TSSSGFLVEDVLYLST--EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMIS 261

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
           +PS+LA+ GL  NSF+MCF +D  GRI FGDQG + Q+ T  L  N ++ TY I +    
Sbjct: 262 IPSILAQKGLTSNSFAMCFSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEMT 320

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-S 373
           +G+S L    F  I D+G+SFT+L    Y  I   F  QV+    + +   P++ CY  S
Sbjct: 321 VGNS-LTDLEFSTIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLS 379

Query: 374 SSQRLPKLPSVKL 386
           SS+   + PS+ L
Sbjct: 380 SSEDRIQTPSISL 392


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 137/381 (35%), Positives = 205/381 (53%), Gaps = 18/381 (4%)

Query: 27  FSTKLIHRFSEEV-KALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           F   + HRFS+ V + LG+    N    P K + +YY  ++  D     +++       +
Sbjct: 39  FGLDIHHRFSDPVTEILGIG---NDELLPHKGTPQYYAAMVHRDRVFHGRRLADDRDTPI 95

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
            F + G++T  +   FG+LH+  + +GTP + FLVALD GSDL W+PC+C  C       
Sbjct: 96  TF-AAGNETHQIAA-FGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSCV-RGLKT 152

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
            N    DLN Y    SST K++ C+  +C   T C +    C Y ++Y + +TSSSG LV
Sbjct: 153 QNGKVIDLNIYELDKSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSSGFLV 211

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           ED+LHLI+  DN     +   + IGCG  Q+G +L+G AP+GL GLG+  +SVPS+LA+ 
Sbjct: 212 EDVLHLIT--DNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILAQK 269

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           GLI +SFSMCF  D SGRI FGD G + Q  T F      + TY + +    +G      
Sbjct: 270 GLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPFNLRE-SHPTYNVTITQIIVGGYAADH 328

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV----NDTITSFEGYPWKCCYKSSSQRLP 379
             F AI DSG+SFT+L    Y  I+ +F+  V    +  ++     P++ CY  S  +  
Sbjct: 329 -EFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTI 387

Query: 380 KLPSVKLMFPQNNSFVVNNPV 400
           ++P + L     + + V +P+
Sbjct: 388 EVPFLNLTMKGGDDYYVTDPI 408


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 142/379 (37%), Positives = 203/379 (53%), Gaps = 13/379 (3%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           S +  HRFS  V+    S+       WP+   F Y   L   D  +     G +  + F 
Sbjct: 24  SLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRHRALSAAGGRPPLTF- 82

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           S+G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C   +    ++
Sbjct: 83  SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPPPSSA 138

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
                + Y PS SSTS+ + C+   C L   C      CPY M Y + +TSSSG LVED+
Sbjct: 139 ASAPASFYIPSLSSTSQAVPCNSDFCGLRKECSKTSS-CPYKMVYVSADTSSSGFLVEDV 197

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L+L +  ++     ++A ++ GCG  Q+G +LD  AP+GL GLG+  ISVPS+LA+ GL 
Sbjct: 198 LYLST--EDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLT 255

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
            NSFSMCF +D  GRI FGDQG + Q+ T  L  N K+ TY I +    +G++ L     
Sbjct: 256 SNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEV 313

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSV 384
             I D+G+SFT+L    Y  I   F  QV     + +   P++ CY  SSS+   + PS+
Sbjct: 314 STIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSI 373

Query: 385 KLMFPQNNSFVVNNPVFVI 403
            L     + F   +P  VI
Sbjct: 374 SLRTVGGSLFPAIDPGQVI 392


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 142/379 (37%), Positives = 203/379 (53%), Gaps = 13/379 (3%)

Query: 28  STKLIHRFSEEVKALGVSKNRN-ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           S +  HRFS  V+    S+       WP+   F Y   L   D  +     G +  + F 
Sbjct: 24  SLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRHRALSAAGGRPPLTF- 82

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           S+G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C   +    ++
Sbjct: 83  SEGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGC---TPPPSSA 138

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
                + Y PS SSTS+ + C+   C L   C      CPY M Y + +TSSSG LVED+
Sbjct: 139 ASAPASFYIPSLSSTSQAVPCNSDFCGLRKECSKTSS-CPYKMVYVSADTSSSGFLVEDV 197

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L+L +  ++     ++A ++ GCG  Q+G +LD  AP+GL GLG+  ISVPS+LA+ GL 
Sbjct: 198 LYLST--EDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLT 255

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
            NSFSMCF +D  GRI FGDQG + Q+ T  L  N K+ TY I +    +G++ L     
Sbjct: 256 SNSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEV 313

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSV 384
             I D+G+SFT+L    Y  I   F  QV     + +   P++ CY  SSS+   + PS+
Sbjct: 314 STIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSI 373

Query: 385 KLMFPQNNSFVVNNPVFVI 403
            L     + F   +P  VI
Sbjct: 374 SLRTVGGSLFPAIDPGQVI 392


>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
 gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
          Length = 575

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 148/417 (35%), Positives = 217/417 (52%), Gaps = 39/417 (9%)

Query: 17  TESSGAETVMFSTKLIHRFSEEVK-----ALGVSKNRNATSW------PAKKSFEYYQVL 65
           TE+SG         L HRFS  V+     A G       +SW      PA  S EYY  L
Sbjct: 24  TEASGG----IGFNLHHRFSPVVRQWMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSAL 79

Query: 66  LSSD----VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           L  D     +++ + +    Q    +      +  + + +LHY  +++GTP+  FLVALD
Sbjct: 80  LRHDRALFTRRRGLASAADGQSTTLTFADGNATRLDTYEYLHYAEVEVGTPSSKFLVALD 139

Query: 122 AGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 181
            GSDL W+PC+C  CA   ++ Y          SPS SSTSK + C H LC+   +C   
Sbjct: 140 TGSDLFWLPCECKLCAKNGSTMY----------SPSLSSTSKTVPCGHPLCERPDACATA 189

Query: 182 KQP---CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
            +    CPY + Y + NT SSG+LVED+LHL+ GG      +VQA ++ GCG  Q+G +L
Sbjct: 190 GKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAFL 249

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQGPATQQSTSF 297
            G A  GL+GLGL ++SVPS LA +GL+  +SFSMCF +D  GRI FGD G   Q  T  
Sbjct: 250 RGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETPL 309

Query: 298 LASNGKYITYI-IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           +A+     +Y  I V    + S  +    F A+VDSG+SFT+L    Y  +   F+ +V+
Sbjct: 310 IAAGSLQPSYYNISVGAITVDSKAMA-VEFTAVVDSGTSFTYLDDPAYTFLTTNFNSRVS 368

Query: 357 DTITSF-EGYP-WKCCYKSSSQR--LPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVG 409
           +   ++  GY  ++ CY+ S  +  + +LP++ L       F +  P+  +  +  G
Sbjct: 369 EASETYGSGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPITWPIIPVLASTNG 425


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 135/318 (42%), Positives = 183/318 (57%), Gaps = 25/318 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCA-PLSASYYNSLDRDLNEYSPSASS 160
           LHY  + +GTP+  F+VALD GSDL W+PCDC  C   L A   +SLD  LN YSP+ASS
Sbjct: 54  LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLD--LNIYSPNASS 111

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           TS  + C+  LC  G  C +P+  CPY + Y +  TSS+G+LVED+LHL+S  ++    +
Sbjct: 112 TSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS--NDKSSKA 169

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
           + A V  GCG  Q+G + DG AP+GL GLGL +ISVPS+LAK G+  NSFSMCF  D +G
Sbjct: 170 IPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAG 229

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 340
           RI FGD+G   Q+ T  L     + TY I V    +G +      F A+ DSG+SFT+L 
Sbjct: 230 RISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAVFDSGTSFTYLT 287

Query: 341 KEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLP-------------KLPSVK 385
              Y  I+  F+    D    T+    P++ CY   + RLP             + P+V 
Sbjct: 288 DAAYTLISESFNSLALDKRYQTTDSELPFEYCY---ALRLPLYSGHHHPNKDSFQYPAVN 344

Query: 386 LMFPQNNSFVVNNPVFVI 403
           L     +S+ V +P+ VI
Sbjct: 345 LTMKGGSSYPVYHPLVVI 362


>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 140/391 (35%), Positives = 205/391 (52%), Gaps = 22/391 (5%)

Query: 27  FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           F   + HRFS+ VK  LGV         P K + +YY  +   D   + +++  G    +
Sbjct: 30  FGFDIHHRFSDPVKEILGVHD------LPDKGTRQYYVAMAHRDRIFRGRRLAAGYHSPL 83

Query: 84  LF-PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
            F PS  +  +     FG+LH+  + +GTP +SFLVALD GSDL W+PC+C +C      
Sbjct: 84  TFIPSNETYQIEA---FGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVH-GIG 139

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
             N      N Y    SSTS+ + C+  LC+L   C +    CPY ++Y +  TS++G L
Sbjct: 140 LSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGFL 199

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           VED+LHLI+  D       +  +  GCG  Q+G +LDG AP+GL GLG+   SVPS+LAK
Sbjct: 200 VEDVLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAK 257

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            GL  NSFSMCF  D  GRI FGD     Q  T F      + TY I V    +G   + 
Sbjct: 258 EGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGEK-VD 315

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV---NDTITSFEGYPWKCCYKSSSQRLP 379
              F AI DSG+SFT+L    Y+ I   F+ ++     + +S    P++ CY+ S  +  
Sbjct: 316 DLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQTV 375

Query: 380 KLPSVKLMFPQNNSFVVNNPVFVIYGTQVGV 410
           +L S+ L     ++++V +P+  + G  + +
Sbjct: 376 EL-SINLTMKGGDNYLVTDPIVTVSGEGINL 405


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 133/391 (34%), Positives = 199/391 (50%), Gaps = 19/391 (4%)

Query: 27  FSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           F   + HRFS+++K  LG+         P K + +YY V+   D   + +++        
Sbjct: 33  FGFDIHHRFSDQIKGMLGIDD------VPQKGTPQYYAVMAHRDRVFRGRRLAGADHHSP 86

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           L  + G+ T  + +  G+LH+  + +GTP + FLVALD GSDL W+PCDC+ C       
Sbjct: 87  LTFAAGNDTHQIASS-GFLHFANVSVGTPPLWFLVALDTGSDLFWLPCDCISCVHGGLRT 145

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHR-LCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
                   N Y    SSTS  +SC++   C     C +    C Y +DY + +TSS G +
Sbjct: 146 RTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFV 205

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           VED+LHLI+  D+         +  GCG  Q+G +L+G AP+GL GLG+  ISVPS+LA+
Sbjct: 206 VEDVLHLIT--DDDQTKDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAR 263

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            GLI NSFSMCF  D +GRI FGD G   Q+ T F      + TY I +    +  S + 
Sbjct: 264 EGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRK-LHPTYNITITKIIVEDS-VA 321

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE----GYPWKCCYKSSSQRL 378
              F AI DSG+SFT++    Y  I   ++ +V     S +      P+  CY  S  + 
Sbjct: 322 DLEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQT 381

Query: 379 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVG 409
            ++P + L     + + V +P+  +   + G
Sbjct: 382 IEVPFLNLTMKGGDDYYVMDPIIQVSSEEEG 412


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 177/323 (54%), Gaps = 23/323 (7%)

Query: 99  FGW-LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPS 157
           FG+ LHY  + +GTP+VSFLVALD GS+LLW+PCDC  C     S   ++D  LN YSP+
Sbjct: 57  FGYILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVD--LNIYSPN 114

Query: 158 ASSTSKHLSCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
            SSTS+ + C+  LC       C + +  CPY + Y +  TS++G +V+D+LHLIS  D+
Sbjct: 115 TSSTSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLIS--DD 172

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           +   +V A +  GCG  Q+G +L G AP+GL GLG+  ISVPS LA  G    SFSMCF 
Sbjct: 173 SQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFS 232

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 335
            +  GRI FGD+G   Q  TSF     +   Y I +    IG        + AI DSG+S
Sbjct: 233 PNGIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQA-SDLVYSAIFDSGTS 291

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS---------------QRLPK 380
           FT+L    Y  IA  F++ V +T  S    P+  CY   S               Q  P 
Sbjct: 292 FTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFSCAYANQTEPT 351

Query: 381 LPSVKLMFPQNNSFVVNNPVFVI 403
           +P+V L+    + F V +P+ ++
Sbjct: 352 IPAVTLVMSGGDYFNVTDPIVLV 374


>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 217

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 105/201 (52%), Positives = 136/201 (67%), Gaps = 11/201 (5%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S++++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S
Sbjct: 28  SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           +G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS  Y  +L
Sbjct: 81  KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG-YRGNL 139

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           DRDL  Y P+ S+TS+HL CSH LC     C NPKQPCPY +DY++ENT+SSGLL+ED L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 208 HLISGGDNALKNSVQASVIIG 228
           HL    D+     V ASVIIG
Sbjct: 200 HLNYREDHV---PVNASVIIG 217


>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 510

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 191/361 (52%), Gaps = 13/361 (3%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S +  HRFS  ++    ++               Y   L+   + + +       + F S
Sbjct: 29  SLEFHHRFSARLRGWADARGHELPGGWPPPGGAAYVAALAGHDRHRALAAADHPPLTF-S 87

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
           +G+ T+ + N  G+LHY  + +GTP  +F+VALD GSDL W+PC C  C P ++    S 
Sbjct: 88  EGNATLKVSN-LGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGASGSA 146

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
               + Y PS SSTS+ + C+   CD    C      CPY M Y + +TSSSG LVED+L
Sbjct: 147 ----SFYIPSMSSTSQAVPCNSDFCDHRKDCSTTSS-CPYKMVYVSADTSSSGFLVEDVL 201

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           +L S  DN     ++A ++ GCG  Q+G +LD  AP+GL GLG+  ISVPS+LA  GL  
Sbjct: 202 YL-STEDNH-PQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTS 259

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           +SFSMCF +D  GRI FGDQG + Q+ T  L  N K+ TY I +    +G+  +    F 
Sbjct: 260 DSFSMCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGITVGTEPM-DLEFS 317

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKCCYK-SSSQRLPKLPSVK 385
            I D+G++FT+L    Y  I   F  QV     + +   P++ CY  SSS+   + P V 
Sbjct: 318 TIFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVS 377

Query: 386 L 386
            
Sbjct: 378 F 378


>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
 gi|219887047|gb|ACL53898.1| unknown [Zea mays]
 gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 416

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 127/305 (41%), Positives = 172/305 (56%), Gaps = 13/305 (4%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           LHY  + +GTP  +F+VALD GSDL W+PC C  C P + +   S       Y P  SST
Sbjct: 6   LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSA----TFYIPGMSST 61

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           SK + C+   CDL   C    Q CPY M Y +  TSSSG LVED+L+L +  +NA    +
Sbjct: 62  SKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLST--ENAHPQIL 118

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
           +A +++GCG  Q+G +LD  AP+GL GLG+ E+SVPS+LA+ GL  NSFSMCF +D  GR
Sbjct: 119 KAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGR 178

Query: 282 IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 341
           I FGDQ  + Q+ T  L  N ++ TY I +    +G+       F  I D+G+SFT+L  
Sbjct: 179 ISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTGTSFTYLAD 236

Query: 342 EVYETIAAEFDRQVNDTITSFEG-YPWKCCYK--SSSQRLPKLPSVKLMFPQNNSFVVNN 398
             Y  I   F  QV     + +   P++ CY   SS  R P +P + L     + F V +
Sbjct: 237 PAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFP-IPDIILRTVTGSMFPVID 295

Query: 399 PVFVI 403
           P  VI
Sbjct: 296 PGQVI 300


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 143/395 (36%), Positives = 213/395 (53%), Gaps = 22/395 (5%)

Query: 18  ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
           E+SG     FS ++ H FS+ VK +LG+         P K S EY++VL   D  ++ + 
Sbjct: 24  EASGK----FSFEVHHMFSDRVKQSLGLDD-----LVPEKGSLEYFKVLAQRDRLIRGRG 74

Query: 75  MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC- 133
           + +  +   +   +G++T+S+ +  G+LHY  + +GTP   FLVALD GSDL W+PC+C 
Sbjct: 75  LASNNEETPITFMRGNRTISI-DLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCG 133

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT 193
             C         S  R LN YSP+ SSTS  + CS   C   + C +P   CPY + Y +
Sbjct: 134 STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLS 193

Query: 194 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
           ++T ++G L ED+LHL++  D  L+  V+A++ +GCG  Q+G      A +GL+GLGL +
Sbjct: 194 KDTFTTGTLFEDVLHLVT-EDEGLE-PVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKD 251

Query: 254 ISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 311
            SVPS+LAKA +  NSFSMCF    D  GRI FGD+G   Q  T  L +     TY + V
Sbjct: 252 YSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSV 310

Query: 312 ETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCC 370
               +G   +      A+ D+G+SFT L +  Y  I   FD  V D     +   P++ C
Sbjct: 311 TEVSVGGDAVG-VQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFC 369

Query: 371 YKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIY 404
           Y  S  +   L P V + F   +   + NP+F+++
Sbjct: 370 YDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVW 404


>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/249 (42%), Positives = 155/249 (62%), Gaps = 7/249 (2%)

Query: 117 LVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 176
           +VALD GSDL W+PCDC +CAP   + Y S + +L+ Y+P  S+T+K ++C++ LC    
Sbjct: 1   MVALDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRN 59

Query: 177 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 236
            C      CPY + Y +  TS+SG+L+ED++HL +   N  +  V+A V  GCG  QSG 
Sbjct: 60  QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGS 117

Query: 237 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTS 296
           +LD  AP+GL GLG+ +ISVPS+LA+ GL+ +SFSMCF  D  GRI FGD+G + Q+ T 
Sbjct: 118 FLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETP 177

Query: 297 FLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI--AAEFDRQ 354
           F   N  +  Y I V    +G++ L    F A+ D+G+SFT+L   +Y T+  +A+  R 
Sbjct: 178 F-NLNPSHPNYNITVTRVRVGTT-LIDDEFTALFDTGTSFTYLVDPMYTTVSESAQDKRH 235

Query: 355 VNDTITSFE 363
             D+   FE
Sbjct: 236 SPDSRIPFE 244


>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 530

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 138/384 (35%), Positives = 204/384 (53%), Gaps = 24/384 (6%)

Query: 4   ISLTIYLAVFWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFE 60
           + L++ + +FW L   E+SG     FS ++ H FS+ VK  LG          P   S E
Sbjct: 9   VLLSMLVLIFWGLERCEASGK----FSFEVHHMFSDVVKQTLGFDD-----LVPENGSLE 59

Query: 61  YYQVLLSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLV 118
           Y++VL   D  ++ + + +  +   L     + T++L N  G+LHY  + +GTP   FLV
Sbjct: 60  YFKVLAHRDRFIRGRGLASNNEETPLTSIGSNLTLAL-NFLGFLHYANVSLGTPATWFLV 118

Query: 119 ALDAGSDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS 177
           ALD GSDL W+PC+C   C         S    LN Y+P+AS+TS  + CS + C     
Sbjct: 119 ALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGK 178

Query: 178 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 237
           C +P+  CPY +   + NT ++G L++D+LHL++  D  LK  V A+V +GCG  Q+G +
Sbjct: 179 CSSPESICPYQI-ALSSNTVTTGTLLQDVLHLVTE-DEDLK-PVNANVTLGCGQNQTGAF 235

Query: 238 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQST 295
              +A +G++GL + E SVPSLLAKA +  NSFSMCF +  S  GRI FGD+G   Q+ T
Sbjct: 236 QTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEET 295

Query: 296 SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
             L S      Y + V    +G   +    F A+ D+GSSFT L +  Y      FD  +
Sbjct: 296 P-LVSLETSTAYGVNVTGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLM 353

Query: 356 NDTITSFE-GYPWKCCYKSSSQRL 378
            D     +  +P++ CY    + L
Sbjct: 354 EDKRRPVDPDFPFEFCYDLREEHL 377


>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
          Length = 519

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 142/393 (36%), Positives = 210/393 (53%), Gaps = 28/393 (7%)

Query: 18  ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
           E+SG     FS ++ H FS+ VK +LG+         P K S EY++VL   D  ++ + 
Sbjct: 24  EASGK----FSFEVHHMFSDRVKQSLGLDD-----LVPEKGSLEYFKVLAQRDRLIRGRG 74

Query: 75  MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC- 133
           + +  +   +   +G++T+S+ +  G+LHY  + +GTP   FLVALD GSDL W+PC+C 
Sbjct: 75  LASNNEETPITFMRGNRTISI-DLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCG 133

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT 193
             C         S  R LN YSP+ SSTS  + CS   C   + C +P   CPY + Y +
Sbjct: 134 STCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLS 193

Query: 194 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
           ++T ++G L ED+LHL++  D  L+  V+A++ +GCG  Q+G      A +GL+GLGL +
Sbjct: 194 KDTFTTGTLFEDVLHLVT-EDEGLE-PVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKD 251

Query: 254 ISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 311
            SVPS+LAKA +  NSFSMCF    D  GRI FGD+G   Q  T  L +        +G 
Sbjct: 252 YSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGG 311

Query: 312 ETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCC 370
           +   +G   L      A+ D+G+SFT L +  Y  I   FD  V D     +   P++ C
Sbjct: 312 DA--VGVQLL------ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFC 363

Query: 371 YKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFV 402
           Y  S  +   L P V + F   +   + NP+F+
Sbjct: 364 YDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFI 396


>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
          Length = 518

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 137/380 (36%), Positives = 201/380 (52%), Gaps = 24/380 (6%)

Query: 8   IYLAVFWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQV 64
           + + +FW L   E+SG     FS ++ H FS+ VK  LG          P   S EY++V
Sbjct: 1   MLVLIFWGLERCEASGK----FSFEVHHMFSDVVKQTLGFDD-----LVPENGSLEYFKV 51

Query: 65  LLSSD--VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
           L   D  ++ + + +  +   L     + T++L N  G+LHY  + +GTP   FLVALD 
Sbjct: 52  LAHRDRFIRGRGLASNNEETPLTSIGSNLTLAL-NFLGFLHYANVSLGTPATWFLVALDT 110

Query: 123 GSDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 181
           GSDL W+PC+C   C         S    LN Y+P+AS+TS  + CS + C     C +P
Sbjct: 111 GSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSP 170

Query: 182 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV 241
           +  CPY +   + NT ++G L++D+LHL++  D  LK  V A+V +GCG  Q+G +   +
Sbjct: 171 ESICPYQI-ALSSNTVTTGTLLQDVLHLVTE-DEDLK-PVNANVTLGCGQNQTGAFQTDI 227

Query: 242 APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLA 299
           A +G++GL + E SVPSLLAKA +  NSFSMCF +  S  GRI FGD+G   Q+ T  L 
Sbjct: 228 AVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETP-LV 286

Query: 300 SNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
           S      Y + V    +G   +    F A+ D+GSSFT L +  Y      FD  + D  
Sbjct: 287 SLETSTAYGVNVTGVSVGGVPVDVPLF-ALFDTGSSFTLLLESAYGVFTKAFDDLMEDKR 345

Query: 360 TSFE-GYPWKCCYKSSSQRL 378
              +  +P++ CY    + L
Sbjct: 346 RPVDPDFPFEFCYDLREEHL 365


>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 132/383 (34%), Positives = 203/383 (53%), Gaps = 19/383 (4%)

Query: 27  FSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM 83
           F  ++ H FS+ VK +LG+         P + S EY++VL   D  ++ + + +  +   
Sbjct: 29  FGFEVHHIFSDAVKQSLGLDD-----LVPEQGSLEYFKVLAHRDRLIRGRGLASNNEDTP 83

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC-VRCAPLSAS 142
           +    G+ T+S+    G L+Y  + +GTP  SFLVALD GSDL W+PC+C   C      
Sbjct: 84  VTFDGGNLTVSI-KLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLED 142

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
                   LN Y+P+AS+TS  + CS + C     C +PK  CPY + Y + +T ++G L
Sbjct: 143 IGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISY-SNSTGTTGTL 201

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           ++D+LHL +  +N     V+ +V +GCG KQ+G +    + +G++GLG+   SVPSLLAK
Sbjct: 202 LQDVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAK 259

Query: 263 AGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
           A +  +SFSMCF +   + GRI FGD+G   Q+ T F+ S      Y + V    +G   
Sbjct: 260 ANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFI-SVAPSTAYGLNVTGVSVGGDP 318

Query: 321 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLP 379
           +    F A  D+GSSFT L +  Y  +   FD  V D     +   P++ CY  S     
Sbjct: 319 VGTRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATS 377

Query: 380 -KLPSVKLMFPQNNSFVVNNPVF 401
            + P V++ F   +  ++NNP F
Sbjct: 378 IEFPFVEMTFVGGSKIILNNPFF 400


>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 530

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 146/403 (36%), Positives = 214/403 (53%), Gaps = 26/403 (6%)

Query: 13  FWLLT--ESSGAETVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD 69
           FW L   E+SG     FS ++ H FS+ VK  LG+         P K S EY++VL   D
Sbjct: 18  FWGLERCEASGK----FSFEVHHMFSDRVKQTLGLDD-----LVPEKGSLEYFKVLAQRD 68

Query: 70  --VQKQKMKTGPQFQMLFPSQGSKTMSLGNDF-GWLHYTWIDIGTPNVSFLVALDAGSDL 126
             ++ + + +  +   +   +G++T+S+  DF G+LHY  + +GTP   FLVALD GS+L
Sbjct: 69  RLIRGRGLASNNEETPITFMRGNRTVSI--DFLGFLHYANVSVGTPATWFLVALDTGSNL 126

Query: 127 LWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPC 185
            W+PC+C   C         S  R LN YSP+ SSTS  + C+   C   + C +P   C
Sbjct: 127 FWLPCNCGSTCIRDLKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSC 186

Query: 186 PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDG 245
           PY + Y +++T ++G L ED+LHL++  D  LK  V+A++ +GCG  Q+G      A +G
Sbjct: 187 PYQIQYLSKDTFTTGTLFEDVLHLVT-EDVDLK-PVKANITLGCGRNQTGFLQSSAAING 244

Query: 246 LIGLGLGEISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGK 303
           L+GLG+ + SVPS+LAKA +  NSFSMCF    D  GRI FGD+G   Q  T  L +   
Sbjct: 245 LLGLGMKDYSVPSILAKAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPS 304

Query: 304 YITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
             TY + V T       +      A+ D+G+SFT L +  Y  I   FD  V D     +
Sbjct: 305 -PTYAVNV-TEVSVGGDVVGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPID 362

Query: 364 -GYPWKCCYKSSSQRLPKL-PSVKLMFPQNNSFVVNNPVFVIY 404
              P++ CY  S      L P V + F   +   + NP+F+++
Sbjct: 363 PEIPFEFCYDLSPNSTTILFPRVAMTFEGGSLMFLRNPLFIVW 405


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 132/386 (34%), Positives = 202/386 (52%), Gaps = 19/386 (4%)

Query: 24  TVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQ 80
           T  F  ++ H FS+ VK +LG+         P + S EY++VL   D  ++ + + +   
Sbjct: 26  TGKFGFEVHHIFSDSVKQSLGLGD-----LVPEQGSLEYFKVLAHRDRLIRGRGLASNND 80

Query: 81  FQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC-VRCAPL 139
              +    G+ T+S+    G L+Y  + +GTP  SFLVALD GSDL W+PC+C   C   
Sbjct: 81  ETPITFDGGNLTVSV-KLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRD 139

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 199
                      LN Y+P+AS+TS  + CS + C     C +P   CPY + Y + +T + 
Sbjct: 140 LEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTK 198

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G L++D+LHL +  +N     V+A+V +GCG KQ+G +    + +G++GLG+   SVPSL
Sbjct: 199 GTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSL 256

Query: 260 LAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           LAKA +  NSFSMCF +   + GRI FGD+G   Q+ T F+ S      Y + +    + 
Sbjct: 257 LAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFI-SVAPSTAYGVNISGVSVA 315

Query: 318 SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYK-SSS 375
              +    F A  D+GSSFT L +  Y  +   FD  V D     +   P++ CY  S +
Sbjct: 316 GDPVDIRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPN 374

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNPVF 401
               + P V++ F   +  ++NNP F
Sbjct: 375 ATTIQFPLVEMTFIGGSKIILNNPFF 400


>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 129/397 (32%), Positives = 204/397 (51%), Gaps = 38/397 (9%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
            S ++ HRFSE+VK +           P   S +YY+ L+  D  ++      Q  + F 
Sbjct: 32  LSFEIHHRFSEQVKTV-----LGGHGLPEMGSLDYYKALVHRDRGRRLTSNNNQTTISF- 85

Query: 87  SQGSKT--MSLGND-------FGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC---- 133
           +QG+ T  +SL +        F +LHY  + IGTP   FLVALD GSDL W+PC+C    
Sbjct: 86  AQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTC 145

Query: 134 VRCAPLS--ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY 191
           VR        ++ N+    LN Y+PS S++S  ++C+  LC L   C +P   CPY + Y
Sbjct: 146 VRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRIRY 205

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
            +  + S+G+LVED++H+ +    A      A +  GC   Q G + + VA +G++GL +
Sbjct: 206 LSPGSKSTGVLVEDVIHMSTEEGEAR----DARITFGCSETQLGLFQE-VAVNGIMGLAM 260

Query: 252 GEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 311
            +I+VP++L KAG+  +SFSMCF  +  G I FGD+G + Q  T  L      + Y + +
Sbjct: 261 ADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQHETP-LGGTISPLFYDVSI 319

Query: 312 ETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYP 366
               +G   + +T F AI DSG++ T+L    Y  +   F     DR++   + S     
Sbjct: 320 TKFKVGKVTV-ETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDS----T 374

Query: 367 WKCCY-KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 402
           ++ CY  +S+    KLPS+        ++ V +P+ V
Sbjct: 375 FEFCYIITSTSDEEKLPSISFEMKGGAAYDVFSPILV 411


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 124/382 (32%), Positives = 195/382 (51%), Gaps = 26/382 (6%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
            S ++ HRFSE+VK +           P   S +YY+ L+  D  +Q          +  
Sbjct: 22  LSFEIHHRFSEQVKTV-----LGGHGLPEMGSLDYYKALVHRDRGRQLTSNNNNQTTISF 76

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           +QG+ T     +  +LHY  + IGTP   FLVALD GSDL W+PC+C      S      
Sbjct: 77  AQGNST----EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQG 132

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
               LN Y+PS S +S  ++C+  LC L   C +P   CPY + Y +  + S+G+LVED+
Sbjct: 133 ERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDV 192

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +H+ +    A      A +  GC   Q G + + VA +G++GL + +I+VP++L KAG+ 
Sbjct: 193 IHMSTEEGEAR----DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVA 247

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
            +SFSMCF  +  G I FGD+G + Q  T  L+     + Y + +    +G   +  T F
Sbjct: 248 SDSFSMCFGPNGKGTISFGDKGSSDQLETP-LSGTISPMFYDVSITKFKVGKVTV-DTEF 305

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYPWKCCY-KSSSQRLPK 380
            A  DSG++ T+L +  Y  +   F     DR+++ ++ S    P++ CY  +S+    K
Sbjct: 306 TATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDS----PFEFCYIITSTSDEDK 361

Query: 381 LPSVKLMFPQNNSFVVNNPVFV 402
           LPSV        ++ V +P+ V
Sbjct: 362 LPSVSFEMKGGAAYDVFSPILV 383


>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 430

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 100/224 (44%), Positives = 132/224 (58%), Gaps = 9/224 (4%)

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
           LN YSP+ S+TS  + C+  LC+  TS QN    CPY M Y + NTSS G LVED+LHL 
Sbjct: 3   LNHYSPNDSTTSSTVPCTSSLCNRCTSNQNV---CPYEMRYLSANTSSIGYLVEDVLHLA 59

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
           +  D++L   V+A +  GCG  Q+G +    AP+GLIGLG+ +ISVPS LA  GL  NSF
Sbjct: 60  T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117

Query: 271 SMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 330
           SMCF  D  GRI FGD GPA Q+ T F  +  +Y +Y +      +G        F AI 
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPF-NTMLEYQSYNVTFNVINVGGEP-NDVPFTAIF 175

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG--YPWKCCYK 372
           DSG+SFT+L +  Y TI  + D  +     S  G  +P++ CY+
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYE 219


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 105/282 (37%), Positives = 159/282 (56%), Gaps = 15/282 (5%)

Query: 24  TVMFSTKLIHRFSEEVK-ALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQ 80
           T  F  ++ H FS+ VK +LG+         P + S EY++VL   D  ++ + + +   
Sbjct: 26  TGKFGFEVHHIFSDSVKQSLGL-----GDLVPEQGSLEYFKVLAHRDRLIRGRGLASNND 80

Query: 81  FQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC-VRCAPL 139
              +    G+ T+S+    G L+Y  + +GTP  SFLVALD GSDL W+PC+C   C   
Sbjct: 81  ETPITFDGGNLTVSV-KLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRD 139

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 199
                      LN Y+P+AS+TS  + CS + C     C +P   CPY + Y + +T + 
Sbjct: 140 LEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTK 198

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G L++D+LHL +  +N     V+A+V +GCG KQ+G +    + +G++GLG+   SVPSL
Sbjct: 199 GTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSL 256

Query: 260 LAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLA 299
           LAKA +  NSFSMCF +   + GRI FGD+G   Q+ T F++
Sbjct: 257 LAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFIS 298


>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 414

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 121/397 (30%), Positives = 182/397 (45%), Gaps = 77/397 (19%)

Query: 18  ESSGAETVMFSTKLIHRFSEEVKA-LGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQK 74
           ES+G     FS ++ H FS+ VK  LG          P K S EY+++L   D  ++ + 
Sbjct: 24  ESAGK----FSFEVHHMFSDTVKQNLGF-----GDLVPEKGSLEYFKLLAQRDRLIRGRG 74

Query: 75  MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCV 134
           + +  +       +   T  LGN             T ++ FL     GSDL W+PC+C 
Sbjct: 75  LSSNNE-------EAPVTFILGNR------------TVSIDFL-----GSDLFWLPCNC- 109

Query: 135 RCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS---CQNPKQPCPYTMDY 191
                                          +C   L D+G S   C +P   CPY + Y
Sbjct: 110 -----------------------------GTTCIRDLEDIGLSQGGCSSPASVCPYQIPY 140

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
               TS+ G L ED+LHL++  D  L+  V+A++ +GCG  Q+G Y   +A +GL+GLG+
Sbjct: 141 LFNTTSTRGTLFEDVLHLVT-EDEGLE-PVKANITLGCGQNQTGLYRKSLAVNGLLGLGM 198

Query: 252 GEISVPSLLAKAGLIRNSFSMCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYII 309
            + SVPS+LAK  +  NSFSMCF    D  GRI FGD+G   Q  T  +       TY +
Sbjct: 199 KDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQLQTPLVPIEPN-PTYAV 257

Query: 310 GVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWK 368
            V    +G   L +    A+ D+G+SFT L +  Y  +   FD  V D     +   P++
Sbjct: 258 NVTEVTVGGDIL-EIQMLALFDTGTSFTHLLEPAYGLLTKAFDDHVTDKRRPIDPEIPFE 316

Query: 369 CCYKSSSQ-RLPKLPSVKLMFPQNNSFVVNNPVFVIY 404
            CY +S   +  K P V + F   +   + +P+F ++
Sbjct: 317 FCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVW 353


>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
 gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
          Length = 149

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 60/121 (49%), Positives = 82/121 (67%), Gaps = 4/121 (3%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           FS++++HR S+E +   +        WP + S  YY+ LL SD+Q+QK +   + Q+L  
Sbjct: 27  FSSRMVHRLSDEAR---LEAGPRMGLWPQRGSGGYYRALLRSDLQRQKRRLAGKNQLLSL 83

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           S+G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS SY  +
Sbjct: 84  SKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLS-SYRGN 142

Query: 147 L 147
           L
Sbjct: 143 L 143


>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
          Length = 263

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 78/196 (39%), Positives = 110/196 (56%), Gaps = 4/196 (2%)

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           +  +   V+A ++ GCG  Q+G +LD  AP+GL GLG+ ++SVPS+LA  G   NSFSMC
Sbjct: 4   EETIPKVVKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMC 63

Query: 274 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSG 333
           F  D  GRI+FGD G + Q  T F   N  + TY I +    +G+S +   S  AIVDSG
Sbjct: 64  FGSDGMGRIYFGDTGSSDQGETPFDV-NHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSG 121

Query: 334 SSFTFLPKEVYETIAAEFDRQVNDTI-TSFEGYPWKCCYK-SSSQRLPKLPSVKLMFPQN 391
           +SFT L   +Y  ++  F  QV +    S  G P++ CY  S +Q    LP + L     
Sbjct: 122 TSFTCLADPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGG 181

Query: 392 NSFVVNNPVFVIYGTQ 407
           + F +N+P+ VI   Q
Sbjct: 182 SQFPINDPIIVISSEQ 197


>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
           vinifera]
          Length = 294

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/177 (40%), Positives = 107/177 (60%), Gaps = 4/177 (2%)

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 288
           CG  Q+G +L+G AP+GL GLG+G ISVPS+LAK GL+ +SFSMCF  D +GRI FGD+G
Sbjct: 1   CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60

Query: 289 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 348
            + Q+ T F  S  + + Y I +    +G +     +F AI DSG+SFT+L    Y +I+
Sbjct: 61  SSGQEETPFNPSKSQLL-YNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 118

Query: 349 AEFDRQVNDTITSFEG-YPWKCCYKSSSQRLP-KLPSVKLMFPQNNSFVVNNPVFVI 403
             F+ +  D  +S +   P++ CY  S Q+   + P V L     ++F V +P+ ++
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIV 175


>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
          Length = 306

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/177 (40%), Positives = 107/177 (60%), Gaps = 4/177 (2%)

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQG 288
           CG  Q+G +L+G AP+GL GLG+G ISVPS+LAK GL+ +SFSMCF  D +GRI FGD+G
Sbjct: 13  CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72

Query: 289 PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 348
            + Q+ T F  S  + + Y I +    +G +     +F AI DSG+SFT+L    Y +I+
Sbjct: 73  SSGQEETPFNPSKSQLL-YNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 130

Query: 349 AEFDRQVNDTITSFEG-YPWKCCYKSSSQRLP-KLPSVKLMFPQNNSFVVNNPVFVI 403
             F+ +  D  +S +   P++ CY  S Q+   + P V L     ++F V +P+ ++
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIV 187


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 156/317 (49%), Gaps = 43/317 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP---S 157
           L++  I +GTP+  F V +D GSD+LW+ C  C+RC   S         DL E +P    
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKS---------DLVELTPYDVD 134

Query: 158 ASSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
           ASST+K +SCS   C   +  + C +    C Y +  Y + +S++G LV+D++HL     
Sbjct: 135 ASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVI-MYGDGSSTNGYLVKDVVHLDLVTG 192

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           N    S   ++I GCG KQSG   +   A DG++G G    S  S LA  G ++ SF+ C
Sbjct: 193 NRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHC 252

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK----- 327
            D ++ G IF  G+      ++T  L+ +  Y   +  +E   +G+S L+ +S       
Sbjct: 253 LDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIE---VGNSVLELSSNAFDSGD 309

Query: 328 ---AIVDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
               I+DSG++  +LP  VY     E +A+  +  ++    SF  + +       + +L 
Sbjct: 310 DKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHY-------TDKLD 362

Query: 380 KLPSVKLMFPQNNSFVV 396
           + P+V   F ++ S  V
Sbjct: 363 RFPTVTFQFDKSVSLAV 379


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 153/314 (48%), Gaps = 37/314 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I +GTP+  F V +D GSD+LW+ C  C+RC P  +        +L  Y   ASS
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-PRKSDLV-----ELTPYDADASS 137

Query: 161 TSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           T+K +SCS   C   +  + C +    C Y +  Y + +S++G LV D++HL     N  
Sbjct: 138 TAKSVSCSDNFCSYVNQRSECHSGS-TCQYVI-LYGDGSSTNGYLVRDVVHLDLVTGNRQ 195

Query: 218 KNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             S   ++I GCG KQSG   +   A DG++G G    S  S LA  G ++ SF+ C D 
Sbjct: 196 TGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDN 255

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-------- 327
           ++ G IF  G+      ++T  L+ +  Y   +  +E   +G+S L+ +S          
Sbjct: 256 NNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIE---VGNSVLQLSSDAFDSGDDKG 312

Query: 328 AIVDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            I+DSG++  +LP  VY     + +A+  +  ++    SF  + +         RL + P
Sbjct: 313 VIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTCFHYI-------DRLDRFP 365

Query: 383 SVKLMFPQNNSFVV 396
           +V   F ++ S  V
Sbjct: 366 TVTFQFDKSVSLAV 379


>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 146

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 55/114 (48%), Positives = 76/114 (66%), Gaps = 7/114 (6%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           S++++HR S+E +   +        WP + S EYY+ L+ SD+Q+QK +      +L  S
Sbjct: 28  SSRMVHRLSDEAR---LEVGPRVGWWPQRGSGEYYRALVRSDIQRQKRR----LAVLSLS 80

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           +G  T S GND GWL+Y W+D+GTP  SFLVALD GSDL W+PCDC++CAPLS 
Sbjct: 81  KGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSG 134


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 150/308 (48%), Gaps = 32/308 (10%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +YT I+IGTP   F V +D GSD+LW+ C  C +C   S      L  DL  Y P  SS+
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSG-----LGIDLALYDPKGSSS 141

Query: 162 SKHLSCSHRLC--DLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
              +SC ++ C    G+  + P     +PC Y  + Y + +S++G  V D L       N
Sbjct: 142 GSAVSCDNKFCAATYGSGEKLPGCTAGKPCEYRAE-YGDGSSTAGSFVSDSLQYNQLSGN 200

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           A     +A+VI GCG +Q GG L+    A DG+IG G    S  S LA AG ++  FS C
Sbjct: 201 AQTRHAKANVIFGCGAQQ-GGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHC 259

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSF 326
            D    G IF  G+      +ST  L +      Y + +++  +  + L+      +TS 
Sbjct: 260 LDTIKGGGIFAIGEVVQPKVKSTPLLPNMSH---YNVNLQSIDVAGNALQLPPHIFETSE 316

Query: 327 K--AIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
           K   I+DSG++ T+LP+ VY+ I AA F +  + T  + +G+    C++ S       P 
Sbjct: 317 KRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGF---LCFEYSESVDDGFPK 373

Query: 384 VKLMFPQN 391
           +   F  +
Sbjct: 374 ITFHFEDD 381


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 98/306 (32%), Positives = 144/306 (47%), Gaps = 28/306 (9%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDL 151
           + L  D G L+YT I+IGTP   + V +D GSD+LW+ C  C +C   S      L  DL
Sbjct: 74  LGLPTDTG-LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKS-----DLGIDL 127

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDIL 207
             Y P  SS+   +SC  + C      + P      PC Y++  Y + +S++G  V D L
Sbjct: 128 RLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSV-MYGDGSSTTGYFVSDSL 186

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGL 265
                  +       ASVI GCG +Q GG L     A DG+IG G    S+ S LA AG 
Sbjct: 187 QYNQVSGDGQTRHANASVIFGCGAQQ-GGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGE 245

Query: 266 IRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
           ++  FS C D    G IF  GD      +ST  +        Y + +E+  +G + L+  
Sbjct: 246 VKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPD---MPHYNVNLESINVGGTTLQLP 302

Query: 325 SF--------KAIVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           S           I+DSG++ T+LP+ VY + +AA F +  + T  S + +     ++S  
Sbjct: 303 SHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQDFLCIQYFQSVD 362

Query: 376 QRLPKL 381
              PK+
Sbjct: 363 DGFPKI 368


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 169/372 (45%), Gaps = 25/372 (6%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-KTMSLGNDFGWLHYTWIDIG 110
           ++P     E  Q+    +++ ++M       + F  QG+     +G     L+YT + +G
Sbjct: 31  AFPTNHGVELSQLRARDELRHRRMLQSSSGVVDFSVQGTFDPFQVG-----LYYTKVQLG 85

Query: 111 TPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 170
           TP V F V +D GSD+LW+ C+     P ++     L   LN + P +SSTS  ++CS +
Sbjct: 86  TPPVEFNVQIDTGSDVLWVSCNSCNGCPQTS----GLQIQLNFFDPGSSSTSSMIACSDQ 141

Query: 171 LCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C+ G      +C +    C YT   Y + + +SG  V D++HL +  + ++  +  A V
Sbjct: 142 RCNNGKQSSDATCSSQNNQCSYTFQ-YGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPV 200

Query: 226 IIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--I 282
           + GC  +Q+G       A DG+ G G  E+SV S L+  G+    FS C   D SG   +
Sbjct: 201 VFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGIL 260

Query: 283 FFGDQGPATQQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFT 337
             G+        TS + +   Y   +    +  +T  I SS    ++ +  IVDSG++  
Sbjct: 261 VLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLA 320

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
           +L +E Y+   +     +  ++ +      + CY  +S      P V L F    S ++ 
Sbjct: 321 YLAEEAYDPFVSAITAAIPQSVRTVVSRGNQ-CYLITSSVTDVFPQVSLNFAGGASMILR 379

Query: 398 NPVFVIYGTQVG 409
              ++I    +G
Sbjct: 380 PQDYLIQQNSIG 391


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 170/377 (45%), Gaps = 35/377 (9%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-KTMSLGNDFGWLHYTWIDIG 110
           ++P   + E  Q+     ++ ++M       + F  QG+     +G     L+YT + +G
Sbjct: 28  AFPTNHTVELSQLRARDALRHRRMLQSSNGVVDFSVQGTFDPFQVG-----LYYTKVQLG 82

Query: 111 TPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           TP V F V +D GSD+LW+ C+ C  C   S      L   LN + P +SSTS  ++CS 
Sbjct: 83  TPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSSTSSMIACSD 137

Query: 170 RLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           + C+ G      +C +    C YT   Y + + +SG  V D++HL +  + ++  +  A 
Sbjct: 138 QRCNNGIQSSDATCSSQNNQCSYTFQ-YGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAP 196

Query: 225 VIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-- 281
           V+ GC  +Q+G       A DG+ G G  E+SV S L+  G+    FS C   D SG   
Sbjct: 197 VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGI 256

Query: 282 IFFGDQGPATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSF 336
           +  G+        TS + +   Y     +  +  +T  I SS    ++ +  IVDSG++ 
Sbjct: 257 LVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTL 316

Query: 337 TFLPKEVYE----TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
            +L +E Y+     I A   + V+  ++         CY  +S      P V L F    
Sbjct: 317 AYLAEEAYDPFVSAITASIPQSVHTVVSR-----GNQCYLITSSVTEVFPQVSLNFAGGA 371

Query: 393 SFVVNNPVFVIYGTQVG 409
           S ++    ++I    +G
Sbjct: 372 SMILRPQDYLIQQNSIG 388


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 151/326 (46%), Gaps = 29/326 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT + +GTP V F V +D GSD+LW+ C+ C  C   S      L   LN + P +SS
Sbjct: 24  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSG-----LQIQLNFFDPGSSS 78

Query: 161 TSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           TS  ++CS + C+ G      +C +    C YT   Y + + +SG  V D++HL +  + 
Sbjct: 79  TSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQ-YGDGSGTSGYYVSDMMHLNTIFEG 137

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
           ++  +  A V+ GC  +Q+G       A DG+ G G  E+SV S L+  G+    FS C 
Sbjct: 138 SVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL 197

Query: 275 DKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA 328
             D SG   +  G+        TS + +   Y     +  +  +T  I SS    ++ + 
Sbjct: 198 KGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257

Query: 329 -IVDSGSSFTFLPKEVYE----TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
            IVDSG++  +L +E Y+     I A   + V+  ++         CY  +S      P 
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSR-----GNQCYLITSSVTEVFPQ 312

Query: 384 VKLMFPQNNSFVVNNPVFVIYGTQVG 409
           V L F    S ++    ++I    +G
Sbjct: 313 VSLNFAGGASMILRPQDYLIQQNSIG 338


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 146/312 (46%), Gaps = 28/312 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P+  + V +D GSD+LW+ C +C RC   S      +   L  Y P  S 
Sbjct: 68  LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKS-----DIGIGLTLYDPKRSK 122

Query: 161 TSKHLSCSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           TS+ +SC H  C        LG   +N   PCPY++  Y + ++++G  V+D L      
Sbjct: 123 TSEFVSCEHNFCSSTYEGRILGCKAEN---PCPYSIS-YGDGSATTGYYVQDYLTFNRVN 178

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
            N    +  +S+I GCG  QSG +      A DG+IG G    SV S LA +G ++  FS
Sbjct: 179 GNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFS 238

Query: 272 MCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSF 326
            C D +  G IF  G+      ++T  + +   Y   +  +E       + S      + 
Sbjct: 239 HCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENG 298

Query: 327 KA-IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           K  ++DSG++  +LP+ VY+ + ++   +Q    +   E      C++ +       P V
Sbjct: 299 KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVE--EQYSCFQYTGNVDSGFPIV 356

Query: 385 KLMFPQNNSFVV 396
           KL F  + S  V
Sbjct: 357 KLHFEDSLSLTV 368


>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
          Length = 207

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 55/84 (65%), Positives = 64/84 (76%)

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
           TSFKA VDSG+SFTFLP   Y  I  EFD+QVN + +SFEG PW+ CY SSS++LPK+PS
Sbjct: 2   TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61

Query: 384 VKLMFPQNNSFVVNNPVFVIYGTQ 407
           + LMF QNNSFVV NPVF  Y  Q
Sbjct: 62  LTLMFQQNNSFVVYNPVFTFYDNQ 85


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 144/311 (46%), Gaps = 26/311 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T I IGTP+  + V +D GSD+LW+  +C+ C   S    + L  DL  Y P+AS++
Sbjct: 88  LYFTQIGIGTPSKGYYVQVDTGSDILWV--NCISCD--SCPRKSGLGIDLTLYDPTASAS 143

Query: 162 SKHLSCSHRLCDLGTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           SK ++C    C   T+   P       PC Y++  Y + +S++G  V D L       + 
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCAANSPCQYSIT-YGDGSSTTGFFVADFLQYDQVSGDG 202

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             N   ASV  GCG K  G      VA DG++G G    S+ S L  AG +   FS C D
Sbjct: 203 QTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLD 262

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK---------QTSF 326
             + G IF        +  T+ L     +  Y + ++T  +G S L+           S 
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPGMPH--YNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320

Query: 327 KAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             I+DSG++  +LP+ VY+ + +A F    + T+ + + +    C++ S       P V 
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDF---LCFQYSGSVDNGFPEVT 377

Query: 386 LMFPQNNSFVV 396
             F  +   VV
Sbjct: 378 FHFDGDLPLVV 388


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 152/344 (44%), Gaps = 39/344 (11%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDL 151
           + L  D G L+YT I +GTP   + V +D GSD+LW+ C  C +C      + + L  DL
Sbjct: 77  LGLPTDTG-LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCP-----HKSGLGLDL 130

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDIL 207
             Y P ASST   + C    C      + PK     PC Y++  Y + +S+ G  V D L
Sbjct: 131 TLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVT-YGDGSSTIGSFVTDAL 189

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
                  +       ASVI GCG +Q G       A DG++G G    S+ S L  AG +
Sbjct: 190 QFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKV 249

Query: 267 RNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
           +  F+ C D    G IF  GD      ++T  +A       Y + ++T  +G + L+  +
Sbjct: 250 KKIFAHCLDTIKGGGIFSIGDVVQPKVKTTPLVADKPH---YNVNLKTIDVGGTTLQLPA 306

Query: 326 F--------KAIVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
                      I+DSG++ T+LP+ V+ E + A F++  + T    +G+    C++    
Sbjct: 307 HIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDVQGF---LCFQYPGS 363

Query: 377 RLPKLPSVKLMF---------PQNNSFVVNNPVFVIYGTQVGVS 411
                P++   F         P    F   N V+ + G Q G S
Sbjct: 364 VDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYCV-GFQNGAS 406


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 101/332 (30%), Positives = 154/332 (46%), Gaps = 29/332 (8%)

Query: 85  FPSQGS-KTMSLGNDFG---WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           FP QG+     +G  FG    L+YT + +G+P   F V +D GSD+LW+ C      P+S
Sbjct: 68  FPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVS 127

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTEN 195
           +     L   LN + P +S T+  +SCS + C LG     + C      C YT   Y + 
Sbjct: 128 S----GLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQ-YGDG 182

Query: 196 TSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLG 252
           + +SG  V D+LH   I GG + +KNS  A ++ GC   Q+G       A DG+ G G  
Sbjct: 183 SGTSGYYVSDLLHFDTILGG-SVMKNS-SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQ 240

Query: 253 EISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY-----I 305
           ++SV S LA  G+    FS C   DDSG   +  G+        T  + S   Y      
Sbjct: 241 DMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPNIVYTPLVPSQPHYNLNLQS 300

Query: 306 TYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
            Y+ G +T  I  S    +S +  I+DSG++  +L +  Y+   +     V+ +++ +  
Sbjct: 301 IYVNG-QTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPYLS 359

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
              + CY +SS      P V L F    S ++
Sbjct: 360 KGNQ-CYLTSSSINDVFPQVSLNFAGGTSMIL 390


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 146/315 (46%), Gaps = 29/315 (9%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDL 151
           + L  D G L+YT + +GTP   F V +D GSD+LW+ C  C +C      + + L  DL
Sbjct: 79  LGLPTDTG-LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCP-----HKSGLGLDL 132

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDIL 207
             Y P ASST   + C    C      + PK     PC Y++  Y + +S+ G  V D L
Sbjct: 133 TLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSANVPCEYSV-TYGDGSSTVGSFVNDAL 191

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
                  +       ASVI GCG +Q G       A DG++G G    S+ S LA AG +
Sbjct: 192 QFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKV 251

Query: 267 RNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
           +  F+ C D    G IF  GD      ++T  +A       Y + ++T  +G + L+  +
Sbjct: 252 KKIFAHCLDTIKGGGIFAIGDVVQPKVKTTPLVADKPH---YNVNLKTIDVGGTTLELPA 308

Query: 326 --FK------AIVDSGSSFTFLPKEVYETIA-AEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
             FK       I+DSG++ T+LP+ V++ +  A F++  + T    + +    C++ S  
Sbjct: 309 DIFKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDF---LCFEYSGS 365

Query: 377 RLPKLPSVKLMFPQN 391
                P++   F  +
Sbjct: 366 VDDGFPTLTFHFEDD 380


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 139/294 (47%), Gaps = 25/294 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C      + + + R L  Y P +S 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY   Y  +   + G+L  D+LH      N     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 221 VQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
              SV  GCG++QSG   +  VA DG+IG G    +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 280 GRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IV 330
           G IF  G+      ++T  + +N  Y  +++ +++  +  + L+        T  K   +
Sbjct: 255 GGIFAIGEVVEPKVKTTPIVKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 331 DSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 381
           DSGS+  +LP+ +Y E I A F +  + T+ +   Y ++C +   S   + PK+
Sbjct: 313 DSGSTLVYLPEIIYSELILAVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 364


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 139/294 (47%), Gaps = 25/294 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C      + + + R L  Y P +S 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY   Y  +   + G+L  D+LH      N     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 221 VQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
              SV  GCG++QSG   +  VA DG+IG G    +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 280 GRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IV 330
           G IF  G+      ++T  + +N  Y  +++ +++  +  + L+        T  K   +
Sbjct: 255 GGIFAIGEVVEPKVKTTPIVKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 331 DSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 381
           DSGS+  +LP+ +Y E I A F +  + T+ +   Y ++C +   S   + PK+
Sbjct: 313 DSGSTLVYLPEIIYSELILAVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 364


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 90/294 (30%), Positives = 138/294 (46%), Gaps = 25/294 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C   S      + R L  Y P +S 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHES-----DILRKLTFYDPRSSV 112

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY   Y  +   + G+L  D+LH      N     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 221 VQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
              SV  GCG++QSG   +  VA DG+IG G    +  S LA AG  +  FS C D  + 
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230

Query: 280 GRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IV 330
           G IF  G+      ++T  + +N  Y  +++ +++  +  + L+        T  K   +
Sbjct: 231 GGIFAIGEVVEPKVKTTPIVKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288

Query: 331 DSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 381
           DSGS+  +LP+ +Y E I A F +  + T+ +   Y ++C +   S   + PK+
Sbjct: 289 DSGSTLVYLPEIIYSELILAVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 340


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 139/294 (47%), Gaps = 25/294 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C      + + + R L  Y P +S 
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 136

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY   Y  +   + G+L  D+LH      N     
Sbjct: 137 SSKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQP 194

Query: 221 VQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
              SV  GCG++QSG   +  VA DG+IG G    +  S LA AG  +  FS C D  + 
Sbjct: 195 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 254

Query: 280 GRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IV 330
           G IF  G+      ++T  + +N  Y  +++ +++  +  + L+        T  K   +
Sbjct: 255 GGIFAIGEVVEPKVKTTPIVKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 312

Query: 331 DSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 381
           DSGS+  +LP+ +Y E I A F +  + T+ +   Y ++C +   S   + PK+
Sbjct: 313 DSGSTLVYLPEIIYSELILAVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 364


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 139/294 (47%), Gaps = 25/294 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWI-PCDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C      + + + R L  Y P +S 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY   Y  +   + G+L  D+LH      N     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 221 VQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
              SV  GCG++QSG   +  VA DG+IG G    +  S LA AG  +  FS C D  + 
Sbjct: 171 TSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNG 230

Query: 280 GRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-------QTSFKA-IV 330
           G IF  G+      ++T  + +N  Y  +++ +++  +  + L+        T  K   +
Sbjct: 231 GGIFAIGEVVEPKVKTTPIVKNNEVY--HLVNLKSINVAGTTLQLPANIFGTTKTKGTFI 288

Query: 331 DSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCY--KSSSQRLPKL 381
           DSGS+  +LP+ +Y E I A F +  + T+ +   Y ++C +   S   + PK+
Sbjct: 289 DSGSTLVYLPEIIYSELILAVFAKHPDITMGAM--YNFQCFHFLGSVDDKFPKI 340


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 137/305 (44%), Gaps = 26/305 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP   + V +D GSD+LW+ C  C RC   S      L  +L  Y P  SS
Sbjct: 3   LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 57

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           T   +SC    C        P      PC Y++  Y + +S++G  V D+L       + 
Sbjct: 58  TGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVT-YGDGSSTTGYFVSDLLQFDQVSGDG 116

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                 ++V  GCG +Q G       A DG+IG G    S+ S L+ AG ++  F+ C D
Sbjct: 117 QTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLD 176

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-------- 327
             + G IF        +  T+ L  N  +  Y + +++  +G + LK  S          
Sbjct: 177 TINGGGIFAIGNVVQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFDTGEKKG 234

Query: 328 AIVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I+DSG++ T+LP+ VY E + A F +  + T  + + +    C++   +     P +  
Sbjct: 235 TIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF---LCFQYVGRVDDDFPKITF 291

Query: 387 MFPQN 391
            F  +
Sbjct: 292 HFEND 296


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/271 (31%), Positives = 131/271 (48%), Gaps = 34/271 (12%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR--CAPLSASYYNSLDRDLNEY 154
            D+G+  Y  + +GTP   F V +D GS + ++PC      C P      N  D     +
Sbjct: 73  KDYGYF-YATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGP------NHQD---AAF 122

Query: 155 SPSASSTSKHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
            P ASST+  +SC+   C  G+  C    Q C YT  Y  E +SSSG+L+ED+L L  G 
Sbjct: 123 DPEASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSY-AEQSSSSGILLEDVLALHDGL 181

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
             A        +I GC  +++G      A DGL GLG  + SV + L KAG+I + FS+C
Sbjct: 182 PGA-------PIIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLC 233

Query: 274 FDK-DDSGRIFFGDQ---GPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLK 322
           F   +  G +  GD    G  + Q T  L S       N K ++  +  +   +  S   
Sbjct: 234 FGMVEGDGALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFD 293

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 353
           Q  +  ++DSG++FT++P  V++  A   ++
Sbjct: 294 Q-GYGTVLDSGTTFTYMPSPVFKAFAGAVEK 323


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/314 (28%), Positives = 141/314 (44%), Gaps = 27/314 (8%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDL 151
           + L  D G L+YT I IGTP   + V +D GSD+LW+ C  C RC   S      L  +L
Sbjct: 80  LGLPTDTG-LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLEL 133

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDIL 207
             Y P  SST   +SC    C        P      PC Y++  Y + +S++G  V D+L
Sbjct: 134 TLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVT-YGDGSSTTGYFVSDLL 192

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
                  +       ++V  GCG +Q G       A DG+IG G    S+ S L+ AG +
Sbjct: 193 QFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKV 252

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
           +  F+ C D  + G IF        +  T+ L  N  +  Y + +++  +G + LK  S 
Sbjct: 253 KKIFAHCLDTINGGGIFAIGNVVQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSH 310

Query: 327 K--------AIVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 377
                     I+DSG++ T+LP+ VY E + A F +  + T  + + +    C++   + 
Sbjct: 311 MFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF---LCFQYVGRV 367

Query: 378 LPKLPSVKLMFPQN 391
               P +   F  +
Sbjct: 368 DDDFPKITFHFEND 381


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 163/367 (44%), Gaps = 29/367 (7%)

Query: 62  YQVLLSSDVQKQKMKTGPQFQ------MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVS 115
           Y++ LS   ++ +++ G   Q      + FP QG+    L      L+YT + +GTP   
Sbjct: 9   YKLKLSKLKERDRVRHGRMLQSSGVGVVDFPVQGTFDPFLVG----LYYTRLQLGTPPRD 64

Query: 116 FLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 175
           F V +D GSD+LW+ C      P+++     L   LN + P +S T+  +SCS + C LG
Sbjct: 65  FYVQIDTGSDVLWVSCGSCNGCPVNS----GLHIPLNFFDPGSSPTASLISCSDQRCSLG 120

Query: 176 -----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCG 230
                + C      C Y    Y + + +SG  V D+LH  +    ++ N+  A ++ GC 
Sbjct: 121 LQSSDSVCSAQNNLCGYNFQ-YGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCS 179

Query: 231 MKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQ 287
             Q+G       A DG+ G G  ++SV S LA  G+   +FS C   DDSG   +  G+ 
Sbjct: 180 ALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEI 239

Query: 288 GPATQQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKE 342
                  T  + S   Y   +    +  +T  I  S    +S +  I+DSG++  +L + 
Sbjct: 240 VEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEA 299

Query: 343 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 402
            Y+   +     V+ ++  +       CY  SS      P V L F    S ++    ++
Sbjct: 300 AYDPFISAITSIVSPSVRPYLS-KGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYL 358

Query: 403 IYGTQVG 409
           I  + +G
Sbjct: 359 IQQSSIG 365


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 89/295 (30%), Positives = 137/295 (46%), Gaps = 39/295 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IG+P   F V +D GSD+LW+ C  C  C   S      +  DL  Y+P +SS
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSS 126

Query: 161 TSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  ++C    C    D       P   C Y +  Y + ++++G  V D + L     N 
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKPDLLCQYKV-IYGDGSATAGYFVNDYIQLQRAVGNH 185

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             +    S++ GCG KQSG       A DG++G G    S+ S LA  G ++  F+ C D
Sbjct: 186 KTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLD 245

Query: 276 KDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFK- 327
               G IF  G+      ++T  + +   Y   + GV+   +G + L       +TS+K 
Sbjct: 246 SISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYKR 302

Query: 328 -AIVDSGSSFTFLPKEVY-----ETIAAEFD---RQVNDTITSF-------EGYP 366
            AI+DSG++  +LP  +Y     + + A+ D   R V+D  T F       +G+P
Sbjct: 303 GAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQFTCFVFDKNVDDGFP 357


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 133/284 (46%), Gaps = 32/284 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IG+P   F V +D GSD+LW+ C  C  C   S      +  DL  Y+P +SS
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKS-----DIGVDLQLYNPKSSS 126

Query: 161 TSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  ++C    C    D       P   C Y +  Y + ++++G  V D + L     N 
Sbjct: 127 TSTLITCDQPFCSATYDAPIPGCKPDLLCQYKV-IYGDGSATAGYFVNDYIQLQRAVGNH 185

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             +    S++ GCG KQSG       A DG++G G    S+ S LA  G ++  F+ C D
Sbjct: 186 KTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLD 245

Query: 276 KDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFK- 327
               G IF  G+       +T  + +   Y   + GV+   +G + L       +TS+K 
Sbjct: 246 SISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVK---VGDTALDLPLGLFETSYKR 302

Query: 328 -AIVDSGSSFTFLPKEVY-----ETIAAEFD---RQVNDTITSF 362
            AI+DSG++  +LP+ +Y     + + A+ D   R V+D  T F
Sbjct: 303 GAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQFTCF 346


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 146/304 (48%), Gaps = 30/304 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I+IG+P+  + V +D GSD+LW+  +C+RC     +  + L  +L +Y P+ S T
Sbjct: 84  LYYTQIEIGSPSKGYYVQVDTGSDILWV--NCIRCDGCPTT--SGLGIELTQYDPAGSGT 139

Query: 162 SKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           +  + C    C       L  +C +   PC + +  Y + +S++G  V D +       N
Sbjct: 140 T--VGCDQEFCVANSPNGLPPACPSTSSPCQFRI-AYGDGSSTTGFYVSDSVQYNQVSGN 196

Query: 216 ALKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                  AS+  GCG  Q GG L     A DG++G G  + S+ S LA A  +R  F+ C
Sbjct: 197 GQTTPSNASITFGCG-AQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHC 255

Query: 274 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 328
            D    G IF        +  T+ L  N  +  Y + ++   +G + L+   ++F +   
Sbjct: 256 LDTVHGGGIFAIGNVVQPKVKTTPLVQNVTH--YNVNLQGISVGGATLQLPSSTFDSGDS 313

Query: 329 ---IVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  +LP+EVY T + A FD+  +  + +++ +    C++ S       P V
Sbjct: 314 KGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQDF---VCFQFSGSIDDGFPVV 370

Query: 385 KLMF 388
              F
Sbjct: 371 TFSF 374


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 81/260 (31%), Positives = 121/260 (46%), Gaps = 22/260 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP   + V +D GSD+LW+ C  C RC   S      L  +L  Y P  SS
Sbjct: 32  LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG-----LGLELTLYDPKDSS 86

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           T   +SC    C        P      PC Y++  Y + +S++G  V D+L       + 
Sbjct: 87  TGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVT-YGDGSSTTGYFVSDLLQFDQVSGDG 145

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                 ++V  GCG +Q G       A DG+IG G    S+ S L+ AG ++  F+ C D
Sbjct: 146 QTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLD 205

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF--------K 327
             + G IF        +  T+ L  N  +  Y + +++  +G + LK  S          
Sbjct: 206 TINGGGIFAIGNVVQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFDTGEKKG 263

Query: 328 AIVDSGSSFTFLPKEVYETI 347
            I+DSG++ T+LP+ VY+ I
Sbjct: 264 TIIDSGTTLTYLPEIVYKEI 283


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 91/318 (28%), Positives = 150/318 (47%), Gaps = 38/318 (11%)

Query: 58  SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSK-TMSLGNDFGWLHYTWIDIGTPNVSF 116
           S EYY+ L   D Q++  +  P+  + FP  G   T + G     L+YT I +GTP   F
Sbjct: 9   SSEYYRTLREHD-QRRLRRILPEV-VAFPISGDDDTFTTG-----LYYTRIYLGTPPQQF 61

Query: 117 LVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG 175
            V +D GSD+ W+ C  C  C   S     ++   ++ + P  S++   +SC+   C L 
Sbjct: 62  YVHVDTGSDVAWVNCVPCTNCKRAS-----NVALPISIFDPEKSTSKTSISCTDEECYLA 116

Query: 176 TS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNALKNSVQASVIIGCGM 231
           ++  C      CPY+   Y + +S++G L+ D+L    +  G N+   S  A +  GCG 
Sbjct: 117 SNSKCSFNSMSCPYST-LYGDGSSTAGYLINDVLSFNQVPSG-NSTATSGTARLTFGCGS 174

Query: 232 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFGDQGP 289
            Q+G +L     DGL+G G  E+S+PS L+K  +  N F+ C   D+  SG +  G    
Sbjct: 175 NQTGTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIRE 230

Query: 290 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEV 343
                T  +     Y   ++ +     G++    T+F        I+DSG++ T+L +  
Sbjct: 231 PGLVYTPIVPKQSHYNVELLNIGVS--GTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPA 288

Query: 344 YETIAAEFDRQVNDTITS 361
           Y+    +F  +V D + S
Sbjct: 289 YD----QFQAKVRDCMRS 302


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 145/313 (46%), Gaps = 30/313 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I IGTP   + V +D GSD+LW+ C  C  C   S     +L  +L  Y P  S 
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKS-----NLGIELTMYDPRGSQ 143

Query: 161 TSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           + + ++C  + C      +  SC +   PC Y++  Y + +S++G  V D L       +
Sbjct: 144 SGELVTCDQQFCVANYGGVLPSCTS-TSPCEYSIS-YGDGSSTAGFFVTDFLQYNQVSGD 201

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
                  ASV  GCG K  G      +A DG++G G    S+ S LA AG +R  F+ C 
Sbjct: 202 GQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261

Query: 275 DKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTS 325
           D  + G IF  G+      ++T  ++    Y   + G++   +G + L           S
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGID---VGGTALGLPTNIFDSGNS 318

Query: 326 FKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  ++P+ VY+ + A  FD+  + ++ + + +    C++ S       P V
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFPEV 375

Query: 385 KLMFPQNNSFVVN 397
              F  + S +V+
Sbjct: 376 TFHFEGDVSLIVS 388


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 128/273 (46%), Gaps = 19/273 (6%)

Query: 98  DFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 156
           D   L+Y  I IGTP   + V +D GSD++W+ C  C  C   S     SL  DL  Y+ 
Sbjct: 73  DILGLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTS-----SLGIDLTLYNI 127

Query: 157 SASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           + S T K + C    C      Q P       CPY ++ Y + +S++G  V+D++     
Sbjct: 128 NESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPY-LEIYGDGSSTAGYFVKDVVQYARV 186

Query: 213 GDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
             +    +   SVI GCG +QSG  G  +  A DG++G G    S+ S LA  G ++  F
Sbjct: 187 SGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIF 246

Query: 271 SMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKY---ITYI-IGVETCCIGSSCLKQTS 325
           + C D  + G IF  G         T  + +   Y   +T + +G E   + +   +   
Sbjct: 247 AHCLDGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGD 306

Query: 326 FK-AIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            K AI+DSG++  +LP+ VY+ + ++   Q  D
Sbjct: 307 RKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPD 339


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/304 (27%), Positives = 146/304 (48%), Gaps = 30/304 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I+IG+P   + V +D GSD+LW+  +C+RC        + L  +L +Y P+ S T
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWV--NCIRCDGCPTR--SGLGIELTQYDPAGSGT 138

Query: 162 SKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           +  + C    C       +  +C +   PC + +  Y + ++++G  V D +       N
Sbjct: 139 T--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRIT-YGDGSTTTGFYVTDFVQYNQVSGN 195

Query: 216 ALKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
               +  AS+  GCG  Q GG L     A DG++G G  + S+ S LA A  +R  F+ C
Sbjct: 196 GQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254

Query: 274 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 328
            D    G IF        +  T+ L  N  +  Y + ++   +G + L+   ++F +   
Sbjct: 255 LDTVRGGGIFAIGNVVQPKVKTTPLVPNVTH--YNVNLQGISVGGATLQLPTSTFDSGDS 312

Query: 329 ---IVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  +LP+EVY T +AA FD+  +  + +++ +    C++ S       P +
Sbjct: 313 KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDF---VCFQFSGSIDDGFPVI 369

Query: 385 KLMF 388
              F
Sbjct: 370 TFSF 373


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 84/304 (27%), Positives = 146/304 (48%), Gaps = 30/304 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I+IG+P   + V +D GSD+LW+  +C+RC        + L  +L +Y P+ S T
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWV--NCIRCDGCPTR--SGLGIELTQYDPAGSGT 138

Query: 162 SKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           +  + C    C       +  +C +   PC + +  Y + ++++G  V D +       N
Sbjct: 139 T--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRIT-YGDGSTTTGFYVTDFVQYNQVSGN 195

Query: 216 ALKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
               +  AS+  GCG  Q GG L     A DG++G G  + S+ S LA A  +R  F+ C
Sbjct: 196 GQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254

Query: 274 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--- 328
            D    G IF        +  T+ L  N  +  Y + ++   +G + L+   ++F +   
Sbjct: 255 LDTVRGGGIFAIGNVVQPKVKTTPLVPNVTH--YNVNLQGISVGGATLQLPTSTFDSGDS 312

Query: 329 ---IVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  +LP+EVY T +AA FD+  +  + +++ +    C++ S       P +
Sbjct: 313 KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDF---VCFQFSGSIDDGFPVI 369

Query: 385 KLMF 388
              F
Sbjct: 370 TFSF 373


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 145/312 (46%), Gaps = 28/312 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T I IGTP   + V +D GSD+LW+  +CV C        ++L  +L  Y P  S +
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWV--NCVSCD--GCPRKSNLGIELTMYDPRGSQS 144

Query: 162 SKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            + ++C  + C      +  SC +   PC Y++  Y + +S++G  V D L       + 
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSIS-YGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                 ASV  GCG K  G      +A DG++G G    S+ S LA AG +R  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 276 KDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTSF 326
             + G IF  G+      ++T  +     Y   + G++   +G + L           S 
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNSK 319

Query: 327 KAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             I+DSG++  ++P+ VY+ + A  FD+  + ++ + + +    C++ S       P V 
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFPEVT 376

Query: 386 LMFPQNNSFVVN 397
             F  + S +V+
Sbjct: 377 FHFEGDVSLIVS 388


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 93/309 (30%), Positives = 135/309 (43%), Gaps = 22/309 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G+P   + V +D GSD+LW+ C  C RC   S      L  DL  Y P  S 
Sbjct: 69  LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKS-----DLGIDLTLYDPKGSE 123

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS+ +SC    C        P    + PCPY++ Y  + ++++G  V+D L      DN 
Sbjct: 124 TSELISCDQEFCSATYDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNHVNDNL 182

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
                 +S+I GCG  QSG        A DG+IG G    SV S LA +G ++  FS C 
Sbjct: 183 RTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL 242

Query: 275 DKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFKA- 328
           D    G IF  G+       +T  +     Y   +  +E       + S      + K  
Sbjct: 243 DNIRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGT 302

Query: 329 IVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           I+DSG++  +LP  VY E I     RQ    +   E      C++ +       P VKL 
Sbjct: 303 IIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVE--QQFSCFQYTGNVDRGFPVVKLH 360

Query: 388 FPQNNSFVV 396
           F  + S  V
Sbjct: 361 FEDSLSLTV 369


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 144/313 (46%), Gaps = 30/313 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I IGTP   + V +D GSD+LW+ C  C  C   S     +L  +L  Y P  S 
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKS-----NLGIELTMYDPRGSQ 143

Query: 161 TSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           + + ++C  + C      +  SC +   PC Y++  Y + +S++G  V D L       +
Sbjct: 144 SGELVTCDQQFCVANYGGVLPSCTS-TSPCEYSIS-YGDGSSTAGFFVTDFLQYNQVSGD 201

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
                  ASV  GCG K  G      +A DG++G G    S+ S LA AG +R  F+ C 
Sbjct: 202 GQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL 261

Query: 275 DKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTS 325
           D  + G IF  G+      ++T  +     Y   + G++   +G + L           S
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGLPTNIFDSGNS 318

Query: 326 FKAIVDSGSSFTFLPKEVYETI-AAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  ++P+ VY+ + A  FD+  + ++ + + +    C++ S       P V
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---CFQYSGSVDDGFPEV 375

Query: 385 KLMFPQNNSFVVN 397
              F  + S +V+
Sbjct: 376 TFHFEGDVSLIVS 388


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 144/314 (45%), Gaps = 30/314 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IGTP+  + + +D G+D++W+ C  C  C   S     +L  DL  Y+   SS
Sbjct: 72  LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRS-----NLGMDLTLYNIKESS 126

Query: 161 TSKHLSCSHRLCD-----LGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
           + K + C   LC      L T C +     CPY ++ Y + +S++G  V+D++       
Sbjct: 127 SGKLVPCDQELCKEINGGLLTGCTSKTNDSCPY-LEIYGDGSSTAGYFVKDVVLFDQVSG 185

Query: 215 NALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           +    S   SVI GCG +QSG   Y +  A DG++G G    S+ S L+ +G ++  F+ 
Sbjct: 186 DLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAH 245

Query: 273 CFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQ 323
           C +  + G IF  G     T  +T  L     Y   +  ++   +G + L        ++
Sbjct: 246 CLNGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTFLNLSTDASEQR 302

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            S   I+DSG++  +LP  +Y+ +  +   +Q N  + +   +    C++ S       P
Sbjct: 303 DSKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTL--HDEYTCFQYSGSVDDGFP 360

Query: 383 SVKLMFPQNNSFVV 396
           +V   F    S  V
Sbjct: 361 NVTFYFENGLSLKV 374


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 150/317 (47%), Gaps = 25/317 (7%)

Query: 98  DFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 156
           D   L+Y  I IGTP  S+ V +D GSD++W+ C  C +C   S     +L  +L  Y+ 
Sbjct: 75  DIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNI 129

Query: 157 SASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
             S + K +SC    C        + C+     CPY ++ Y + +S++G  V+D++   S
Sbjct: 130 DESDSGKLVSCDDDFCYQISGGPLSGCK-ANMSCPY-LEIYGDGSSTAGYFVKDVVQYDS 187

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGLIRN 268
              +    +   SVI GCG +QSG  LD     A DG++G G    S+ S LA +G ++ 
Sbjct: 188 VAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKK 246

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQ 323
            F+ C D  + G IF   +    + + + L  N  +    +T + +G E   I +   + 
Sbjct: 247 IFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306

Query: 324 TSFK-AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
              K AI+DSG++  +LP+ +YE +  +   Q            +K C++ S +     P
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFP 365

Query: 383 SVKLMFPQNNSFVVNNP 399
           +V   F +N+ F+   P
Sbjct: 366 NVTFHF-ENSVFLRVYP 381


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 141/312 (45%), Gaps = 23/312 (7%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDL 151
           + L  D G L++T I +GTP   + V +D GSD+LW+ C  C +C   S      L  DL
Sbjct: 75  LGLPTDTG-LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSG-----LGLDL 128

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDIL 207
             Y P ASS+   +SC    C      + P      PC Y++  Y + +S++G  V D L
Sbjct: 129 TFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSV-MYGDGSSTTGFFVTDAL 187

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
                  +       A+V  GCG +Q G       A DG++G G    S+ S LA AG +
Sbjct: 188 QFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKV 247

Query: 267 RNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYI----IGVETCCIGSSCL 321
           +  F+ C D    G IF  G+      ++T  +A    Y   +    +G  T  + +   
Sbjct: 248 KKIFAHCLDTIKGGGIFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVF 307

Query: 322 KQTSFKA-IVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
           +    K  I+DSG++ T+LP+ V+ E +AA F++  +    + + +    C++       
Sbjct: 308 ETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQDF---MCFQYPGSVDD 364

Query: 380 KLPSVKLMFPQN 391
             P++   F  +
Sbjct: 365 GFPTITFHFEDD 376


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 94/310 (30%), Positives = 138/310 (44%), Gaps = 24/310 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G+P   + V +D GSD+LW+ C +C RC   S      L  DL  Y P  S 
Sbjct: 69  LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKS-----DLGIDLTLYDPKGSE 123

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  +SC    C        P    + PCPY++ Y  + ++++G  V+D L       N 
Sbjct: 124 TSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNRINGNL 182

Query: 217 LKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
             +   +S+I GCG  QSG  G     A DG+IG G    SV S LA +G ++  FS C 
Sbjct: 183 RTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL 242

Query: 275 DKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFKA- 328
           D    G IF  G+       +T  +     Y   +  +E       + S      + K  
Sbjct: 243 DNVRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGT 302

Query: 329 IVDSGSSFTFLPKEVY-ETIAAEFDRQVNDTITSFEGYPWKC-CYKSSSQRLPKLPSVKL 386
           ++DSG++  +LP  VY E I     RQ    +   E   ++C  Y  +  R    P VKL
Sbjct: 303 VIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVE-QQFRCFLYTGNVDR--GFPVVKL 359

Query: 387 MFPQNNSFVV 396
            F  + S  V
Sbjct: 360 HFKDSLSLTV 369


>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
          Length = 426

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 71/231 (30%), Positives = 120/231 (51%), Gaps = 19/231 (8%)

Query: 178 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 237
           C +P   CPY + Y +  + S+G+LVED++H+ +    A      A +  G   +   G 
Sbjct: 128 CISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEAR----DARITFG---ESQLGL 180

Query: 238 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF 297
              VA +G++GL + +I+VP++L KAG+  +SFSMCF  +  G I FGD+G + Q  T  
Sbjct: 181 FKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLETP- 239

Query: 298 LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEF-----D 352
           L+     + Y + +    +G   +  T F A  DSG++ T+L +  Y  +   F     D
Sbjct: 240 LSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPD 298

Query: 353 RQVNDTITSFEGYPWKCCY-KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 402
           R+++ ++ S    P++ CY  +S+    KLPSV        ++ V +P+ V
Sbjct: 299 RRLSKSVDS----PFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILV 345


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 141/312 (45%), Gaps = 30/312 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I IGTP+  + V +D GSD+LW+ C  C RC   S      L  DL  Y   AS+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208

Query: 161 TSKHLSCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  + C    C L       C+ P   C Y++  Y + +S++G  V+D +       N 
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNF 266

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                  +V+ GCG KQSG       A DG++G G    S+ S LA +G ++  FS C D
Sbjct: 267 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 326

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-I 329
             D G IF   +    + + + L  N  +   +     +G +   + S   +    K  I
Sbjct: 327 NVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386

Query: 330 VDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           +DSG++  + P+EVY     + ++ + D +++    +F       C+  +       P+V
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTV 440

Query: 385 KLMFPQNNSFVV 396
            L F ++ S  V
Sbjct: 441 TLHFDKSISLTV 452


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 141/312 (45%), Gaps = 30/312 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I IGTP+  + V +D GSD+LW+ C  C RC   S      L  DL  Y   AS+
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 208

Query: 161 TSKHLSCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  + C    C L       C+ P   C Y++  Y + +S++G  V+D +       N 
Sbjct: 209 TSDAVGCDDNFCSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNF 266

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                  +V+ GCG KQSG       A DG++G G    S+ S LA +G ++  FS C D
Sbjct: 267 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 326

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-I 329
             D G IF   +    + + + L  N  +   +     +G +   + S   +    K  I
Sbjct: 327 NVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386

Query: 330 VDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           +DSG++  + P+EVY     + ++ + D +++    +F       C+  +       P+V
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTV 440

Query: 385 KLMFPQNNSFVV 396
            L F ++ S  V
Sbjct: 441 TLHFDKSISLTV 452


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 141/312 (45%), Gaps = 30/312 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I IGTP+  + V +D GSD+LW+ C  C RC   S      L  DL  Y   AS+
Sbjct: 73  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 127

Query: 161 TSKHLSCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  + C    C L       C+ P   C Y++  Y + +S++G  V+D +       N 
Sbjct: 128 TSDAVGCDDNFCSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNF 185

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                  +V+ GCG KQSG       A DG++G G    S+ S LA +G ++  FS C D
Sbjct: 186 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 245

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA-I 329
             D G IF   +    + + + L  N  +   +     +G +   + S   +    K  I
Sbjct: 246 NVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 305

Query: 330 VDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           +DSG++  + P+EVY     + ++ + D +++    +F       C+  +       P+V
Sbjct: 306 IDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF------TCFDYTGNVDDGFPTV 359

Query: 385 KLMFPQNNSFVV 396
            L F ++ S  V
Sbjct: 360 TLHFDKSISLTV 371


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 150/317 (47%), Gaps = 25/317 (7%)

Query: 98  DFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 156
           D   L+Y  I IGTP  S+ V +D GSD++W+ C  C +C   S     +L  +L  Y+ 
Sbjct: 75  DIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNI 129

Query: 157 SASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
             S + K +SC    C        + C+     CPY ++ Y + +S++G  V+D++   S
Sbjct: 130 DESDSGKLVSCDDDFCYQISGGPLSGCK-ANMSCPY-LEIYGDGSSTAGYFVKDVVQYDS 187

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGLIRN 268
              +    +   SVI GCG +QSG  LD     A DG++G G    S+ S LA +G ++ 
Sbjct: 188 VAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKK 246

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQ 323
            F+ C D  + G IF   +    + + + L  N  +    +T + +G E   I +   + 
Sbjct: 247 IFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQP 306

Query: 324 TSFK-AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
              K AI+DSG++  +LP+ +YE +  +   Q            +K C++ S +     P
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFP 365

Query: 383 SVKLMFPQNNSFVVNNP 399
           +V   F +N+ F+   P
Sbjct: 366 NVTFHF-ENSVFLRVYP 381


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 90/322 (27%), Positives = 140/322 (43%), Gaps = 42/322 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I IGTP+  + V +D GSD+LW+ C  C RC   S      L  DL  Y   AS+
Sbjct: 77  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKS-----DLGVDLTLYDMKAST 131

Query: 161 TSKHLSCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  + C    C L       C+ P   C Y++  Y + +S++G  V+D +       N 
Sbjct: 132 TSDAVGCDDNFCSLYDGPLPGCK-PGLQCLYSV-LYGDGSSTTGYFVQDFVQYNRISGNF 189

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                  +V+ GCG KQSG       A DG++G G    S+ S LA +G ++  FS C D
Sbjct: 190 QTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLD 249

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYI---------------IGVETCCIGSSC 320
             D G IF    G   +    FL  N   I  +               +G +   + S  
Sbjct: 250 NVDGGGIFA--IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDA 307

Query: 321 LKQTSFKA-IVDSGSSFTFLPKEVY-----ETIAAEFDRQVNDTITSFEGYPWKCCYKSS 374
            +    K  I+DSG++  + P+EVY     + ++ + D +++    +F       C+  +
Sbjct: 308 FESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT------CFDYT 361

Query: 375 SQRLPKLPSVKLMFPQNNSFVV 396
                  P+V L F ++ S  V
Sbjct: 362 GNVDDGFPTVTLHFDKSISLTV 383


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 154/358 (43%), Gaps = 22/358 (6%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGT 111
           ++P+    E  ++     ++ ++M     + + FP +G+   S       L+YT + +GT
Sbjct: 30  AFPSNDGVELSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVG----LYYTKVKLGT 85

Query: 112 PNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 171
           P     V +D GSD+LW+ C      P ++     L   LN + P +SSTS  +SC  R 
Sbjct: 86  PPRELYVQIDTGSDVLWVSCGSCNGCPQTSG----LQIQLNYFDPGSSSTSSLISCLDRR 141

Query: 172 CDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C  G      SC      C YT  Y  + + +SG  V D++H  S  +  L  +  ASV+
Sbjct: 142 CRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVV 200

Query: 227 IGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IF 283
            GC + Q+G       A DG+ G G   +SV S L+  G+    FS C   D+SG   + 
Sbjct: 201 FGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLV 260

Query: 284 FGDQGPATQQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA-IVDSGSSFTF 338
            G+        +  + S   Y   +    +  +   I  S    ++ +  IVDSG++  +
Sbjct: 261 LGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNNRGTIVDSGTTLAY 320

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           L +E Y          +  ++ S      +C   ++S  +   P V L F    S V+
Sbjct: 321 LAEEAYNPFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVL 378


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 90/324 (27%), Positives = 145/324 (44%), Gaps = 25/324 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT + +G P   F V +D GSD+LW+ C+     P ++     L   LN + P +S+T
Sbjct: 82  LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATS----GLQIPLNFFDPGSSTT 137

Query: 162 SKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +  +SCS ++C LG     ++C      C Y    Y + + +SG  V D++HL    D++
Sbjct: 138 ASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQ-YGDGSGTSGYYVMDMIHLDVVIDSS 196

Query: 217 LKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           + ++  ASV+ GC   Q+G       A DG+ G G  ++SV S L+  G+    FS C  
Sbjct: 197 VTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLK 256

Query: 276 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTS 325
            DDSG   +  G+        T  + S      Y + +++  +    L          +S
Sbjct: 257 GDDSGGGILVLGEIVEPNVVYTPLVPSQPH---YNLNLQSISVNGQVLPISPAVFATSSS 313

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
              I+DSG++  +L +E Y          V+ +  S        CY +SS      P V 
Sbjct: 314 QGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVV-LKGNRCYVTSSSVSDIFPQVS 372

Query: 386 LMFPQNNSFVVNNPVFVIYGTQVG 409
           L F    S V+    ++I    VG
Sbjct: 373 LNFAGGASLVLGAQDYLIQQNSVG 396


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 85/307 (27%), Positives = 142/307 (46%), Gaps = 34/307 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWI-PCDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I+IG+P   + V +D GSD+LW+    C  C   S      L  +L +Y P+ S 
Sbjct: 84  LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSG-----LGIELTQYDPAGSG 138

Query: 161 TSKHLSCSHRLC-------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           T+  + C    C        +  +C +   PC + +  Y + +S++G  V D +      
Sbjct: 139 TT--VGCEQEFCVANSAASGVPPACPSAASPCQFRIT-YGDGSSTTGFYVTDFVQYNQVS 195

Query: 214 DNALKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
            N        S+  GCG  Q GG L     A DG++G G  + S+ S LA A  +R  F+
Sbjct: 196 GNGQTTPSNVSITFGCG-AQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFA 254

Query: 272 MCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA 328
            C D    G IF  G+        T+ L  N  +  Y + ++   +G + L+   ++F +
Sbjct: 255 HCLDTVRGGGIFAIGNVVQPPIVKTTPLVPNATH--YNVNLQGISVGGATLQLPTSTFDS 312

Query: 329 ------IVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                 I+DSG++  +LP+EVY T + A FD+  +  + ++E +    C++ S     + 
Sbjct: 313 GDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDF---ICFQFSGSLDEEF 369

Query: 382 PSVKLMF 388
           P +   F
Sbjct: 370 PVITFSF 376


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 129/285 (45%), Gaps = 34/285 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  ++IG P   + + +D+GSDL W+ CD  C  C        N +   L  Y P+  
Sbjct: 63  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT-- 110

Query: 160 STSKHLSCSHRLCDL--------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHL 209
             SK + C HRLC             C++P + C Y + Y  +  SS+G+LV D   L L
Sbjct: 111 -KSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKY-ADQGSSTGVLVNDSFALRL 168

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRN 268
            +G      +  + SV  GCG  Q     D  +P DG++GLG G +S+ S L + G+ +N
Sbjct: 169 TNG------SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKN 222

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
               C      G +FFGD     Q++T + +A +     Y  G  +   G   L     K
Sbjct: 223 VVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK 282

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
            + DSGSSFT+   + Y+ +       ++ T+          C+K
Sbjct: 283 VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWK 327


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 165/379 (43%), Gaps = 38/379 (10%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGT 111
           ++P+    E  ++     ++ ++M     + + FP +G+   S     G L+YT + +GT
Sbjct: 30  AFPSNDGVELSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPS---QVG-LYYTKVKLGT 85

Query: 112 PNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 171
           P   F V +D GSD+LW+ C      P ++     L   LN + P +SSTS  +SCS R 
Sbjct: 86  PPREFYVQIDTGSDVLWVSCGSCNGCPQTS----GLQIQLNYFDPRSSSTSSLISCSDRR 141

Query: 172 CDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C  G      SC +    C YT   Y + + +SG  V D++H     +  L  +  ASV+
Sbjct: 142 CRSGVQTSDASCSSQNNQCTYTFQ-YGDGSGTSGYYVSDLMHFAGIFEGTLTTNSSASVV 200

Query: 227 IGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IF 283
            GC + Q+G       A DG+ G G   +SV S L+  G+    FS C   D+SG   + 
Sbjct: 201 FGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSGGGVLV 260

Query: 284 FGD-------QGPATQQSTSF------LASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 330
            G+         P  Q    +      ++ NG+    I+ +      +S  + T    IV
Sbjct: 261 LGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQ----IVPIAPAVFATSNNRGT----IV 312

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQ 390
           DSG++  +L +E Y          V  ++ S      +C   ++S  +   P V L F  
Sbjct: 313 DSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAG 372

Query: 391 NNSFVVNNPVFVIYGTQVG 409
             S V+    +++    +G
Sbjct: 373 GASLVLRPQDYLMQQNYIG 391


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 33/284 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  ++IG P   + + +D+GSDL W+ CD  C  C        N +   L  Y P+  
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT-- 112

Query: 160 STSKHLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLI 210
             SK + C HRLC            C +P + C Y + Y  +  SS+G+L+ D   L L 
Sbjct: 113 -KSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLT 170

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNS 269
           +G      +  + SV  GCG  Q     D  +P DG++GLG G +S+ S L + G+ +N 
Sbjct: 171 NG------SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNV 224

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
              C      G +FFGD     Q++T + +A +     Y  G  +   G   L     K 
Sbjct: 225 VGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           + DSGSSFT+   + Y+ +       ++ T+          C+K
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWK 328


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 83/270 (30%), Positives = 124/270 (45%), Gaps = 22/270 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T I IGTP  S+ V +D GSD+LW+  +CV C   +    + L  +L  Y PS SS+
Sbjct: 80  LYFTQIGIGTPAKSYYVQVDTGSDILWV--NCVFCD--TCPRKSGLGIELTLYDPSGSSS 135

Query: 162 SKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
              ++C    C      +  SC  P  PC Y++  Y + +S++G  V D L       N+
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCV-PAAPCQYSIS-YGDGSSTTGFFVTDFLQYNQVSGNS 193

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                  S+  GCG K  G       A DG++G G    S+ S LA AG +R  F+ C D
Sbjct: 194 QTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLD 253

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------QTSFK 327
             + G IF        + ST+ L     +  Y + +E   +G   L+          S  
Sbjct: 254 TINGGGIFAIGDVVQPKVSTTPLVPGMPH--YNVNLEAIDVGGVKLQLPTNIFDIGESKG 311

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            I+DSG++  +LP  VY  I ++   Q  D
Sbjct: 312 TIIDSGTTLAYLPGVVYNAIMSKVFAQYGD 341


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 123/277 (44%), Gaps = 32/277 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I+IG P   + + +D GS L WI CD  C  C       Y     ++    P   S
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENI---VPPRDS 185

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             + L  +   CD   +C+     C Y +  Y + +SS+G+L  D + LI+  D   +N 
Sbjct: 186 HCQELQGNQNYCD---TCKQ----CDYEI-AYADRSSSAGVLARDNMELITA-DGEREN- 235

Query: 221 VQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
               ++ GC   Q G  L   A  DG++GL  G +S+P+ LAK G+I N F  C   D S
Sbjct: 236 --MDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPS 293

Query: 280 GR--IFFGDQGPATQQSTSFLASNGK---YITYIIGVETCCIGSSCLKQTS--FKAIVDS 332
           G   +F GD        T     NG    Y T +  V   C   +  +Q     + I DS
Sbjct: 294 GSAYMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDS 353

Query: 333 GSSFTFLPKEVY-------ETIAAEFDRQVNDTITSF 362
           GSS+T+ P E+Y       E ++  F R  +D    F
Sbjct: 354 GSSYTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPF 390


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 90/318 (28%), Positives = 152/318 (47%), Gaps = 31/318 (9%)

Query: 98  DFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 156
           D   L+Y  I IGTP  S+ V +D GSD++W+ C  C +C   S     +L  +L  Y+ 
Sbjct: 75  DIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRS-----TLGIELTLYNI 129

Query: 157 SASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
             S + K +SC    C        + C+     CPY ++ Y + +S++G  V+D++   S
Sbjct: 130 DESDSGKLVSCDDDFCYQISGGPLSGCK-ANMSCPY-LEIYGDGSSTAGYFVKDVVQYDS 187

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGLIRN 268
              +    +   SVI GCG +QSG  LD     A DG++G G    S+ S LA +G ++ 
Sbjct: 188 VAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKK 246

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY----ITYI-IGVETCCIGSSCLKQ 323
            F+ C D  + G IF   +    + + + L  N  +    +T + +G E   I +   + 
Sbjct: 247 IFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306

Query: 324 TSFK-AIVDSGSSFTFLPKEVYET-IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
              K AI+DSG++  +LP+ +YE  +  E   +V+     ++      C++ S +     
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKEPALKVHIVDKDYK------CFQYSGRVDEGF 360

Query: 382 PSVKLMFPQNNSFVVNNP 399
           P+V   F +N+ F+   P
Sbjct: 361 PNVTFHF-ENSVFLRVYP 377


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 83/281 (29%), Positives = 124/281 (44%), Gaps = 27/281 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSA- 158
           L+Y  + IG P   + + +D GSDL W+ CD  CV C  +    Y       N+  P   
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTK---NKIVPCVD 113

Query: 159 ---SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
              SS    LS  H+       C +PKQ C Y + Y  +  SS G+L+ D   +      
Sbjct: 114 QLCSSLHGGLSGKHK-------CDSPKQQCDYEIKY-ADQGSSLGVLLTDSFAV------ 159

Query: 216 ALKNS--VQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            L NS  V+ S+  GCG  Q  G    VAP DG++GLG G IS+ S L + G+ +N    
Sbjct: 160 RLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGH 219

Query: 273 CFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
           C      G +FFGD   P ++ +   +  +     Y  G  +   G   L     + ++D
Sbjct: 220 CLSIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLD 279

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           SGSSFT+   + Y+ +       ++ T+          C+K
Sbjct: 280 SGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCWK 320


>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
 gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
          Length = 307

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 64/166 (38%), Positives = 91/166 (54%), Gaps = 10/166 (6%)

Query: 246 LIGLGLGEISVPSLLAKAGLIR-NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKY 304
           L+GLG+ ++SVPS+LA  G+++ NSFSMCF KD  GRI FGD G A Q  T F+  +  +
Sbjct: 9   LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKS-TH 67

Query: 305 ITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
             Y I + +  +G   L    F AI DSG+SFT+L    Y      F+ Q+++   +F G
Sbjct: 68  SYYNISITSMSVGDKNLP-LGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSG 126

Query: 365 ------YPWKCCYK-SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
                 +P++ CY  S  Q   +LP V L       F V +PV+ I
Sbjct: 127 STRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPI 172


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 33/284 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  ++IG P   + + +D+GSDL W+ CD  C  C        N +   L  Y P+  
Sbjct: 56  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT-- 103

Query: 160 STSKHLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLI 210
             SK + C HRLC            C +P + C Y + Y  +  SS+G+L+ D   L L 
Sbjct: 104 -KSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLT 161

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNS 269
           +G      +  + SV  GCG  Q     D  +P DG++GLG G +S+ S L + G+ +N 
Sbjct: 162 NG------SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNV 215

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
              C      G +FFGD     Q++T + +A +     Y  G  +   G   L     K 
Sbjct: 216 VGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 275

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           + DSGSSFT+   + Y+ +       ++ T+          C+K
Sbjct: 276 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWK 319


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 33/284 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  ++IG P   + + +D+GSDL W+ CD  C  C        N +   L  Y P+  
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC--------NEVPHPL--YRPT-- 112

Query: 160 STSKHLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLI 210
             SK + C HRLC            C +P + C Y + Y  +  SS+G+L+ D   L L 
Sbjct: 113 -KSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKY-ADQGSSTGVLINDSFALRLT 170

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNS 269
           +G      +  + SV  GCG  Q     D  +P DG++GLG G +S+ S L + G+ +N 
Sbjct: 171 NG------SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNV 224

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
              C      G +FFGD     Q++T + +A +     Y  G  +   G   L     K 
Sbjct: 225 VGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           + DSGSSFT+   + Y+ +       ++ T+          C+K
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWK 328


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 136/311 (43%), Gaps = 26/311 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IGTP  ++ + +D GSD++W+ C  C  C   S     SL  DL  Y    SS
Sbjct: 82  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----SLGMDLTLYDIKESS 136

Query: 161 TSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           + K + C    C      L T C      CPY ++ Y + +S++G  V+DI+       +
Sbjct: 137 SGKLVPCDQEFCKEINGGLLTGC-TANISCPY-LEIYGDGSSTAGYFVKDIVLYDQVSGD 194

Query: 216 ALKNSVQASVIIGCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
              +S   S++ GCG +QSG     +  A DG++G G    S+ S LA +G ++  F+ C
Sbjct: 195 LKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHC 254

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA---- 328
            +  + G IF  G         T  L     Y   +  V+      S    TS +     
Sbjct: 255 LNGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKG 314

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVK 385
            I+DSG++  +LP+ +YE +  +   Q  D    T  + Y    C++ S       P+V 
Sbjct: 315 TIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYT---CFQYSESVDDGFPAVT 371

Query: 386 LMFPQNNSFVV 396
             F    S  V
Sbjct: 372 FFFENGLSLKV 382


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 142/312 (45%), Gaps = 27/312 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I +G P   + V +D GSD+LW+ C +C +C   S      L   L  Y P +S+
Sbjct: 81  LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKS-----DLGVKLTLYDPQSST 135

Query: 161 TSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           ++  + C    C      +   C     PC Y++  Y + +S++G  V+D L       N
Sbjct: 136 SATRIYCDDDFCAATYNGVLQGC-TKDLPCQYSV-VYGDGSSTAGFFVKDNLQFDRVTGN 193

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              +S   SVI GCG KQSG       A DG++G G    S+ S LA AG ++  F+ C 
Sbjct: 194 LQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL 253

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--------F 326
           D    G IF   +  + + +T+ +  N  +  Y + ++   +G + L+  +         
Sbjct: 254 DNVKGGGIFAIGEVVSPKVNTTPMVPNQPH--YNVVMKEIEVGGNVLELPTDIFDTGDRR 311

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             I+DSG++  +LP+ VYE++  +    Q    + + E      C++ +       P VK
Sbjct: 312 GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVE--EQFTCFQYTGNVNEGFPVVK 369

Query: 386 LMFPQNNSFVVN 397
             F  + S  VN
Sbjct: 370 FHFNGSLSLTVN 381


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/253 (31%), Positives = 114/253 (45%), Gaps = 25/253 (9%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I +G P   + + +D GSDL WI CD  C  CA      Y      +    P   S
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRDS 247

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L      C+   +C+     C Y ++Y  + +SS G+L +D +HLI+  GG   L 
Sbjct: 248 LCQELQGDQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLAKDDMHLIATNGGREKL- 298

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS+PS LA  G+I N F  C  ++
Sbjct: 299 -----DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRE 353

Query: 278 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIVDSGS 334
            +  G +F GD        T      G    Y    +    G   L    S + I DSGS
Sbjct: 354 TNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGS 413

Query: 335 SFTFLPKEVYETI 347
           S+T+LP+E+Y+ +
Sbjct: 414 SYTYLPEEMYKNL 426


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 152/343 (44%), Gaps = 28/343 (8%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           FP QG+    L      L++T + +G+P   F V +D GSD+LW+ C      P+++   
Sbjct: 70  FPVQGTFNPFLVG----LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTS--- 122

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSS 199
             L   L  + P +S+T+  +SCS + C  G       C +    C YT   Y + + +S
Sbjct: 123 -GLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQ-YGDGSGTS 180

Query: 200 GLLVEDILH----LISGGD-NALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGE 253
           G  V D++H    L+S G+ + +  +  +SV   C   Q+G       A DG+ G G  E
Sbjct: 181 GYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQE 240

Query: 254 ISVPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYI--- 308
           +SV S LA  G+    FS C   DDS  G +  G+        T  + S   Y  Y+   
Sbjct: 241 MSVISQLASQGITPRVFSHCLKGDDSGGGVLVLGEIVEPNIVYTPLVPSQPHYNLYLQSI 300

Query: 309 -IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
            +  +T  I  S    +S +  IVDSG++  +L +  Y+   +     V+    ++    
Sbjct: 301 SVAGQTLAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKG 360

Query: 367 WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQVG 409
            + CY  +S      P V L F    S ++N   +++    VG
Sbjct: 361 NQ-CYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVG 402


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/259 (30%), Positives = 114/259 (44%), Gaps = 29/259 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I IG P   + + +D GSDL WI CD  C  CA      Y      +    P    
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRDL 243

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L  +   C+   +C+     C Y ++Y  + +SS G+L  D +H+I+  GG   L 
Sbjct: 244 LCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHMIATNGGREKL- 294

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS PS LA  G+I N F  C  ++
Sbjct: 295 -----DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITRE 349

Query: 278 D--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
               G +F GD        T     +G    Y         G   L++     ++ + I 
Sbjct: 350 QGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIF 409

Query: 331 DSGSSFTFLPKEVYETIAA 349
           DSGSS+T+LP E+YE + A
Sbjct: 410 DSGSSYTYLPNEIYENLVA 428


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/279 (28%), Positives = 127/279 (45%), Gaps = 29/279 (10%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C  +   +Y       N+  P A+S
Sbjct: 73  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTK---NKIVPCAAS 129

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
               L+ + +       C  P+Q C Y + Y T+  SS G+L+ D   L      +L+NS
Sbjct: 130 LCTSLTPNKK-------CAVPQQ-CDYQIKY-TDKASSLGVLIADNFTL------SLRNS 174

Query: 221 --VQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             V+A++  GCG  Q  G    V  A DGL+GLG G +S+ S L + G+ +N    CF  
Sbjct: 175 STVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFST 234

Query: 277 DDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSG 333
           +  G +FFGD    T + T       ++G Y  Y  G  T       L     + + DSG
Sbjct: 235 NGGGFLFFGDDIVPTSRVTWVPMARTTSGNY--YSPGSGTLYFDRRSLGMKPMEVVFDSG 292

Query: 334 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           S++ +   E Y+   +     ++ ++          C+K
Sbjct: 293 STYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWK 331


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 126/265 (47%), Gaps = 38/265 (14%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVR-CAPLSASYYNSLDRDLNEY 154
            D+G+  Y  + +GTP   F V +D GS + ++PC  C R C P               +
Sbjct: 57  KDYGYF-YATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKD---------AAF 106

Query: 155 SPSASSTSKHLSCSHRLCDLGT---SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
            P++SS+S  + C    C  G     C   K+ C Y   Y  E +SS+GLLV D L L  
Sbjct: 107 DPASSSSSAVIGCDSDKCICGRPPCGCSE-KRECTYQRTY-AEQSSSAGLLVSDQLQLRD 164

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
           G            V+ GC  K++G   +  A DG++GLG  E+S+ + LA +G+I + F+
Sbjct: 165 GA---------VEVVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFA 214

Query: 272 MCFDK-DDSGRIFFGDQGPA----TQQSTSFLASNGKYITYIIGVETCCIGSSCLK---- 322
           +CF   +  G +  GD   A      Q T+ L+S      Y + +E   +G   L     
Sbjct: 215 LCFGSVEGDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPE 274

Query: 323 --QTSFKAIVDSGSSFTFLPKEVYE 345
             +  +  ++DSG++FT+LP E ++
Sbjct: 275 RYEEGYGTVLDSGTTFTYLPSEAFQ 299


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 141/303 (46%), Gaps = 26/303 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+Y  I IGTP+  + V +D GSD++W+ C   R  P ++    SL  +L  Y    S+T
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS----SLGMELTPYDLEESTT 141

Query: 162 SKHLSCSHRLC---DLG-TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
            K +SC  + C   + G  S       CPY +  Y + +S++G  V+D +       +  
Sbjct: 142 GKLVSCDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLE 200

Query: 218 KNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             +   S+  GCG +QSG  G     A DG++G G    S+ S LA    ++  F+ C D
Sbjct: 201 TTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLD 260

Query: 276 KDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA---- 328
             + G IF  G         T  + +   Y   + GV+   +G   L  ++  F+A    
Sbjct: 261 GTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQ---VGHIILNISADVFEAGDRK 317

Query: 329 --IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             I+DSG++  +LP+ +YE + A+   +Q N  + +  G  +K C++ S +     P V 
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG-EYK-CFQYSERVDDGFPPVI 375

Query: 386 LMF 388
             F
Sbjct: 376 FHF 378


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 86/304 (28%), Positives = 142/304 (46%), Gaps = 28/304 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+Y  I IGTP+  + V +D GSD++W+ C   R  P ++    SL  +L  Y    S+T
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTS----SLGMELTPYDLEESTT 141

Query: 162 SKHLSCSHRLC---DLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            K +SC  + C   + G  + C      CPY +  Y + +S++G  V+D +       + 
Sbjct: 142 GKLVSCDEQFCLEVNGGPLSGCTT-NMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDL 199

Query: 217 LKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              +   S+  GCG +QSG  G     A DG++G G    S+ S LA    ++  F+ C 
Sbjct: 200 ETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL 259

Query: 275 DKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--FKA--- 328
           D  + G IF  G         T  + +   Y   + GV+   +G   L  ++  F+A   
Sbjct: 260 DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQ---VGHIILNISADVFEAGDR 316

Query: 329 ---IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  +LP+ +YE + A+   +Q N  + +  G  +K C++ S +     P V
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG-EYK-CFQYSERVDDGFPPV 374

Query: 385 KLMF 388
              F
Sbjct: 375 IFHF 378


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/331 (29%), Positives = 154/331 (46%), Gaps = 39/331 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P   + V +D GSD+LWI C  C +C   +     +L+  L+ +  +ASS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKT-----NLNFRLSLFDMNASS 127

Query: 161 TSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           TSK + C    C       SCQ P   C Y + Y  E+T S G  + D+L L     +  
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQ-PALGCSYHIVYADEST-SDGKFIRDMLTLEQVTGDLK 185

Query: 218 KNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
              +   V+ GCG  QSG   +G  A DG++G G    SV S LA  G  +  FS C D 
Sbjct: 186 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 245

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC-LKQTSFK---AIVD 331
              G IF  G       ++T  + +   Y   ++G++    G+S  L ++  +    IVD
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV--DGTSLDLPRSIVRNGGTIVD 303

Query: 332 SGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP----- 382
           SG++  + PK +Y    ETI A    +++    +F+      C+  S+      P     
Sbjct: 304 SGTTLAYFPKVLYDSLIETILARQPVKLHIVEETFQ------CFSFSTNVDEAFPPVSFE 357

Query: 383 ---SVKL-MFPQNNSFVVNNPVFVIYGTQVG 409
              SVKL ++P +  F +   ++  +G Q G
Sbjct: 358 FEDSVKLTVYPHDYLFTLEEELYC-FGWQAG 387


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/313 (30%), Positives = 139/313 (44%), Gaps = 49/313 (15%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I +GTP V + V +D GSD+ W+ C  C  C  ++ +   S+   L  Y PS SS
Sbjct: 36  LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSC--VTETQLPSIK--LTTYDPSRSS 91

Query: 161 TSKHLSCSHRLCD--LGT---SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T   LSC    C   LG+   SC +    C Y+  Y  + +S+ G  ++D++      +N
Sbjct: 92  TDGALSCRDSNCGAALGSNEVSCTSAGY-CAYSTTY-GDGSSTQGYFIQDVMTFQEIHNN 149

Query: 216 ALKNSVQASVIIGCGMKQSGGYL-DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              N   ASV  GCG  QSG  L    A DGLIG G   +S+PS LA  G + N F+ C 
Sbjct: 150 TQVNGT-ASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL 208

Query: 275 DKDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI-GSSCLKQTSFK---- 327
             D+   G I  G         T  ++ N     Y +G++   + G +     SF     
Sbjct: 209 QGDNQGGGTIVIGSVSEPNISYTPIVSRN----HYAVGMQNIAVNGRNVTTPASFDTTST 264

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL----- 378
                I+DSG++  +L    Y         Q  + +++FE       + S SQ L     
Sbjct: 265 SAGGVIMDSGTTLAYLVDPAYT--------QFVNAVSTFE----SSMFSSHSQCLQLAWC 312

Query: 379 ---PKLPSVKLMF 388
                 P+VKL F
Sbjct: 313 SLQADFPTVKLFF 325


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/259 (30%), Positives = 113/259 (43%), Gaps = 29/259 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I +G P   + + +D GSDL WI CD  C  CA      Y      +    P    
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKI---VPPRDL 243

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L  +   C+   +C+     C Y ++Y  + +SS G+L  D +HLI+  GG   L 
Sbjct: 244 LCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHLIATNGGREKL- 294

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS+PS LA  G+I N F  C  ++
Sbjct: 295 -----DFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITRE 349

Query: 278 D--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
               G +F GD        T     +G    Y         G   L+       + + I 
Sbjct: 350 QGGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIF 409

Query: 331 DSGSSFTFLPKEVYETIAA 349
           DSGSS+T+LP E+YE + A
Sbjct: 410 DSGSSYTYLPDEIYENLVA 428


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/331 (29%), Positives = 154/331 (46%), Gaps = 39/331 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P   + V +D GSD+LWI C  C +C   +     +L+  L+ +  +ASS
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKT-----NLNFRLSLFDMNASS 127

Query: 161 TSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           TSK + C    C       SCQ P   C Y + Y  E+T S G  + D+L L     +  
Sbjct: 128 TSKKVGCDDDFCSFISQSDSCQ-PALGCSYHIVYADEST-SDGKFIRDMLTLEQVTGDLK 185

Query: 218 KNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
              +   V+ GCG  QSG   +G  A DG++G G    SV S LA  G  +  FS C D 
Sbjct: 186 TGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 245

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC-LKQTSFK---AIVD 331
              G IF  G       ++T  + +   Y   ++G++    G+S  L ++  +    IVD
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV--DGTSLDLPRSIVRNGGTIVD 303

Query: 332 SGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP----- 382
           SG++  + PK +Y    ETI A    +++    +F+      C+  S+      P     
Sbjct: 304 SGTTLAYFPKVLYDSLIETILARQPVKLHIVEETFQ------CFSFSTNVDEAFPPVSFE 357

Query: 383 ---SVKL-MFPQNNSFVVNNPVFVIYGTQVG 409
              SVKL ++P +  F +   ++  +G Q G
Sbjct: 358 FEDSVKLTVYPHDYLFTLEEELYC-FGWQAG 387


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 89/304 (29%), Positives = 135/304 (44%), Gaps = 24/304 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I +G  +  + V +D GSD LW+ C  C  C   S      L  DL  Y P+ S 
Sbjct: 75  LYYTKIGLGPKD--YYVQVDTGSDTLWVNCVGCTACPKKSG-----LGMDLTLYDPNLSK 127

Query: 161 TSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDN 215
           TSK + C    C    D   S       CPY++ Y   +T+S   + +D+    + G   
Sbjct: 128 TSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLR 187

Query: 216 ALKNSVQASVIIGCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
            + ++   SVI GCG KQSG        + DG+IG G    SV S LA AG ++  FS C
Sbjct: 188 TVPDN--TSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHC 245

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKA 328
            D    G IF  G+      ++T  L     Y   +  +E       + S  L  +S + 
Sbjct: 246 LDSISGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRG 305

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKL 386
            I+DSG++  +LP  +Y+ +  +   Q +          + C + S  + +  L P+VK 
Sbjct: 306 TIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFTCFHYSDEESVDDLFPTVKF 365

Query: 387 MFPQ 390
            F +
Sbjct: 366 TFEE 369


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 159/382 (41%), Gaps = 66/382 (17%)

Query: 34  RFSEEVKALGVSKNRNATSWPAKK--SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSK 91
           + SE ++AL V+K+     W A +  S  +  +  ++DV+            L P  G  
Sbjct: 7   KRSEAIRAL-VAKSHARVRWMAARANSSSWSSMAGTTDVESP----------LHPDGGGY 55

Query: 92  TMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRD 150
            M             I +GTP   F    D GSDL+W+  + C  C+  +          
Sbjct: 56  VMD------------ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTI--------- 94

Query: 151 LNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
              + P  SST + + CS +LC +L  SC+     C Y+ +Y +  T   G    D + L
Sbjct: 95  ---FDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYSYEYGSGET--EGEFARDTISL 149

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
            +  D + K     S  +GCGM  SG   DGV  DGL+GLG G +S+ S L+ A  I + 
Sbjct: 150 GTTSDGSQKF---PSFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSK 200

Query: 270 FSMCF----DKDDSGRIFFGDQGP---ATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL 321
           FS C      + +S  + FG          QST     +  Y T Y++ V    +    +
Sbjct: 201 FSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTM 260

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                  I+DSG++ T++P  VY  + +  +  V              CY  SS R  K 
Sbjct: 261 GSPG-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKF 319

Query: 382 PSVKLMF-------PQNNSFVV 396
           P++ +         P +N F+V
Sbjct: 320 PALTIRLAGATMTPPSSNYFLV 341


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 120/277 (43%), Gaps = 19/277 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  CV C+ +    Y       N+  P   
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVD 113

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
                L   H        C +PKQ C Y + Y  +  SS G+LV D   L       L N
Sbjct: 114 QMCAAL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLAN 163

Query: 220 S--VQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           S  V+  +  GCG  +Q G   +  A DG++GLG G +S+ S L + G+ +N    C   
Sbjct: 164 SSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLST 223

Query: 277 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 335
              G +FFGD   P ++ + + +A +     Y  G      G   L     + + DSGSS
Sbjct: 224 RGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSS 283

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           FT+   + Y+ +       ++  +     +    C+K
Sbjct: 284 FTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWK 320


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 120/277 (43%), Gaps = 19/277 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  CV C+ +    Y       N+  P   
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVD 113

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
                L   H        C +PKQ C Y + Y  +  SS G+LV D   L       L N
Sbjct: 114 QMCAAL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLAN 163

Query: 220 S--VQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           S  V+  +  GCG  +Q G   +  A DG++GLG G +S+ S L + G+ +N    C   
Sbjct: 164 SSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLST 223

Query: 277 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 335
              G +FFGD   P ++ + + +A +     Y  G      G   L     + + DSGSS
Sbjct: 224 RGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSS 283

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           FT+   + Y+ +       ++  +     +    C+K
Sbjct: 284 FTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWK 320


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 77/284 (27%), Positives = 123/284 (43%), Gaps = 33/284 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  CV C+ +    Y               
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRP------------- 103

Query: 160 STSKHLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           + +K + C  ++C            C +PKQ C Y + Y  +  SS G+LV D   L   
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL--- 159

Query: 213 GDNALKNS--VQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
               L NS  V+  +  GCG  +Q G   +  A DG++GLG G +S+ S L + G+ +N 
Sbjct: 160 ---RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNV 216

Query: 270 FSMCFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
              C      G +FFGD   P ++ + + +A +     Y  G      G   L     + 
Sbjct: 217 VGHCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEV 276

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           + DSGSSFT+   + Y+ +       ++  +     +    C+K
Sbjct: 277 VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWK 320


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 120/277 (43%), Gaps = 19/277 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  CV C+ +    Y       N+  P   
Sbjct: 57  LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTK---NKLVPCVD 113

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
                L   H        C +PKQ C Y + Y  +  SS G+LV D   L       L N
Sbjct: 114 QMCAAL---HGGLTGRHKCDSPKQQCDYEIKY-ADQGSSLGVLVTDSFAL------RLAN 163

Query: 220 S--VQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           S  V+  +  GCG  +Q G   +  A DG++GLG G +S+ S L + G+ +N    C   
Sbjct: 164 SSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLST 223

Query: 277 DDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSS 335
              G +FFGD   P ++ + + +A +     Y  G      G   L     + + DSGSS
Sbjct: 224 RGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSS 283

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           FT+   + Y+ +       ++  +     +    C+K
Sbjct: 284 FTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWK 320


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 78/283 (27%), Positives = 125/283 (44%), Gaps = 35/283 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+A+S
Sbjct: 54  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTANS 103

Query: 161 TSKHLSCSHRLCDL-------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
               + C++ LC            C +PKQ C Y + Y T++ SS G+L+ D   L    
Sbjct: 104 L---VPCANALCTALHSGHGSNNKCPSPKQ-CDYQIKY-TDSASSQGVLINDNFSLPMRS 158

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
            N     ++  +  GCG  Q  G    V  A DG++GLG G +S+ S L + G+ +N   
Sbjct: 159 SN-----IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLG 213

Query: 272 MCFDKDDSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 329
            C   +  G +FFGD    T + T       +G Y  Y  G  T       L     + +
Sbjct: 214 HCLSTNGGGFLFFGDDIVPTSRVTWVPMAKISGNY--YSPGSGTLYFDRRSLGVKPMEVV 271

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
            DSGS++T+   + Y+ + +     ++ ++          C+K
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPLCWK 314


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 83/285 (29%), Positives = 129/285 (45%), Gaps = 37/285 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+ + 
Sbjct: 52  YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSC--------NKVPHPL--YKPTKN- 100

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPC--PYTMDY---YTENTSSSGLLVEDILHLISGGDN 215
             K + C+  +C    S Q+P + C  P   DY   YT++ SS G+LV D   L      
Sbjct: 101 --KLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTL------ 152

Query: 216 ALKNS--VQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            L+NS  V+ S   GCG  Q  G  +GV     DGL+GLG G +S+ S L   G+ +N  
Sbjct: 153 PLRNSSSVRPSFTFGCGYDQQVGK-NGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVL 211

Query: 271 SMCFDKDDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
             C   +  G +FFGD    T ++T      +++G Y  Y  G  T       L     +
Sbjct: 212 GHCLSTNGGGFLFFGDNVVPTSRATWVPMVRSTSGNY--YSPGSGTLYFDRRSLGVKPME 269

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
            + DSGS++T+   + Y+   +     ++ ++          C+K
Sbjct: 270 VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWK 314


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 83/268 (30%), Positives = 128/268 (47%), Gaps = 39/268 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  C  CA      Y+              
Sbjct: 22  LYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDP------------- 68

Query: 160 STSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
             ++ + C   LC L       +C  P + C Y ++Y  + +S+ G+L+ED + L+    
Sbjct: 69  KKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEY-ADGSSTMGVLMEDTITLL---- 123

Query: 215 NALKNSVQA--SVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFS 271
             L N  ++  + IIGCG  Q G      A  DG++GL   +IS+PS LAK G++RN   
Sbjct: 124 --LTNGTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIG 181

Query: 272 MCF--DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
            C     +  G +FFGD   PA   + + +   GK IT  IG ++   G +  K      
Sbjct: 182 HCLAGGSNGGGYLFFGDSLVPALGMTWTPIM--GKSITGNIGGKS---GDADDKTGDIGG 236

Query: 329 IV-DSGSSFTFLPKEVYETIAAEFDRQV 355
           ++ DSG+SFT+L  E Y  + +  + QV
Sbjct: 237 VMFDSGTSFTYLVPEAYNAVLSAMEMQV 264


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 137/317 (43%), Gaps = 35/317 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IGTP   + V +D GSD++W+ C  C  C   S     SL  +L  Y    S 
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESL 151

Query: 161 TSKHLSCSHRLCDLGTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T K +SC    C    +   P        C YT + Y + +SS G  V DI+       +
Sbjct: 152 TGKLVSCDQDFC-YAINGGPPSYCIANMSCSYT-EIYADGSSSFGYFVRDIVQYDQVSGD 209

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
               S   SVI GC   QSG      A DG++G G    S+ S LA +G +R  F+ C D
Sbjct: 210 LETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLD 269

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--------FK 327
             + G IF        + +T+ L  N  +  Y + ++   +G   L   +          
Sbjct: 270 GLNGGGIFAIGHIVQPKVNTTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKG 327

Query: 328 AIVDSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            I+DSG++  +LP+ VY+ + ++      D +V+     F       C++ S       P
Sbjct: 328 TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT------CFQYSESLDDGFP 381

Query: 383 SVKLMFPQNNSFVVNNP 399
           +V   F +N+ ++  +P
Sbjct: 382 AVTFHF-ENSLYLKVHP 397


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 138/304 (45%), Gaps = 32/304 (10%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P   + V +D GSD+LW+ C  C +C P+       L   L+ Y   ASS
Sbjct: 76  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKTD----LGIPLSLYDSKASS 130

Query: 161 TSKHLSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
           TSK++ C    C      +    K+PC Y +  Y + ++S G  V+D + L     N   
Sbjct: 131 TSKNVGCEDAFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFVKDNITLDQVTGNLRT 189

Query: 219 NSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             +   V+ GCG  QSG  G  +  A DG++G G    SV S LA  G ++  FS C D 
Sbjct: 190 APLAQEVVFGCGKNQSGQLGQTES-AVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDN 248

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------- 328
            + G IF  G+      ++T  + +   Y   + G++    G       S  +       
Sbjct: 249 MNGGGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDV--DGEPIDLPPSLASTNGDGGT 306

Query: 329 IVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           I+DSG++  +LP+ +Y    E I A+   +++    +F       C+  +S      P V
Sbjct: 307 IIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVV 360

Query: 385 KLMF 388
            L F
Sbjct: 361 NLHF 364


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 29/259 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I +G P   + + +D GSDL WI CD  C  CA      Y      +    P    
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKDL 259

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L  +   C+   +C+     C Y ++Y  + +SS G+L  D +H+I+  GG   L 
Sbjct: 260 LCQELQGNQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLARDDMHIITTNGGREKL- 310

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS+PS LA  G+I N F  C  +D
Sbjct: 311 -----DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRD 365

Query: 278 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
            +  G +F GD        TS    +     +    +    G   L        S + I 
Sbjct: 366 PNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIF 425

Query: 331 DSGSSFTFLPKEVYETIAA 349
           DSGSS+T+LP E+Y+ + A
Sbjct: 426 DSGSSYTYLPDEIYKNLIA 444


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 114/259 (44%), Gaps = 29/259 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I +G P   + + +D GSDL WI CD  C  CA      Y      +    P    
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKDL 260

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L  +   C+   +C+     C Y ++Y  + +SS G+L  D +H+I+  GG   L 
Sbjct: 261 LCQELQGNQNYCE---TCKQ----CDYEIEY-ADRSSSMGVLARDDMHIITTNGGREKL- 311

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS+PS LA  G+I N F  C  +D
Sbjct: 312 -----DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRD 366

Query: 278 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
            +  G +F GD        TS    +     +    +    G   L        S + I 
Sbjct: 367 PNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIF 426

Query: 331 DSGSSFTFLPKEVYETIAA 349
           DSGSS+T+LP E+Y+ + A
Sbjct: 427 DSGSSYTYLPDEIYKNLIA 445


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 128/302 (42%), Gaps = 24/302 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT + +G+P   F V +D GSD+LW+ C  C  C   S      L  DL  Y P+ S 
Sbjct: 71  LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSG-----LGMDLTLYDPNGSK 125

Query: 161 TSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           TS  + C    C    S     C+     CPY++  Y + +++SG  V D L       N
Sbjct: 126 TSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSIT-YGDGSTTSGSFVNDSLTFDEVSGN 183

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                  +SVI GCG KQSG        A DG+IG G    SV S LA +G ++  FS C
Sbjct: 184 LHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHC 243

Query: 274 FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQTSFKA 328
            D    G IF   Q    + +T+ L     +   I     +  E   +        S + 
Sbjct: 244 LDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRG 303

Query: 329 -IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I+DSG++  +LP  +Y  +  +   RQ    +   E      C+  S +     P VK 
Sbjct: 304 TIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVE--DQFTCFHYSDKLDEGFPVVKF 361

Query: 387 MF 388
            F
Sbjct: 362 HF 363


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 140/312 (44%), Gaps = 23/312 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+Y  + IGTP+  + V +D GSD++W+ C   R  P ++    SL  +L  Y+   S +
Sbjct: 85  LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTS----SLGMELTLYNIKDSVS 140

Query: 162 SKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
            K + C    C        S       CPY ++ Y + +S++G  V+D++       +  
Sbjct: 141 GKLVPCDEEFCYEVNGGPLSGCTANMSCPY-LEIYGDGSSTAGYFVKDVVQYDRVSGDLQ 199

Query: 218 KNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             S   SVI GCG +QSG  G     A DG++G G    S+ S LA    ++  F+ C D
Sbjct: 200 TTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLD 259

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-------- 327
             + G IF        + + + L  N  +  Y + +    +G   L   + +        
Sbjct: 260 GINGGGIFAIGHVVQPKVNMTPLIPNQPH--YNVNMTAVQVGEDFLHLPTEEFEAGDRKG 317

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           AI+DSG++  +LP+ VYE + ++   Q  D         +  C++ S       P+V   
Sbjct: 318 AIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSGSVDDGFPNVTFH 376

Query: 388 FPQNNSFVVNNP 399
           F +N+ F+  +P
Sbjct: 377 F-ENSVFLKVHP 387


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 77/280 (27%), Positives = 126/280 (45%), Gaps = 28/280 (10%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+A+ 
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTAN- 101

Query: 161 TSKHLSCSHRLCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDN 215
             + + C++ LC    S Q  N K P P   DY   YT++ SS G+L+ D   L     N
Sbjct: 102 --RLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN 159

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                ++  +  GCG  Q  G    V  A DG++GLG G +S+ S L + G+ +N    C
Sbjct: 160 -----IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHC 214

Query: 274 FDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDS 332
              +  G +FFGD   P+++ +   +A       Y  G  T       L     + + DS
Sbjct: 215 LSTNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDS 274

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           GS++T+   + Y+ + +     ++ ++          C+K
Sbjct: 275 GSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWK 314


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 86/311 (27%), Positives = 135/311 (43%), Gaps = 26/311 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IGTP  ++ + +D GSD++W+ C  C  C   S     +L  DL  Y    SS
Sbjct: 84  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRS-----NLGMDLTLYDIKESS 138

Query: 161 TSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           + K + C    C      L T C      CPY ++ Y + +S++G  V+DI+       +
Sbjct: 139 SGKFVPCDQEFCKEINGGLLTGC-TANISCPY-LEIYGDGSSTAGYFVKDIVLYDQVSGD 196

Query: 216 ALKNSVQASVIIGCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
              +S   S++ GCG +QSG     +  A  G++G G    S+ S LA +G ++  F+ C
Sbjct: 197 LKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHC 256

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA---- 328
            +  + G IF  G         T  L     Y   +  V+      S    TS +     
Sbjct: 257 LNGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKG 316

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVK 385
            I+DSG++  +LP+ +YE +  +   Q  D    T  + Y    C++ S       P+V 
Sbjct: 317 TIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYT---CFQYSESVDDGFPAVT 373

Query: 386 LMFPQNNSFVV 396
             F    S  V
Sbjct: 374 FYFENGLSLKV 384


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 113/257 (43%), Gaps = 29/257 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I +G P   + + +D GSDL WI CD  C  CA      Y      +    P    
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRDL 250

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L      C    +C+     C Y ++Y  + +SS G+L +D +H+I+  GG   L 
Sbjct: 251 LCQELQGDQNYC---ATCKQ----CDYEIEY-ADRSSSMGVLAKDDMHMIATNGGREKL- 301

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS+PS LA  G+I N F  C  K+
Sbjct: 302 -----DFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKE 356

Query: 278 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
            +  G +F GD        T      G    Y    +    G   L+      +S + I 
Sbjct: 357 PNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIF 416

Query: 331 DSGSSFTFLPKEVYETI 347
           DSGSS+T+LP E+Y+ +
Sbjct: 417 DSGSSYTYLPDEIYKKL 433


>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1336

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 82/271 (30%), Positives = 129/271 (47%), Gaps = 40/271 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L++T + +G P  S+ + +D GSDL W+ CD  C  C   +            +Y P+ S
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHV----------QYKPTRS 242

Query: 160 STSKHLSCSHRLC-DLGTSCQNPKQP-----CPYTMDYYTENTSSSGLLVEDILHLISGG 213
           +    +S    LC D+  + +N         C Y +  Y +++SS G+LV D LHL++  
Sbjct: 243 NV---VSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQ-YADHSSSLGVLVRDELHLVTTN 298

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            +  K     +V+ GCG  Q G  L+ +A  DG++GL   ++S+P  LA  GLI+N    
Sbjct: 299 GSKTK----LNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGH 354

Query: 273 CFDKDDS--GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK---Q 323
           C   D +  G +F GD             ++  +   Y T I+G+     G+  LK   Q
Sbjct: 355 CLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN---YGNRQLKFDGQ 411

Query: 324 TSF-KAIVDSGSSFTFLPKEVYETIAAEFDR 353
           +   K   DSGSS+T+ PKE Y  + A  + 
Sbjct: 412 SKVGKVFFDSGSSYTYFPKEAYLDLVASLNE 442


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 77/280 (27%), Positives = 126/280 (45%), Gaps = 28/280 (10%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+A+ 
Sbjct: 53  YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTAN- 101

Query: 161 TSKHLSCSHRLCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDN 215
             + + C++ LC    S Q  N K P P   DY   YT++ SS G+L+ D   L     N
Sbjct: 102 --RLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN 159

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                ++  +  GCG  Q  G    V  A DG++GLG G +S+ S L + G+ +N    C
Sbjct: 160 -----IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHC 214

Query: 274 FDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDS 332
              +  G +FFGD   P+++ +   +A       Y  G  T       L     + + DS
Sbjct: 215 LSTNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDS 274

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           GS++T+   + Y+ + +     ++ ++          C+K
Sbjct: 275 GSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWK 314


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 137/317 (43%), Gaps = 35/317 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+Y  I IGTP   + V +D GSD++W+ C  C  C   S     SL  +L  Y    S 
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKS-----SLGMELTLYDIKESL 151

Query: 161 TSKHLSCSHRLCDLGTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T K +SC    C    +   P        C YT + Y + +SS G  V DI+       +
Sbjct: 152 TGKLVSCDQDFC-YAINGGPPSYCIANMSCSYT-EIYADGSSSFGYFVRDIVQYDQVSGD 209

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
               S   SVI GC   QSG      A DG++G G    S+ S LA +G +R  F+ C D
Sbjct: 210 LETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLD 269

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--------FK 327
             + G IF        + +T+ L  N  +  Y + ++   +G   L   +          
Sbjct: 270 GLNGGGIFAIGHIVQPKVNTTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKG 327

Query: 328 AIVDSGSSFTFLPKEVYETIAAEF-----DRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            I+DSG++  +LP+ VY+ + ++      D +V+     F       C++ S       P
Sbjct: 328 TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT------CFQYSESLDDGFP 381

Query: 383 SVKLMFPQNNSFVVNNP 399
           +V   F +N+ ++  +P
Sbjct: 382 AVTFHF-ENSLYLKVHP 397


>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1388

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 130/271 (47%), Gaps = 40/271 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L++T + +G P  S+ + +D GSDL W+ CD  C+ C   +   Y           P+ S
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYK----------PTRS 240

Query: 160 STSKHLSCSHRLC-DLGTSCQNPKQ-----PCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           +    +S    LC D+  + +N         C Y +  Y +++SS G+LV D LHL++  
Sbjct: 241 NV---VSSVDALCLDVQKNQKNGHHDESLLQCDYEIQ-YADHSSSLGVLVRDELHLVTTN 296

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            +  K     +V+ GCG  Q+G  L+ +   DG++GL   ++S+P  LA  GLI+N    
Sbjct: 297 GSKTK----LNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGH 352

Query: 273 CFDKDDS--GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK---Q 323
           C   D +  G +F GD             ++  +   Y T I+G+     G+  L+   Q
Sbjct: 353 CLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGIN---YGNRQLRFDGQ 409

Query: 324 TSF-KAIVDSGSSFTFLPKEVYETIAAEFDR 353
           +   K + DSGSS+T+ PKE Y  + A  + 
Sbjct: 410 SKVGKMVFDSGSSYTYFPKEAYLDLVASLNE 440


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 113/259 (43%), Gaps = 29/259 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I IG P   + + +D GSDL WI CD  C   A      Y      +    P    
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKI---VPPRDL 243

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNALK 218
             + L  +   C+   +C+     C Y ++Y  + +SS G+L  D +H+I+  GG   L 
Sbjct: 244 LCQELQGNQNYCE---TCKQ----CDYEIEY-ADQSSSMGVLARDDMHMIATNGGREKL- 294

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  + GC   Q G  L   A  DG++GL    IS PS LA  G+I N F  C  ++
Sbjct: 295 -----DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITRE 349

Query: 278 D--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIV 330
               G +F GD        T     +G    Y         G   L++     ++ + I 
Sbjct: 350 QGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIF 409

Query: 331 DSGSSFTFLPKEVYETIAA 349
           DSGSS+T+LP E+YE + A
Sbjct: 410 DSGSSYTYLPNEIYENLVA 428


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 103/384 (26%), Positives = 157/384 (40%), Gaps = 70/384 (18%)

Query: 34  RFSEEVKALGVSKNRNATSWPAKK--SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSK 91
           + SE ++ L V+K+     W A +  S  +  +  ++DV+            L P  G  
Sbjct: 7   KRSEAIRGL-VAKSHARVRWMAARANSSSWSSMAGTTDVESP----------LHPDGGGY 55

Query: 92  TMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRD 150
            M             I +GTP   F    D GSDL+W+  + C  C+  +          
Sbjct: 56  VMD------------ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTI--------- 94

Query: 151 LNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
              + P  SST + + CS +LC +L  SC+     C Y+ +Y +  T   G    D + L
Sbjct: 95  ---FDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYSYEYGSGET--EGEFARDTISL 149

Query: 210 --ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
              SGG          S  +GCGM  SG   DGV  DGL+GLG G +S+ S L+ A  I 
Sbjct: 150 GTTSGGSQKFP-----SFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--ID 198

Query: 268 NSFSMCF----DKDDSGRIFFGDQGP---ATQQSTSFLASNGKYIT-YIIGVETCCIGSS 319
           + FS C      + +S  + FG          QST     +  Y T Y++ V    +   
Sbjct: 199 SKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQ 258

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
            +       I+DSG++ T++P  VY  + +  +  V              CY  SS R  
Sbjct: 259 TMGSPG-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNY 317

Query: 380 KLPSVKLMF-------PQNNSFVV 396
           K P++ +         P +N F+V
Sbjct: 318 KFPALTIRLAGATMTPPSSNYFLV 341


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 133/301 (44%), Gaps = 37/301 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +GTP  ++ + +D GSDLLW+ C  C+ C   S      L   +  Y   AS+
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASA 89

Query: 161 TSKHLSCSHRLCDLGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +S  + CS   C L T       N +  C Y+   Y + + + G LVED+LH +      
Sbjct: 90  SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQ-YGDGSGTLGYLVEDVLHYMV----- 143

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
              +  A+VI GCG KQSG       A DG+IG G  ++S  S LAK G   N F+ C D
Sbjct: 144 ---NATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLD 200

Query: 276 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGVET----CCIGSSCLKQTSFKA- 328
             + G   +  G+      Q T  +     Y   +  +        I          +  
Sbjct: 201 GGERGGGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGT 260

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVKLM 387
           I DSG++  +LP E Y+     F + V+  +      P+  C    S+ + KL P+V L 
Sbjct: 261 IFDSGTTLAYLPDEAYQA----FTQAVSLVVA-----PFLLCDTRLSRFIYKLFPNVVLY 311

Query: 388 F 388
           F
Sbjct: 312 F 312


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/304 (28%), Positives = 136/304 (44%), Gaps = 24/304 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I +G PN  + V +D GSD LW+ C  C  C   S      L  +L  Y P++S 
Sbjct: 76  LYYTKIGLG-PN-DYYVQVDTGSDTLWVNCVGCTTCPKKSG-----LGMELTLYDPNSSK 128

Query: 161 TSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDN 215
           TSK + C    C    D   S       CPY++ Y   +T+S   + +D+    + G   
Sbjct: 129 TSKVVPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLR 188

Query: 216 ALKNSVQASVIIGCGMKQSGGY--LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
            + ++   SVI GCG KQSG        + DG+IG G    SV S LA AG ++  FS C
Sbjct: 189 TVPDN--TSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHC 246

Query: 274 FDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCC----IGSSCLKQTSFKA 328
            D  + G IF  G+      ++T  +     Y   +  +E       + +     TS + 
Sbjct: 247 LDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGRG 306

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK-LPSVKL 386
            I+DSG++  +LP  +Y+ +  +   Q +          + C + S  + L    P+VK 
Sbjct: 307 TIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKF 366

Query: 387 MFPQ 390
            F +
Sbjct: 367 TFEE 370


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 162/380 (42%), Gaps = 46/380 (12%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML-FPSQGSKTMSLGNDFGWLHYTWIDIG 110
           + P  +SFE  Q+     ++  ++  G    ++ F  QGS    L      L++T + +G
Sbjct: 33  ALPLNQSFELAQLRARDHLRHARLLQGFVGGVVDFSVQGSSDPYLVG----LYFTRVKLG 88

Query: 111 TPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           TP   F V +D GSD+LW+ C  C  C   S      L   LN +  ++SST++ + CSH
Sbjct: 89  TPPREFNVQIDTGSDVLWVTCSSCSNCPQTSG-----LGIQLNYFDTTSSSTARLVPCSH 143

Query: 170 RLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            +C        T C      C Y   Y  + + +SG  V D  +  +    +L  +  A+
Sbjct: 144 PICTSQIQTTATQCPPQSNQCSYAFQY-GDGSGTSGYYVSDTFYFDAVLGESLIANSSAA 202

Query: 225 VIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-- 281
           ++ GC   QSG       A DG+ G G GE+SV S L+  G+    FS C   +DSG   
Sbjct: 203 IVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGI 262

Query: 282 IFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
           +  G+               P        +A +G+    ++ ++     +S  + T    
Sbjct: 263 LVLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQ----LLPIDPAAFATSSNRGT---- 314

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
           I+D+G++  +L +E Y+   +     V+   T T  +G     CY  S+      P V  
Sbjct: 315 IIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKG---NQCYLVSNSVSEVFPPVSF 371

Query: 387 MFPQNNSFVVNNPVFVIYGT 406
            F    + ++    +++Y T
Sbjct: 372 NFAGGATMLLKPEEYLMYLT 391


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 158/357 (44%), Gaps = 53/357 (14%)

Query: 3   RISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYY 62
           R  L I +AVF ++ E +      F  K+ H+F+         K +    + +  +  + 
Sbjct: 4   RRKLCIVVAVFVIVNEFASGN---FVFKVQHKFA--------GKEKKLEHFKSHDTRRHS 52

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQG-SKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           ++L S D+               P  G S+  S+G     L++T I +G+P   + V +D
Sbjct: 53  RMLASIDL---------------PLGGDSRVDSVG-----LYFTKIKLGSPPKEYHVQVD 92

Query: 122 AGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTS 177
            GSD+LW+ C  C  C   +     +L+  L+ +  +ASSTSK + C    C       S
Sbjct: 93  TGSDILWVNCKPCPECPSKT-----NLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDS 147

Query: 178 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-- 235
           CQ P   C Y + Y  E+T S G  + D L L     +     +   V+ GCG  QSG  
Sbjct: 148 CQ-PAVGCSYHIVYADEST-SEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQL 205

Query: 236 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQS 294
           G  D  A DG++G G    SV S LA  G  +  FS C D    G IF  G       ++
Sbjct: 206 GKSDS-AVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKT 264

Query: 295 TSFLASNGKYITYIIGVET----CCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 347
           T  + +   Y   ++G++       +  S ++      IVDSG++  + PK +Y+++
Sbjct: 265 TPMVPNQMHYNVMLMGMDVDGTALDLPPSIMRNGG--TIVDSGTTLAYFPKVLYDSL 319


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/305 (28%), Positives = 137/305 (44%), Gaps = 45/305 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +GTP  ++ + +D GSDLLW+ C  C+ C   S      L   +  Y   AS+
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFS-----DLKIPIVPYDVKASA 89

Query: 161 TSKHLSCSHRLCDLGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +S  + CS   C L T       N +  C Y+   Y + + + G LVED+LH +      
Sbjct: 90  SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQ-YGDGSGTLGYLVEDVLHYMV----- 143

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
              +  A+VI GCG KQSG       A DG+IG G  ++S  S LAK G   N F+ C D
Sbjct: 144 ---NATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLD 200

Query: 276 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYI---------IGVETCCIGSSCLKQT 324
             + G   +  G+      Q T  +     Y   +         + ++     +  ++ T
Sbjct: 201 GGERGGGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGT 260

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PS 383
            F    DSG++  +LP E Y+     F + V+  +      P+  C    S+ + KL P+
Sbjct: 261 IF----DSGTTLAYLPDEAYQA----FTQAVSLVVA-----PFLLCDTRLSRFIYKLFPN 307

Query: 384 VKLMF 388
           V L F
Sbjct: 308 VVLYF 312


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 37/285 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+ + 
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC--------NKVPHPL--YRPTKN- 105

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTM--DY---YTENTSSSGLLVEDILHLISGGDN 215
             K + C++ +C    S  +P + C      DY   YT+  SS G+LV D   L      
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSL------ 157

Query: 216 ALKN--SVQASVIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSF 270
            L+N  +V+ S+  GCG  Q  G  +G AP   DGL+GLG G +S+ S L + G+ +N  
Sbjct: 158 PLRNKSNVRPSLSFGCGYDQQVGK-NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216

Query: 271 SMCFDKDDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
             C      G +FFGD    T + T      +++G Y  Y  G  T       L     +
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNY--YSPGSATLYFDRRSLSTKPME 274

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
            + DSGS++T+   + Y+   +     ++ ++          C+K
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWK 319


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 144/313 (46%), Gaps = 37/313 (11%)

Query: 58  SFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTM-SLGNDFGWLHYTWIDIGTPNVSF 116
           S ++Y  L   D Q++  +  P+  + FP  G   + ++G     L+YT I +GTP   F
Sbjct: 2   SLDHYHTLRKHD-QRRLRRMLPEV-VSFPISGDNDIFAMG-----LYYTRISLGTPPQQF 54

Query: 117 LVALDAGSDLLWIPCDCVRCAPLSA-SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL- 174
            V +D GS++ W     V+CAP +   +   +   ++ + P  S+T   +SC+   C + 
Sbjct: 55  YVDVDTGSNVAW-----VKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVL 109

Query: 175 --GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIGCGM 231
                C   +  CPY++  Y + +S++G  + D+        DN+   S  A ++ GCG 
Sbjct: 110 NKKLQCSPERLSCPYSL-LYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGG 168

Query: 232 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGP 289
            Q+G +    + DGL+G G   +S+P+ LA+  +  N F+ C   D SGR  +  G    
Sbjct: 169 TQTGSW----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIRE 224

Query: 290 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGSSFTFLPKEV 343
                T  +     Y   ++ +     G +     SF        I+DSG++ T+L +  
Sbjct: 225 PDLVYTPMVFGEDHYNVQLLNIGIS--GRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPA 282

Query: 344 YETIAAEFDRQVN 356
           Y+    EF R V+
Sbjct: 283 YD----EFRRGVS 291


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 136/304 (44%), Gaps = 32/304 (10%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P   + V +D GSD+LW+ C  C +C P+       L   L+ Y    SS
Sbjct: 77  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKT----DLGIPLSLYDSKTSS 131

Query: 161 TSKHLSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
           TSK++ C    C      +    K+PC Y +  Y + ++S G  ++D + L     N   
Sbjct: 132 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFIKDNITLEQVTGNLRT 190

Query: 219 NSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             +   V+ GCG  QSG  G  D  A DG++G G    S+ S LA  G  +  FS C D 
Sbjct: 191 APLAQEVVFGCGKNQSGQLGQTDS-AVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 249

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------- 328
            + G IF  G+      ++T  + +   Y   + G++    G       S  +       
Sbjct: 250 MNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDV--DGDPIDLPPSLASTNGDGGT 307

Query: 329 IVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           I+DSG++  +LP+ +Y    E I A+   +++    +F       C+  +S      P V
Sbjct: 308 IIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVV 361

Query: 385 KLMF 388
            L F
Sbjct: 362 NLHF 365


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 37/285 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+ + 
Sbjct: 57  YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC--------NKVPHPL--YRPTKN- 105

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTM--DY---YTENTSSSGLLVEDILHLISGGDN 215
             K + C++ +C    S  +P + C      DY   YT+  SS G+LV D   L      
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSL------ 157

Query: 216 ALKN--SVQASVIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSF 270
            L+N  +V+ S+  GCG  Q  G  +G AP   DGL+GLG G +S+ S L + G+ +N  
Sbjct: 158 PLRNKSNVRPSLSFGCGYDQQVGK-NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVL 216

Query: 271 SMCFDKDDSGRIFFGDQGPATQQST---SFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
             C      G +FFGD    T + T      +++G Y  Y  G  T       L     +
Sbjct: 217 GHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNY--YSPGSATLYFDRRSLSTKPME 274

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
            + DSGS++T+   + Y+   +     ++ ++          C+K
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWK 319


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 143/335 (42%), Gaps = 26/335 (7%)

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
           V ++++  G    + FP +GS    +      L++T + +G P   F V +D GSD+LW+
Sbjct: 62  VSRRRLLGGVAGVVDFPVEGSANPYMVG----LYFTRVKLGNPAKEFFVQIDTGSDILWV 117

Query: 130 PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-----TSCQNPK-- 182
            C      P S+     L+  L  ++P +SST+  ++CS   C  G       CQ     
Sbjct: 118 TCSPCTGCPTSS----GLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQ 173

Query: 183 -QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG- 240
             PC YT   Y + + +SG  V D +   +   N    +  AS++ GC   QSG      
Sbjct: 174 SSPCGYTFT-YGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 232

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFL 298
            A DG+ G G  ++SV S L   G+    FS C    D+G   +  G+        T  +
Sbjct: 233 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLV 292

Query: 299 ASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDR 353
            S   Y     +  +  +   I SS    ++ +  IVDSG++  +L    Y+   +    
Sbjct: 293 PSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAA 352

Query: 354 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            V+ ++ S      +C   SSS      P+V L F
Sbjct: 353 AVSPSVRSLVSKGSQCFITSSSVD-SSFPTVTLYF 386


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 129/291 (44%), Gaps = 36/291 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+ T + +GTP   F V +D GSD+LWI C+     P S+     L  +LN +    SST
Sbjct: 83  LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSS----GLGIELNFFDTVGSST 138

Query: 162 SKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGD 214
           +  + CS  +C          C      C YT   Y + + +SG+ V D ++  +I G  
Sbjct: 139 AALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQ-YEDGSGTSGVYVSDAMYFDMILGQS 197

Query: 215 NALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                +  A+++ GC   QSG       A DG++G G GE+SV S L+  G+    FS C
Sbjct: 198 TPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHC 257

Query: 274 F--DKDDSGRIFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
              D +  G +  G+               P    +   +A NG+    ++ +      +
Sbjct: 258 LKGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQ----VLSINPAVFAT 313

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
           S  + T    I+DSG++ ++L +E Y+ +    D  V+   TSF     +C
Sbjct: 314 SDKRGT----IIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQC 360


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 143/335 (42%), Gaps = 26/335 (7%)

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
           V ++++  G    + FP +GS    +      L++T + +G P   F V +D GSD+LW+
Sbjct: 60  VSRRRLLGGVAGVVDFPVEGSANPYMVG----LYFTRVKLGNPAKEFFVQIDTGSDILWV 115

Query: 130 PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-----TSCQNPK-- 182
            C      P S+     L+  L  ++P +SST+  ++CS   C  G       CQ     
Sbjct: 116 TCSPCTGCPTSS----GLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQ 171

Query: 183 -QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG- 240
             PC YT   Y + + +SG  V D +   +   N    +  AS++ GC   QSG      
Sbjct: 172 SSPCGYTFT-YGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 230

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFL 298
            A DG+ G G  ++SV S L   G+    FS C    D+G   +  G+        T  +
Sbjct: 231 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLV 290

Query: 299 ASNGKY----ITYIIGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDR 353
            S   Y     +  +  +   I SS    ++ +  IVDSG++  +L    Y+   +    
Sbjct: 291 PSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAA 350

Query: 354 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            V+ ++ S      +C   SSS      P+V L F
Sbjct: 351 AVSPSVRSLVSKGSQCFITSSSVD-SSFPTVTLYF 384


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 136/304 (44%), Gaps = 32/304 (10%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +G+P   + V +D GSD+LW+ C  C +C P+       L   L+ Y    SS
Sbjct: 73  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVKT----DLGIPLSLYDSKTSS 127

Query: 161 TSKHLSCSHRLCDLGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
           TSK++ C    C      +    K+PC Y +  Y + ++S G  ++D + L     N   
Sbjct: 128 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHV-VYGDGSTSDGDFIKDNITLEQVTGNLRT 186

Query: 219 NSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             +   V+ GCG  QSG  G  D  A DG++G G    S+ S LA  G  +  FS C D 
Sbjct: 187 APLAQEVVFGCGKNQSGQLGQTDS-AVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 245

Query: 277 DDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------- 328
            + G IF  G+      ++T  + +   Y   + G++    G       S  +       
Sbjct: 246 MNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDV--DGDPIDLPPSLASTNGDGGT 303

Query: 329 IVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           I+DSG++  +LP+ +Y    E I A+   +++    +F       C+  +S      P V
Sbjct: 304 IIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF------ACFSFTSNTDKAFPVV 357

Query: 385 KLMF 388
            L F
Sbjct: 358 NLHF 361


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 122/255 (47%), Gaps = 29/255 (11%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           YT + +GTP  +F V +D GS + +IPC DC  C   +A +++          P  S+T+
Sbjct: 14  YTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFD----------PDKSTTA 63

Query: 163 KHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           K L+C   LC+ GT SC      C Y+   Y E +SS G ++ED        D+ ++   
Sbjct: 64  KKLACGDPLCNCGTPSCTCNNDRCYYSRT-YAERSSSEGWMIEDTFGF-PDSDSPVR--- 118

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
              ++ GC   ++G     +A DG++G+G    +  S L +  +I + FS+CF     G 
Sbjct: 119 ---LVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGI 174

Query: 282 IFFGDQGPATQQSTSF--LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSG 333
           +  GD       +T +  L ++     Y + ++   +    L          +  ++DSG
Sbjct: 175 LLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSG 234

Query: 334 SSFTFLPKEVYETIA 348
           ++FT+LP + ++ +A
Sbjct: 235 TTFTYLPTDAFKAMA 249


>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
          Length = 290

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 110/235 (46%), Gaps = 15/235 (6%)

Query: 52  SWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGT 111
           ++P+    E  ++     ++ ++M     + + FP +G+   S       L+YT + +GT
Sbjct: 30  AFPSNDGVELSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVG----LYYTKVKLGT 85

Query: 112 PNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL 171
           P     V +D GSD+LW+ C      P ++     L   LN + P +SSTS  +SC  R 
Sbjct: 86  PPRELYVQIDTGSDVLWVSCGSCNGCPQTS----GLQIQLNYFDPGSSSTSSLISCLDRR 141

Query: 172 CDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C  G      SC      C YT   Y + + +SG  V D++H  S  +  L  +  ASV+
Sbjct: 142 CRSGVQTSDASCSGRNNQCTYTFQ-YGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVV 200

Query: 227 IGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
            GC + Q+G       A DG+ G G   +SV S L+  G+    FS C   D+SG
Sbjct: 201 FGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSG 255


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 109/255 (42%), Gaps = 27/255 (10%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           ++IG P   + + +D GSDL W+ CD  C RC+      Y    R  N++ P        
Sbjct: 81  LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDFVP-------- 128

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSV 221
             C H LC       N     P+  DY   Y ++ SS G+L+ D+  L         N V
Sbjct: 129 --CRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL------NFTNGV 180

Query: 222 QASV--IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q  V   +GCG  Q          DG++GLG G+ S+ S L   GL+RN    C      
Sbjct: 181 QLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGG 240

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 339
           G IFFGD   +++ + + ++S         G      G       S  A+ D+GSS+T+ 
Sbjct: 241 GYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYF 300

Query: 340 PKEVYETIAAEFDRQ 354
               Y+ + +   ++
Sbjct: 301 NPYAYQALISWLGKE 315


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 137/310 (44%), Gaps = 44/310 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G+P   F V +D GSD+LW+ C+ C  C   S      L   LN +  S+SS
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSG-----LGIQLNFFDSSSSS 119

Query: 161 TSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T+  + CS  +C        T C +    C YT   Y + + +SG  V D L+  +    
Sbjct: 120 TAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQ-YGDGSGTSGYYVSDTLYFDAILGQ 178

Query: 216 ALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
           +L ++  A ++ GC   QSG       A DG+ G G GE+SV S L+  G+    FS C 
Sbjct: 179 SLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL 238

Query: 275 DKDDSGR--IFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
             D SG   +  G+               P    +   +A NG+    ++ ++     +S
Sbjct: 239 KGDGSGGGILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQ----LLPIDPAAFATS 294

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             + T    IVDSG++  +L  E Y+   +  +  V+ ++T       +C   S+     
Sbjct: 295 NSQGT----IVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQCYLVST----- 345

Query: 380 KLPSVKLMFP 389
              SV  MFP
Sbjct: 346 ---SVSQMFP 352


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 110/257 (42%), Gaps = 25/257 (9%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT I+IG P   + + +D GSD  WI CD  C  C       Y   +  +         
Sbjct: 16  YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVH---PRDP 72

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             + L  +   C+   +C+     C Y +  Y + +SS G+L  D + L +  D  +KN 
Sbjct: 73  LCEELQGNQNYCE---TCKQ----CDYEIT-YADRSSSKGVLARDNMQLTT-ADGEMKN- 122

Query: 221 VQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
                + GC   Q G  LD   + DG++GL  G IS+ + LA +G+I N F  C   D S
Sbjct: 123 --VDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPS 180

Query: 280 --GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS-----FKAIVDS 332
             G +F GD        T     NG    Y   V     G+  L          + I DS
Sbjct: 181 SGGYMFLGDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDS 240

Query: 333 GSSFTFLPKEVYETIAA 349
           GSS+T+ P E+Y  + A
Sbjct: 241 GSSYTYFPHEIYTNLIA 257


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 108/260 (41%), Gaps = 31/260 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT + IG P   + + +D GSDL WI CD  C  CA      Y     ++    P   S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV---VPPRDS 215

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNAL 217
             + L  +    D    C           DY   Y + +SS G+L  D + LI+  D   
Sbjct: 216 YCQELQGNQNYGDTSKQC-----------DYEITYADRSSSMGILARDNMQLITA-DGER 263

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           +N      + GCG  Q G  L   A  DG++GL    IS+P+ LA  G+I N F  C   
Sbjct: 264 EN---LDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320

Query: 277 DDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS-----FKAI 329
           D S  G +F GD        T     NG    Y   V+    G   L          + I
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380

Query: 330 VDSGSSFTFLPKEVYETIAA 349
            DSGSS+T+LP + Y  + A
Sbjct: 381 FDSGSSYTYLPHDDYTNLIA 400


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 136/321 (42%), Gaps = 19/321 (5%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I +G+P   F V +D GSD+LW+ C      P ++     L   LN + P +S T
Sbjct: 80  LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTS----GLQIQLNFFDPGSSVT 135

Query: 162 SKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +  +SCS + C  G     + C      C YT   Y + + +SG  V D+L       ++
Sbjct: 136 ATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQ-YGDGSGTSGFYVSDVLQFDMIVGSS 194

Query: 217 LKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           L  +  A V+ GC   Q+G  +    A DG+ G G   +SV S LA  GL    FS C  
Sbjct: 195 LVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLK 254

Query: 276 KDDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA- 328
            ++ G   +  G+        T  + S   Y   ++ +    +   I  S    ++ +  
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGT 314

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           I+D+G++  +L +  Y          V+ ++        + CY  ++      P V L F
Sbjct: 315 IIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-CYVIATSVADIFPPVSLNF 373

Query: 389 PQNNSFVVNNPVFVIYGTQVG 409
               S  +N   ++I    VG
Sbjct: 374 AGGASMFLNPQDYLIQQNNVG 394


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 108/260 (41%), Gaps = 31/260 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +YT + IG P   + + +D GSDL WI CD  C  CA      Y     ++    P   S
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV---VPPRDS 215

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNAL 217
             + L  +    D    C           DY   Y + +SS G+L  D + LI+  D   
Sbjct: 216 YCQELQGNQNYGDTSKQC-----------DYEITYADRSSSMGILARDNMQLITA-DGER 263

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           +N      + GCG  Q G  L   A  DG++GL    IS+P+ LA  G+I N F  C   
Sbjct: 264 EN---LDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320

Query: 277 DDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS-----FKAI 329
           D S  G +F GD        T     NG    Y   V+    G   L          + I
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380

Query: 330 VDSGSSFTFLPKEVYETIAA 349
            DSGSS+T+LP + Y  + A
Sbjct: 381 FDSGSSYTYLPHDDYTNLIA 400


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/276 (26%), Positives = 118/276 (42%), Gaps = 18/276 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  ++IG P   + + +D GSDL W+ CD  C  C  +    Y       N+  P   
Sbjct: 65  LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTK---NKLVPCVD 121

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNAL 217
                L   H   +    C +P + C Y + Y  +  SS+G+LV D   L L +G     
Sbjct: 122 QLCASL---HNGLNRKHKCDSPYEQCDYVIKY-ADQGSSTGVLVNDSFALRLANG----- 172

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
            + V+ S+  GCG  Q     +    DG++GLG G +S+ S   + G+ +N    C    
Sbjct: 173 -SVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCLSLR 231

Query: 278 DSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSF 336
             G +FFGD     Q+ T + +  +     Y  G  +   G   L+    + + DSGSSF
Sbjct: 232 GGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGSSF 291

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
           T+   + Y+ +       ++ T+          C+K
Sbjct: 292 TYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWK 327


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/254 (29%), Positives = 116/254 (45%), Gaps = 27/254 (10%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS--TSKH 164
           +G P   + +  D GSDL W+ CD  C +C       Y    +  N+  P       S H
Sbjct: 63  VGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLY----QPSNDLVPCKDPLCMSLH 118

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQA 223
            S  HR       C+NP Q C Y ++Y  +  SS G+LV D+  L ++ GD      ++ 
Sbjct: 119 SSMDHR-------CENPDQ-CDYEVEY-ADGGSSLGVLVRDVFPLNLTNGD-----PIRP 164

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
            + +GCG  Q  G       DG++GLG G +S+ S L   G++RN    CF+    G +F
Sbjct: 165 RLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLF 224

Query: 284 FGD--QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 341
           FGD    P     T       K+ +   G E    G S   +  F  + DSGSS+T+   
Sbjct: 225 FGDGIYDPYRLVWTPMSRDYPKHYSPGFG-ELIFNGRSTGLRNLF-VVFDSGSSYTYFNA 282

Query: 342 EVYETIAAEFDRQV 355
           + Y+ + +  +R++
Sbjct: 283 QAYQVLTSLLNREL 296


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 132/304 (43%), Gaps = 24/304 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G P   F V +D GSD+LW+ C  C  C P S+     L+  L  ++P +SS
Sbjct: 4   LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC-PTSS----GLNIQLESFNPDSSS 58

Query: 161 TSKHLSCSHRLCDLG-----TSCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           T+  ++CS   C  G       CQ       PC YT   Y + + +SG  V D +   + 
Sbjct: 59  TASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFT-YGDGSGTSGYYVSDTMFFETV 117

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
             N    +  AS++ GC   QSG       A DG+ G G  ++SV S L   G+    FS
Sbjct: 118 MGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 177

Query: 272 MCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTS 325
            C    D+G   +  G+        T  + S   Y     +  +  +   I SS    ++
Sbjct: 178 HCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 237

Query: 326 FKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            +  IVDSG++  +L    Y+   +     V+ ++ S      +C   SSS      P+V
Sbjct: 238 TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD-SSFPTV 296

Query: 385 KLMF 388
            L F
Sbjct: 297 TLYF 300


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 137/319 (42%), Gaps = 38/319 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T I +GTP   + V +D GSD+LW+ C  C +C   S      L  DL  Y P ASS
Sbjct: 86  LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSG-----LGLDLTFYDPKASS 140

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +   +SC    C      + P      PC Y++  Y + +S++G  + D L       + 
Sbjct: 141 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSV-MYGDGSSTTGFFITDALQFDQVTGDG 199

Query: 217 LKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                 A++  GCG +Q G   +   A DG++G G    S+ S LA AG  +  F+ C D
Sbjct: 200 QTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLD 259

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNG--------------KYITYIIGVETCCIGSSCL 321
               G IF        +    F  ++G                  Y + +++  +G + L
Sbjct: 260 TIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTL 319

Query: 322 K------QTSFK--AIVDSGSSFTFLPKEVYETIA-AEFDRQVNDTITSFEGYPWKCCYK 372
           +      +T  K   I+DSG++ T+LP+ V++ +    F +  +    + + +    C++
Sbjct: 320 QLPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF---LCFQ 376

Query: 373 SSSQRLPKLPSVKLMFPQN 391
            S       P++   F  +
Sbjct: 377 YSGSVDDGFPTITFHFEDD 395


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 141/315 (44%), Gaps = 23/315 (7%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 156
           N F  L++T + +G P   F V +D GSD+LW+ C      P S+     L  +LN +  
Sbjct: 78  NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSG----LGIELNLFDT 133

Query: 157 SASSTSKHLSCSHRLCDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-IS 211
           + SS+++ L C+  +C   ++    C      C Y+  +Y + + +SG  V D +H  I 
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSF-HYRDRSGTSGFYVTDSMHFDIL 192

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            G++ + NS  A+++ GC + Q G       A DG+ G G GE SV S L+  G+    F
Sbjct: 193 LGESTIANS-SATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVF 251

Query: 271 SMCFD--KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-- 326
           S C    ++  G +  G+    +   +  + S   Y   +  +     G      T F  
Sbjct: 252 SHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALS--GQLFPNPTMFPI 309

Query: 327 ----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
               + I+DSG++  +L +EVY+ I +     V+ + T       + C++ S       P
Sbjct: 310 SNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIFP 368

Query: 383 SVKLMFPQNNSFVVN 397
            ++  F    S VV 
Sbjct: 369 VLRFNFEGIASMVVT 383


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/274 (27%), Positives = 122/274 (44%), Gaps = 28/274 (10%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           IG P   + + +D GSDL W+ CD  C  C        N +   L  Y P+A+   + + 
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCDAPCRSC--------NKVPHPL--YRPTAN---RLVP 47

Query: 167 CSHRLCDLGTSCQ--NPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C++ LC    S Q  N K P P   DY   YT++ SS G+L+ D   L     N     +
Sbjct: 48  CANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN-----I 102

Query: 222 QASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           +  +  GCG  Q  G    V  A DG++GLG G +S+ S L + G+ +N    C   +  
Sbjct: 103 RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG 162

Query: 280 GRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTF 338
           G +FFGD   P+++ +   +A       Y  G  T       L     + + DSGS++T+
Sbjct: 163 GFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTY 222

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
              + Y+ + +     ++ ++          C+K
Sbjct: 223 FTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWK 256


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 141/315 (44%), Gaps = 23/315 (7%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 156
           N F  L++T + +G P   F V +D GSD+LW+ C      P S+     L  +LN +  
Sbjct: 78  NPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSG----LGIELNLFDT 133

Query: 157 SASSTSKHLSCSHRLCDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-IS 211
           + SS+++ L C+  +C   ++    C      C Y+  +Y + + +SG  V D +H  I 
Sbjct: 134 TKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSF-HYRDRSGTSGFYVTDSMHFDIL 192

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            G++ + NS  A+++ GC + Q G       A DG+ G G GE SV S L+  G+    F
Sbjct: 193 LGESTIANS-SATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVF 251

Query: 271 SMCFD--KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-- 326
           S C    ++  G +  G+    +   +  + S   Y   +  +     G      T F  
Sbjct: 252 SHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALS--GQLFPNPTMFPI 309

Query: 327 ----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
               + I+DSG++  +L +EVY+ I +     V+ + T       + C++ S       P
Sbjct: 310 SNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADIFP 368

Query: 383 SVKLMFPQNNSFVVN 397
            ++  F    S VV 
Sbjct: 369 VLRFNFEGIASMVVT 383


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 152/370 (41%), Gaps = 25/370 (6%)

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-KTMSLGNDFGWLHYTWIDIGTP 112
           PA    E  Q+    + +  ++       + FP  G+     +G     L+YT + +GTP
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVG-----LYYTKLRLGTP 90

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
              F V +D GSD+LW+ C      P ++     L   LN + P +S T+  +SCS + C
Sbjct: 91  PRDFYVQVDTGSDVLWVSCASCNGCPQTS----GLQIQLNFFDPGSSVTASPISCSDQRC 146

Query: 173 DLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
             G     + C      C YT   Y + + +SG  V D+L       ++L  +  A V+ 
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQ-YGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVF 205

Query: 228 GCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFF 284
           GC   Q+G  +    A DG+ G G   +SV S LA  G+    FS C   ++ G   +  
Sbjct: 206 GCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFL 339
           G+        T  + S   Y   ++ +    +   I  S    ++ +  I+D+G++  +L
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
            +  Y          V+ ++        + CY  ++      P V L F    S  +N  
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQ 384

Query: 400 VFVIYGTQVG 409
            ++I    VG
Sbjct: 385 DYLIQQNNVG 394


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 92/320 (28%), Positives = 141/320 (44%), Gaps = 28/320 (8%)

Query: 85  FPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSAS 142
           FP +GS      N F   L++T + +G+P   + V +D GSD+LW+ C  C  C   S  
Sbjct: 77  FPVEGS-----ANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG- 130

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS---CQ-NPKQPCPYTMDYYTENT 196
               L+  L  ++P  SSTS  + CS   C   L TS   CQ +   PC YT   Y + +
Sbjct: 131 ----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFT-YGDGS 185

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEIS 255
            +SG  V D ++  S   N    +  AS++ GC   QSG       A DG+ G G  ++S
Sbjct: 186 GTSGYYVSDTMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLS 245

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYII 309
           V S L   G+    FS C    D+G   +  G+        T  + S   Y     + ++
Sbjct: 246 VVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVV 305

Query: 310 GVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 368
             +   I SS    ++ +  IVDSG++  +L    Y+         V+ ++ S      +
Sbjct: 306 NGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ 365

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            C+ +SS      P+V L F
Sbjct: 366 -CFVTSSSVDSSFPTVSLYF 384


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 152/370 (41%), Gaps = 25/370 (6%)

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-KTMSLGNDFGWLHYTWIDIGTP 112
           PA    E  Q+    + +  ++       + FP  G+     +G     L+YT + +GTP
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVG-----LYYTKLRLGTP 90

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
              F V +D GSD+LW+ C      P ++     L   LN + P +S T+  +SCS + C
Sbjct: 91  PRDFYVQVDTGSDVLWVSCASCNGCPQTS----GLQIQLNFFDPGSSVTASPISCSDQRC 146

Query: 173 DLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
             G     + C      C YT   Y + + +SG  V D+L       ++L  +  A V+ 
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQ-YGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVF 205

Query: 228 GCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFF 284
           GC   Q+G  +    A DG+ G G   +SV S LA  G+    FS C   ++ G   +  
Sbjct: 206 GCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFL 339
           G+        T  + S   Y   ++ +    +   I  S    ++ +  I+D+G++  +L
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
            +  Y          V+ ++        + CY  ++      P V L F    S  +N  
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQ 384

Query: 400 VFVIYGTQVG 409
            ++I    VG
Sbjct: 385 DYLIQQNNVG 394


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 90/342 (26%), Positives = 149/342 (43%), Gaps = 22/342 (6%)

Query: 85  FPSQGSKTMSL-GNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           FP QGS    L G+    L++T + +G+P   F V +D GSD+LW+ C      P S+  
Sbjct: 86  FPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSS-- 143

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSS 198
              L  DL+ +    S T+  ++CS  +C          C    Q C Y+   Y + + +
Sbjct: 144 --GLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQ-CGYSFR-YGDGSGT 199

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVP 257
           SG  + D  +  +    +L  +  A ++ GC   QSG       A DG+ G G G++SV 
Sbjct: 200 SGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVV 259

Query: 258 SLLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTSFLASNGKYITYIIGV---- 311
           S L+  G+    FS C   D SG   F  G+        +  + S   Y   ++ +    
Sbjct: 260 SQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNG 319

Query: 312 ETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC 370
           +   + ++  + ++ +  IVD+G++ T+L KE Y+         V+  +T       + C
Sbjct: 320 QMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-C 378

Query: 371 YKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI-YGTQVGVS 411
           Y  S+      PSV L F    S ++    ++  YG   G S
Sbjct: 379 YLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGAS 420


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/320 (28%), Positives = 141/320 (44%), Gaps = 28/320 (8%)

Query: 85  FPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSAS 142
           FP +GS      N F   L++T + +G+P   + V +D GSD+LW+ C  C  C   S  
Sbjct: 77  FPVEGS-----ANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG- 130

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS---CQ-NPKQPCPYTMDYYTENT 196
               L+  L  ++P  SSTS  + CS   C   L TS   CQ +   PC YT   Y + +
Sbjct: 131 ----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFT-YGDGS 185

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEIS 255
            +SG  V D ++  +   N    +  AS++ GC   QSG       A DG+ G G  ++S
Sbjct: 186 GTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLS 245

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYII 309
           V S L   G+    FS C    D+G   +  G+        T  + S   Y     + ++
Sbjct: 246 VVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVV 305

Query: 310 GVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 368
             +   I SS    ++ +  IVDSG++  +L    Y+         V+ ++ S      +
Sbjct: 306 NGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ 365

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            C+ +SS      P+V L F
Sbjct: 366 -CFVTSSSVDSSFPTVSLYF 384


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/301 (27%), Positives = 126/301 (41%), Gaps = 22/301 (7%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I++GTP   F V +D GSD+LW+ C      PL++     L   LN + P  SST
Sbjct: 40  LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTS----GLGVALNFFDPRGSST 95

Query: 162 SKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           +  LSC    C     +  S     + C Y+ + Y + + + G  V D        +  +
Sbjct: 96  ASPLSCIDSKCVSSNQISESVCTTDRYCGYSFE-YGDGSGTLGYYVSDEFDYNQYVNQYV 154

Query: 218 KNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
            N+  A +  GC   QSG       A DG+ G G  ++SV S L   GL    FS C + 
Sbjct: 155 TNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEG 214

Query: 277 DDSGR--IFFGDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-I 329
            D G   +  G+        T  + S   Y   + G+    +   I       T+ +  I
Sbjct: 215 ADPGGGILVLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTI 274

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF--EGYPWKCCYKSSSQRLPKLPSVKLM 387
           +D G++  +L +E YE         V+ +   F  +G P   C+ +        PSV L 
Sbjct: 275 IDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNP---CFLTVHSIDEIFPSVTLY 331

Query: 388 F 388
           F
Sbjct: 332 F 332


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 122/260 (46%), Gaps = 20/260 (7%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP+  F + +D+GS + ++PC  C +C    +   N ++     + P  SST  
Sbjct: 94  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            + C     ++  +C N +  C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 154 PVKC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQ 202

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
             + GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 203 RAVFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261

Query: 284 FGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSF 336
               G        F  SN  +   Y I ++   +    L+       +    ++DSG+++
Sbjct: 262 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTY 321

Query: 337 TFLPKEVYETIAAEFDRQVN 356
            +LP++ +         +VN
Sbjct: 322 AYLPEQAFVAFKDAVTNKVN 341


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 122/260 (46%), Gaps = 20/260 (7%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP+  F + +D+GS + ++PC  C +C    +   N ++     + P  SST  
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            + C     ++  +C N +  C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 153 PVKC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQ 201

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
             + GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 202 RAVFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260

Query: 284 FGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSF 336
               G        F  SN  +   Y I ++   +    L+       +    ++DSG+++
Sbjct: 261 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTY 320

Query: 337 TFLPKEVYETIAAEFDRQVN 356
            +LP++ +         +VN
Sbjct: 321 AYLPEQAFVAFKDAVTNKVN 340


>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 440

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 70/252 (27%), Positives = 110/252 (43%), Gaps = 21/252 (8%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           ++IG P   + + +D GSDL W+ CD  C RC+      Y    R  N+  P      +H
Sbjct: 83  LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDLVPC-----RH 133

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
             C+         C+ P Q C Y + Y  ++ SS G+L+ D+  L         N VQ  
Sbjct: 134 ALCASLHLSDNYDCEVPHQ-CDYEVQY-ADHYSSLGVLLHDVYTL------NFTNGVQLK 185

Query: 225 V--IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 282
           V   +GCG  Q          DG++GLG G+ S+ S L   GL+RN    C      G I
Sbjct: 186 VRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYI 245

Query: 283 FFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 342
           FFGD   + + + + ++S       + G      G       +  A+ D+GSS+T+    
Sbjct: 246 FFGDVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNSY 305

Query: 343 VYETIAAEFDRQ 354
            Y+ + +   ++
Sbjct: 306 AYQVLISWLKKE 317


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 75/257 (29%), Positives = 118/257 (45%), Gaps = 26/257 (10%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSK- 163
           I+IG P   + + LD GSDL W+ CD  CV C  L A +   L +  N+  P      K 
Sbjct: 61  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVHC--LEAPH--PLYQPSNDLIPCNDPLCKA 116

Query: 164 -HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN-SV 221
            H + +HR       C+ P+Q C Y ++Y  +  SS G+LV D+  L     N  K   +
Sbjct: 117 LHFNGNHR-------CETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSL-----NYTKGLRL 162

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
              + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C      G 
Sbjct: 163 TPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGI 222

Query: 282 IFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 339
           +FFG+    + +   T     N K+ +  +G E    G       +   + DSGSS+T+ 
Sbjct: 223 LFFGNDLYDSSRVSWTPMARENSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYTYF 281

Query: 340 PKEVYETIAAEFDRQVN 356
             + Y+ +     R+++
Sbjct: 282 NSKAYQAVTYLLKRELS 298


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 139/309 (44%), Gaps = 38/309 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +G+P   F V +D GSD+LW+ C      P S+     L   LN + P +SST
Sbjct: 67  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSS----GLHIPLNFFDPGSSST 122

Query: 162 SKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +  +SCS + C LG       C +    C YT   Y + + +SG  V D+L+  +   ++
Sbjct: 123 ASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQ-YGDGSGTSGYYVSDLLNFDAIVGSS 181

Query: 217 LKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-- 273
           + NS  AS++ GC + Q+G       A DG+ G G  ++SV S ++  G+    FS C  
Sbjct: 182 VTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLK 240

Query: 274 --------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
                          ++D         Q P    +   ++ NGK     + ++     +S
Sbjct: 241 GDGGGGGILVLGEIVEEDIVYSPLVPSQ-PHYNLNLQSISVNGKS----LAIDPEVFATS 295

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             + T    IVDSG++  +L +E Y+   +     V+ ++        +C   +SS +  
Sbjct: 296 TNRGT----IVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVK-G 350

Query: 380 KLPSVKLMF 388
             P+V L F
Sbjct: 351 IFPTVSLNF 359


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 139/309 (44%), Gaps = 38/309 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +G+P   F V +D GSD+LW+ C      P S+     L   LN + P +SST
Sbjct: 82  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSS----GLHIPLNFFDPGSSST 137

Query: 162 SKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +  +SCS + C LG       C +    C YT   Y + + +SG  V D+L+  +   ++
Sbjct: 138 ASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQ-YGDGSGTSGYYVSDLLNFDAIVGSS 196

Query: 217 LKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-- 273
           + NS  AS++ GC + Q+G       A DG+ G G  ++SV S ++  G+    FS C  
Sbjct: 197 VTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLK 255

Query: 274 --------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
                          ++D         Q P    +   ++ NGK     + ++     +S
Sbjct: 256 GDGGGGGILVLGEIVEEDIVYSPLVPSQ-PHYNLNLQSISVNGKS----LAIDPEVFATS 310

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             + T    IVDSG++  +L +E Y+   +     V+ ++        +C   +SS +  
Sbjct: 311 TNRGT----IVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVK-G 365

Query: 380 KLPSVKLMF 388
             P+V L F
Sbjct: 366 IFPTVSLNF 374


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 151/369 (40%), Gaps = 25/369 (6%)

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGS-KTMSLGNDFGWLHYTWIDIGTP 112
           PA    E  Q+    + +  ++       + FP  G+     +G     L+YT + +GTP
Sbjct: 36  PANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVG-----LYYTKLRLGTP 90

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
              F V +D GSD+LW+ C      P ++     L   LN + P +S T+  +SCS + C
Sbjct: 91  PRDFYVQVDTGSDVLWVSCASCNGCPQTS----GLQIQLNFFDPGSSVTASPISCSDQRC 146

Query: 173 DLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
             G     + C      C YT   Y + + +SG  V D+L       ++L  +  A V+ 
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQ-YGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVF 205

Query: 228 GCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFF 284
           GC   Q+G  +    A DG+ G G   +SV S LA  G+    FS C   ++ G   +  
Sbjct: 206 GCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA-IVDSGSSFTFL 339
           G+        T  + S   Y   ++ +    +   I  S    ++ +  I+D+G++  +L
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
            +  Y          V+ ++        + CY  ++      P V L F    S  +N  
Sbjct: 326 SEAAYVPFVEAITNAVSQSVRPVVSKGNQ-CYVITTSVGDIFPPVSLNFAGGASMFLNPQ 384

Query: 400 VFVIYGTQV 408
            ++I    V
Sbjct: 385 DYLIQQNNV 393


>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
 gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 405

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/264 (31%), Positives = 120/264 (45%), Gaps = 35/264 (13%)

Query: 96  GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEY 154
           GN F   +Y+  + IG+P  +F   +D GSDL W+ CD    AP S     +L  +L +Y
Sbjct: 41  GNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCD----APCSGC---TLPPNL-QY 92

Query: 155 SPSASSTSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--IL 207
            P  +     + CS+ +C          C NP++ C Y + Y  +  SS G LV D   L
Sbjct: 93  KPKGNI----IPCSNPICTALHWPNKPHCPNPQEQCDYEVKY-ADQGSSMGALVTDQFPL 147

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAG 264
            L++G      + +Q  V  GCG  QS  Y     P    G++GLG G+I + + L  AG
Sbjct: 148 KLVNG------SFMQPPVAFGCGYDQS--YPSAHPPPATAGVLGLGRGKIGLLTQLVSAG 199

Query: 265 LIRNSFSMCFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           L RN    C      G +FFGD   P+   + + L S   +  Y  G             
Sbjct: 200 LTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQDNH--YTTGPADLLFNGKPTGL 257

Query: 324 TSFKAIVDSGSSFTFLPKEVYETI 347
              K I D+GSS+T+   + Y+TI
Sbjct: 258 KGLKLIFDTGSSYTYFNSKAYQTI 281


>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
          Length = 165

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 3/119 (2%)

Query: 27  FSTKLIHRFSEEVKALGVSKN-RNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           +S ++ H+FS EVK     ++  +   WP + S EYY+ L   D  +   K      + F
Sbjct: 28  YSLQMYHKFSNEVKEWMTWRHGLDTDGWPVEGSNEYYKALYHHDSARHGRKLADHPSLTF 87

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
             +G++T+ +    G+L Y+ + +GTPNV+  VALD GSD+ W+PCDC  CAP SA+ Y
Sbjct: 88  -LEGNETVEIPQ-LGFLFYSMVQVGTPNVTLFVALDTGSDVFWVPCDCQACAPTSAASY 144


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 160/380 (42%), Gaps = 53/380 (13%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATS-----WPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
            S +++HR    ++ L   K  NA S        +   +     LSS    Q+       
Sbjct: 63  LSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEK------ 116

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           Q   P Q   ++  G+     +   + +GTP   F +  D GSDL W      +C P + 
Sbjct: 117 QATLPVQSGASIGSGD-----YAVTVGLGTPKKEFTLIFDTGSDLTW-----TQCEPCAK 166

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENT 196
           + Y   +  L+   P+ S++ K++SCS   C L     G SC +P   C Y +  Y + +
Sbjct: 167 TCYKQKEPRLD---PTKSTSYKNISCSSAFCKLLDTEGGESCSSPT--CLYQVQ-YGDGS 220

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G    + L L S   N  KN      + GCG +Q+ G   G A  GL+GLG  ++S+
Sbjct: 221 YSIGFFATETLTLSS--SNVFKN-----FLFGCG-QQNSGLFRGAA--GLLGLGRTKLSL 270

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           PS  A+    +  FS C     S  G + FG Q   T + T           Y + +   
Sbjct: 271 PSQTAQK--YKKLFSYCLPASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITEL 328

Query: 315 CIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WK 368
            +G + L       ++   ++DSG+  T LP   Y  +++ F + + D   S +GY  + 
Sbjct: 329 SVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLMTD-YPSTDGYSIFD 387

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            CY  S     K+P V + F
Sbjct: 388 TCYDFSKNETIKIPKVGVSF 407


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/261 (31%), Positives = 118/261 (45%), Gaps = 30/261 (11%)

Query: 96  GNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCD--CVRCA-PLSASYYNSLDRDL 151
           GN +   HY+ I +IG P  +F + +D GSDL W+ CD  C  C  PL   Y     +  
Sbjct: 60  GNVYPTGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLY-----KPK 114

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
           N   P ASS  + +           +C  P + C Y ++Y  +  SS G+L+ D   L  
Sbjct: 115 NNRVPCASSLCQAIQ--------NNNCDIPTEQCDYEVEY-ADLGSSLGVLLSDYFPLRL 165

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAGLIRN 268
              + L    Q  +  GCG  Q   YL   +P    G++GLG G+ S+ S L   G+ +N
Sbjct: 166 NNGSLL----QPRIAFGCGYDQK--YLGPHSPPDTAGILGLGRGKASILSQLRTLGITQN 219

Query: 269 SFSMCFDKDDSGRIFFGDQ--GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
               CF +   G +FFGD    P+    T  L S+   + Y  G      G         
Sbjct: 220 VVGHCFSRVTGGFLFFGDHLLPPSGITWTPMLRSSSDTL-YSSGPAELLFGGKPTGIKGL 278

Query: 327 KAIVDSGSSFTFLPKEVYETI 347
           + I DSGSS+T+   +VY++I
Sbjct: 279 QLIFDSGSSYTYFNAQVYQSI 299


>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 410

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 82/295 (27%), Positives = 128/295 (43%), Gaps = 41/295 (13%)

Query: 96  GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN F   +Y+  + IG P  +F   +D GSD+ W+ CD  C  C         +L   L 
Sbjct: 46  GNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKL- 95

Query: 153 EYSPSASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI- 206
           +Y P  ++    + CS  +C          C NPK+ C Y ++Y  + +S   L+++   
Sbjct: 96  QYKPKGNT----VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKA 263
             L++G      +++Q  +  GCG  QS  Y     P    G++GLG G+I + + L  A
Sbjct: 152 FKLLNG------SAMQPRLAFGCGYDQS--YPSAHPPPATAGVLGLGRGKIGLLTQLVSA 203

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQG-PATQQS-TSFLASNGKYITYIIGVETCCIGSSCL 321
           GL RN    C      G +FFGD   P+   + T  L  +  Y T   G           
Sbjct: 204 GLTRNVVGHCLSSKGGGYLFFGDTLIPSLGVAWTPLLPPDNHYTT---GPAELLFNGKPT 260

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEF--DRQVNDTITSFEGYPWKCCYKSS 374
                K I D+GSS+T+   + Y+TI      D +V+    + E      C+K +
Sbjct: 261 GLKGLKLIFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGA 315


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 133/301 (44%), Gaps = 22/301 (7%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +G+P   + V +D GSD+LW+ C  C  C   S      L+  L  ++P  SST
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG-----LNIQLEFFNPDTSST 171

Query: 162 SKHLSCSHRLCD--LGTS---CQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           S  + CS   C   L TS   CQ +   PC YT   Y + + +SG  V D ++  +   N
Sbjct: 172 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFT-YGDGSGTSGYYVSDTMYFDTVMGN 230

Query: 216 ALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
               +  AS++ GC   QSG       A DG+ G G  ++SV S L   G+    FS C 
Sbjct: 231 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290

Query: 275 DKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYIIGVETCCIGSSCLKQTSFKA 328
              D+G   +  G+        T  + S   Y     + ++  +   I SS    ++ + 
Sbjct: 291 KGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQG 350

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
            IVDSG++  +L    Y+         V+ ++ S      + C+ +SS      P+V L 
Sbjct: 351 TIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ-CFVTSSSVDSSFPTVSLY 409

Query: 388 F 388
           F
Sbjct: 410 F 410


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 125/282 (44%), Gaps = 47/282 (16%)

Query: 79  PQFQMLFPSQGSKTMSLGNDF-GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCA 137
           PQ   LFP   +     GN F   L+YT I +G+P   + + +D GS   W+ CD   CA
Sbjct: 140 PQNSTLFPHSLA-----GNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCA 194

Query: 138 PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTS 197
             +   +         Y P  + T+  L  S  LC+ G   +NP Q C Y +  Y + +S
Sbjct: 195 SCAKGAHPL-------YRP--ARTADALPASDPLCE-GAQHENPNQ-CDYEIS-YADGSS 242

Query: 198 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISV 256
           S G+ V D +  + G D   +N   A ++ GCG  Q G  L+ +   DG++GL    +S+
Sbjct: 243 SMGVYVRDSMQFV-GEDGEREN---ADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSL 298

Query: 257 PSLLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQSTSFLASNGKYITYIIGVET 313
           P+ LA  G+I N+F  C   D SG    +F GD          ++   G     I     
Sbjct: 299 PTQLASRGIISNAFGHCMSTDPSGAGGYLFLGD---------DYIPRWGMTWVPIRDGPA 349

Query: 314 CCIGSSCLKQTSF------------KAIVDSGSSFTFLPKEV 343
             +  + +KQ +             + + D+GS++T+ P E 
Sbjct: 350 DDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEA 391


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 128/305 (41%), Gaps = 26/305 (8%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G L++T + +G P   ++V +D GSD+LW+ C  C  C   SA     L+  L  Y P  
Sbjct: 26  GGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRE 80

Query: 159 SSTSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           SST+  +SCS  LC  G       C      C Y    Y + ++S G  V D +      
Sbjct: 81  SSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFS-YGDGSTSEGYYVRDAMQYNVIS 139

Query: 214 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            N L N+  + V+ GC ++Q+G       A DG+IG G  E+SVP+ LA    I   FS 
Sbjct: 140 SNGLANTT-SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSH 198

Query: 273 CFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQT 324
           C + +  G       G A +   ++       + Y + +    + S+ L           
Sbjct: 199 CLEGEKRGGGILVIGGIA-EPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTN 257

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PS 383
               I+DSG++  + P   Y           + T    +G   +C   S   RL  L P+
Sbjct: 258 DTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSG--RLSDLFPN 315

Query: 384 VKLMF 388
           V L F
Sbjct: 316 VTLNF 320


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 72/260 (27%), Positives = 114/260 (43%), Gaps = 32/260 (12%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + LD GSDL W+ CD  CVRC              L    P    +S  
Sbjct: 64  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109

Query: 165 LSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           + C+  LC     +    C+ P+Q C Y ++Y  +  SS G+LV D+  +     N  K 
Sbjct: 110 IPCNDPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM-----NYTKG 162

Query: 220 -SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
             +   + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C     
Sbjct: 163 LRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLG 222

Query: 279 SGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSF 336
            G +FFGD    + +   T       K+ +  +G E    G       +   + DSGSS+
Sbjct: 223 GGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSY 281

Query: 337 TFLPKEVYETIAAEFDRQVN 356
           T+   + Y+ +     R+++
Sbjct: 282 TYFNSKAYQAVTYLLKRELS 301


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 87/312 (27%), Positives = 140/312 (44%), Gaps = 29/312 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I +GTP   + V +D GSD+LW+ C  C  C   S      L  +L+ YSPS+SS
Sbjct: 73  LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKS-----DLGIELSLYSPSSSS 127

Query: 161 TSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           TS  ++C+   C    D       P+  C Y +  Y + +S++G  V D + L     N 
Sbjct: 128 TSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRV-AYGDGSSTAGYFVRDHVVLDRVTGNF 186

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
              S   S++ GCG +QSG       A DG++G G    S+ S LA +G ++  F+ C D
Sbjct: 187 QTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLD 246

Query: 276 KDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVE---------TCCIGSSCLKQTS 325
             + G IF  G+      ++T  +     Y  ++  +E         T    +   K T 
Sbjct: 247 NINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGT- 305

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              I+DSG++  + P  +YE + ++ F RQ    + + E      C++         P+V
Sbjct: 306 ---IIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVE--EQFTCFEYDGNVDDGFPTV 360

Query: 385 KLMFPQNNSFVV 396
              F  + S  V
Sbjct: 361 TFHFEDSLSLTV 372


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 25/277 (9%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           FP +GS    +      L++T + +G P   + V +D GSD+LW+ C      P S+   
Sbjct: 75  FPVEGSANPYMVG----LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSS--- 127

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQ---NPKQPCPYTMDYYTENT 196
             L+  L  ++P +SSTS  + CS   C          CQ   +P  PC YT   Y + +
Sbjct: 128 -GLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFT-YGDGS 185

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEIS 255
            +SG  V D ++  +   N    +  ASV+ GC   QSG  +    A DG+ G G  ++S
Sbjct: 186 GTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLS 245

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYII 309
           V S L   G+   +FS C    D+G   +  G+        T  + S   Y     +  +
Sbjct: 246 VVSQLYSLGVSPKTFSHCLKGSDNGGGILVLGEIVEPGLVFTPLVPSQPHYNLNLESIAV 305

Query: 310 GVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYE 345
             +   I SS    ++ +  IVDSG++  +L    Y+
Sbjct: 306 SGQKLPIDSSLFATSNTQGTIVDSGTTLVYLVDGAYD 342


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 107/246 (43%), Gaps = 28/246 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + +D GSDL W+ CD  C RC+      Y    R  N+  P        
Sbjct: 89  INIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLY----RPSNDLVP-------- 136

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSV 221
             C H LC       N +    +  DY   Y ++ SS G+LV D+  L         N V
Sbjct: 137 --CRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDVYVL------NFTNGV 188

Query: 222 QASV--IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q  V   +GCG  Q          DG++GLG G+ S+ S L   GL+RN    C      
Sbjct: 189 QLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGG 248

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFL 339
           G IFFGD   +++ + + ++S   Y  Y  G     +G       +  A+ D+GSS+T+ 
Sbjct: 249 GYIFFGDVYDSSRLAWTPMSSR-DYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYF 307

Query: 340 PKEVYE 345
               Y+
Sbjct: 308 NSNAYQ 313


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 125/270 (46%), Gaps = 33/270 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + +G+P   + + +D GSDL W  CD  C  CA      YN            A 
Sbjct: 39  LYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNP---------KKAK 89

Query: 160 STSKHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
               HL    ++   G+  C +  + C Y ++Y  + +S+ G+LVED L +       L 
Sbjct: 90  VVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEY-ADGSSTMGVLVEDTLTV------RLT 142

Query: 219 NS--VQASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
           N   +Q   IIGCG  Q G      A  DG+IGL   ++++P+ LA+ G+I+N    C  
Sbjct: 143 NGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLA 202

Query: 275 -DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSC--------LKQT 324
              +  G +FFGD+  P+   + + +    + + Y   +++   G           L ++
Sbjct: 203 DGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNNDEDLTRS 262

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQ 354
           +   + DSG+SFT+L  + Y ++ +   +Q
Sbjct: 263 TSSVMFDSGTSFTYLVPQAYASVLSAVTKQ 292


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 89/337 (26%), Positives = 136/337 (40%), Gaps = 57/337 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP     + LD GSDL WI CD C  C   + S+Y           P  SST +++SC
Sbjct: 177 VGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHY----------YPKDSSTYRNISC 226

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
               C L +S      C+   Q CPY  DY   + ++     E     ++  +   K   
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
              V+ GCG   + G+  G +  GL+GLG G IS PS +    +  +SFS C      + 
Sbjct: 287 VVDVMFGCG-HWNKGFFYGAS--GLLGLGRGPISFPSQIQ--SIYGHSFSYCLTDLFSNT 341

Query: 277 DDSGRIFFGDQGPATQQS----TSFLASNG--KYITYIIGVETCCIGSSCL---KQT--- 324
             S ++ FG+            T+ LA         Y + +++  +G   L   +QT   
Sbjct: 342 SVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHW 401

Query: 325 ---------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
                        I+DSGS+ TF P   Y+ I   F++++     + + +    CY  S 
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSG 461

Query: 376 QRLP-KLPSVKLM--------FPQNNSFVVNNPVFVI 403
             +  +LP   +         FP  N F    P  VI
Sbjct: 462 AMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVI 498


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 115/265 (43%), Gaps = 37/265 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           ++   +IG P   + +  D GSDL W+ CD  C++C P     Y                
Sbjct: 67  YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQP-------------- 112

Query: 161 TSKHLSCSHRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGD 214
           T+  + C   +C         C +P Q C Y ++Y  +  SS G+LV D+  ++L SG  
Sbjct: 113 TNDLVVCKDPICASLHPDNYRCDDPDQ-CDYEVEY-ADGGSSIGVLVNDLFPVNLTSG-- 168

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
                  +  + IGCG  Q    L G+A    DG++GLG G  S+ + L+  GL+RN   
Sbjct: 169 ----MRARPRLTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVG 220

Query: 272 MCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
            CF +   G +FFGD    + +      S      Y  G     +        +   + D
Sbjct: 221 HCFSRRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFD 280

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVN 356
           SGSS+T+   + Y+T+ +   + ++
Sbjct: 281 SGSSYTYFNTQTYQTLLSFIKKDLH 305


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 30/259 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + LD GSDL W+ CD  CVRC              L    P    +S  
Sbjct: 64  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 109

Query: 165 LSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           + C+  LC     +    C+ P+Q C Y ++Y  +  SS G+LV D+  +    +     
Sbjct: 110 IPCNDPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGL 163

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +   + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C      
Sbjct: 164 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 223

Query: 280 GRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           G +FFGD    + +   T       K+ +  +G E    G       +   + DSGSS+T
Sbjct: 224 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYT 282

Query: 338 FLPKEVYETIAAEFDRQVN 356
           +   + Y+ +     R+++
Sbjct: 283 YFNSKAYQAVTYLLKRELS 301


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 127/303 (41%), Gaps = 26/303 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G P   ++V +D GSD+LW+ C  C  C   SA     L+  L  Y P  SS
Sbjct: 1   LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSA-----LNIPLTMYDPRESS 55

Query: 161 TSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T+  +SCS  LC  G       C      C Y    Y + ++S G  V D +       N
Sbjct: 56  TTSLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFS-YGDGSTSEGYYVRDAMQYNVISSN 114

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            L N+  + V+ GC ++Q+G       A DG+IG G  E+SVP+ LA    I   FS C 
Sbjct: 115 GLANTT-SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 173

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------KQTSF 326
           + +  G       G A +   ++       + Y + +    + S+ L             
Sbjct: 174 EGEKRGGGILVIGGIA-EPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDT 232

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL-PSVK 385
             I+DSG++  + P   Y           + T    +G   +C   S   RL  L P+V 
Sbjct: 233 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSG--RLSDLFPNVT 290

Query: 386 LMF 388
           L F
Sbjct: 291 LNF 293


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 30/259 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + LD GSDL W+ CD  CVRC              L    P    +S  
Sbjct: 42  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 87

Query: 165 LSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           + C+  LC     +    C+ P+Q C Y ++Y  +  SS G+LV D+  +    +     
Sbjct: 88  IPCNDPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGL 141

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +   + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C      
Sbjct: 142 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 201

Query: 280 GRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           G +FFGD    + +   T       K+ +  +G E    G       +   + DSGSS+T
Sbjct: 202 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYT 260

Query: 338 FLPKEVYETIAAEFDRQVN 356
           +   + Y+ +     R+++
Sbjct: 261 YFNSKAYQAVTYLLKRELS 279


>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 406

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 125/282 (44%), Gaps = 47/282 (16%)

Query: 79  PQFQMLFPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCA 137
           PQ   LFP   +     GN F   L+YT I +G+P   + + +D GS   W+ CD   CA
Sbjct: 140 PQNSTLFPHSLA-----GNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCA 194

Query: 138 PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTS 197
             +   +         Y P  + T+  L  S  LC+ G   +NP Q C Y +  Y + +S
Sbjct: 195 SCAKGAHPL-------YRP--ARTADALPASDPLCE-GAQHENPNQ-CDYEIS-YADGSS 242

Query: 198 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISV 256
           S G+ V D +  + G D   +N   A ++ GCG  Q G  L+ +   DG++GL    +S+
Sbjct: 243 SMGVYVRDSMQFV-GEDGEREN---ADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSL 298

Query: 257 PSLLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQSTSFLASNGKYITYIIGVET 313
           P+ LA  G+I N+F  C   D SG    +F GD          ++   G     I     
Sbjct: 299 PTQLASRGIISNAFGHCMSTDPSGAGGYLFLGD---------DYIPRWGMTWVPIRDGPA 349

Query: 314 CCIGSSCLKQTSF------------KAIVDSGSSFTFLPKEV 343
             +  + +KQ +             + + D+GS++T+ P E 
Sbjct: 350 DDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEA 391


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 30/259 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + LD GSDL W+ CD  CVRC              L    P    +S  
Sbjct: 61  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 106

Query: 165 LSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           + C+  LC     +    C+ P+Q C Y ++Y  +  SS G+LV D+  +    +     
Sbjct: 107 IPCNDPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGL 160

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +   + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C      
Sbjct: 161 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 220

Query: 280 GRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           G +FFGD    + +   T       K+ +  +G E    G       +   + DSGSS+T
Sbjct: 221 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGE-LLFGGRTTGLKNLLTVFDSGSSYT 279

Query: 338 FLPKEVYETIAAEFDRQVN 356
           +   + Y+ +     R+++
Sbjct: 280 YFNSKAYQAVTYLLKRELS 298


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 30/259 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG P   + + LD GSDL W+ CD  CVRC              L    P    +S  
Sbjct: 52  INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC--------------LEAPHPLYQPSSDL 97

Query: 165 LSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           + C+  LC     +    C+ P+Q C Y ++Y  +  SS G+LV D+  +    +     
Sbjct: 98  IPCNDPLCKALHLNSNQRCETPEQ-CDYEVEY-ADGGSSLGVLVRDVFSM----NYTQGL 151

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +   + +GCG  Q  G       DG++GLG G++S+ S L   G ++N    C      
Sbjct: 152 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 211

Query: 280 GRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           G +FFGD    + +   T       K+ +  +G E    G       +   + DSGSS+T
Sbjct: 212 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL-LFGGRTTGLKNLLTVFDSGSSYT 270

Query: 338 FLPKEVYETIAAEFDRQVN 356
           +   + Y+ +     R+++
Sbjct: 271 YFNSKAYQAVTYLLKRELS 289


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 112/249 (44%), Gaps = 23/249 (9%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG  + +F   +D+GSDL W+ CD  C  C       Y   +  LN + P    TS H
Sbjct: 59  INIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLC--TSLH 116

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
              +H        C++    C Y ++Y  ++ SS G+LV D + L       L N   A+
Sbjct: 117 PITNHH-------CKSADDQCQYEIEY-ADHGSSLGVLVNDHVPL------KLTNGSLAA 162

Query: 225 --VIIGCGMKQSGGYLDGVAPD-GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
             +  GCG        D   P  G++GLG GE+S  S L+  G++RN    C   D+ G 
Sbjct: 163 PRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL-SDEGGF 221

Query: 282 IFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 340
           +FFGD+  P++  + + ++       Y  G      G           + DSGSS+T+  
Sbjct: 222 LFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFN 281

Query: 341 KEVYETIAA 349
            + Y +I A
Sbjct: 282 SQAYNSILA 290


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 147/341 (43%), Gaps = 25/341 (7%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           FP QGS    L      L++T + +G+P   F V +D GSD+LW+ C      P S+   
Sbjct: 86  FPVQGSSDPYLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSS--- 138

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSS 199
             L  DL+ +    S T+  ++CS  +C          C    Q C Y+   Y + + +S
Sbjct: 139 -GLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQ-CGYSFR-YGDGSGTS 195

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPS 258
           G  + D  +  +    +L  +  A ++ GC   QSG       A DG+ G G G++SV S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 259 LLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTSFLASNGKYITYIIGV----E 312
            L+  G+    FS C   D SG   F  G+        +  + S   Y   ++ +    +
Sbjct: 256 QLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQ 315

Query: 313 TCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
              + ++  + ++ +  IVD+G++ T+L KE Y+         V+  +T       + CY
Sbjct: 316 MLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CY 374

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI-YGTQVGVS 411
             S+      PSV L F    S ++    ++  YG   G S
Sbjct: 375 LVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGAS 415


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/264 (29%), Positives = 118/264 (44%), Gaps = 26/264 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L++T+I +G P   + + +D  SDL WI CD  C  CA  + + Y    R  N  +P  S
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKP--RRDNIVTPKDS 264

Query: 160 -STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
                H +     C+   +CQ     C Y ++Y  +++SS G+L  D LHL      A  
Sbjct: 265 LCVELHRNQKAGYCE---TCQQ----CDYEIEY-ADHSSSMGVLARDELHLTM----ANG 312

Query: 219 NSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
           +S       GC   Q G  L+  V  DG++GL   ++S+PS LA  G+I N    C   D
Sbjct: 313 SSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAND 372

Query: 278 --DSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAI 329
               G +F GD   P    S   +  +    +Y   +     GS  L     ++   + +
Sbjct: 373 VVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIV 432

Query: 330 VDSGSSFTFLPKEVYETIAAEFDR 353
            DSGSS+T+  KE Y  + A   +
Sbjct: 433 FDSGSSYTYFTKEAYSELVASLKQ 456


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 79/269 (29%), Positives = 125/269 (46%), Gaps = 30/269 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L++T I +G+P   + + +D GSDL WI CD  C  CA      Y     +L     S  
Sbjct: 313 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSL- 371

Query: 160 STSKHLSCSHRLCDLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
                  C     +L T  C+  +Q C Y ++ Y +++SS G+L  D LHL+    +  K
Sbjct: 372 -------CVEVQRNLKTGYCETCEQ-CDYEIE-YADHSSSMGVLASDDLHLMLANGSLTK 422

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                 ++ GC   Q G  L+ +A  DG++GL   ++S+PS LA   +I N    C   D
Sbjct: 423 ----LGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSD 478

Query: 278 DS--GRIFFGDQ-----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA 328
            +  G +F GD      G A     +  + N  Y + I+ +       S  +Q   + + 
Sbjct: 479 ATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN--YHSQIMKISHGSRQLSLGRQDGRTERV 536

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVND 357
           + D+GSS+T+ PKE Y  + A   + V+D
Sbjct: 537 VFDTGSSYTYFPKEAYYALVASL-KDVSD 564


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 86/326 (26%), Positives = 140/326 (42%), Gaps = 24/326 (7%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           FP QGS    L      L++T + +G+P   F V +D GSD+LW+ C      P S+   
Sbjct: 86  FPVQGSSDPYLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSS--- 138

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSS 199
             L  DL+ +    S T+  ++CS  +C          C    Q C Y+   Y + + +S
Sbjct: 139 -GLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQCSENNQ-CGYSFR-YGDGSGTS 195

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPS 258
           G  + D  +  +    +L  +  A ++ GC   QSG       A DG+ G G G++SV S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 259 LLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTSFLASNGKYITYIIGV----E 312
            L+  G+    FS C   D SG   F  G+        +  L S   Y   ++ +    +
Sbjct: 256 QLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQ 315

Query: 313 TCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
              I ++  + ++ +  IVD+G++ T+L KE Y+         V+  +T       + CY
Sbjct: 316 ILPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQ-CY 374

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVN 397
             S+      P V L F    S ++ 
Sbjct: 375 LVSTSISDMFPPVSLNFAGGASMMLR 400


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 147/341 (43%), Gaps = 25/341 (7%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           FP QGS    L      L++T + +G+P   F V +D GSD+LW+ C      P S+   
Sbjct: 86  FPVQGSSDPYLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSS--- 138

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSS 199
             L  DL+ +    S T+  ++CS  +C          C    Q C Y+   Y + + +S
Sbjct: 139 -GLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENNQ-CGYSFR-YGDGSGTS 195

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPS 258
           G  + D  +  +    +L  +  A ++ GC   QSG       A DG+ G G G++SV S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 259 LLAKAGLIRNSFSMCFDKDDSGRIFF--GDQGPATQQSTSFLASNGKYITYIIGV----E 312
            L+  G+    FS C   D SG   F  G+        +  + S   Y   ++ +    +
Sbjct: 256 QLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQ 315

Query: 313 TCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
              + ++  + ++ +  IVD+G++ T+L KE Y+         V+  +T       + CY
Sbjct: 316 MLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-CY 374

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI-YGTQVGVS 411
             S+      PSV L F    S ++    ++  YG   G S
Sbjct: 375 LVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGAS 415


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 116/265 (43%), Gaps = 32/265 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   F + +D GSDL W+ CD  C  C    A           +Y P+ ++
Sbjct: 68  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT 117

Query: 161 TSKHLSCSHRLC---DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
               L CSH LC   DL  +  C +P+  C Y +  Y+++ SS G LV D   L      
Sbjct: 118 ----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIG-YSDHASSIGALVTDEFPL------ 166

Query: 216 ALKNS--VQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            L N   +   +  GCG  +Q+ G        G++GLG G++ + + L   G+ +N    
Sbjct: 167 KLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVH 226

Query: 273 CFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
           C      G +  GD+  P++  + + LA+N     Y+ G                  + D
Sbjct: 227 CLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGINVVFD 286

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVN 356
           SGSS+T+   E Y+ I     + +N
Sbjct: 287 SGSSYTYFNAEAYQAILDLIRKDLN 311


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 118/268 (44%), Gaps = 30/268 (11%)

Query: 99  FGWLHYTWIDIGTPNVS--FLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEY 154
            G L+YT I +G P     + + +D GS+L WI CD  C  CA  +   Y     +L   
Sbjct: 26  MGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNL--- 82

Query: 155 SPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
                 +S+      +   L   C+N  Q C Y ++Y  +++ S G+L +D  HL     
Sbjct: 83  ----VRSSEAFCVEVQRNQLTEHCENCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL----- 131

Query: 215 NALKNS--VQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
             L N    ++ ++ GCG  Q G  L+ +   DG++GL   +IS+PS LA  G+I N   
Sbjct: 132 -KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVG 190

Query: 272 MCF--DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-- 326
            C   D +  G IF G D  P+   +   +  + +   Y + V     G   L       
Sbjct: 191 HCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENG 250

Query: 327 ---KAIVDSGSSFTFLPKEVYETIAAEF 351
              K + D+GSS+T+ P + Y  +    
Sbjct: 251 RVGKVLFDTGSSYTYFPNQAYSQLVTSL 278


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 77/267 (28%), Positives = 118/267 (44%), Gaps = 26/267 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L++T I +G+P   + + +D GSDL WI CD  C  CA      Y     +L     S  
Sbjct: 100 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSL- 158

Query: 160 STSKHLSCSHRLCDLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
                  C     +L T  C+  +Q C Y ++ Y +++SS G+L  D LHL+    +  K
Sbjct: 159 -------CVEVQRNLKTGYCETCEQ-CDYEIE-YADHSSSMGVLASDDLHLMLANGSLTK 209

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                 ++ GC   Q G  L+ +A  DG++GL   ++S+PS LA   +I N    C   D
Sbjct: 210 ----LGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSD 265

Query: 278 DS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-----KQTSFKAIV 330
            +  G +F GD              N     Y   +     GS  L        + + + 
Sbjct: 266 ATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVF 325

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVND 357
           D+GSS+T+ PKE Y  + A   + V+D
Sbjct: 326 DTGSSYTYFPKEAYYALVASL-KDVSD 351


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 134/306 (43%), Gaps = 27/306 (8%)

Query: 85  FPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSAS 142
           FP +GS      N F   L++T + +G+P   + V +D GSD+LW+ C  C  C   S  
Sbjct: 77  FPVEGS-----ANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSG- 130

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTS---CQ-NPKQPCPYTMDYYTENT 196
               L+  L  ++P  SSTS  + CS   C   L TS   CQ +   PC YT   Y + +
Sbjct: 131 ----LNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFT-YGDGS 185

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEIS 255
            +SG  V D ++  +   N    +  AS++ GC   QSG       A DG+ G G  ++S
Sbjct: 186 GTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLS 245

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQQSTSFLASNGKY----ITYII 309
           V S L   G+    FS C    D+G   +  G+        T  + S   Y     + ++
Sbjct: 246 VVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVV 305

Query: 310 GVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 368
             +   I SS    ++ +  IVDSG++  +L    Y+         V+ ++ S      +
Sbjct: 306 NGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ 365

Query: 369 CCYKSS 374
           C   SS
Sbjct: 366 CFVTSS 371


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 119/261 (45%), Gaps = 27/261 (10%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  + IG P   + +  D GSDL W+ CD  CVRC       Y   +  +    P  +S
Sbjct: 67  YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPMCAS 126

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
                     L   G  C++P+Q C Y ++Y  +  SS G+LV+D+  L     N L+  
Sbjct: 127 ----------LHPPGYKCEHPEQ-CDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGLR-- 170

Query: 221 VQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           +   + +GCG  Q  G      P DG++GLG G+ S+ S L   G+IRN    C      
Sbjct: 171 LAPRLALGCGYDQIPG--QSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGG 228

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DSGSSF 336
           G +FFGD    + +         ++  Y  G     +G    K T FK ++   DSGSS+
Sbjct: 229 GFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGG---KTTVFKNLLVTFDSGSSY 285

Query: 337 TFLPKEVYETIAAEFDRQVND 357
           T+L    Y+ +     +++++
Sbjct: 286 TYLNSLAYQALVHLVRKELSE 306


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 113/265 (42%), Gaps = 39/265 (14%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+   I+IG P   + + +D GSDL W+ CD     P +     ++ +D   Y P+    
Sbjct: 61  LYTVSINIGNPPKPYELDIDTGSDLTWVQCD----GPDAPCKGCTMPKD-KLYKPNGKQV 115

Query: 162 SKHLSCSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
            K   CS  +C        LG  C     PC Y + Y  ++ S+ G+LV D +H I    
Sbjct: 116 VK---CSDPICVATQSTHVLGQICSKQSPPCVYNVQY-ADHASTLGVLVRDYMH-IGSPS 170

Query: 215 NALKNSVQASVIIGCGMKQ--SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           ++ K+ +   V  GCG +Q  SG       P G++GLG G+ S+ S L   G I N    
Sbjct: 171 SSTKDPL---VAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGH 227

Query: 273 CFDKDDSGRIFFGDQ---------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           C   +  G +F GD+          P  Q S     + G    +  G  T   G      
Sbjct: 228 CLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTPAKG------ 281

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIA 348
              + I DSGSS+T+    VY  +A
Sbjct: 282 --LQIIFDSGSSYTYFSSPVYTIVA 304


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 92/188 (48%), Gaps = 12/188 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T I IGTP   + V +D GSD+LW+  +CV C        ++L  +L  Y P  S +
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWV--NCVSCD--GCPRKSNLGIELTMYDPRGSQS 144

Query: 162 SKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            + ++C  + C      +  SC +   PC Y++  Y + +S++G  V D L       + 
Sbjct: 145 GELVTCDQQFCVANYGGVLPSCTS-TSPCEYSIS-YGDGSSTAGFFVTDFLQYNQVSGDG 202

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                 ASV  GCG K  G      +A DG++G G    S+ S LA AG +R  F+ C D
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262

Query: 276 KDDSGRIF 283
             + G IF
Sbjct: 263 TVNGGGIF 270


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 134/309 (43%), Gaps = 42/309 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +G+P   F V +D GSD+LW+ C+     P ++     L   LN +  S+SST
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTS----GLGIQLNFFDSSSSST 120

Query: 162 SKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           +  + CS  +C        T C      C YT   Y + + +SG  V D L+  +    +
Sbjct: 121 AGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQ-YEDGSGTSGYYVSDTLYFDAILGES 179

Query: 217 LKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           L  +  A ++ GC   QSG   +   A DG+ G G GE+SV S L+  G+    FS C  
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239

Query: 276 KD-------------DSGRIF--FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
            +             + G ++       P    +   +A NGK    ++ ++     +S 
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGK----LLPIDPSVFATS- 294

Query: 321 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
               S   IVDSG++  +L  E Y+   +  +  V+ ++T       +C   S+      
Sbjct: 295 ---NSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQCYLVST------ 345

Query: 381 LPSVKLMFP 389
             SV  MFP
Sbjct: 346 --SVSQMFP 352


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 139/324 (42%), Gaps = 58/324 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I IGTP +     LD GSDL+W  CD  C RC P  A            Y+P+ S+T  +
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL----------YAPARSATYAN 145

Query: 165 LSCSHRLCDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SC   +C    S    C  P   C Y    Y + TS+ G+L  +   L  G D A++  
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFS-YGDGTSTDGVLATETFTL--GSDTAVRG- 201

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKD 277
               V  GCG +  G   +     GL+G+G G +   SL+++ G+ R  FS C   F+  
Sbjct: 202 ----VAFGCGTENLGSTDNS---SGLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249

Query: 278 DSGRIFFGDQG--PATQQSTSFLAS-----NGKYITYIIGVETCCIGSSCL--KQTSFK- 327
            +  +F G      +  ++T F+ S       +   Y + +E   +G + L      F+ 
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 328 -------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
                   I+DSG++FT L +  +  +A     +V   + S        C+ ++S    +
Sbjct: 310 TPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVE 369

Query: 381 LPSVKLMFP------QNNSFVVNN 398
           +P + L F       +  S+VV +
Sbjct: 370 VPRLVLHFDGADMELRRESYVVED 393


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 147/333 (44%), Gaps = 46/333 (13%)

Query: 99  FGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 158
           F  L++T + +G+P   F V +D GSD+LWI  +C+ C+  +  + + L  +L+ +  + 
Sbjct: 79  FVGLYFTKVKLGSPAKEFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134

Query: 159 SSTSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--IS 211
           SST+  +SC   +C        + C +    C YT   Y + + ++G  V D ++   + 
Sbjct: 135 SSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQ-YGDGSGTTGYYVSDTMYFDTVL 193

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            G + + NS  +++I GC   QSG       A DG+ G G G +SV S L+  G+    F
Sbjct: 194 LGQSVVANS-SSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVF 252

Query: 271 SMCFD--KDDSGRIFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCC 315
           S C    ++  G +  G+               P    +   +A NG+ +          
Sbjct: 253 SHCLKGGENGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLP--------- 303

Query: 316 IGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG---YPWKCCY 371
           I S+    T+ +  IVDSG++  +L +E Y      F + +   ++ F          CY
Sbjct: 304 IDSNVFATTNNQGTIVDSGTTLAYLVQEAYN----PFVKAITAAVSQFSKPIISKGNQCY 359

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 404
             S+      P V L F    S V+N   ++++
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMH 392


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 139/324 (42%), Gaps = 58/324 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I IGTP +     LD GSDL+W  CD  C RC P  A            Y+P+ S+T  +
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL----------YAPARSATYAN 145

Query: 165 LSCSHRLCDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SC   +C    S    C  P   C Y    Y + TS+ G+L  +   L  G D A++  
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFS-YGDGTSTDGVLATETFTL--GSDTAVRG- 201

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKD 277
               V  GCG +  G   +     GL+G+G G +   SL+++ G+ R  FS C   F+  
Sbjct: 202 ----VAFGCGTENLGSTDNS---SGLVGMGRGPL---SLVSQLGVTR--FSYCFTPFNAT 249

Query: 278 DSGRIFFGDQG--PATQQSTSFLAS-----NGKYITYIIGVETCCIGSSCL--KQTSFK- 327
            +  +F G      +  ++T F+ S       +   Y + +E   +G + L      F+ 
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309

Query: 328 -------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
                   I+DSG++FT L +  +  +A     +V   + S        C+ ++S    +
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVE 369

Query: 381 LPSVKLMFP------QNNSFVVNN 398
           +P + L F       +  S+VV +
Sbjct: 370 VPRLVLHFDGADMELRRESYVVED 393


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 110/261 (42%), Gaps = 42/261 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I+IG P   + + +D GSDL W+ CD     P +     +L +D   Y P+ +   K   
Sbjct: 66  INIGNPPNPYELDIDTGSDLTWVQCD----GPDAPCKGCTLPKD-KLYKPNGNQLVK--- 117

Query: 167 CSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNAL 217
           CS  +C          G  C  P  PC Y ++Y  +N  S+G L  D +H+ S  G N  
Sbjct: 118 CSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEY-ADNAESTGALARDYMHIGSPSGSNV- 175

Query: 218 KNSVQASVIIGCGMKQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                  V+ GCG +Q   G     +  G++GLG G+IS+ S L   G I N    C   
Sbjct: 176 -----PLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSA 230

Query: 277 DDSGRIFFGDQ---------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
           +  G +F GD+          P  Q S     S G    +  G  T   G         +
Sbjct: 231 EGGGYLFLGDKFIPSSGIFWTPIIQSSLEKHYSTGPVDLFFNGKPTPAKG--------LQ 282

Query: 328 AIVDSGSSFTFLPKEVYETIA 348
            I DSGSS+T+    VY  +A
Sbjct: 283 IIFDSGSSYTYFSPRVYTIVA 303


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/314 (27%), Positives = 129/314 (41%), Gaps = 53/314 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +   + IGTP     + LD GSDL+W  C  CV C           D+ L  +  S SST
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSC----------FDQPLPYFDTSRSST 84

Query: 162 SKHLSCSHRLCDLG---TSCQNPKQPCPYTMDYYT---ENTSSSGLLVEDILHLISGGDN 215
           +  L C    C L    T C    Q    T  YYT   +N+ + GLL  D    ++G   
Sbjct: 85  NALLPCESTQCKLDPTVTVCVKLNQTV-QTCAYYTSYGDNSVTIGLLAADKFTFVAG--- 140

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
               +    V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF 
Sbjct: 141 ----TSLPGVTFGCGLNNTGVFNSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFT 189

Query: 276 K-----------DDSGRIFFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLK- 322
                       D    +F   QG   T     +  +      Y + ++   +GS+ L  
Sbjct: 190 TITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPV 249

Query: 323 -QTSFK-------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS 374
            +++F         I+DSG+S T LP +VY+ +  EF  Q+   +          C+ + 
Sbjct: 250 PESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAP 309

Query: 375 SQRLPKLPSVKLMF 388
           SQ  P +P + L F
Sbjct: 310 SQAKPDVPKLVLHF 323


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 117/265 (44%), Gaps = 32/265 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   F + +D GSDL W+ CD  C  C    A           +Y P+ ++
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT 116

Query: 161 TSKHLSCSHRLC---DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGG 213
               L CSH LC   DL     C +P+  C Y +  Y+++ SS G LV D   L L +G 
Sbjct: 117 ----LPCSHILCSGLDLPQDRPCADPEDQCDYEIG-YSDHASSIGALVTDEVPLKLANGS 171

Query: 214 DNALKNSVQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
              L+      +  GCG  +Q+ G        G++GLG G++ + + L   G+ +N    
Sbjct: 172 IMNLR------LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVH 225

Query: 273 CFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
           C      G +  GD+  P++  + + LA+N     Y+ G                  + D
Sbjct: 226 CLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFD 285

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVN 356
           SGSS+T+   E Y+ I     + +N
Sbjct: 286 SGSSYTYFNAEAYQAILDLIRKDLN 310


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 117/263 (44%), Gaps = 30/263 (11%)

Query: 102 LHYTWIDIGTPNVS--FLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPS 157
           L+YT I +G P     + + +D GS+L WI CD  C  CA  +   Y     +L      
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNL------ 255

Query: 158 ASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
              +S+      +   L   C+N  Q C Y ++Y  +++ S G+L +D  HL       L
Sbjct: 256 -VRSSEAFCVEVQRNQLTEHCENCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL------KL 306

Query: 218 KNS--VQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            N    ++ ++ GCG  Q G  L+ +   DG++GL   +IS+PS LA  G+I N    C 
Sbjct: 307 HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL 366

Query: 275 --DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----- 326
             D +  G IF G D  P+   +   +  + +   Y + V     G   L          
Sbjct: 367 ASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVG 426

Query: 327 KAIVDSGSSFTFLPKEVYETIAA 349
           K + D+GSS+T+ P + Y  +  
Sbjct: 427 KVLFDTGSSYTYFPNQAYSQLVT 449


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 117/265 (44%), Gaps = 32/265 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  ++IG P   F + +D GSDL W+ CD  C  C    A           +Y P+ ++
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK----------QYKPNHNT 116

Query: 161 TSKHLSCSHRLC---DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGG 213
               L CSH LC   DL     C +P+  C Y +  Y+++ SS G LV D   L L +G 
Sbjct: 117 ----LPCSHILCSGLDLPQDRPCADPEDQCDYEIG-YSDHASSIGALVTDEVPLKLANGS 171

Query: 214 DNALKNSVQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
              L+      +  GCG  +Q+ G        G++GLG G++ + + L   G+ +N    
Sbjct: 172 IMNLR------LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVH 225

Query: 273 CFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
           C      G +  GD+  P++  + + LA+N     Y+ G                  + D
Sbjct: 226 CLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFD 285

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVN 356
           SGSS+T+   E Y+ I     + +N
Sbjct: 286 SGSSYTYFNAEAYQAILDLIRKDLN 310


>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 547

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 87/186 (46%), Gaps = 21/186 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +YT++ IGTP  +    LD GS L   PC  C RC P     +           P  SST
Sbjct: 81  YYTYLTIGTPGQTVSGILDTGSTLPAFPCSGCTRCGPSKTGMFK----------PELSST 130

Query: 162 SKHLSCSHRLCDLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           S    CS   C  G  SC    + C Y++ Y  E +S+SG L ED+L +  GG       
Sbjct: 131 SSTFGCSDARCFCGANSCSCNNEQCGYSIRYL-EGSSTSGFLAEDMLAVGDGGP------ 183

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
             A+ + GC   +SG     +A DG+ G+G    S+   L + G+I ++FSMCF     G
Sbjct: 184 -AANFVFGCAQSESGLLYSQIA-DGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREG 241

Query: 281 RIFFGD 286
            +  G+
Sbjct: 242 VLLLGN 247


>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
          Length = 320

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 93/190 (48%), Gaps = 16/190 (8%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT I+IG+P   + V +D GSD+LW+  +C+RC        + L  +L +Y P+ S T
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWV--NCIRCD--GCPTRSGLGIELTQYDPAGSGT 138

Query: 162 SKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           +  + C    C       +  +C +   PC + +  Y + ++++G  V D +       N
Sbjct: 139 T--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRIT-YGDGSTTTGFYVTDFVQYNQVSGN 195

Query: 216 ALKNSVQASVIIGCGMKQSGGYL--DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
               +  AS+  GCG  Q GG L     A DG++G G  + S+ S LA A  +R  F+ C
Sbjct: 196 GQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254

Query: 274 FDKDDSGRIF 283
            D    G IF
Sbjct: 255 LDTVRGGGIF 264


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 122/266 (45%), Gaps = 36/266 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +G+P   F V +D GSD+LW+ C+     P S+     L  DLN +  ++SST
Sbjct: 70  LYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSS----GLGIDLNYFDTASSST 125

Query: 162 SKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDN 215
           +  +SCS  +C        + C +    C YT   Y + + +SG  V D ++  +  G +
Sbjct: 126 AALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQ-YGDGSGTSGYYVYDAMYFDVIMGQS 184

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              NS  ++V+ GC   QSG       A DG+ G G G +SV S ++  G+    FS C 
Sbjct: 185 VFSNS-SSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCL 243

Query: 275 DKDDSGR--IFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
               SG   +  G+               P    +   +A NG+    I+ ++     + 
Sbjct: 244 KGQGSGGGILVLGEILEPNIVYTPLVPLQPHYNLNLQSIAVNGQ----ILPIDQDVFATG 299

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYE 345
             + T    IVDSG++  +L +E Y+
Sbjct: 300 NNRGT----IVDSGTTLAYLVQEAYD 321


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/249 (28%), Positives = 111/249 (44%), Gaps = 23/249 (9%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I+IG  + +F   +D+GSDL W+ CD  C  C       Y   +  LN + P    TS H
Sbjct: 59  INIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLC--TSLH 116

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
              +H        C++    C Y ++Y  ++ SS G+LV D + L       L N   A+
Sbjct: 117 PITNHH-------CKSADDQCQYEIEY-ADHGSSLGVLVNDHVPL------KLTNGSLAA 162

Query: 225 --VIIGCGMKQSGGYLDGVAPD-GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
             +  GCG        D   P  G++GLG GE+S  S L+  G++RN    C   D+ G 
Sbjct: 163 PRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL-SDEGGF 221

Query: 282 IFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 340
           +FFGD+  P++  + + ++       Y  G                  + DSGSS+T+  
Sbjct: 222 LFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFN 281

Query: 341 KEVYETIAA 349
            + Y +I A
Sbjct: 282 SQAYNSILA 290


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 134/313 (42%), Gaps = 54/313 (17%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T I IGTP  +F + +D GS L ++PC  C +C            +D N + P  SST +
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGK---------HQDPN-FQPDWSSTYQ 143

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            L CS     +  +C +    C Y   Y  E +SSSG+L EDI+    G  + LK     
Sbjct: 144 PLKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSF--GKQSELKPQ--- 192

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--- 280
             + GC   ++G      A DG++GLG G++S+   L + G+I NSFS+C+   D G   
Sbjct: 193 RTVFGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGA 251

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
            +  G   PA    T    +   Y  Y I ++   I    L          +  I+DSG+
Sbjct: 252 MVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGT 309

Query: 335 SFTFLPKEVY----ETIAAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           ++ +LP+  +    + I  E          DR  ND   S  G          SQ     
Sbjct: 310 TYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG-------SDVSQLSKTF 362

Query: 382 PSVKLMFPQNNSF 394
           P+V L+F   N  
Sbjct: 363 PAVDLVFSNGNRL 375


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/330 (27%), Positives = 135/330 (40%), Gaps = 59/330 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST-SKH 164
           I++G+P   F   +D GSDL+WI C  C +C   S   Y+          PSASST +K 
Sbjct: 8   IELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYD----------PSASSTFAKT 57

Query: 165 LSCSHRLCDLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
              +     L  S C +  + C Y   Y   +++     +E +    SGG +    + Q 
Sbjct: 58  SCSTSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQ- 116

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSG 280
               GCG   SG +  G A  G++GLG G+IS+ + L  A  I N FS C   FD D S 
Sbjct: 117 ---FGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSK 168

Query: 281 R--IFFGDQGPATQQ--STSFLASNGKYITYIIGVETCCIGSS----------------- 319
              + FG          ST  + ++G+   Y +G+E   +G                   
Sbjct: 169 TSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSK 228

Query: 320 ------CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS 373
                  L+  S   I DSG++ T L   VY  + + F   V+          +  CY  
Sbjct: 229 KKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDV 288

Query: 374 SSQRLPKLPSVKLMF-------PQNNSFVV 396
           S  +  K P++ L F       PQ N FV+
Sbjct: 289 SKSKNFKFPALTLAFKGTKFSPPQKNYFVI 318


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 82/276 (29%), Positives = 122/276 (44%), Gaps = 40/276 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP V +L   D GSDL W  C  C++C       Y  L    N   P  S++  H+ C
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQLRPIFN---PLKSTSFSHVPC 135

Query: 168 SHRLCDLGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           + + C          Q  C Y+  Y     S   L  E I    + G +++K+      +
Sbjct: 136 NTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKI----TIGSSSVKS------V 185

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIF 283
           IGCG   SGG+  G A  G+IGLG G++S+ S +++   I   FS C        +G+I 
Sbjct: 186 IGCGHASSGGF--GFA-SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKIN 242

Query: 284 FGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVDSGSS 335
           FG      GP    +   L S      Y I +E   IG+   +  +F      I+DSG++
Sbjct: 243 FGQNAVVSGPGVVSTP--LISKNTVTYYYITLEAISIGNE--RHMAFAKQGNVIIDSGTT 298

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
            +FLPKE+Y+ + +   + V        G  W  C+
Sbjct: 299 LSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 334


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 134/313 (42%), Gaps = 54/313 (17%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T I IGTP  +F + +D GS L ++PC  C +C            +D N + P  SST +
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGK---------HQDPN-FQPDWSSTYQ 143

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            L CS     +  +C +    C Y   Y  E +SSSG+L EDI+    G  + LK     
Sbjct: 144 PLKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSF--GKQSELKPQ--- 192

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--- 280
             + GC   ++G      A DG++GLG G++S+   L + G+I NSFS+C+   D G   
Sbjct: 193 RTVFGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGA 251

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
            +  G   PA    T    +   Y  Y I ++   I    L          +  I+DSG+
Sbjct: 252 MVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGT 309

Query: 335 SFTFLPKEVY----ETIAAEF---------DRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           ++ +LP+  +    + I  E          DR  ND   S  G          SQ     
Sbjct: 310 TYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG-------SDVSQLSKTF 362

Query: 382 PSVKLMFPQNNSF 394
           P+V L+F   N  
Sbjct: 363 PAVDLVFSNGNRL 375


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/271 (28%), Positives = 123/271 (45%), Gaps = 31/271 (11%)

Query: 102 LHYTWIDIGTPNVS--FLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPS 157
           L+YT I +G P     + + +D GSDL WI CD  C  CA  +   Y     +L      
Sbjct: 197 LYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNL------ 250

Query: 158 ASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
              +S+      +   L   C++  Q C Y ++Y  +++ S G+L +D  HL       L
Sbjct: 251 -VRSSEPFCVEVQRNQLTEHCESCHQ-CDYEIEY-ADHSYSMGVLTKDKFHL------KL 301

Query: 218 KNS--VQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            N    ++ ++ GCG  Q G  L+ +   DG++GL   +IS+PS LA  G+I N    C 
Sbjct: 302 HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL 361

Query: 275 --DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----- 326
             D +  G IF G D  P+   +   +  +     Y + V     G++ L          
Sbjct: 362 ASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVG 421

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
           K + D+GSS+T+ P + Y  +     ++V+D
Sbjct: 422 KVLFDTGSSYTYFPNQAYSQLVTSL-QEVSD 451


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 114/268 (42%), Gaps = 28/268 (10%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+Y  + IG P   + + +D GSDL W+ CD  C  CA      Y+     + +      
Sbjct: 30  LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARVVD------ 83

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
              +  +C+        +C    + C Y +D Y + +S+ G+LVED + L+      L N
Sbjct: 84  --CRRPTCAQVQRGGQFTCSGDVRQCDYEVD-YVDGSSTMGILVEDTITLV------LTN 134

Query: 220 SV--QASVIIGCGMKQSGGYLDGVA-PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
               Q   +IGCG  Q G      A  DG+IGL   +IS+PS LA  G+  N    C   
Sbjct: 135 GTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG 194

Query: 275 DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK-----A 328
             +  G +FFGD   PA   + + +        Y   + +   G   L+          A
Sbjct: 195 GSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGA 254

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           + DSG+SFT+L    Y  + +   RQ  
Sbjct: 255 MFDSGTSFTYLVPNAYTAVLSAVVRQAQ 282


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 145/333 (43%), Gaps = 49/333 (14%)

Query: 83  MLFPSQGSKTMSLGNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPL 139
           ++ P   +  M L +D      + T + IG+P   F + +D GS + ++PC +CV+C   
Sbjct: 67  LVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCG-- 124

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS 199
                N  D     + P  SST + + C     +   +C      C Y   Y  E ++SS
Sbjct: 125 -----NHQDP---RFQPELSSTYQPVKC-----NADCNCDENGVQCTYERRY-AEMSTSS 170

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G+L ED++    G ++ L   V    + GC   +SG      A DG++GLG G +SV   
Sbjct: 171 GVLAEDVMSF--GKESEL---VPQRAVFGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQ 224

Query: 260 LAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           L   G++ NSFS+C+   D G    +  G   P     +    S   Y  Y I ++   +
Sbjct: 225 LVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHV 282

Query: 317 GSSCLK------QTSFKAIVDSGSSFTFLPKEVYETI------AAEFDRQVNDTITSFEG 364
               LK         + AI+DSG+++ + P++ Y            F +Q++    +F+ 
Sbjct: 283 AGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK- 341

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
                C+  + + + +LP V   FP+ +    N
Sbjct: 342 ---DICFSGAGRDVTELPKV---FPEVDMVFAN 368


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 88/324 (27%), Positives = 140/324 (43%), Gaps = 53/324 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +GTP    LV LD GSD  WI C  C  C           ++    + PS SST
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDC----------YEQHEALFDPSKSST 183

Query: 162 SKHLSCSHRLC-DLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
              ++CS R C +LG+S    C + K+ CPY +  Y +++ + G L  D L L       
Sbjct: 184 YSDITCSSRECQELGSSHKHNCSSDKK-CPYEIT-YADDSYTVGNLARDTLTLS------ 235

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                    + GCG   +G + +    DGL+GLG G+ S+ S +  A      FS C   
Sbjct: 236 -PTDAVPGFVFGCGHNNAGSFGE---IDGLLGLGRGKASLSSQV--AARYGAGFSYCLPS 289

Query: 277 DDSGRIFFGDQG-----PATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK------QT 324
             S   +    G     P   Q T  +A  G++ + Y + +    +    +K       T
Sbjct: 290 SPSATGYLSFSGAAAAAPTNAQFTEMVA--GQHPSFYYLNLTGITVAGRAIKVPPSVFAT 347

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPK 380
           +   I+DSG++F+ LP   Y    A     V   +  ++  P    +  CY  +     +
Sbjct: 348 AAGTIIDSGTAFSCLPPSAY----AALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVR 403

Query: 381 LPSVKLMFPQNNSFVVNNPVFVIY 404
           +PSV L+F  + + V  +P  V+Y
Sbjct: 404 IPSVALVF-ADGATVHLHPSGVLY 426


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 71/260 (27%), Positives = 120/260 (46%), Gaps = 30/260 (11%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP+  F + +D+GS + ++PC  C +C        N  D     + P  SST  
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCG-------NHQD---PRFQPDLSSTYS 142

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            + C     ++  +C N +  C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 143 PVKC-----NVDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSF--GKESELK---PQ 191

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
             + GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 192 RAVFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 250

Query: 284 FGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSF 336
               G        F  SN  +   Y I ++   +    L+       +    ++DSG+++
Sbjct: 251 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTY 310

Query: 337 TFLPKEVYETIAAEFDRQVN 356
            +LP++ +         +VN
Sbjct: 311 AYLPEQAFVAFKDAVTNKVN 330


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 136/326 (41%), Gaps = 42/326 (12%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP     + LD GSDL+W      +CAP      +  D+DL    P+ASST   L 
Sbjct: 88  LAVGTPRRPVALTLDTGSDLVW-----TQCAPCR----DCFDQDLPVLDPAASSTYAALP 138

Query: 167 CSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           C    C        G       + C Y   Y  ++ +   +  +      SGG     ++
Sbjct: 139 CGAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHT 198

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KD 277
            +  +  GCG    G +       G+ G G G  S+PS L        SFS CF    + 
Sbjct: 199 RR--LTFGCGHLNKGVFQSN--ETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFES 249

Query: 278 DSGRIFFGDQGPA--------TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK 327
            S  +  G    A          ++T  L +  +   Y + ++   +G + L   +T F+
Sbjct: 250 KSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR 309

Query: 328 A-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQRLPKLPS 383
           + I+DSG+S T LP+EVYE + AEF  QV    +  EG     C+    ++  R P +PS
Sbjct: 310 STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPS 369

Query: 384 VKLMFPQNN-SFVVNNPVFVIYGTQV 408
           + L     +     +N VF   G +V
Sbjct: 370 LTLHLEGADWELPRSNYVFEDLGARV 395


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 145/334 (43%), Gaps = 49/334 (14%)

Query: 82  QMLFPSQGSKTMSLGNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAP 138
            ++ P   +  M L +D      + T + IG+P   F + +D GS + ++PC +CV+C  
Sbjct: 66  NLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCG- 124

Query: 139 LSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSS 198
                 N  D     + P  SST + + C     +   +C      C Y   Y  E ++S
Sbjct: 125 ------NHQDP---RFQPELSSTYQPVKC-----NADCNCDENGVQCTYERRY-AEMSTS 169

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
           SG+L ED++    G ++ L   V    + GC   +SG      A DG++GLG G +SV  
Sbjct: 170 SGVLAEDVMSF--GKESEL---VPQRAVFGCETMESGDLYTQRA-DGIMGLGRGTLSVMD 223

Query: 259 LLAKAGLIRNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
            L   G++ NSFS+C+   D G    +  G   P     +    S   Y  Y I ++   
Sbjct: 224 QLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIH 281

Query: 316 IGSSCLK------QTSFKAIVDSGSSFTFLPKEVYETI------AAEFDRQVNDTITSFE 363
           +    LK         + AI+DSG+++ + P++ Y            F +Q++    +F+
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
                 C+  + + + +LP V   FP+ +    N
Sbjct: 342 ----DICFSGAGRDVTELPKV---FPEVDMVFAN 368


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 114/254 (44%), Gaps = 21/254 (8%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           ++IG P   + + +D GS+L W+ CD  C +C+      Y    +  N++ P        
Sbjct: 78  LNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHPLY----KPSNDFIPCKDPLCAS 133

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           L  +        +C++P Q C Y + Y  +  S+ G+L+ D+  L         N VQ  
Sbjct: 134 LQPTDDY-----TCEDPNQ-CDYEIKY-ADQYSTLGVLLNDVYLL------NFTNGVQLK 180

Query: 225 V--IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 282
           V   +GCG  Q          DG++GLG G+ S+ S L   GL+RN    C      G I
Sbjct: 181 VRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYI 240

Query: 283 FFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKE 342
           FFG+   +++ S + ++S      Y  G      G       S   I D+GSS+T+   +
Sbjct: 241 FFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQ 300

Query: 343 VYETIAAEFDRQVN 356
            Y+ + +  +++++
Sbjct: 301 AYQAMISLLNKELH 314


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 71/262 (27%), Positives = 118/262 (45%), Gaps = 33/262 (12%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           Y  ++IG P   + + +D GS+L WI C      P      N +   L  Y P      K
Sbjct: 41  YVTMNIGEPAKPYFLDIDTGSNLTWIKC---HATPGPCKTCNKVPHPL--YRPK-----K 90

Query: 164 HLSCSHRLCD-----LGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            + C+  LCD     LGT+  C+     C Y ++Y  + T+S G+L+ D   L +G    
Sbjct: 91  LVPCADPLCDALHKDLGTTKDCREEPDQCHYQINY-ADGTTSLGVLLLDKFSLPTGS--- 146

Query: 217 LKNSVQASVIIGCGMKQSGG----YLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFS 271
                  ++  GCG  Q  G      + V  DG++GLG G + + S L  +G + +N   
Sbjct: 147 -----ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIG 201

Query: 272 MCFDKDDSGRIFFGDQG-PATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFKAI 329
            C      G +F G++  P++     ++    +    Y  G  T  +G + +    FKAI
Sbjct: 202 HCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKAI 261

Query: 330 VDSGSSFTFLPKEVYETIAAEF 351
            DSGS++T+LP+ ++  + +  
Sbjct: 262 FDSGSTYTYLPENLHAQLVSAL 283


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 147/318 (46%), Gaps = 43/318 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D+GS + ++PC DC +C                ++ P  SST + + C
Sbjct: 99  IGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDP----------KFQPEMSSTYQPVKC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                ++  +C + ++ C Y  +Y  E++SS G+L ED   LIS G+ +     +A  + 
Sbjct: 149 -----NMDCNCDDDREQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VF 197

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFF 284
           GC   ++G      A DG+IGLG G++S+   L   GLI NSF +C+   D G    I  
Sbjct: 198 GCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILG 256

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTF 338
           G   P+    T        Y  Y I +    +    L   S        A++DSG+++ +
Sbjct: 257 GFDYPSDMVFTDSDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAY 314

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQ 390
           LP   +        R+V+ T+   +G    +   C   ++S  + +L    PSV+++F  
Sbjct: 315 LPDAAFAAFEEAVMREVS-TLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKS 373

Query: 391 NNSFVVNNPVFVIYGTQV 408
             S++++   ++   ++V
Sbjct: 374 GQSWLLSPENYMFRHSKV 391


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 53/314 (16%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++  I +G P+  + V +D GSD+LW+ C  C +C   S      L   L  Y P++S 
Sbjct: 26  LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKS-----DLGIKLTLYDPASSV 80

Query: 161 TSKHLSCSHRLCDLGTSCQN-------PKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           ++  +SC    C   TS  N        + PC Y +  Y + +S++G  V D +      
Sbjct: 81  SATRVSCDDDFC---TSTYNGLLPDCKKELPCQYNV-VYGDGSSTAGYFVSDAVQFERVT 136

Query: 214 DNALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
            N        +V  GCG +QSGG    G A DG++G                    +F+ 
Sbjct: 137 GNLQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAH 176

Query: 273 CFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------ 325
           C D  + G IF  G+       +T  + +   Y  Y+  +E   +G + L+  +      
Sbjct: 177 CLDNVNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLELPTDVFDSG 233

Query: 326 --FKAIVDSGSSFTFLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
                I+DSG++  +LP+ VY+++  E   +Q   ++ + E      C+K S       P
Sbjct: 234 DRRGTIIDSGTTLAYLPEVVYDSMMNEIRSQQPGLSLHTVE--EQFICFKYSGNVDDGFP 291

Query: 383 SVKLMFPQNNSFVV 396
            +K  F  + +  V
Sbjct: 292 DIKFHFKDSLTLTV 305


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 149/359 (41%), Gaps = 45/359 (12%)

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           YQ L  ++V++++ +     +  F +   +   + +D G        +G P V  LV +D
Sbjct: 55  YQSLDRNNVERRRTR-----RAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGID 109

Query: 122 AGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ- 179
            GSDLLW+ C  C  C   S   ++          PS SST   LS    +C      + 
Sbjct: 110 TGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSPQKKY 159

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           N    C Y   Y   +TSS  L  EDI+   S           +SV+ GCG    G + D
Sbjct: 160 NHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRGRF-D 214

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPATQQS 294
           G    G++GL  G+ S+ S L       + FS C    FD      ++  GD       S
Sbjct: 215 G-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSS 267

Query: 295 TSFLASNGKYITYIIGV---ETCC-IGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETI 347
           T F   NG Y   + G+   ET   I     ++T       ++DSG++ TFL K+ ++ +
Sbjct: 268 TPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPL 327

Query: 348 AAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NPVFV 402
           + E  R V        +   P   CYK   ++ L   P +   F +    V++ N +FV
Sbjct: 328 SNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 386


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 117/263 (44%), Gaps = 33/263 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +Y  ++IG P   F + +D GSDL W+ CD    AP +            +Y P+ ++  
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD----APCNGC---------TKYKPNHNT-- 111

Query: 163 KHLSCSHRLC---DL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDN 215
             L CSH LC   DL     C +P+  C Y +  Y+++ SS G LV D   L L +G   
Sbjct: 112 --LPCSHILCSGLDLPQDRPCADPEDQCDYEIG-YSDHASSIGALVTDEVPLKLANGSIM 168

Query: 216 ALKNSVQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            L+      +  GCG  +Q+ G        G++GLG G++ + + L   G+ +N    C 
Sbjct: 169 NLR------LTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCL 222

Query: 275 DKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSG 333
                G +  GD+  P++  + + LA+N     Y+ G                  + DSG
Sbjct: 223 SHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSG 282

Query: 334 SSFTFLPKEVYETIAAEFDRQVN 356
           SS+T+   E Y+ I     + +N
Sbjct: 283 SSYTYFNAEAYQAILDLIRKDLN 305


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 149/359 (41%), Gaps = 45/359 (12%)

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           YQ L  ++V++++ +     +  F +   +   + +D G        +G P V  LV +D
Sbjct: 23  YQSLDRNNVERRRTR-----RAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGID 77

Query: 122 AGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ- 179
            GSDLLW+ C  C  C   S   ++          PS SST   LS    +C      + 
Sbjct: 78  TGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSPQKKY 127

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           N    C Y   Y   +TSS  L  EDI+   S           +SV+ GCG    G + D
Sbjct: 128 NHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRGRF-D 182

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPATQQS 294
           G    G++GL  G+ S+ S L       + FS C    FD      ++  GD       S
Sbjct: 183 G-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSS 235

Query: 295 TSFLASNGKYITYIIGV---ETCC-IGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETI 347
           T F   NG Y   + G+   ET   I     ++T       ++DSG++ TFL K+ ++ +
Sbjct: 236 TPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPL 295

Query: 348 AAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NPVFV 402
           + E  R V        +   P   CYK   ++ L   P +   F +    V++ N +FV
Sbjct: 296 SNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 131/302 (43%), Gaps = 46/302 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP     + LD GSDL+W      +CAP      +  D+ +    P+ASST   L 
Sbjct: 90  LAVGTPPRPVALTLDTGSDLVW-----TQCAPCR----DCFDQGIPLLDPAASSTYAALP 140

Query: 167 CSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN---SV 221
           C    C     TSC    + C Y   +Y + + + G +  D       GDN  +N   S+
Sbjct: 141 CGAPRCRALPFTSCGG--RSCVYVY-HYGDKSVTVGKIATDRFTF---GDNGRRNGDGSL 194

Query: 222 QAS--VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK--D 277
            A+  +  GCG    G +       G+ G G G  S+PS L        SFS CF    D
Sbjct: 195 PATRRLTFGCGHFNKGVFQSN--ETGIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFD 247

Query: 278 DSGRIFFGDQGPAT---------QQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF 326
               I      PA           ++T    +  +   Y + ++   +G + L   +T F
Sbjct: 248 SKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKF 307

Query: 327 KA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQRLPKLP 382
           ++ I+DSG+S T LP+EVYE + AEF  QV    +  EG     C+    S+  R P +P
Sbjct: 308 RSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAVP 367

Query: 383 SV 384
           S+
Sbjct: 368 SL 369


>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 117/265 (44%), Gaps = 35/265 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           +Y  + IG P   + +    GSDL W+ CD  CVRC       Y                
Sbjct: 67  YYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRP-------------- 112

Query: 161 TSKHLSCSHRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            +  + C   +C      G  C++P+Q C Y ++Y  +  SS G+LV+D+  L     N 
Sbjct: 113 NNNLVICKDPMCAXLHPPGYKCEHPEQ-CDYEVEY-ADGGSSLGVLVKDVFPL--NFTNG 168

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           L+  +   + +GCG  Q  G      P DG++GLG G+ S+ S L   G+IRN    C  
Sbjct: 169 LR--LAPRLALGCGYDQIPG--XSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVS 224

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DS 332
               G +FFGD    + +         ++  Y  G     +G    K T FK ++   DS
Sbjct: 225 SHGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGG---KTTVFKNLLVTFDS 281

Query: 333 GSSFTFLPKEVYETIAAEFDRQVND 357
           GSS+T+L    Y+ +     +++++
Sbjct: 282 GSSYTYLNSLAYQALVHLVRKELSE 306


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 112/250 (44%), Gaps = 24/250 (9%)

Query: 115 SFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
           ++ + +D GS   ++PC  C RC   +  YY+  DR +          S    C      
Sbjct: 50  TYDLIVDTGSARTYVPCKGCARCGEHAHGYYD-YDRSMEFERLDCGEASDATLCEE---T 105

Query: 174 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 233
           +  +CQ+  + C Y + Y  E +SS G +V D + L  G       ++ A +  GC   +
Sbjct: 106 MKGTCQSDGR-CSYVVSY-AEGSSSRGYVVRDRVRLGEG-------TLSAMLAFGCEEAE 156

Query: 234 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-------GRIFFGD 286
           +    +  A DGL G G G  +V + LA AGLI N FS C +   +       GR  FG 
Sbjct: 157 TNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGA 215

Query: 287 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-TSFKAIVDSGSSFTFLPKEVYE 345
             PA  + T  +A       + +   +  +G S ++   S+   +DSG++FTF+P+ V+ 
Sbjct: 216 DAPALAR-TPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWV 274

Query: 346 TIAAEFDRQV 355
           +     D Q 
Sbjct: 275 SFKTRLDTQA 284


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 141/328 (42%), Gaps = 42/328 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT + +GTP   F V +D GSD+LW+ C+     P S+     L  +LN +    SST
Sbjct: 77  LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSS----QLGIELNFFDTVGSST 132

Query: 162 SKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGD 214
           +  + CS  +C          C      C YT   Y + + +SG  V D ++  LI G  
Sbjct: 133 AALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQ-YGDGSGTSGYYVSDAMYFSLIMGQP 191

Query: 215 NALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
            A+ +S  A+++ GC + QSG       A DG+ G G G +SV S L+  G+    FS C
Sbjct: 192 PAVNSS--ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHC 249

Query: 274 FDKDDSG------------RIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGS 318
              D  G             I +    P+      +   +A NG+ +     V +     
Sbjct: 250 LKGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFS----- 304

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQ 376
             +       IVD G++  +L +E Y+ +    +  V+ +   T+ +G     CY  S+ 
Sbjct: 305 --ISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTS 359

Query: 377 RLPKLPSVKLMFPQNNSFVVNNPVFVIY 404
                PSV L F    S V+    ++++
Sbjct: 360 IGDIFPSVSLNFEGGASMVLKPEQYLMH 387


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 147/333 (44%), Gaps = 46/333 (13%)

Query: 99  FGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 158
           F  L++T + +G+P   F V +D GSD+LWI  +C+ C+  +  + + L  +L+ +  + 
Sbjct: 79  FVGLYFTKVKLGSPAKDFYVQIDTGSDILWI--NCITCS--NCPHSSGLGIELDFFDTAG 134

Query: 159 SSTSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--IS 211
           SST+  +SC+  +C        + C +    C YT   Y + + ++G  V D ++   + 
Sbjct: 135 SSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQ-YGDGSGTTGYYVSDTMYFDTVL 193

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            G + + NS  ++++ GC   QSG       A DG+ G G G +SV S L+  G+    F
Sbjct: 194 LGQSMVANS-SSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVF 252

Query: 271 SMCFD--KDDSGRIFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCC 315
           S C    ++  G +  G+               P    +   +A NG+ +          
Sbjct: 253 SHCLKGGENGGGVLVLGEILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLP--------- 303

Query: 316 IGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG---YPWKCCY 371
           I S+    T+ +  IVDSG++  +L +E Y      F   +   ++ F          CY
Sbjct: 304 IDSNVFATTNNQGTIVDSGTTLAYLVQEAYN----PFVDAITAAVSQFSKPIISKGNQCY 359

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY 404
             S+      P V L F    S V+N   ++++
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMH 392


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 146/321 (45%), Gaps = 29/321 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +GTP + F V +D GSD+LW+ C+     P S+     L   LN +  S+SS+
Sbjct: 78  LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSS----GLGIQLNFFDASSSSS 133

Query: 162 SKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDN 215
           S  +SCS  +C+       T C      C YT   Y + + +SG  V + ++  +  G +
Sbjct: 134 SSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQ-YGDGSGTSGYYVSESMYFDMVMGQS 192

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            + NS  ASV+ GC   QSG       A DG+ G G G++SV S L+  G+    FS C 
Sbjct: 193 MIANS-SASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL 251

Query: 275 --DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKA 328
             + +  G +  G+        +  + S   Y  Y+  +    +T  I  S    +  + 
Sbjct: 252 KGEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRG 311

Query: 329 -IVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
            I+DSG++  +L +E Y      I A   + V  TI+         CY  S+      P 
Sbjct: 312 TIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISK-----GNQCYLVSTSVGEIFPL 366

Query: 384 VKLMFPQNNSFVVNNPVFVIY 404
           V L F  + S V+    ++++
Sbjct: 367 VSLNFAGSASMVLKPEEYLMH 387


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 148/359 (41%), Gaps = 45/359 (12%)

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           YQ L  ++V++++ +     +  F     +   + +D G        +G P V  LV +D
Sbjct: 23  YQSLDRNNVERRRTR-----RAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGID 77

Query: 122 AGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ- 179
            GSDLLW+ C  C  C   S   ++          PS SST   LS    +C      + 
Sbjct: 78  TGSDLLWVQCRPCADCFRQSTPIFD----------PSKSSTYVDLSYDSPICPNSPQKKY 127

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           N    C Y   Y   +TSS  L  EDI+   S           +SV+ GCG    G + D
Sbjct: 128 NHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV----TVSSVVFGCGHSNRGRF-D 182

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKD-DSGRIFFGDQGPATQQS 294
           G    G++GL  G+ S+ S L       + FS C    FD      ++  GD       S
Sbjct: 183 G-QQSGILGLSAGDQSIVSRLG------SRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSS 235

Query: 295 TSFLASNGKYITYIIGV---ETCC-IGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETI 347
           T F   NG Y   + G+   ET   I     ++T       ++DSG++ TFL K+ ++ +
Sbjct: 236 TPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPL 295

Query: 348 AAEFDRQVNDTITS--FEGYPWKCCYKSS-SQRLPKLPSVKLMFPQNNSFVVN-NPVFV 402
           + E  R V        +   P   CYK   ++ L   P +   F +    V++ N +FV
Sbjct: 296 SNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/316 (27%), Positives = 135/316 (42%), Gaps = 50/316 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP   F   +D GSDL W+ C  C RC           ++    + P ASS+  + 
Sbjct: 12  ISLGTPPQQFSAIVDTGSDLCWVQCAPCARC----------FEQPDPLFIPLASSSYSNA 61

Query: 166 SCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           SC+  LCD L     + +  C Y+  Y   + +      E +          L  S  A 
Sbjct: 62  SCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETV---------TLNGSTLAR 112

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR-- 281
           +  GCG  Q G +      DGLIGLG G +S+PS L  +    + FS C  D+  +G   
Sbjct: 113 IGFGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFS 167

Query: 282 -IFFGDQGPATQQS-TSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFK--------AI 329
            I FG+    ++ S T  L +      Y +GVE+  +G+  +    ++F+         I
Sbjct: 168 PITFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVI 227

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY-----KSSSQRLP----K 380
           +DSG++ T+     +  I AE  RQ++        Y    CY      +SS  LP     
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVH 287

Query: 381 LPSVKLMFPQNNSFVV 396
           L +V    P +N +V+
Sbjct: 288 LTNVDFEIPVSNLWVL 303


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/300 (27%), Positives = 126/300 (42%), Gaps = 35/300 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +  GTP  ++ V  D GSD+ WI     +C P S   Y   D     + P+ S+T   + 
Sbjct: 139 VGFGTPAQTYTVIFDTGSDVSWI-----QCLPCSGHCYKQHDP---IFDPTKSATYSVVP 190

Query: 167 CSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C H  C    G+ C N    C Y ++Y  + +SS+G+L  + L L S             
Sbjct: 191 CGHPQCAAADGSKCSNGT--CLYKVEY-GDGSSSAGVLSHETLSLTS-------TRALPG 240

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRI 282
              GCG    G + D    DGLIGLG G++S+ S  A +     +FS C   D++  G +
Sbjct: 241 FAFGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYL 295

Query: 283 FFGDQGPATQQSTSFLASNGK--YIT-YIIGVETCCIGSSCLKQ-----TSFKAIVDSGS 334
             G   PA+     + A   K  Y + Y + + +  IG   L       T     +DSG+
Sbjct: 296 TIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGT 355

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 394
             T+LP E Y  +   F   +     +    P+  CY  + Q    +P+V   F   + F
Sbjct: 356 ILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVF 415


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 145/363 (39%), Gaps = 66/363 (18%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
           + +   + L E   A    FS  LIHR S        SK R     +A    A +   + 
Sbjct: 13  VVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGRFR 72

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
           Q  ++SD  + +         L PS G   M+L             IGTP V  +  +D 
Sbjct: 73  QSAMTSDGIQSR---------LVPSAGEYIMNL------------SIGTPPVPVIAIVDT 111

Query: 123 GSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SC 178
           GSDL W  C  C  C      +++          P  SST +  SC    C  LG   SC
Sbjct: 112 GSDLTWTQCRPCTHCYKQVVPFFD----------PKNSSTYRDSSCGTSFCLALGNDRSC 161

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
           +N K+ C + M  Y + + + G L  + L + S    A K         GC + +SGG  
Sbjct: 162 RNGKK-CTF-MYSYADGSFTGGNLAVETLTVAS---TAGKPVSFPGFAFGC-VHRSGGIF 215

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQG---PA 290
           D  +  G++GLG+ E+S+ S L     I   FS C      D   S RI FG  G    A
Sbjct: 216 DEHS-SGIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMSSRINFGRSGIVSGA 272

Query: 291 TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA---------IVDSGSSFTFLPK 341
              ST  +        Y+I +E   +G   L    F           IVDSG+++T+LP 
Sbjct: 273 GTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPL 332

Query: 342 EVY 344
           E Y
Sbjct: 333 EFY 335


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 146/318 (45%), Gaps = 43/318 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D+GS + ++PC DC +C                ++ P  SST + + C
Sbjct: 100 IGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDP----------KFQPELSSTYQPVKC 149

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                ++  +C + K+ C Y  +Y  E++SS G+L ED   LIS G+ +     +A  + 
Sbjct: 150 -----NMDCNCDDDKEQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VF 198

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFF 284
           GC   ++G      A DG+IGLG G++S+   L   GLI NSF +C+   D G    I  
Sbjct: 199 GCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILG 257

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTF 338
           G   P+    T        Y  Y I +    +    L   S        A++DSG+++ +
Sbjct: 258 GFDYPSDMIFTDSDPDRSPY--YNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAY 315

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEG----YPWKCCYKSSSQRLPKL----PSVKLMFPQ 390
           LP   +        R+V+  +   +G    +   C   ++S  + +L    PSV+++F  
Sbjct: 316 LPDAAFAAFEEAVMREVS-PLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKS 374

Query: 391 NNSFVVNNPVFVIYGTQV 408
             S++++   ++   ++V
Sbjct: 375 GQSWLLSPENYMFRHSKV 392


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/283 (31%), Positives = 121/283 (42%), Gaps = 52/283 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V F+   D GSDL W  C  C  C P          +D   Y PSASST   +
Sbjct: 70  LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 119

Query: 166 SCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
            CS   C L T    +C NP  PC Y    Y++   S G+L  + L +   G +    +V
Sbjct: 120 PCSSATC-LPTWRSRNCSNPSSPCRYIYS-YSDGAYSVGILGTETLTI---GSSVPGQTV 174

Query: 222 Q-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDD 278
              SV  GCG    G  L+     G +GLG G +   SLLA+ G+ + S+ +   F+   
Sbjct: 175 SVGSVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSTM 228

Query: 279 SGRIFFGD-----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKA- 328
               F G       GP T QST  L S      Y + ++   +G   L   +     +A 
Sbjct: 229 DSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRAD 288

Query: 329 -----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
                +VDSG++FT L K  +        R+V D +    G P
Sbjct: 289 GNGGMMVDSGTTFTILAKSGF--------REVVDRVAQLLGQP 323


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 117/260 (45%), Gaps = 25/260 (9%)

Query: 96  GNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   +Y+ I +IG P  +F   +D GSDL W+ CD  C  C       Y    +  N
Sbjct: 46  GNVYPTGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLY----KPKN 101

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-IS 211
              P ++S  + +S           C  P   C Y ++Y  +  SS G+L+ D   L +S
Sbjct: 102 NLVPCSNSLCQAVSTGENY-----HCDAPDDQCDYEIEY-ADLGSSIGVLLSDSFPLRLS 155

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD---GLIGLGLGEISVPSLLAKAGLIRN 268
            G       +Q  +  GCG  Q   +L    P    G++GLG G++S+ S L   G+ +N
Sbjct: 156 NG-----TLLQPKMAFGCGYDQK--HLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQN 208

Query: 269 SFSMCFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
               CF +   G +FFGD   P+++ + + +  +     Y  G      G         +
Sbjct: 209 VVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQ 268

Query: 328 AIVDSGSSFTFLPKEVYETI 347
            I DSGSS+T+   +VY++I
Sbjct: 269 LIFDSGSSYTYFNAQVYQSI 288


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/325 (27%), Positives = 143/325 (44%), Gaps = 39/325 (12%)

Query: 94  SLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE 153
           +L  +    ++  + +GTP ++F   +D GSDL W      +CAP + + +    +    
Sbjct: 87  ALAENGAGAYHMILSVGTPPLAFPAIIDTGSDLTW-----TQCAPCTTACFA---QPTPL 138

Query: 154 YSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           Y P+ SST   L C+  LC    S            DY      ++G L  D L +  G 
Sbjct: 139 YDPARSSTFSKLPCASPLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGD 198

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
            +   +S  A V  GC    +GG +DG +  G++GLG   +   SLL++ G+ R  FS C
Sbjct: 199 GDGDASSSFAGVAFGCS-TANGGDMDGAS--GIVGLGRSAL---SLLSQIGVGR--FSYC 250

Query: 274 FDKD-DSGR--IFFGDQGPATQ---QSTSFL----ASNGKYITYIIGVETCCIGSSCLKQ 323
              D D+G   I FG     T    QST+ L    A+  +   Y + +    +GS+ L  
Sbjct: 251 LRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPV 310

Query: 324 TS----FKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG--YPWKCCY 371
           TS    F A      IVDSG++FT+L +  Y  +   F  Q    +T   G  + +  C+
Sbjct: 311 TSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCF 370

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVV 396
           ++ +   P +P +   F     + V
Sbjct: 371 EAGAADTP-VPRLVFRFAGGAEYAV 394


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/302 (27%), Positives = 125/302 (41%), Gaps = 41/302 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP     + LD GSDL+W  C  C  C   S  YY++          S SST    
Sbjct: 95  LAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALP 144

Query: 166 SCSHRLCDLG---TSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           SC    C L    T C N   Q C ++  Y  + +++ G L  + +  ++G         
Sbjct: 145 SCDSTQCKLDPSVTMCVNQTVQTCAFSYSY-GDKSATIGFLDVETVSFVAGAS------- 196

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
              V+ GCG+  +G +       G+ G G G +S+PS L K G   + F+    +  S  
Sbjct: 197 VPGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTV 253

Query: 282 IF-----FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFK 327
           +F         G  T Q+T  + +      Y + ++   +GS+          LK  +  
Sbjct: 254 LFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGG 313

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I+DSG++FT LP  VY  +  EF   V    + S E  P  C       + P +P + L
Sbjct: 314 TIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVL 373

Query: 387 MF 388
            F
Sbjct: 374 HF 375


>gi|388513215|gb|AFK44669.1| unknown [Lotus japonicus]
          Length = 101

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/83 (43%), Positives = 56/83 (67%), Gaps = 2/83 (2%)

Query: 21  GAETVMFSTKLIHRFSEEVKALGVSKNRNAT--SWPAKKSFEYYQVLLSSDVQKQKMKTG 78
           G   V FS++L+HRFSEE K    S+   A   SWP K + EY+++LL+SD+ +Q+MK G
Sbjct: 19  GEAAVTFSSRLVHRFSEEAKVHLASRGNGAALQSWPNKSTSEYFRLLLNSDLTRQRMKLG 78

Query: 79  PQFQMLFPSQGSKTMSLGNDFGW 101
            Q++ ++PS+G +T   GN++ W
Sbjct: 79  SQYESMYPSKGGQTFFFGNEWNW 101


>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 454

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 117/273 (42%), Gaps = 49/273 (17%)

Query: 96  GNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   HYT  ++IG P   + + +D+GSDL W+ CD  C  C         +  RD  
Sbjct: 56  GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGC---------TKPRD-Q 105

Query: 153 EYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
            Y P+ +     + C  +LC      +  +C +P  PC Y ++Y  ++ SS G+LV D +
Sbjct: 106 LYKPNHNL----VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEY-ADHGSSLGVLVRDYI 160

Query: 208 HL-ISGGDNALKNSVQASVIIGCGMKQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
               + G     + V+  V  GCG  Q   G     A  G++GLG G  S+ S L   GL
Sbjct: 161 PFQFTNG-----SVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGL 215

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYII----------GVETCC 315
           IRN    C      G +FFGD          F+ S+G   T ++          G     
Sbjct: 216 IRNVVGHCLSAQGGGFLFFGDD---------FIPSSGIVWTSMLSSSSEKHYSSGPAELV 266

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 348
                      + I DSGSS+T+   + Y+ + 
Sbjct: 267 FNGKATAVKGLELIFDSGSSYTYFNSQAYQAVV 299


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/314 (27%), Positives = 125/314 (39%), Gaps = 46/314 (14%)

Query: 100 GWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 157
           G L Y   + +GTP       LD GSDL+W  C  C  C P               +SP 
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPI----------FSPG 149

Query: 158 ASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           ASS+ + + C+  LC+  L  SCQ P   C Y    Y + T++ G+   +     S    
Sbjct: 150 ASSSYEPMRCAGELCNDILHHSCQRPDT-CTYRYS-YGDGTTTRGVYATERFTFSSSSSG 207

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                + A +  GCG    G   +G    G++G G   +S+ S LA    IR  FS C  
Sbjct: 208 GETTKLSAPLGFGCGTMNKGSLNNG---SGIVGFGRAPLSLVSQLA----IRR-FSYCLT 259

Query: 276 KDDSGR---IFFG-------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--- 322
              SGR   + FG       D   AT Q+T  L S      Y +      +G+  L+   
Sbjct: 260 PYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPI 319

Query: 323 -------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS--FEGYPWKCCYKS 373
                    S  AIVDSG++ T  P  V   +   F  Q+     +    G     C+ +
Sbjct: 320 SAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAA 379

Query: 374 SSQRLPKLPSVKLM 387
           ++ R+P+   V  M
Sbjct: 380 AASRVPRPAVVPRM 393


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 119/262 (45%), Gaps = 40/262 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V +L   D GSDL W  C  C++C       Y  L    N   P  S++  H+
Sbjct: 96  VSIGTPPVDYLGIADTGSDLTWAQCLPCLKC-------YQQLRPIFN---PLKSTSFSHV 145

Query: 166 SCSHRLCDLGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            C+ + C          Q  C Y+  Y     S   L  E     I+ G +++K+     
Sbjct: 146 PCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEK----ITIGSSSVKS----- 196

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGR 281
            +IGCG   SGG+  G A  G+IGLG G++S+ S +++   I   FS C        +G+
Sbjct: 197 -VIGCGHASSGGF--GFA-SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 252

Query: 282 IFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVDSG 333
           I FG+     GP    +   L S      Y I +E   IG+   +  +F      I+DSG
Sbjct: 253 INFGENAVVSGPGVVSTP--LISKNTVTYYYITLEAISIGNE--RHMAFAKQGNVIIDSG 308

Query: 334 SSFTFLPKEVYETIAAEFDRQV 355
           ++ T LPKE+Y+ + +   + V
Sbjct: 309 TTLTILPKELYDGVVSSLLKVV 330


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/265 (28%), Positives = 121/265 (45%), Gaps = 28/265 (10%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSAS 159
           L+YT+I +G P   + + +D GSDL W+ CD  C  C    +  Y     ++  +  S  
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLC 257

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALK 218
              +     +   D   +CQ     C Y +  Y + +SS G+LV+D   L  S G     
Sbjct: 258 MEVQR----NYDGDQCAACQQ----CNYEVQ-YADQSSSLGVLVKDEFTLRFSNG----- 303

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
           +  + + I GC   Q G  L+ ++  DG++GL   ++S+PS LA  G+I N    C   D
Sbjct: 304 SLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGD 363

Query: 278 DS--GRIFFGDQGPATQQSTSFLA-----SNGKYITYIIGVETCCIGSSCLKQTSFK--A 328
            +  G +F GD     Q   +++A     S   Y T ++ ++   I  S     S +   
Sbjct: 364 PAGGGYLFLGDDF-VPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQV 422

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDR 353
           + DSGSS+T+  KE Y  + A  + 
Sbjct: 423 VFDSGSSYTYFTKEAYYQLVANLEE 447


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 123/279 (44%), Gaps = 36/279 (12%)

Query: 90  SKTMSLGNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNS 146
           S  M L +D      + T + IGTP   F + +D+GS + ++PC  C +C        N 
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG-------NH 125

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
            D     + P  SST   + C     ++  +C + K  C Y   Y  E +SSSG+L EDI
Sbjct: 126 QD---PRFQPDLSSTYSPVKC-----NVDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDI 176

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +    G ++ LK       + GC   ++G      A DG++GLG G++S+   L   G+I
Sbjct: 177 VSF--GTESELKPQ---RAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVI 230

Query: 267 RNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK- 322
            +SFSMC+   D G    +      P     T   A    Y  Y I ++   +    L+ 
Sbjct: 231 GDSFSMCYGGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRV 288

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                      ++DSG+++ +LP++ +         QV+
Sbjct: 289 DPRIFDGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVH 327


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 80/301 (26%), Positives = 131/301 (43%), Gaps = 42/301 (13%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP   F + +D GS + ++PC  C +C                 + P +SST K
Sbjct: 90  TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCG----------KHQDPRFQPESSSTYK 139

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            + C     +   +C +  + C Y   Y  E +SSSGLL ED+L    G ++ L      
Sbjct: 140 PMQC-----NPSCNCDDEGKQCTYERRY-AEMSSSSGLLAEDVLSF--GNESEL---TPQ 188

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGR 281
             I GC   ++G      A DG++GLG G +SV   L    ++ NSFS+C+   D   G 
Sbjct: 189 RAIFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGA 247

Query: 282 IFFGDQGPATQQSTSFLASNGKYIT--YIIGVETCCIGSSCLK------QTSFKAIVDSG 333
           +  G+  P         A +  Y +  Y I ++   +    LK            ++DSG
Sbjct: 248 MVLGNIPPPPDM---VFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSG 304

Query: 334 SSFTFLPKEVY----ETIAAE--FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           +++ +LP+E +    + I  E  F +Q++    S+    +    +  SQ     P V ++
Sbjct: 305 TTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMV 364

Query: 388 F 388
           F
Sbjct: 365 F 365


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 147/340 (43%), Gaps = 33/340 (9%)

Query: 90  SKTMSLGNDFGWLHY--TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY--Y 144
           S  M+L +D     Y  + + IGTP   F + +D GS + ++PC  C  C    AS+  +
Sbjct: 25  SARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTH 84

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
               RD   + P  SS+ + + C    C  G  C +    C Y    Y E ++S G+L +
Sbjct: 85  RLFCRD-PRFKPENSSSYQKIGCRSSDCITGL-CDSNSHQCKYER-MYAEMSTSKGVLGK 141

Query: 205 DILHLISGGDNALKNSVQASVI-IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           D+L      D    + +Q+ ++  GC   +SG     VA DG++GLG G +S+   L   
Sbjct: 142 DLL------DFGPASRLQSQLLSFGCETAESGDLYLQVA-DGIMGLGRGPLSIVDQLVGN 194

Query: 264 GLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-IGVETCCIGSSCLK 322
           G I +SFS+C+   D G                F  S+ +   Y  + +    +  + LK
Sbjct: 195 GAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLK 254

Query: 323 QTS------FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG----YPWKCCYK 372
             S      F  I+DSG+++ +LP   +E        Q+  ++ + +G    YP   CY 
Sbjct: 255 LDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLG-SLQAVDGPDPNYP-DICYA 312

Query: 373 SSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYGTQV 408
            +     +L    P V  +F +N    +    ++   T+V
Sbjct: 313 GAGTDTKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKV 352


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 130/303 (42%), Gaps = 46/303 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP       +D GSD +W  C    C P        L++    ++PS SST K++ CS
Sbjct: 96  IGTPPFQLYGVVDTGSDGIWFQCK--PCKPC-------LNQTSPIFNPSKSSTYKNIRCS 146

Query: 169 HRLCDLG--TSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
             +C  G  T C  N K+ C Y + Y  + + S G + +D L L S   + +       +
Sbjct: 147 SPICKRGEKTRCSSNRKRKCEYEITYL-DRSGSQGDISKDTLTLNSNDGSPIS---FPKI 202

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDD-SG 280
           +IGCG K S    +G+A  G+IG G G  S+ S L  +  I   FS C    F K + S 
Sbjct: 203 VIGCGHKNSLT-TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISS 258

Query: 281 RIFFGDQGPATQQST-------SFLASNGKYITYIIGVETCCIG--------SSCLKQTS 325
           +++FGD    +           SF   N     Y   +E   +G        SS +    
Sbjct: 259 KLYFGDMAVVSGHGVVSTPLIQSFYVGN-----YFTNLEAFSVGDHIIKLKDSSLIPDNE 313

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             A++DSGS+ T LP +VY  +       V              CYK++ ++  ++P + 
Sbjct: 314 GNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKY-EVPIIT 372

Query: 386 LMF 388
             F
Sbjct: 373 AHF 375


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 132/316 (41%), Gaps = 43/316 (13%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
            P+Q   ++  GN     +   + +GTP   + V  D GSDL W+ C  C  C       
Sbjct: 136 LPAQRGISLGTGN-----YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADC------- 183

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
           Y   D     + PS SST   ++C    C +L  S  +    C Y +  Y + + + G L
Sbjct: 184 YEQQD---PLFDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQ-YGDQSQTDGNL 239

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           V D L L +       +      + GCG  Q+ G    V  DGL GLG  ++S+PS  A 
Sbjct: 240 VRDTLTLSA-------SDTLPGFVFGCG-DQNAGLFGQV--DGLFGLGREKVSLPSQGAP 289

Query: 263 AGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQSTSFL--ASNGKYITYIIGVETCCIGS 318
           +      F+ C     SGR +   G   PA  Q T+    A+   Y   ++G++   +G 
Sbjct: 290 S--YGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIK---VGG 344

Query: 319 SCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
             ++        +   ++DSG+  T LP   Y  + A F R +     +        CY 
Sbjct: 345 RAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYD 404

Query: 373 SSSQRLPKLPSVKLMF 388
            +  R  ++P+V+L F
Sbjct: 405 FTGHRTAQIPTVELAF 420


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 123/279 (44%), Gaps = 36/279 (12%)

Query: 90  SKTMSLGNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNS 146
           S  M L +D      + T + IGTP   F + +D+GS + ++PC  C +C        N 
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCG-------NH 125

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
            D     + P  SST   + C     ++  +C + K  C Y   Y  E +SSSG+L EDI
Sbjct: 126 QD---PRFQPDLSSTYSPVKC-----NVDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDI 176

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +    G ++ LK       + GC   ++G      A DG++GLG G++S+   L   G+I
Sbjct: 177 VSF--GTESELKPQ---RAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVI 230

Query: 267 RNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK- 322
            +SFSMC+   D G    +      P     T   A    Y  Y I ++   +    L+ 
Sbjct: 231 GDSFSMCYGGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRV 288

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                      ++DSG+++ +LP++ +         QV+
Sbjct: 289 DPRIFDGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVH 327


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 131/313 (41%), Gaps = 50/313 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +   + IGTP     + LD GSDL+W      +C P  A +    D+ L  + PS SST 
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTL 85

Query: 163 KHLSCSHRLCD--LGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
              SC   LC      SC +PK    Q C YT  Y  + + ++G L  D    +  G + 
Sbjct: 86  SLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV 144

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                   V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF  
Sbjct: 145 ------PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTT 191

Query: 277 -----------DDSGRIFFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
                      D    +F   QG   T     +  +      Y + ++   +GS+ L   
Sbjct: 192 ITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVP 251

Query: 323 QTSFK-------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           +++F         I+DSG+S T LP +VY+ +  EF  Q+   +          C+ + S
Sbjct: 252 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPS 311

Query: 376 QRLPKLPSVKLMF 388
           Q  P +P + L F
Sbjct: 312 QAKPDVPKLVLHF 324


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/299 (28%), Positives = 127/299 (42%), Gaps = 44/299 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRD-LNEYSPSASSTSKHL 165
           + +GTP     + LD GSDL+W      +CAP      N  D+  +    P+ASST   +
Sbjct: 98  LSVGTPPRPVALTLDTGSDLVW-----TQCAPC----LNCFDQGAIPVLDPAASSTHAAV 148

Query: 166 SCSHRLCDL--GTSCQNP-----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            C   +C     TSC        ++ C Y   +Y + + + G L  D       GDNA  
Sbjct: 149 RCDAPVCRALPFTSCGRGGSSWGERSCVYVY-HYGDKSITVGKLASDRF-TFGPGDNADG 206

Query: 219 NSV-QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-- 275
             V +  +  GCG    G +       G+ G G G  S+PS L        SFS CF   
Sbjct: 207 GGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFSYCFTSM 259

Query: 276 -KDDSGRIFFGDQGPAT------QQSTSFLASNGKYITYIIGVETCCIGSSCL------- 321
            +  S  +  G   PA        QST  L    +   Y + ++   +G++ +       
Sbjct: 260 FESTSSLVTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQ 318

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
           +     AI+DSG+S T LP++VYE + AEF  QV   +++ EG     C+   S   PK
Sbjct: 319 RLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPK 377


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 139/319 (43%), Gaps = 60/319 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +++G  N+S +V  D GSDL W     V+C P  + Y    ++    Y PS SS+ K + 
Sbjct: 142 VELGGKNMSLIV--DTGSDLTW-----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVF 190

Query: 167 CSHRLC-DLGTSCQNP----------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           C+   C DL  +  N           K  C Y + Y   + +   L  E I+     GD 
Sbjct: 191 CNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL----GDT 246

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
            L+N     ++ GCG + + G   G +  GL+GLG   +S+ S   K       FS C  
Sbjct: 247 KLEN-----LVFGCG-RNNKGLFGGAS--GLMGLGRSSVSLVSQTLKT--FNGVFSYCLP 296

Query: 275 --DKDDSGRIFFGDQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSFK 327
             +   SG + FG+     + STS     L  N +  + YI+ +    IG   LK  SF 
Sbjct: 297 SLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFG 356

Query: 328 A--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRL 378
              ++DSG+  T LP  +Y+ +  EF +Q       F G+P          C+  +S   
Sbjct: 357 RGILIDSGTVITRLPPSIYKAVKTEFLKQ-------FSGFPSAPGYSILDTCFNLTSYED 409

Query: 379 PKLPSVKLMFPQNNSFVVN 397
             +P++K++F  N    V+
Sbjct: 410 ISIPTIKMIFEGNAELEVD 428


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 107/220 (48%), Gaps = 25/220 (11%)

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG-VAPDGLIGLG 250
           Y + +S++G LV+D++HL     N    S   ++I GCG KQSG   +   A DG++G G
Sbjct: 2   YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61

Query: 251 LGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF-FGDQGPATQQSTSFLASNGKYITYII 309
               S  S LA  G ++ SF+ C D ++ G IF  G+      ++T  L+ +  Y   + 
Sbjct: 62  QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLN 121

Query: 310 GVETCCIGSSCLKQTSFK--------AIVDSGSSFTFLPKEVY-----ETIAAEFDRQVN 356
            +E   +G+S L+ +S           I+DSG++  +LP  VY     E +A+  +  ++
Sbjct: 122 AIE---VGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLH 178

Query: 357 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
               SF  + +       + +L + P+V   F ++ S  V
Sbjct: 179 TVQESFTCFHY-------TDKLDRFPTVTFQFDKSVSLAV 211


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 144/339 (42%), Gaps = 44/339 (12%)

Query: 89  GSKTMSLGNDFGWLHY--TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYN 145
           GS  M L +D     Y  + + IGTP   F + +D GS + ++PC  C  C        N
Sbjct: 19  GSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCG-------N 71

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
             D     +SP+ SS+ K L C    C  G  C   ++        Y E ++SSG+L +D
Sbjct: 72  HQD---PRFSPALSSSYKPLECGSE-CSTGF-CDGSRK----YQRQYAEKSTSSGVLGKD 122

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
           ++   +  D   +      ++ GC   ++G   D  A DG+IGLG G +S+   L +   
Sbjct: 123 VIGFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNA 176

Query: 266 IRNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
           + + FS+C+   D G    I  G Q P     T+       Y  Y + ++   +G S L+
Sbjct: 177 MEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY--YNLMLKGIRVGGSPLR 234

Query: 323 ------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---CCYKS 373
                    +  ++DSG+++ + P   ++   +    QV  ++    G   K    CY  
Sbjct: 235 LKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVG-SLKEVPGPDEKFKDICYAG 293

Query: 374 SSQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYGTQV 408
           +   +  L    PSV  +F    S  ++   ++   T++
Sbjct: 294 AGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKI 332


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 132/316 (41%), Gaps = 43/316 (13%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
            P+Q   ++  GN     +   + +GTP   + V  D GSDL W+ C  C  C       
Sbjct: 136 LPAQRGISLGTGN-----YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADC------- 183

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
           Y   D     + PS SST   ++C    C +L  S  +    C Y +  Y + + + G L
Sbjct: 184 YEQQD---PLFDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQ-YGDQSQTDGNL 239

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           V D L L +       +      + GCG  Q+ G    V  DGL GLG  ++S+PS  A 
Sbjct: 240 VRDTLTLSA-------SDTLPGFVFGCG-DQNAGLFGQV--DGLFGLGREKVSLPSQGAP 289

Query: 263 AGLIRNSFSMCFDKDDSGRIF--FGDQGPATQQSTSFL--ASNGKYITYIIGVETCCIGS 318
           +      F+ C     SGR +   G   PA  Q T+    A+   Y   ++G++   +G 
Sbjct: 290 S--YGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIK---VGG 344

Query: 319 SCLK------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
             ++        +   ++DSG+  T LP   Y  + A F R +     +        CY 
Sbjct: 345 RAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYD 404

Query: 373 SSSQRLPKLPSVKLMF 388
            +  R  ++P+V+L F
Sbjct: 405 FTGHRTAQIPTVELAF 420


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 161/385 (41%), Gaps = 64/385 (16%)

Query: 30  KLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQG 89
           KL HRFSE   +   S  R  +        E+++ L+     + +        ML  S  
Sbjct: 31  KLKHRFSELEGSSKQSGKRGMSE-------EHFRQLMDHTRARSRRFLLEVDLMLNGSST 83

Query: 90  SKTMSLGNDFGWLHYTWIDIGTPNVSFLVAL-DAGSDLLWIPCD-CVRCAPLSASYYNS- 146
           S            +Y  I +G P V FL A+ D GSD+LW  C  C  C+        S 
Sbjct: 84  SDAT---------YYAQIGVGHP-VQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSS 133

Query: 147 --LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
             +   +  Y P  S T+   +CS  LC  G SC+     C Y +  Y + +SS+G+   
Sbjct: 134 IIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCAYDIS-YEDTSSSTGIYFR 192

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           D++HL        K S+  ++ +GC    SG +      DG++G G  ++SVP+ LA   
Sbjct: 193 DVVHL------GHKASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQA 242

Query: 265 LIRNSFSMCF--DKDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
              N F  C   +K+  G +  G  D+ P     T  LA++   I Y + + +  + S  
Sbjct: 243 GSYNIFYHCLSGEKEGGGILVLGKNDEFPEMVY-TPMLAND---IVYNVKLVSLSVNSKA 298

Query: 321 L--KQTSFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
           L  + + F+          I+DSG+S    P +      A F + V+   T+    P + 
Sbjct: 299 LPIEASEFEYNATVGNGGTIIDSGTSSATFPSKAL----ALFVKAVSKFTTAIPTAPLES 354

Query: 370 ----CYKSSSQRLPKLPSVKLMFPQ 390
               C+ S S R     SV++ FP 
Sbjct: 355 SGSPCFISISDR----NSVEVDFPN 375


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 83/318 (26%), Positives = 144/318 (45%), Gaps = 42/318 (13%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T I IGTP  +F + +D GS + ++PC  C +C                ++ P  SST +
Sbjct: 92  TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFEPELSSTYQ 141

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            +SC     ++  +C N ++ C Y   Y  E +SSSG+L EDI   IS G+ +    V  
Sbjct: 142 PVSC-----NIDCTCDNERKQCVYERQY-AEMSSSSGVLGEDI---ISFGNQS--ELVPQ 190

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGR 281
             I GC  +++G      A DG++GLG G++S+   L + G+I +SFS+C+   D   G 
Sbjct: 191 RAIFGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGA 249

Query: 282 IFFGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
           +  G   P +     F  S+  +   Y I ++   +    L             ++DSG+
Sbjct: 250 MILGGISPPS--GMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGT 307

Query: 335 SFTFLPKEVY----ETIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           ++ +LP+  +    + +  E    +Q++    ++    +       SQ     P+V+++F
Sbjct: 308 TYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVF 367

Query: 389 P--QNNSFVVNNPVFVIY 404
              Q  S    N +F  Y
Sbjct: 368 SNGQKLSLSPENYLFQYY 385


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 82/302 (27%), Positives = 124/302 (41%), Gaps = 41/302 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP     + LD GS L+W  C  C  C   S  YY++          S SST    
Sbjct: 39  LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALP 88

Query: 166 SCSHRLCDLG---TSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           SC    C L    T C N   Q C Y+  Y  + +++ G L  + +  ++G         
Sbjct: 89  SCDSTQCKLDPSVTMCVNQTVQTCAYSYSY-GDKSATIGFLDVETVSFVAGAS------- 140

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
              V+ GCG+  +G +       G+ G G G +S+PS L K G   + F+    +  S  
Sbjct: 141 VPGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTV 197

Query: 282 IF-----FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFK 327
           +F         G  T Q+T  + +      Y + ++   +GS+          LK  +  
Sbjct: 198 LFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGG 257

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I+DSG++FT LP  VY  +  EF   V    + S E  P  C       + P +P + L
Sbjct: 258 TIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVL 317

Query: 387 MF 388
            F
Sbjct: 318 HF 319


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/315 (27%), Positives = 135/315 (42%), Gaps = 35/315 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           H   I IGTP +     +D GSDL+WI     +CAP    Y     +    + P  SST 
Sbjct: 68  HLMEIYIGTPPIKITGLVDTGSDLIWI-----QCAPCLGCY----KQIKPMFDPLKSSTY 118

Query: 163 KHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
            ++SC   LC  L T   +P++ C YT   Y +N+ + G+L +D     S   N  K   
Sbjct: 119 NNISCDSPLCHKLDTGVCSPEKRCNYTYG-YGDNSLTKGVLAQDTATFTS---NTGKPVS 174

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI--RNSFSMCF----- 274
            +  + GCG   +GG+ D     GLIGLG G     SL+++ G +     FS C      
Sbjct: 175 LSRFLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKFSQCLVPFLT 229

Query: 275 DKDDSGRIFFGD--QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKA 328
           D   S R+ FG   Q       T+ L    K  +Y + +    +  +     S       
Sbjct: 230 DIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANM 289

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
           +VDSG+    LP+++Y+ + AE   +V    IT       + CY++ +    K P++   
Sbjct: 290 LVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNL--KGPTLTFH 347

Query: 388 FPQNNSFVVNNPVFV 402
           F   N  +     F+
Sbjct: 348 FVGANVLLTPIQTFI 362


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 82/302 (27%), Positives = 124/302 (41%), Gaps = 41/302 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP     + LD GS L+W  C  C  C   S  YY++          S SST    
Sbjct: 95  LAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDA----------SRSSTFALP 144

Query: 166 SCSHRLCDLG---TSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           SC    C L    T C N   Q C Y+  Y  + +++ G L  + +  ++G         
Sbjct: 145 SCDSTQCKLDPSVTMCVNQTVQTCAYSYSY-GDKSATIGFLDVETVSFVAGAS------- 196

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
              V+ GCG+  +G +       G+ G G G +S+PS L K G   + F+    +  S  
Sbjct: 197 VPGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KVGNFSHCFTAVSGRKPSTV 253

Query: 282 IF-----FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLKQTSFK 327
           +F         G  T Q+T  + +      Y + ++   +GS+          LK  +  
Sbjct: 254 LFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGG 313

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I+DSG++FT LP  VY  +  EF   V    + S E  P  C       + P +P + L
Sbjct: 314 TIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVL 373

Query: 387 MF 388
            F
Sbjct: 374 HF 375


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 121/266 (45%), Gaps = 34/266 (12%)

Query: 90  SKTMSLGNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNS 146
           S  M L +D      + T + IGTP   F + +D+GS + ++PC  C +C        N 
Sbjct: 70  SARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG-------NH 122

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
            D     + P  SST   + CS        +C + K  C Y   Y  E +SSSG+L EDI
Sbjct: 123 QD---PRFQPDLSSTYSPVKCS-----ADCTCDSDKSQCTYERQY-AEMSSSSGVLGEDI 173

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +    G ++ LK       + GC   ++G      A DG++GLG G++S+   L   G+I
Sbjct: 174 VSF--GTESELKPQ---RAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVI 227

Query: 267 RNSFSMCFDKDD--SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
            +SFSMC+   D   G +  G   PA        +   +   Y I ++   +    L+  
Sbjct: 228 GDSFSMCYGGMDIGGGAMVLGAM-PAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLD 286

Query: 323 ----QTSFKAIVDSGSSFTFLPKEVY 344
                +    ++DSG+++ +LP++ +
Sbjct: 287 PRIFDSKHGTVLDSGTTYAYLPEQAF 312


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 135/313 (43%), Gaps = 31/313 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT + +GTP   F V +D GSD+LW+ C      P ++     L   L+ + P  SS+
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTS----ELQIQLSFFDPGVSSS 138

Query: 162 SKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           +  +SCS R C       + C +P   C Y+   Y + + +SG  + D +   +   + L
Sbjct: 139 ASLVSCSDRRCYSNFQTESGC-SPNNLCSYSFK-YGDGSGTSGYYISDFMSFDTVITSTL 196

Query: 218 KNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
             +  A  + GC   QSG       A DG+ GLG G +SV S LA  GL    FS C   
Sbjct: 197 AINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKG 256

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------QTSF 326
           DK   G +  G         T  + S      Y + +++  +    L          T  
Sbjct: 257 DKSGGGIMVLGQIKRPDTVYTPLVPSQPH---YNVNLQSIAVNGQILPIDPSVFTIATGD 313

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE---GYPWKCCYKSSSQRLPKLPS 383
             I+D+G++  +LP E Y    + F + V + ++ +     Y    C++ ++  +   P 
Sbjct: 314 GTIIDTGTTLAYLPDEAY----SPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQ 369

Query: 384 VKLMFPQNNSFVV 396
           V L F    S V+
Sbjct: 370 VSLSFAGGASMVL 382


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 120/280 (42%), Gaps = 31/280 (11%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD 148
           G ++  +   F +L Y  +++GTP    L   D GSDL+W+ C        S+S     D
Sbjct: 91  GVESKIITRSFEYLMY--VNVGTPPTQLLAIADTGSDLVWVNC--------SSSGGGLAD 140

Query: 149 RDLNE---YSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
            D      + P+ SST   LSC    C  L  +  +    C Y    Y + + + G+L  
Sbjct: 141 ADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYS-YGDGSRTIGVLST 199

Query: 205 DILHLISGGDNALKNSVQA-SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           +    + GG    K  V+   V  GC    +G +      DGL+GLG G  S+ S L   
Sbjct: 200 ETFSFVDGGG---KGQVRVPRVNFGCSTASAGTFRS----DGLVGLGAGAFSLVSQLGAT 252

Query: 264 GLIRNSFSMC----FDKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCI 316
             I    S C    +D + S  + FG +   ++    ST  + S+     Y + +E+  +
Sbjct: 253 THIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSY-YTVALESVAV 311

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           G   +     + IVDSG++ TFL   +   +  E +R++ 
Sbjct: 312 GGQEVATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIK 351


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 132/313 (42%), Gaps = 29/313 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT + +GTP   F V +D GSD+LW+ C      P ++     L   L+ + P  SS+
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTS----ELQIQLSFFDPGVSSS 138

Query: 162 SKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           +  +SCS R C       + C +P   C Y+   Y + + +SG  + D +   +   + L
Sbjct: 139 ASLVSCSDRRCYSNFQTESGC-SPNNLCSYSFK-YGDGSGTSGFYISDFMSFDTVITSTL 196

Query: 218 KNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
             +  A  + GC   Q+G       A DG+ GLG G +SV S LA  GL    FS C   
Sbjct: 197 AINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKG 256

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------QTSF 326
           DK   G +  G         T  + S      Y + +++  +    L          T  
Sbjct: 257 DKSGGGIMVLGQIKRPDTVYTPLVPSQPH---YNVNLQSIAVNGQILPIDPSVFTIATGD 313

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFEGYPWKCCYKSSSQRLPKLPSV 384
             I+D+G++  +LP E Y          V+      ++E Y    C++ ++  +   P V
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ---CFEITAGDVDVFPEV 370

Query: 385 KLMFPQNNSFVVN 397
            L F    S V+ 
Sbjct: 371 SLSFAGGASMVLR 383


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 152/355 (42%), Gaps = 72/355 (20%)

Query: 27  FSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
           FS +LIHR S +      ++N+     NA      ++   ++  LS+  +      G ++
Sbjct: 28  FSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLSNTPESTVYVNGGEY 87

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLS 140
            M +                       +GTP  +    +D GSD++W+ C  C +C   +
Sbjct: 88  LMTY----------------------SVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQT 125

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSS 198
              +N          PS SS+ K++ CS  LC     TSC N +  C YT+++  ++ S 
Sbjct: 126 TPIFN----------PSKSSSYKNIPCSSNLCQSVRYTSC-NKQNSCEYTINFSDQSYSQ 174

Query: 199 SGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
             L VE +       D+   +SV     +IGCG    G +    +  G++GLG+G +S+ 
Sbjct: 175 GELSVETLTL-----DSTTGHSVSFPKTVIGCGHNNRGMFQGETS--GIVGLGIGPVSLT 227

Query: 258 SLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYII 309
           + L  +  I   FS C      D + + ++ FGD    +     ST F+  + +   Y+ 
Sbjct: 228 TQLKSS--IGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLT 285

Query: 310 GVETCCIGSSCLKQTSFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQV 355
            +E   +G+   K+  F+          I+DSG++ T LP  VY  + +   + V
Sbjct: 286 -LEAFSVGN---KRIEFEVLDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLV 336


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 147/317 (46%), Gaps = 37/317 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L++T + +G+P   F V +D GSD+LW+ C+     P ++     L  +L+ + PS+SST
Sbjct: 85  LYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTS----GLGIELSFFDPSSSST 140

Query: 162 SKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG-GDN 215
           +  +SCSH +C          C      C Y+  +Y + + ++G  V D+L+  +  GD+
Sbjct: 141 TSLVSCSHPICTSLVQTTAAECSPQSNQCSYSF-HYGDGSGTTGYYVSDMLYFDTVLGDS 199

Query: 216 ALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            + NS  AS++ GC   QSG       A DG+ G G  ++SV S L+  G+    FS C 
Sbjct: 200 LIANS-SASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL 258

Query: 275 --DKDDSGRIFFGD-------QGPATQQSTSF------LASNGKYITYIIGVETCCIGSS 319
             + D  G++  G+         P     + +      ++ NG+    ++ ++     +S
Sbjct: 259 KGEGDGGGKLVLGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQ----LLPIDPAVFATS 314

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             + T    IVDSG++ T+L +  Y+   +     V+ + T       + CY  S+    
Sbjct: 315 NNQGT----IVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ-CYLVSTSVDE 369

Query: 380 KLPSVKLMFPQNNSFVV 396
             P V L F    S V+
Sbjct: 370 IFPPVSLNFAGGASMVL 386


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 136/318 (42%), Gaps = 41/318 (12%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           +LG     L Y   + IG+P V+  +++D GSD+ W+ C  C +C     S  +SL    
Sbjct: 121 TLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQC----HSEVDSL---- 172

Query: 152 NEYSPSASSTSKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
             + PSASST    SCS   C        G  C + +  C Y +  Y + +S++G    D
Sbjct: 173 --FDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQ--CQYIVS-YVDGSSTTGTYSSD 227

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            L L   G NA+K         GC   +SGG+ D    DGL+GLG    S+ S    AG 
Sbjct: 228 TLTL---GSNAIKG-----FQFGCSQSESGGFSDQT--DGLMGLGGDAQSLVS--QTAGT 275

Query: 266 IRNSFSMCF--DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK- 322
              +FS C       SG +  G    +    T  L S      Y + +E   +G   L  
Sbjct: 276 FGKAFSYCLPPTPGSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNI 335

Query: 323 -QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             + F A  ++DSG+  T LP   Y  +++ F   +     +        C+  S Q   
Sbjct: 336 PTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSV 395

Query: 380 KLPSVKLMFPQNNSFVVN 397
            +PSV L+F  +   VVN
Sbjct: 396 SIPSVALVF--SGGAVVN 411


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/298 (26%), Positives = 135/298 (45%), Gaps = 36/298 (12%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP   F + +D+GS + ++PC  C +C        N  D     + P  SS   
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSS--- 137

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
             S S   C++  +C + K+ C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 138 --SYSPVKCNVDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---AQ 189

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
             + GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 190 RAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGA 248

Query: 284 FGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSF 336
               G  T     F  S+  +   Y I ++   +    L+       +    ++DSG+++
Sbjct: 249 MVLGGVPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTY 308

Query: 337 TFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMF 388
            +LP++ +         +V+    I   +      C+  + + + KL    P V ++F
Sbjct: 309 AYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVF 366


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 135/317 (42%), Gaps = 52/317 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP    +V +D GSDL WI  + C  C           ++    + PS SST   +
Sbjct: 29  IYLGTPPQKAVVIIDTGSDLTWIQSEPCRAC----------FEQADPIFDPSKSSTYNKI 78

Query: 166 SCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           +CS   C   LGT   +    C Y   Y   + +      E I    + G+         
Sbjct: 79  ACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEE-------- 130

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDD 278
            V  G  +  +G + D    +G++GLG G +S+PS L    ++ N FS C         +
Sbjct: 131 -VKFGASVYNTGTFGD-TGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSE 186

Query: 279 SGRIFFGDQG-PATQQSTSFLASNGKYITYI-IGVETCCIGSSCLK--QTSFK------- 327
           +  ++FGD   P+ +   + +  N  + TY  I V+   +G S L   Q+ ++       
Sbjct: 187 TSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSG 246

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPS-- 383
             I+DSG++ T+L +EV+  + A +  QV   T TS  G     C+ +     P  P+  
Sbjct: 247 GTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG--LDLCFNTRGTGSPVFPAMT 304

Query: 384 -----VKLMFPQNNSFV 395
                V L  P  N+F+
Sbjct: 305 IHLDGVHLELPTANTFI 321


>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
          Length = 393

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/270 (28%), Positives = 119/270 (44%), Gaps = 57/270 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST--S 162
           ++IG P+  + + +D GSDL W+ CD  CV+C      YY    R  N   P       S
Sbjct: 38  LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYY----RPRNNLVPCMDPICQS 93

Query: 163 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            H +  HR       C+NP Q C Y ++Y  +  SS G+LV D  +L     N       
Sbjct: 94  LHSNGDHR-------CENPGQ-CDYEVEY-ADGGSSFGVLVTDTFNL-----NFTSEKRH 139

Query: 223 ASVI-IGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--- 277
           + ++ +GCG  Q  GG    +  DG++GLG G+ S+ S L+  GL+RN    C       
Sbjct: 140 SPLLALGCGYDQFPGGSHHPI--DGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGG 197

Query: 278 ---------DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
                    DS R+ +    P  +  +  LA            E    G    K T FK 
Sbjct: 198 FLFFGDDLYDSSRVAWTPMSPDAKHYSPGLA------------ELTFDG----KTTGFKN 241

Query: 329 IV---DSGSSFTFLPKEVYETIAAEFDRQV 355
           ++   DSG+S+T+L  + Y+ + +   +++
Sbjct: 242 LLTTFDSGASYTYLNSQAYQGLISLLKKEL 271


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/307 (27%), Positives = 137/307 (44%), Gaps = 54/307 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V++   +D GSDL+W  C  CV C           ++    + PS+SST   L
Sbjct: 106 MSIGTPAVAYAAIIDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYAAL 155

Query: 166 SCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            CS  LC DL +S C + K  C YT   Y +++S+ G+L  +           L  +   
Sbjct: 156 PCSSTLCSDLPSSKCTSAK--CGYTYT-YGDSSSTQGVLAAETF--------TLAKTKLP 204

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGR 281
            V  GCG    G G+  G    GL+GLG G +   SL+++ GL  N FS C    DD+ +
Sbjct: 205 DVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--NKFSYCLTSLDDTSK 256

Query: 282 ----------IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK-- 327
                     I       ++ Q+T  + +  +   Y + ++   +GS+   L  ++F   
Sbjct: 257 SPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQ 316

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                  IVDSG+S T+L  + Y  +   F  Q+        G     C+++ +  + ++
Sbjct: 317 DDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQV 376

Query: 382 PSVKLMF 388
              KL+F
Sbjct: 377 EVPKLVF 383


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 118/250 (47%), Gaps = 34/250 (13%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP   F + +D+GS + ++PC  C +C        N  D     + P  SS   
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCG-------NHQD---PRFQPDLSS--- 137

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
             S S   C++  +C + K+ C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 138 --SYSPVKCNVDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---PQ 189

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--- 280
             + GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 190 RAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 248

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
            +  G   P+    +        Y  Y I ++   +    L+       +    ++DSG+
Sbjct: 249 MVLGGVPAPSDMVFSHSDPLRSPY--YNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGT 306

Query: 335 SFTFLPKEVY 344
           ++ +LP++ +
Sbjct: 307 TYAYLPEQAF 316


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/300 (26%), Positives = 135/300 (45%), Gaps = 40/300 (13%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP   F + +D+GS + ++PC  C +C        N  D     + P  SS   
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCG-------NHQD---PRFQPDLSS--- 136

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
             S S   C++  +C + K+ C Y   Y  E +SSSG+L EDI+    G ++ LK     
Sbjct: 137 --SYSPVKCNVDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSF--GRESELK---PQ 188

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--- 280
             I GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   D G   
Sbjct: 189 HAIFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGA 247

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
            +  G   P     ++       Y  Y I ++   +    L+       +    ++DSG+
Sbjct: 248 MVLGGMLAPPDMIFSNSDPLRSPY--YNIELKEIHVAGKALRVESRIFNSKHGTVLDSGT 305

Query: 335 SFTFLPKEVYETIAAEFDRQVN--DTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMF 388
           ++ +LP++ +         +V+    I   +      C+  + + + KL    P V ++F
Sbjct: 306 TYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVF 365


>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 91/204 (44%), Gaps = 36/204 (17%)

Query: 96  GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN F   +Y+  + IGTP  +F   +D GSDL W+ CD  C  C              + 
Sbjct: 46  GNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCT----------LPPIR 95

Query: 153 EYSPSASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI- 206
           +Y P  ++    + C   +C          C NPK+ C Y ++Y  + +S   L+++   
Sbjct: 96  QYKPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAK 262
           L L++G      +++Q  +  GCG  Q    L    P     G++GLG G+I V   L  
Sbjct: 152 LKLLNG------SAMQPRLAFGCGYDQ---ILPKAHPPPATAGVLGLGRGKIGVLPQLVA 202

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGD 286
           AGL RN    C      G +FFGD
Sbjct: 203 AGLTRNVVGHCLSSKGGGYLFFGD 226


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 145/339 (42%), Gaps = 41/339 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS--LDRDLNEYS----P 156
           ++    +GTP   F++  D GSDL W+ C   R +   AS   S  + R  N  S    P
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169

Query: 157 SASSTSK-HLSCSHRLCDLGTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGD 214
            +S T K ++  S   C  GT+   P  PC Y  DY Y + +S+ G++  D   +   G 
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTT---PPAPCGY--DYRYKDKSSARGVVGTDAATIALSGS 224

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            + + +    V++GC     G      + DG++ LG   IS  S    A      FS C 
Sbjct: 225 GSDRKAKLQEVVLGCTTSYDGQSFQ--SSDGVLSLGNSNISFASR--AAARFGGRFSYCL 280

Query: 275 -----DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK------ 322
                 ++ +  + FG  G A   S + L  + +    Y + V+   +    L       
Sbjct: 281 VDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVW 340

Query: 323 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLP 379
             + +  AI+DSG+S T L    Y+ + A   +Q+   +      P++ CY  ++++R P
Sbjct: 341 DVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLA-RVPRVTMDPFEYCYNWTATRRPP 399

Query: 380 KLPSVKLMF-------PQNNSFVVN-NPVFVIYGTQVGV 410
            +P +++ F       P   S+V++  P     G Q GV
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGV 438


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 125/305 (40%), Gaps = 43/305 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V F+   D GSDL W  C  C  C P          +D   Y PSASST   +
Sbjct: 81  LAIGTPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPV 130

Query: 166 SCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            CS   C       +C  P   C Y    Y++   S+G+L  + L L   G +    +V 
Sbjct: 131 PCSSATCLPVLRSRNCSTPSSLCRYGYS-YSDGAYSAGILGTETLTL---GSSVPGQAVS 186

Query: 223 AS-VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDS 279
            S V  GCG    G   D +   G +GLG G +   SLLA+ G+ + S+ +   F+    
Sbjct: 187 VSDVAFGCGTDNGG---DSLNSTGTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSTLD 240

Query: 280 GRIFFGD-----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQT 324
                G       GP   QST  L S      Y++ ++   +G   L            +
Sbjct: 241 SPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANS 300

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR-LPKLPS 383
           +   +VDSG++F+ LP+  +  +     + +     +       C    + +R LP +P 
Sbjct: 301 TGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMPD 360

Query: 384 VKLMF 388
           + L F
Sbjct: 361 LVLHF 365


>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
 gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
          Length = 603

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/254 (30%), Positives = 111/254 (43%), Gaps = 26/254 (10%)

Query: 112 PNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           P   + +  D GSDL WI CD  C  CA  + ++Y    R  N   P      K L C  
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKP--RRGNIVPP------KDLLCME 250

Query: 170 -RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
            +       C+   Q C Y ++Y  +++SS G+L  D L L+    +  K     + I G
Sbjct: 251 VQRNQKAGYCETCDQ-CDYEIEY-ADHSSSMGVLATDKLLLMVANGSLTK----LNFIFG 304

Query: 229 CGMKQSGGYLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSGRIFFG 285
           C   Q G  L   V  DG++GL   ++S+PS LA  G+I N    C   D    G +F G
Sbjct: 305 CAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLG 364

Query: 286 DQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIV-DSGSSFTFL 339
           D   P    +   +  +     Y   V     GSS L     ++  K I+ DSGSS+T+ 
Sbjct: 365 DDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYF 424

Query: 340 PKEVYETIAAEFDR 353
           PKE Y  + A  + 
Sbjct: 425 PKEAYSELVASLNE 438


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/254 (29%), Positives = 115/254 (45%), Gaps = 27/254 (10%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS--TSKH 164
           +G P   + +  D GSDL W+ CD  C +C       Y    +  N+  P       S H
Sbjct: 63  VGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLY----QPSNDLVPCKDPLCMSLH 118

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNSVQA 223
            S  HR       C+NP Q C Y ++Y  +  SS G+LV D+  L ++ GD      ++ 
Sbjct: 119 SSMDHR-------CENPDQ-CDYEVEY-ADGGSSLGVLVRDVFPLNLTNGD-----PIRP 164

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
            + +GCG  Q  G       DG++GLG G +S+ S L   G++RN    CF+    G  F
Sbjct: 165 RLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXF 224

Query: 284 FGDQ--GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 341
           FGD    P     T       K+ +   G E    G S   +  F  + DSGSS+T+   
Sbjct: 225 FGDGIYDPYRLVWTPMSRDYPKHYSPGFG-ELIFNGRSTGLRNLF-VVFDSGSSYTYFNA 282

Query: 342 EVYETIAAEFDRQV 355
           + Y+ + +  +R++
Sbjct: 283 QAYQVLTSLLNREL 296


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 110/258 (42%), Gaps = 30/258 (11%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCA----PLSASYYNSLDRDLNEYS 155
           L+Y  + +G P+  + + +D+GS+L WI CD  C+ CA    PL      SL    +   
Sbjct: 78  LYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDPLC 137

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
            +  + S H   +H+            Q C Y + Y  ++  S G LV D +  +     
Sbjct: 138 AAVQAGSGHYH-NHK---------EASQRCDYDVAY-ADHGYSEGFLVRDSVRALLTN-- 184

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
             K  + A+ + GCG  Q     +     DG++GLG G  S+PS  AK GLI+N    C 
Sbjct: 185 --KTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242

Query: 275 --DKDDSGRIFFGDQGPATQQSTSF-LASNGKYITYIIGVETCCIGSSCLKQTSFKA--- 328
                D G +FFGD   +T   T   +        Y +G      G+  L +        
Sbjct: 243 FGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKLG 302

Query: 329 --IVDSGSSFTFLPKEVY 344
             I DSGS++T+   + Y
Sbjct: 303 GIIFDSGSTYTYFTNQAY 320


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 124/266 (46%), Gaps = 40/266 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P +SST K + C
Sbjct: 89  IGTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPESSSTYKPIKC 138

Query: 168 SHR-LCDL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           +   +CD  G  C   +Q        Y E ++SSG+L ED+   IS G+ +    +    
Sbjct: 139 NIDCICDSDGVQCVYERQ--------YAEMSTSSGVLGEDV---ISFGNQS--ELIPQRA 185

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIF 283
           + GC   ++G      A DG++GLG G++S+   L + G I +SFS+C+   D   G + 
Sbjct: 186 VFGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMV 244

Query: 284 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFT 337
            G   P +    ++ +   +   Y + ++   +    L  +S      + A++DSG+++ 
Sbjct: 245 LGGISPPSDMIFTY-SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYA 303

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFE 363
           +LP E +    + F   + D I S +
Sbjct: 304 YLPAEAF----SAFKDAIMDEIHSLK 325


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 124/266 (46%), Gaps = 40/266 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P +SST K + C
Sbjct: 89  IGTPPQQFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFDPESSSTYKPIKC 138

Query: 168 SHR-LCDL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           +   +CD  G  C   +Q        Y E ++SSG+L ED+   IS G+ +    +    
Sbjct: 139 NIDCICDSDGVQCVYERQ--------YAEMSTSSGVLGEDV---ISFGNQS--ELIPQRA 185

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIF 283
           + GC   ++G      A DG++GLG G++S+   L + G I +SFS+C+   D   G + 
Sbjct: 186 VFGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMV 244

Query: 284 FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFT 337
            G   P +    ++ +   +   Y + ++   +    L  +S      + A++DSG+++ 
Sbjct: 245 LGGISPPSDMIFTY-SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYA 303

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFE 363
           +LP E +    + F   + D I S +
Sbjct: 304 YLPAEAF----SAFKDAIMDEIHSLK 325


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/348 (25%), Positives = 149/348 (42%), Gaps = 86/348 (24%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLD-RDLNEYSPSAS 159
           T +  GTP  +  +  D GS L+W PC     C  C+      +  +D   +  + P  S
Sbjct: 83  TPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRFVPKLS 136

Query: 160 STSKHLSCSHRLC------DLGTSCQ--NPK-----QPCP-YTMDYYTENTSSSGLLVED 205
           S+SK + C +  C      D+ + C+  NPK     Q CP Y + Y   + S++GLL+ +
Sbjct: 137 SSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY--GSGSTAGLLLSE 194

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            L      D  + N      ++GC       +L    P G+ G G G  S+PS   + GL
Sbjct: 195 TLDF---PDKXIPN-----FVVGCS------FLSIHQPSGIAGFGRGSESLPS---QMGL 237

Query: 266 IRNSFSMCFDKDD----SGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGVE 312
            + ++ +   K D    SG++     G         P  Q  +  +++N     Y + + 
Sbjct: 238 KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPS--VSNNAYKEYYYLNIR 295

Query: 313 TCCIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND---- 357
              +G+  +K   +K           +I+DSGS+FTF+ K V E +A EF++Q+ +    
Sbjct: 296 KIIVGNQAVK-VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA 354

Query: 358 -TITSFEGYPWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVV 396
             + +  G   + C+  S ++  K P +   F        P NN F +
Sbjct: 355 TDVETLTG--LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFAL 400


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 136/327 (41%), Gaps = 70/327 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP  +  + +D GSDL+W PC     C  C+      +++ +   N + P +SS+S
Sbjct: 94  LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFIPKSSSSS 147

Query: 163 KHLSCSHRLC-------------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           K L C +  C             D   +  N  Q CP  + +Y    +  G+++ + L L
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG-GIMLSETLDL 206

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
              G          + I+GC +      L    P G+ G G G  S+PS L   GL + S
Sbjct: 207 PGKG--------VPNFIVGCSV------LSTSQPAGISGFGRGPPSLPSQL---GLKKFS 249

Query: 270 F--------------SMCFDKD-DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           +              S+  D + DSG    G       Q+      +   + Y +G+   
Sbjct: 250 YCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHI 309

Query: 315 CIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSF 362
            +G   +K   +K            I+DSG++FT++  E++E +AAEF++QV +   T  
Sbjct: 310 TVGGKHVK-IPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEV 368

Query: 363 EGYP-WKCCYKSSSQRLPKLPSVKLMF 388
           EG    + C+  S    P  P + L F
Sbjct: 369 EGITGLRPCFNISGLNTPSFPELTLKF 395


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/348 (25%), Positives = 149/348 (42%), Gaps = 86/348 (24%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLD-RDLNEYSPSAS 159
           T +  GTP  +  +  D GS L+W PC     C  C+      +  +D   +  + P  S
Sbjct: 83  TPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS------FPKIDPTGIPRFVPKLS 136

Query: 160 STSKHLSCSHRLC------DLGTSCQ--NPK-----QPCP-YTMDYYTENTSSSGLLVED 205
           S+SK + C +  C      D+ + C+  NPK     Q CP Y + Y   + S++GLL+ +
Sbjct: 137 SSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY--GSGSTAGLLLSE 194

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            L      D  + N      ++GC       +L    P G+ G G G  S+PS   + GL
Sbjct: 195 TLDF---PDKKIPN-----FVVGCS------FLSIHQPSGIAGFGRGSESLPS---QMGL 237

Query: 266 IRNSFSMCFDKDD----SGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGVE 312
            + ++ +   K D    SG++     G         P  Q  +  +++N     Y + + 
Sbjct: 238 KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPS--VSNNAYKEYYYLNIR 295

Query: 313 TCCIGSSCLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND---- 357
              +G+  +K   +K           +I+DSGS+FTF+ K V E +A EF++Q+ +    
Sbjct: 296 KIIVGNQAVK-VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA 354

Query: 358 -TITSFEGYPWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFVV 396
             + +  G   + C+  S ++  K P +   F        P NN F +
Sbjct: 355 TDVETLTG--LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFAL 400


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 170/410 (41%), Gaps = 64/410 (15%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYY 62
           + + I+L    +++ ++G +   F+ +LIHR S +       +N  +  +   ++S  + 
Sbjct: 8   VIVIIFLISTAVVSAATGPD-YGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHN 66

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
             L+++ V+            ++ ++G   M L             +GTP    +   D 
Sbjct: 67  TGLVTNTVEAP----------IYNNRGEYLMKL------------SVGTPPFPIIAVADT 104

Query: 123 GSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSC 178
           GSD++W  C+ C  C            +DL  ++PS S+T + +SCS  +C       SC
Sbjct: 105 GSDIIWTQCEPCTNC----------YQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSC 154

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
              K  C Y++  Y +N+ S G    D L +   G  + +        IGCG   +G + 
Sbjct: 155 SF-KPDCTYSIS-YGDNSHSQGDFAVDTLTM---GSTSGRVVAFPRTAIGCGHDNAGSFD 209

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ 293
             V+  G++GLGLG  S+   +  A  +   FS C      D   S ++ FG     +  
Sbjct: 210 ANVS--GIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGS 265

Query: 294 ---STSFLASNGKYITYIIGVETCCIG--------SSCLKQTSFKAIVDSGSSFTFLPKE 342
              ST    S+     Y + ++   +G        ++ +       I+DSG++ T LP +
Sbjct: 266 GAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVD 325

Query: 343 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
           +Y   A      +N   T       + C+++++    K+P + + F   N
Sbjct: 326 LYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDY-KVPFIAMHFEGAN 374


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/327 (26%), Positives = 139/327 (42%), Gaps = 62/327 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +++   +D GSDL+W  C  CV C           ++    + PS+SST   L
Sbjct: 122 MSIGTPALAYAAIVDTGSDLVWTQCKPCVEC----------FNQSTPVFDPSSSSTYSTL 171

Query: 166 SCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            CS  LC DL TS C +  + C YT   Y + +S+ G+L  +           L  +   
Sbjct: 172 PCSSSLCSDLPTSTCTSAAKDCGYTYT-YGDASSTQGVLAAETF--------TLAKTKLP 222

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DDSGR 281
            V  GCG    G G+  G    GL+GLG G +   SL+++ GL    FS C    DD+ +
Sbjct: 223 GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--GKFSYCLTSLDDTSK 274

Query: 282 --IFFGD--------QGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK-- 327
             +  G            A  Q+T  + +  +   Y + ++   +GS+   L  ++F   
Sbjct: 275 SPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQ 334

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ----- 376
                  IVDSG+S T+L  + Y  +   F  Q+   +          C+K+ +      
Sbjct: 335 DDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDV 394

Query: 377 RLPKL-----PSVKLMFPQNNSFVVNN 398
            +PKL         L  P  N  V+++
Sbjct: 395 EVPKLVLHFDGGADLDLPAENYMVLDS 421


>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
          Length = 656

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/196 (30%), Positives = 95/196 (48%), Gaps = 21/196 (10%)

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSAS 142
           LF S  ++ + L    G  HY WI +GTP     + +D GS +   PC  C +C   +  
Sbjct: 77  LFTSDQNEVVPLNLGMG-THYAWIYVGTPPQRVSIIIDTGSGMTAFPCSGCDQCGNHTDI 135

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
            +N+          + SS+ + +SC+HR       C NP +PC      Y E +S S  +
Sbjct: 136 PFNT----------NLSSSIQPISCNHRTYFSCAYCTNPTEPCR----TYMEGSSWSAKV 181

Query: 203 VEDILHL---ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGL-GLGEISVPS 258
           +EDI++L    S  D  L +S     + GC  K++G ++  VA DG++G+   G   V  
Sbjct: 182 MEDIVYLGDVASAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDIVTK 240

Query: 259 LLAKAGLIRNSFSMCF 274
           L  +  +  N+F++CF
Sbjct: 241 LFREKKIPSNTFTLCF 256


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 86/327 (26%), Positives = 135/327 (41%), Gaps = 52/327 (15%)

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSA 141
           ML  S G +T     D  +L    + IGTP+ SF   +D GSDL+W  C+ C +C     
Sbjct: 78  MLQSSSGIETPVYAGDGEYLMN--VAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPT 135

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSS 199
             +N          P  SS+   L C  + C DL + +C N +  C YT  Y   +T+  
Sbjct: 136 PIFN----------PQDSSSFSTLPCESQYCQDLPSETCNNNE--CQYTYGYGDGSTTQG 183

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPS 258
            +  E             + S   ++  GCG    G G  +G    GLIG+G G +S+PS
Sbjct: 184 YMATETF---------TFETSSVPNIAFGCGEDNQGFGQGNGA---GLIGMGWGPLSLPS 231

Query: 259 LLAKAGLIRNSFSMC---FDKDDSGRIFFGDQG---PATQQSTSFLASNGKYITYIIGVE 312
            L         FS C   +       +  G      P    ST+ + S+     Y I ++
Sbjct: 232 QLGVG-----QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQ 286

Query: 313 TCCIGSSCL--KQTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITS 361
              +G   L    ++F+         I+DSG++ T+LP++ Y  +A  F  Q+N  T+  
Sbjct: 287 GITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDE 346

Query: 362 FEGYPWKCCYKSSSQRLPKLPSVKLMF 388
                  C  + S     ++P + + F
Sbjct: 347 SSSGLSTCFQQPSDGSTVQVPEISMQF 373


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 112/386 (29%), Positives = 153/386 (39%), Gaps = 57/386 (14%)

Query: 22  AETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
           AE+  FS  +I R   +     ++  + A     + SF   +   SS V K +  +  Q 
Sbjct: 25  AESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASR---SSQVDKPQSSSASQL 81

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
                +  + T+ L  D G   Y     IGTP        D GSDL+W  CD    A   
Sbjct: 82  S----NNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWG 137

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTEN 195
            S         + Y P+ASST   L CS RLC    S     C      C Y   Y   +
Sbjct: 138 GS---------SSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGD 188

Query: 196 TS--SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
               + G L  +   L  GGD          V  GC     G Y +G    GL+GLG G 
Sbjct: 189 DPDFTQGFLGSETFTL--GGDAV------PGVGFGCTTALEGDYGEGA---GLVGLGRGP 237

Query: 254 ISVPSLLAKAGLIRNSFSMCFDKDDSGR--IFFGDQGPATQ-----QSTSFLASNGKYIT 306
           +S+ S L  AG    +F  C   D S    + FG     T      QST  LAS      
Sbjct: 238 LSLVSQL-DAG----TFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLAST---TF 289

Query: 307 YIIGVETCCIGSS--CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
           Y + + +  IGS+           + DSG++ T+L +  Y    A F  Q   ++T  EG
Sbjct: 290 YAVNLRSITIGSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTT-SLTPVEG 348

Query: 365 -YPWKCCY-KSSSQRLPKLPSVKLMF 388
            Y ++ CY K  S RL  +P++ L F
Sbjct: 349 RYGFEACYEKPDSARL--IPAMVLHF 372


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 92/409 (22%), Positives = 170/409 (41%), Gaps = 62/409 (15%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYY 62
           + + I+L    +++ ++G +   F+ +LIHR S +       +N  +  +   ++S  + 
Sbjct: 8   VIVIIFLISTAVVSAATGPD-YGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHN 66

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
             L+++ V+            ++ ++G   M L             +GTP    +   D 
Sbjct: 67  TGLVTNTVEAP----------IYNNRGEYLMKL------------SVGTPPFPIIAVADT 104

Query: 123 GSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQ 179
           GSD++W    CV C        N   +DL  ++PS S+T + +SCS  +C       SC 
Sbjct: 105 GSDIIWT--QCVPCT-------NCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCS 155

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
             K  C Y++  Y +N+ S G    D L +   G  + +        IGCG   +G +  
Sbjct: 156 F-KPDCTYSIS-YGDNSHSQGDFAVDTLTM---GSTSGRVVAFPRTAIGCGHDNAGSFDA 210

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ- 293
            V+  G++GLGLG  S+   +  A  +   FS C      D   S ++ FG     +   
Sbjct: 211 NVS--GIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSG 266

Query: 294 --STSFLASNGKYITYIIGVETCCIG--------SSCLKQTSFKAIVDSGSSFTFLPKEV 343
             ST    S+     Y + ++   +G        ++ +       I+DSG++ T LP ++
Sbjct: 267 AVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDL 326

Query: 344 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
           Y   A      +N   T       + C+++++    K+P + + F   N
Sbjct: 327 YHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDY-KVPFIAMHFEGAN 374


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 123/282 (43%), Gaps = 45/282 (15%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPN-VSFLVALDAGSDLLWIPC-DCVRCAPLSAS 142
           FP  GS       + G+ +Y  I +G P+  +F V +D GS L ++PC  C +C   +  
Sbjct: 100 FPLHGSV-----KEHGY-YYANIALGDPSPRTFQVIVDTGSTLTYVPCATCAKCGTHTGG 153

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS---CQNPK----QPCPYTMDYYTEN 195
                      + P    T K L+C  + C        C   +      C Y+   Y E 
Sbjct: 154 ---------TRFDP----TGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSR-TYAEG 199

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEI- 254
           +  SG LV D +H   GGD A   +    V+ GC   +SG   D  A DGLIGLG  +  
Sbjct: 200 SGVSGDLVRDKMHF--GGDIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFA 256

Query: 255 SVPSLLAKAGLIRNSFSMCFDK-DDSGRIFFGDQGPATQQS-----TSFLASNGKYITYI 308
           S+P+ LA    +   FS+CF   +  G + FG + PAT  +     T    +      Y+
Sbjct: 257 SIPNQLADTHGLPRVFSLCFGSFEGGGALSFG-RLPATPHTPPLVYTDMRVNEAHPAYYV 315

Query: 309 IGVETCCIGSSCLKQTS-----FKAIVDSGSSFTFLPKEVYE 345
           +      IG   +   S     +  ++DSG++FT++P +V+ 
Sbjct: 316 VSTAAMKIGDVAVATPSDLAVGYGTVMDSGTTFTYVPTKVFH 357


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 141/315 (44%), Gaps = 40/315 (12%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +      D GSDL+W  C  C +C       ++          P +SS+  ++
Sbjct: 64  LSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFD----------PRSSSSYTNI 113

Query: 166 SCSHRLCD-LGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           +C    C+ L +S C   ++ C YT   Y +N+ + G+L ++ L L S     +      
Sbjct: 114 TCGTESCNKLDSSLCSTDQKTCNYTYS-YADNSITQGVLAQETLTLTSTTGEPV---AFQ 169

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA-GLIRNSFSMC---FDKDDS 279
            +I GCG   S G+ D     GLIGLG G +S+ S +  + G   N FS C   F+ D S
Sbjct: 170 GIIFGCGHNNS-GFNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPS 226

Query: 280 --GRIFFGDQGPATQQ---STSFLASNGK-YITYIIGVETCCI------GSSCLKQTSFK 327
              ++ FG           ST  ++ +G  Y   ++G+    I      GSS    T   
Sbjct: 227 ITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITKGN 286

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
            ++DSG++ T+LP+E Y  +  +   +V       +GY  + CY++ +      P++ + 
Sbjct: 287 ILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDGY--ELCYQTPTNL--NGPTLTIH 342

Query: 388 FPQNNSFVVNNPVFV 402
           F   +  +    +F+
Sbjct: 343 FEGGDVLLTPAQMFI 357


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 126/302 (41%), Gaps = 57/302 (18%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +IGTP    LVALD  +D  W+PC  CV CA  S+  ++          PS SS+S++L 
Sbjct: 96  NIGTPAQPMLVALDTSNDAAWVPCSGCVGCA--SSVLFD----------PSKSSSSRNLQ 143

Query: 167 CSHRLCDLGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           C    C       NP     + C + M Y      +S  L +D L         L N V 
Sbjct: 144 CDAPQCK---QAPNPTCTAGKSCGFNMTYGGSTIEAS--LTQDTL--------TLANDVI 190

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD---- 278
            S   GC  K +G  L      GL+GLG G +S+ S      L  ++FS C         
Sbjct: 191 KSYTFGCISKATGTSLPA---QGLMGLGRGPLSLIS--QTQNLYMSTFSYCLPNSKSSNF 245

Query: 279 SGRIFFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCL---KQTSFK 327
           SG +  G +    +  T+ L  N +     Y+  +   +G +   I +S L     T   
Sbjct: 246 SGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAG 305

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
            I DSG+ FT L +  Y  +  EF R++ N   TS  G+    CY  S       PSV  
Sbjct: 306 TIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLGGF--DTCYSGSV----VYPSVTF 359

Query: 387 MF 388
           MF
Sbjct: 360 MF 361


>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 452

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 115/265 (43%), Gaps = 33/265 (12%)

Query: 96  GNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   HYT  ++IG P   + + +D+GSDL W+ CD  C  C         +  RD  
Sbjct: 56  GNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGC---------TKPRD-Q 105

Query: 153 EYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
            Y P+ +     + C  +LC      +  +C +P   C Y ++Y  ++ SS G+LV D +
Sbjct: 106 LYKPNHNL----VQCVDQLCSEVQLSMEYTCASPDDQCDYEVEY-ADHGSSLGVLVRDYI 160

Query: 208 HL-ISGGDNALKNSVQASVIIGCGMKQS-GGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
               + G     + V+  V  GCG  Q   G     A  G++GLG G  S+ S L   GL
Sbjct: 161 PFQFTNG-----SVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGL 215

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQ 323
           I N    C      G +FFGD    +     TS L S+ +   Y  G             
Sbjct: 216 IHNVVGHCLSARGGGFLFFGDDFIPSSGIVWTSMLPSSSEK-HYSSGPAELVFNGKATVV 274

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIA 348
              + I DSGSS+T+   + Y+ + 
Sbjct: 275 KGLELIFDSGSSYTYFNSQAYQAVV 299


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 120/287 (41%), Gaps = 41/287 (14%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD-----CVRCAPLSAS-YYNSLDRDLNEYSPS 157
           Y  ++IG P   + + +D GS+L W+ C      C  C P     YY   D +L      
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLK----- 93

Query: 158 ASSTSKHLSCSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
                  + C   LC         +    +N    C Y + Y T    S G L  DI+  
Sbjct: 94  -------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS- 143

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR- 267
           ++G D       +  +  GCG KQ        +P DG++GLG+G+  + + L    +I+ 
Sbjct: 144 VNGRD-------KKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKE 196

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSF 326
           N    C      G ++ GD  P T+  T +         Y  G+    I    ++   +F
Sbjct: 197 NVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTF 255

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYK 372
           +A+ DSGS++T +P ++Y  I ++    +++ ++   +G     C+K
Sbjct: 256 EAVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWK 302


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 151/351 (43%), Gaps = 48/351 (13%)

Query: 61  YYQVLLSSDVQKQKMKTG--PQFQMLFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFL 117
           Y +   S DV+K     G   Q  +  P+      +LG     L Y   + +G+P  +  
Sbjct: 92  YIKRKFSGDVKKDGQGAGGVEQSHVTVPT------TLGTSLNTLEYLITVRLGSPAKTQT 145

Query: 118 VALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-- 174
           V +D+GSD+ W+ C  C++C       ++ +D     + PS SST    SCS   C    
Sbjct: 146 VLIDSGSDVSWVQCKPCLQC-------HSQVD---PLFDPSLSSTYSPFSCSSAACAQLG 195

Query: 175 --GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
             G  C +  Q C Y +  Y + +S++G    D L L   G N + N        GC   
Sbjct: 196 QDGNGCSSSSQ-CQYIV-RYADGSSTTGTYSSDTLAL---GSNTISN-----FQFGCSHV 245

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDSGRIFFG-DQGPA 290
           +S G+ D    DGL+GLG G    PSL ++ AG    +FS C     S   F     G +
Sbjct: 246 ES-GFND--LTDGLMGLGGG---APSLASQTAGTFGTAFSYCLPPTPSSSGFLTLGAGTS 299

Query: 291 TQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEVYET 346
               T  L S+     Y + +E   +G + L    + F A  ++DSG+  T LP+  Y  
Sbjct: 300 GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTAYSA 359

Query: 347 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
           +++ F   +     +        C+  S Q   +LPSV L+F  +   VVN
Sbjct: 360 LSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVF--SGGAVVN 408


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 89/173 (51%), Gaps = 23/173 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D+GS + ++PC DC +C                ++ P  SST + + C
Sbjct: 99  IGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDP----------KFQPEMSSTYQPVKC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                ++  +C + ++ C Y  +Y  E++SS G+L ED   LIS G+ +     +A  + 
Sbjct: 149 -----NMDCNCDDDREQCVYEREY-AEHSSSKGVLGED---LISFGNESQLTPQRA--VF 197

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
           GC   ++G      A DG+IGLG G++S+   L   GLI NSF +C+   D G
Sbjct: 198 GCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 128/306 (41%), Gaps = 57/306 (18%)

Query: 118 VALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---- 172
           V +D GSDL W+ C  C RC       YN  D   N   PS S + + + CS   C    
Sbjct: 148 VIVDTGSDLSWVQCQPCKRC-------YNQQDPVFN---PSTSPSYRTVLCSSPTCQSLQ 197

Query: 173 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
               +LG    NP   C Y ++Y   + +   L  E   HL  G   A+ N      I G
Sbjct: 198 SATGNLGVCGSNPPS-CNYVVNYGDGSYTRGELGTE---HLDLGNSTAVNN-----FIFG 248

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG 285
           CG + + G   G +  GL+GLG   +S+ S    + +    FS C    + + SG +  G
Sbjct: 249 CG-RNNQGLFGGAS--GLVGLGRSSLSLIS--QTSAMFGGVFSYCLPITETEASGSLVMG 303

Query: 286 DQGPATQQSTSF----LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFTF 338
                 + +T      +  N +   Y + +    +GS  ++  SF     ++DSG+  T 
Sbjct: 304 GNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITR 363

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQN 391
           LP  +Y+ +  EF +Q       F G+P          C+  S  +  ++P++K+ F  N
Sbjct: 364 LPPSIYQALKDEFVKQ-------FSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGN 416

Query: 392 NSFVVN 397
               V+
Sbjct: 417 AELNVD 422


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/394 (24%), Positives = 159/394 (40%), Gaps = 63/394 (15%)

Query: 30  KLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQK------QKMKTGPQFQM 83
           +L HR      +   ++ + A     ++  EY Q  +S    +      Q++ TG +   
Sbjct: 76  RLAHRCGPSTASASFAEVQRAD----EQRVEYIQRRVSGGGARGAKGALQQLATGSRSAT 131

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           +       TM +G    + +   + +GTP VS  V +D GSD+ W     V+C P SA  
Sbjct: 132 V-----PTTMGVGT---FQYVVTVSLGTPGVSQTVEVDTGSDVSW-----VQCKPCSAPA 178

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGTSCQNPKQPCPYTMDYYTENTSSS 199
            NS  RD   + P+ SST   + C    C         C   +  C Y +  Y + ++++
Sbjct: 179 CNS-QRD-QLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ--CGYVVS-YGDGSNTT 233

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G+   D L L  G      N+V  + + GCG  Q+ G   G+  DGL+ LG   +S+ S 
Sbjct: 234 GVYGSDTLALAPG------NTV-GTFLFGCGHAQA-GMFAGI--DGLLALGRQSMSLKS- 282

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCI 316
              AG     FS C     S   +    GP++     +T  L +      Y++ +    +
Sbjct: 283 -QAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISV 341

Query: 317 GSS--CLKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP------ 366
           G     +  ++F    +VD+G+  T LP   Y  + + F   +        GYP      
Sbjct: 342 GGQQVAVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRGAIAPC-----GYPSAPANG 396

Query: 367 -WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
               CY  S   +  LP+V L F    +  +  P
Sbjct: 397 ILDTCYDFSRYGVVTLPTVALTFSGGATLALEAP 430


>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 119/287 (41%), Gaps = 41/287 (14%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD-----CVRCAPLSAS-YYNSLDRDLNEYSPS 157
           Y  ++IG P   + + +D GS+L W+ C      C  C P     YY   D +L      
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLK----- 93

Query: 158 ASSTSKHLSCSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
                  + C   LC         +    +N    C Y + Y T    S G L  DI+  
Sbjct: 94  -------VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS- 143

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR- 267
           ++G D       +  +  GCG KQ        +P DG++GLG+G+    + L    +I+ 
Sbjct: 144 VNGRD-------KKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKE 196

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSF 326
           N    C      G ++ GD  P T+  T +         Y  G+    I    ++   +F
Sbjct: 197 NVIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTF 255

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYK 372
           +A+ DSGS++T +P ++Y  I ++    +++ ++   +G     C+K
Sbjct: 256 EAVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWK 302


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 137/320 (42%), Gaps = 45/320 (14%)

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
           +  P+Q   ++  GN     +   + +GTP     V  D GSDL W     V+C P S  
Sbjct: 131 VTLPAQRGISLGTGN-----YVVSMGLGTPARDMTVVFDTGSDLSW-----VQCTPCSDC 180

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSG 200
           Y    ++    + P+ SST   + C+   C      SC   K+ C Y +  Y + + + G
Sbjct: 181 Y----EQKDPLFDPARSSTYSAVPCASPECQGLDSRSCSRDKK-CRYEV-VYGDQSQTDG 234

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
            L  D L L        ++ V    + GCG + +G  L G A DGL+GLG  ++S+ S  
Sbjct: 235 ALARDTLTLT-------QSDVLPGFVFGCGEQDTG--LFGRA-DGLVGLGREKVSLSSQA 284

Query: 261 A-KAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGK---YITYIIGVETC 314
           A K G     FS C     S  G +  G   PA  + T+    +     Y   ++GV+  
Sbjct: 285 ASKYG---AGFSYCLPSSPSAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVA 341

Query: 315 --CIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WK 368
              +  S +  ++   ++DSG+  T LP  VY  + + F R +      ++  P      
Sbjct: 342 GRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGR--YGYKRAPALSILD 399

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            CY  +     ++PSV L+F
Sbjct: 400 TCYDFTGHTTVRIPSVALVF 419


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 134/320 (41%), Gaps = 42/320 (13%)

Query: 110 GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           G     F V +D GSD+LW+ C+     P S+     L  +LN +    SST+  + CS 
Sbjct: 75  GXXXXXFNVQIDTGSDILWVNCNTCSNCPQSSQ----LGIELNFFDTVGSSTAALIPCSD 130

Query: 170 RLCDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILH--LISGGDNALKNSVQ 222
            +C  G       C      C YT  Y  + + +SG  V D ++  LI G   A+ ++  
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFNLIMGQPPAVNST-- 187

Query: 223 ASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--DKDDS 279
           A+++ GC + QSG       A DG+ G G G +SV S L+  G+    FS C   D +  
Sbjct: 188 ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGG 247

Query: 280 GRIFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
           G +  G+               P    +   +A NG+ +     V +       +     
Sbjct: 248 GILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFS-------ISNNRG 300

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPWKCCYKSSSQRLPKLPSV 384
             IVD G++  +L +E Y+ +    +  V+ +   T+ +G     CY  S+      P V
Sbjct: 301 GTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDIFPLV 357

Query: 385 KLMFPQNNSFVVNNPVFVIY 404
            L F    S V+    ++++
Sbjct: 358 SLNFEGGASMVLKPEQYLMH 377


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 161/373 (43%), Gaps = 65/373 (17%)

Query: 1   MNRIS-LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSF 59
           MN +S LT+ L     +   S A +  FS +LIHR S +      ++N+           
Sbjct: 1   MNTLSFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENK----------- 49

Query: 60  EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVA 119
             YQ  +  D  ++ +     F     +   ++  + +  G+L      +GTP       
Sbjct: 50  --YQHFV--DAARRSINRANHFFKDSDTSTPESTVIPDRGGYL--MTYSVGTPPTKIYGI 103

Query: 120 LDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGT 176
            D GSD++W+ C+ C +C   +   +N          PS SS+ K++ CS +LC     T
Sbjct: 104 ADTGSDIVWLQCEPCEQCYNQTTPIFN----------PSKSSSYKNIPCSSKLCHSVRDT 153

Query: 177 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 236
           SC + +  C Y +  Y +++ S G L  D L L S   + +       ++IGCG   +G 
Sbjct: 154 SCSD-QNSCQYKIS-YGDSSHSQGDLSVDTLSLESTSGSPVS---FPKIVIGCGTDNAGT 208

Query: 237 YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF------DKDDSGRIFFGDQGPA 290
           +  G A  G++GLG G +S+ + L  +  I   FS C       + + S  + FGD    
Sbjct: 209 F--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVV 264

Query: 291 TQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-----------IVDSGSSF 336
           +     ST  +  +  +  Y + ++   +G+   K+  F             I+DSG++ 
Sbjct: 265 SGDGVVSTPLIKKDPVF--YFLTLQAFSVGN---KRVEFGGSSEGGDDEGNIIIDSGTTL 319

Query: 337 TFLPKEVYETIAA 349
           T +P +VY  + +
Sbjct: 320 TLIPSDVYTNLES 332


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 135/333 (40%), Gaps = 62/333 (18%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSL 147
           G + + + N     +   + +GTP     + LD  +D  W+PC  C  C+  +       
Sbjct: 36  GQQVLKIAN-----YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT------- 83

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
                 + P+AS+T   L CS   C    G SC             Y  ++S +  LV+D
Sbjct: 84  ------FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQD 137

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            +         L N V      GC    SGG    + P GL+GLG G I   SL+++AG 
Sbjct: 138 AI--------TLANDVIPGFTFGCINAVSGG---SIPPQGLLGLGRGPI---SLISQAGA 183

Query: 266 IRNS-FSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           + +  FS C         SG +  G  G P + ++T  L +  +   Y + +    +G  
Sbjct: 184 MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRI 243

Query: 320 CL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
            +            T    I+DSG+  T   + VY  I  EF +QVN  I+S   +    
Sbjct: 244 KVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGAF--DT 301

Query: 370 CYKSSSQRLPKLPSVKLMF-------PQNNSFV 395
           C+ ++++   + P+V L F       P  NS +
Sbjct: 302 CFAATNEA--EAPAVTLHFEGLNLVLPMENSLI 332


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/261 (25%), Positives = 115/261 (44%), Gaps = 34/261 (13%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           T + IGTP   F + +D GS + ++PC  C +C                 + P  SST +
Sbjct: 79  TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCG----------KHQDPRFQPDLSSTYR 128

Query: 164 HLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            + C     +   +C +  + C Y   Y  E +SSSG++ ED++    G ++ LK     
Sbjct: 129 PVKC-----NPSCNCDDEGKQCTYERRY-AEMSSSSGVIAEDVVSF--GNESELK---PQ 177

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGR 281
             + GC   ++G      A DG++GLG G +SV   L   G+I +SFS+C+   D   G 
Sbjct: 178 RAVFGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGA 236

Query: 282 IFFGDQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGS 334
           +  G   P    +  F  SN  +   Y I ++   +    LK            ++DSG+
Sbjct: 237 MVLGQISPPP--NMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGT 294

Query: 335 SFTFLPKEVYETIAAEFDRQV 355
           ++ + P+  +  +     +++
Sbjct: 295 TYAYFPEAAFHALKDAIMKEI 315


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 125/300 (41%), Gaps = 53/300 (17%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +IGTP    LVALD  +D  WIPC  CV C   S+S           + PS SS+S+ L 
Sbjct: 93  NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 167 CSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C    C      SC   K  C + M Y    ++    L +D L L S         V  +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTY--GGSTIEAYLTQDTLTLAS--------DVIPN 189

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SG 280
              GC  K SG  L      GL+GLG G +S+ S      L +++FS C         SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 281 RIFFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAI 329
            +  G +    +  T+ L  N +     Y+  +   +G +   I +S L     T    I
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            DSG+ +T L +  Y  +  EF R+V N   TS  G+    CY  S       PSV  MF
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMF 358


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 135/306 (44%), Gaps = 51/306 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +++   +D GSDL+W  C  CV C   S   ++          PS+SST   +
Sbjct: 109 VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATV 158

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS   C DL TS       C YT   Y +++S+ G+L  +           L  S    
Sbjct: 159 PCSSASCSDLPTSKCTSASKCGYTYT-YGDSSSTQGVLATETF--------TLAKSKLPG 209

Query: 225 VIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSG 280
           V+ GCG    G G+  G    GL+GLG G +   SL+++ GL  + FS C    D  ++ 
Sbjct: 210 VVFGCGDTNEGDGFSQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNS 261

Query: 281 RIFFGDQG--------PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK--- 327
            +  G            ++ Q+T  + +  +   Y + ++   +GS+   L  ++F    
Sbjct: 262 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQD 321

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
                 IVDSG+S T+L  + Y  +   F  Q+        G     C+++ ++ + ++ 
Sbjct: 322 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 381

Query: 383 SVKLMF 388
             +L+F
Sbjct: 382 VPRLVF 387


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/394 (24%), Positives = 158/394 (40%), Gaps = 63/394 (15%)

Query: 30  KLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQK------QKMKTGPQFQM 83
           +L HR      +   ++ + A     ++  EY Q  +S    +      Q++ TG +   
Sbjct: 76  RLAHRCGPSTASASFAEVQRAD----EQRVEYIQRRVSGGGARGAKGALQQLATGSRSAT 131

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           +       TM +G    + +   + +GTP VS  V +D GSD+ W     V+C P SA  
Sbjct: 132 V-----PTTMGVGT---FQYVVTVSLGTPGVSQTVEVDTGSDVSW-----VQCKPCSAPA 178

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGTSCQNPKQPCPYTMDYYTENTSSS 199
            NS  RD   + P+ SST   + C    C         C   +  C Y +  Y + ++++
Sbjct: 179 CNS-QRD-QLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ--CGYVVS-YGDGSNTT 233

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G+   D L L  G      N+V  + + GCG  Q+ G   G+  DGL+ LG   +S+ S 
Sbjct: 234 GVYGSDTLALAPG------NTV-GTFLFGCGHAQA-GMFAGI--DGLLALGRQSMSLKS- 282

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCI 316
              AG     FS C     S   +    GP +     +T  L +      Y++ +    +
Sbjct: 283 -QAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISV 341

Query: 317 GSS--CLKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP------ 366
           G     +  ++F    +VD+G+  T LP   Y  + + F   +        GYP      
Sbjct: 342 GGQQVAVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRGAIAP-----YGYPSAPANG 396

Query: 367 -WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNP 399
               CY  S   +  LP+V L F    +  +  P
Sbjct: 397 ILDTCYDFSRYGVVTLPTVALTFSGGATLALEAP 430


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 125/300 (41%), Gaps = 53/300 (17%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +IGTP    LVALD  +D  WIPC  CV C   S+S           + PS SS+S+ L 
Sbjct: 93  NIGTPAQPMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 167 CSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C    C      SC   K  C + M Y    ++    L +D L L S         V  +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTY--GGSTIEAYLTQDTLTLAS--------DVIPN 189

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SG 280
              GC  K SG  L      GL+GLG G +S+ S      L +++FS C         SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 281 RIFFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAI 329
            +  G +    +  T+ L  N +     Y+  +   +G +   I +S L     T    I
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            DSG+ +T L +  Y  +  EF R+V N   TS  G+    CY  S       PSV  MF
Sbjct: 305 FDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMF 358


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 98/212 (46%), Gaps = 15/212 (7%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYS 155
           N    ++YT + IGTP   F V +D GSD+LW+ C  CV C PL         +++  + 
Sbjct: 76  NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC-PL---------QNVTFFD 125

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           P ASS++  L+CS + C      ++   P  Y ++ Y++ + +SG  + D++   +   +
Sbjct: 126 PGASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVE-YSDGSFTSGYYISDLISFETVMSS 184

Query: 216 ALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            L     A  + GC    +G   L   +  G++GLG G + V S L+   L    FS+C 
Sbjct: 185 NLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCL 244

Query: 275 D--KDDSGRIFFGDQGPATQQSTSFLASNGKY 304
              ++  G I  G+        T  + S   Y
Sbjct: 245 SGGQEGGGVIILGENRLPNTVYTPLVRSQTHY 276


>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
 gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
          Length = 376

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/272 (27%), Positives = 119/272 (43%), Gaps = 60/272 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST--S 162
           ++IG P+  + + +D GSDL W+ CD  CV+C      YY    R  N   P       S
Sbjct: 24  LNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYY----RPRNNLVPCMDPICQS 79

Query: 163 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            H +  HR       C+NP Q C Y ++Y  +  SS G+LV D  +L         +  +
Sbjct: 80  LHSNGDHR-------CENPGQ-CDYEVEY-ADGGSSFGVLVRDTFNL------NFTSEKR 124

Query: 223 ASVIIG---CGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD- 277
            S ++    CG  Q  GG    +  DG++GLG G+ S+ S L+  GL+RN    C     
Sbjct: 125 HSPLLALGLCGYDQFPGGSHHPI--DGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHG 182

Query: 278 -----------DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
                      DS R+ +    P  +  +  LA     +T+              K T F
Sbjct: 183 GGFLFFGDDLYDSSRVAWTPMSPDAKHYSPGLAE----LTFDG------------KTTGF 226

Query: 327 KAIV---DSGSSFTFLPKEVYETIAAEFDRQV 355
           K ++   DSG+S+T+L  + Y+ + +   +++
Sbjct: 227 KNLLTTFDSGASYTYLNSQAYQGLISLLKKEL 258


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 136/336 (40%), Gaps = 54/336 (16%)

Query: 74  KMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC 133
           K+ T   F    P +    + LG      +   +  GTP    L+  D GSDL+W+ C  
Sbjct: 30  KLATITSFWAESPMESGAFLGLGQ-----YLVSMAFGTPPQEVLLIADTGSDLIWLQCST 84

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-------GTSCQNPKQPCP 186
               P          R    +  S S+T   + CS   C L       G SC +P  P P
Sbjct: 85  TAAPPAFCPKKACSRRP--AFVASKSATLSVVPCSAAQCLLVPAPRGHGPSC-SPAAPVP 141

Query: 187 YTMDY-YTENTSSSGLLVEDILHLISG--GDNALKNSVQASVIIGCGMKQSGGYLDGVAP 243
               Y Y + +S++G L  D   + +G  G  A++      V  GCG +  GG   G   
Sbjct: 142 CGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRG-----VAFGCGTRNQGGSFSGTG- 195

Query: 244 DGLIGLGLGEISVPSLLAKAG-LIRNSFSMCFDKDDSGR-------IFFGDQGPATQQST 295
            G+IGLG G++S P   A++G L   +FS C    + GR       +F G        + 
Sbjct: 196 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAY 251

Query: 296 SFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK----------AIVDSGSSFTFLPKEVY 344
           + L SN    T Y +GV    +G+  L     +           ++DSGS+ T+L    Y
Sbjct: 252 TPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAY 311

Query: 345 ETIAAEFDRQVN-----DTITSFEGYPWKCCYKSSS 375
             + + F   V+      + T F+G   + CY  SS
Sbjct: 312 LHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSS 345


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 88/351 (25%), Positives = 144/351 (41%), Gaps = 53/351 (15%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPA-----KKSFEYYQVLLSSDVQKQKMKTGPQFQ 82
           S K+++++   +   G  K  N  S        +   + +QV LS +      K   + Q
Sbjct: 70  SLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFK---EMQ 126

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CV-RCAPLS 140
              P+    T       G  +   + +GTP   F ++ D GSDL W  C+ C+  C P  
Sbjct: 127 TTIPASIVPT-------GGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFP-- 177

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSS- 199
                   ++  ++ P+ S++ K++SCS   C L      P Q C      Y     S  
Sbjct: 178 --------QNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGY 229

Query: 200 --GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
             G L  + L + S   +  KN      + GC  ++S G  +G    GL+GLG   I++P
Sbjct: 230 TIGFLATETLAIAS--SDVFKN-----FLFGCS-EESRGTFNGTT--GLLGLGRSPIALP 279

Query: 258 SLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
           S        +N FS C     S  G + FG +     +ST         +  + G+ T  
Sbjct: 280 SQTTNK--YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPI----SPKLKQLYGLNTVG 333

Query: 316 IGSSC----LKQTSFKAIVDSGSSFTFLPKEVYETIAAEF-DRQVNDTITS 361
           I        +  +  + I+DSG++FTFLP   Y  + + F +   N T+T+
Sbjct: 334 ISVRGRELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTN 384


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 134/333 (40%), Gaps = 62/333 (18%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSL 147
           G + + + N     +   + +GTP     + LD  +D  W+PC  C  C+  +       
Sbjct: 36  GQQVLKIAN-----YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT------- 83

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
                 + P+AS+T   L CS   C    G SC             Y  ++S +  LV+D
Sbjct: 84  ------FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQD 137

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            +         L N V      GC    SGG    + P GL+GLG G I   SL+++AG 
Sbjct: 138 AI--------TLANDVIPGFTFGCINAVSGG---SIPPQGLLGLGRGPI---SLISQAGA 183

Query: 266 IRNS-FSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           + +  FS C         SG +  G  G P + ++T  L +  +   Y + +    +G  
Sbjct: 184 MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRI 243

Query: 320 CL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
            +            T    I+DSG+  T   + VY  I  EF +QVN  I+S   +    
Sbjct: 244 KVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGAF--DT 301

Query: 370 CYKSSSQRLPKLPSVKLMF-------PQNNSFV 395
           C+  +++   + P+V L F       P  NS +
Sbjct: 302 CFAETNEA--EAPAVTLHFEGLNLVLPMENSLI 332


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 121/286 (42%), Gaps = 47/286 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           I IGTP+V  L   D GSDL W+   PCD  +C   +   Y+ L+       P  S    
Sbjct: 100 IYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCT 159

Query: 164 HLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            L  S  +C D G         C Y    Y +N+ S G L  D + L+      L+    
Sbjct: 160 QLPYSQYVCSDYGD--------CIYAYT-YGDNSYSYGGLSSDSIRLM-----LLQLHYN 205

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDS 279
           + +  GCG +            G++GLG G +S+ S L     I + FS C   F  + +
Sbjct: 206 SKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSN 263

Query: 280 GRIFFGD----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSG 333
            ++ FG+    QG     +   +  +  +  Y + +E   +G+  +K  QT    I+DSG
Sbjct: 264 SKLKFGEAAIVQGNGVVSTPLIIKPDLPF--YYLNLEGITVGAKTVKTGQTDGNIIIDSG 321

Query: 334 SSFTFLPKEVY--------ETIAAEFDRQVNDTITSFEGYPWKCCY 371
           S+ T+L +  Y        ET+A E D+ +         YP+  C+
Sbjct: 322 STLTYLEESFYNEFVSLVKETVAVEEDQYI--------PYPFDFCF 359


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/296 (26%), Positives = 124/296 (41%), Gaps = 34/296 (11%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP V      D GSDL+W+ C  C +C P +A  ++          P  SST K + C
Sbjct: 98  IGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFD----------PRKSSTFKTVPC 147

Query: 168 SHRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
             + C L      +C      C Y    Y ++T  SG+L  + ++  S  +NA+K     
Sbjct: 148 DSQPCTLLPPSQRACVGKSGQC-YYQYIYGDHTLVSGILGFESINFGS-KNNAIK---FP 202

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSG 280
            +  GC    +    +     GL+GLG+G +S+ S L     I   FS CF     + + 
Sbjct: 203 KLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTS 260

Query: 281 RIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIVDS 332
           ++ FG+     Q     ST  +  +     Y + +E   IG+  +K    QT    ++DS
Sbjct: 261 KMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDS 320

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           G+SFT L +  Y    A                 +  C+++  +R  + P V  +F
Sbjct: 321 GTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKR-KRFPDVVFLF 375


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 135/306 (44%), Gaps = 51/306 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +++   +D GSDL+W  C  CV C   S   ++          PS+SST   +
Sbjct: 99  VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATV 148

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS   C DL TS       C YT   Y +++S+ G+L  +           L  S    
Sbjct: 149 PCSSASCSDLPTSKCTSASKCGYTYT-YGDSSSTQGVLATETF--------TLAKSKLPG 199

Query: 225 VIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSG 280
           V+ GCG    G G+  G    GL+GLG G +   SL+++ GL  + FS C    D  ++ 
Sbjct: 200 VVFGCGDTNEGDGFSQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNS 251

Query: 281 RIFFGDQG--------PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK--- 327
            +  G            ++ Q+T  + +  +   Y + ++   +GS+   L  ++F    
Sbjct: 252 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQD 311

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
                 IVDSG+S T+L  + Y  +   F  Q+        G     C+++ ++ + ++ 
Sbjct: 312 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 371

Query: 383 SVKLMF 388
             +L+F
Sbjct: 372 VPRLVF 377


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 135/306 (44%), Gaps = 51/306 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +++   +D GSDL+W  C  CV C   S   ++          PS+SST   +
Sbjct: 78  VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATV 127

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS   C DL TS       C YT   Y +++S+ G+L  +           L  S    
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYT-YGDSSSTQGVLATETF--------TLAKSKLPG 178

Query: 225 VIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSG 280
           V+ GCG    G G+  G    GL+GLG G +   SL+++ GL  + FS C    D  ++ 
Sbjct: 179 VVFGCGDTNEGDGFSQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNS 230

Query: 281 RIFFGDQG--------PATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK--- 327
            +  G            ++ Q+T  + +  +   Y + ++   +GS+   L  ++F    
Sbjct: 231 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQD 290

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
                 IVDSG+S T+L  + Y  +   F  Q+        G     C+++ ++ + ++ 
Sbjct: 291 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 350

Query: 383 SVKLMF 388
             +L+F
Sbjct: 351 VPRLVF 356


>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
 gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
          Length = 424

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 109/251 (43%), Gaps = 36/251 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCA-PLSASYYNSLDRDLNEYSPSASSTSKHL 165
           IG P   F + +D GSDL W+ CD  C  C  PL   Y                  +  L
Sbjct: 73  IGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLY---------------KPRNNLL 117

Query: 166 SCSHRLC----DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALK 218
           SC   LC    + GT  CQ+    C Y + Y  E  SS G+LV D   L L++G      
Sbjct: 118 SCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTDYFPLRLMNG------ 170

Query: 219 NSVQASVIIGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
           + ++  +  GCG  Q S G +      G++GLG G+ S+ S L   G++ N    C  + 
Sbjct: 171 SFLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRK 230

Query: 278 DSGRIFFGDQGPATQQSTSFLASNGKYIT--YIIGVETCCIGSSCLKQTSFKAIVDSGSS 335
             G +FFG Q P      S+   + K +   Y  G      G       + + I DSGSS
Sbjct: 231 GGGFLFFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSS 289

Query: 336 FTFLPKEVYET 346
           +T+   +VY++
Sbjct: 290 YTYFNAQVYQS 300


>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 151

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 9/113 (7%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM- 83
           F   + HRFS+ VK +      +    P K S +YY+ +   D  +  +++ T  + +  
Sbjct: 30  FGFDMHHRFSDPVKGI-----LDVDDLPEKLSLQYYKAMAHRDWVIHGRRLSTSDEVKPP 84

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRC 136
           L  S G++T  L +  G+LHY  + +GTP++ FLVALD GSDL W+PCDC  C
Sbjct: 85  LTFSDGNETYRLSS-LGYLHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTSC 136


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 125/300 (41%), Gaps = 53/300 (17%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +IGTP  + LVALD  +D  WIPC  CV C   S+S           + PS SS+S+ L 
Sbjct: 93  NIGTPAQAMLVALDTSNDAAWIPCSGCVGC---SSSVL---------FDPSKSSSSRTLQ 140

Query: 167 CSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C    C      SC   K  C + M Y    ++    L +D L         L   V  +
Sbjct: 141 CEAPQCKQAPNPSCTVSKS-CGFNMTY--GGSAIEAYLTQDTL--------TLATDVIPN 189

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SG 280
              GC  K SG  L      GL+GLG G +S+ S      L +++FS C         SG
Sbjct: 190 YTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFSYCLPNSKSSNFSG 244

Query: 281 RIFFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAI 329
            +  G +    +  T+ L  N +     Y+  +   +G +   I +S L     T    I
Sbjct: 245 SLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTI 304

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            DSG+ +T L +  Y  +  EF R+V N   TS  G+    CY  S       PSV  MF
Sbjct: 305 FDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGGF--DTCYSGSV----VFPSVTFMF 358


>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
          Length = 150

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 9/113 (7%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD--VQKQKMKTGPQFQM- 83
           F   + HRFS+ VK +      +    P K S +YY+ +   D  +  +++ T  + +  
Sbjct: 30  FGFDMHHRFSDPVKGI-----LDVDDLPEKLSLQYYKAMAHRDWVIHGRRLSTSDEVKPP 84

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRC 136
           L  S G++T  L +  G+LHY  + +GTP++ FLVALD GSDL W+PCDC  C
Sbjct: 85  LTFSDGNETYRLSS-LGYLHYANVSLGTPSLWFLVALDTGSDLFWLPCDCTSC 136


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 126/308 (40%), Gaps = 51/308 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP     + LD GSDL+W      +C P  A +    D+ L  + PS SST    S
Sbjct: 86  LAIGTPPQPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTLSLTS 136

Query: 167 CSHRLCD--LGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           C   LC      SC +PK    Q C YT   Y + + ++G L  D    +  G +     
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQTCVYTYS-YGDKSVTTGFLEVDKFTFVGAGASV---- 191

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---- 276
               V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF      
Sbjct: 192 --PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGL 242

Query: 277 -------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------C 320
                  D    ++   +G    QST  + +      Y + ++   +GS+          
Sbjct: 243 KPSTVLLDLPADLYKSGRG--AVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFA 300

Query: 321 LKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
           LK  +   I+DSG++ T LP  VY  +   F  QV   + S        C  +  +  P 
Sbjct: 301 LKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPY 360

Query: 381 LPSVKLMF 388
           +P + L F
Sbjct: 361 VPKLVLHF 368


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 114/256 (44%), Gaps = 32/256 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P  SS+ K L C
Sbjct: 86  IGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDP----------KFQPELSSSYKALKC 135

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +        +C +  + C Y    Y E +SSSG+L ED   LIS G+ +     +A  + 
Sbjct: 136 NP-----DCNCDDEGKLCVYER-RYAEMSSSSGVLSED---LISFGNESQLTPQRA--VF 184

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++SV   L   G+I + FS+C+   +   G +  G
Sbjct: 185 GCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 243

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P      S  +   +   Y I ++   +    LK            ++DSG+++ + 
Sbjct: 244 KISPPAGMVFSH-SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYF 302

Query: 340 PKEVYETIAAEFDRQV 355
           PKE +  I     +++
Sbjct: 303 PKEAFIAIKDAIIKEI 318


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 130/331 (39%), Gaps = 51/331 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP     + LD GSDL WI CD C  C   +  +YN          P+ SS+ +++SC
Sbjct: 176 VGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYN----------PNESSSYRNISC 225

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
               C L +S      C+   Q CPY  DY   + ++    +E     ++  +   K   
Sbjct: 226 YDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKH 285

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
              V+ GCG    G +        L+GLG G +S PS L    +  +SFS C      + 
Sbjct: 286 VVDVMFGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ--SIYGHSFSYCLTDLFSNT 340

Query: 277 DDSGRIFFGDQGPATQQS----TSFLASNG--KYITYIIGVETCCIGSSCLK-------- 322
             S ++ FG+            T  LA         Y + +++  +G   L         
Sbjct: 341 SVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHW 400

Query: 323 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
             +     I+DSGS+ TF P   Y+ I   F++++     + + +    CY  S     +
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVE 460

Query: 381 LPSVKLM--------FPQNNSFVVNNPVFVI 403
           LP   +         FP  N F    P  VI
Sbjct: 461 LPDYGIHFADGAVWNFPAENYFYQYEPDEVI 491


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 135/315 (42%), Gaps = 56/315 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  +  + LD GS+L W+      CAP  A    S       + P ASST   + 
Sbjct: 89  LAVGTPPQNVTMVLDTGSELSWL-----LCAPAGARNKFS----AMSFRPRASSTFAAVP 139

Query: 167 CSHRLC---DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C+   C   DL +  +C      C  ++  Y + +SS G L  D+  + SG        +
Sbjct: 140 CASAQCRSRDLPSPPACDGASSRCSVSLS-YADGSSSDGALATDVFAVGSG------PPL 192

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 280
           +A+   GC         DGVA  GL+G+  G +   S +++A   R  FS C  D+DD+G
Sbjct: 193 RAA--FGCMSSAFDSSPDGVASAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAG 245

Query: 281 RIFFG----------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTS 325
            +  G          +  P  Q +        +A + + +   +G +   I +S L    
Sbjct: 246 VLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDH 305

Query: 326 FKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYKSSSQ 376
             A   +VDSG+ FTFL  + Y  + AEF RQ    + + +         +  C++    
Sbjct: 306 TGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQG 365

Query: 377 RLP---KLPSVKLMF 388
           R P   +LP V L+F
Sbjct: 366 RSPPTARLPGVTLLF 380


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 131/321 (40%), Gaps = 43/321 (13%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
           + SQ     S GN     +   I +GTP    L   D   DL W+PC  C  C       
Sbjct: 84  YASQSELNFSKGN-----YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCT------ 132

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSS--- 198
                +D   + PS SST    +C    C +  G  CQ   + C Y      +  SS   
Sbjct: 133 -----KDGFTFFPSESSTYTSAACESYQCQITNGAVCQ--TKMCIYLCGPLPQQRSSCTN 185

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            GL+  D +   S    AL  S   +  I CG      +  G    G++GLG G  S+ S
Sbjct: 186 KGLVAMDTISFHSSSGQAL--SYPNTNFI-CGTFIDNWHYIGA---GIVGLGRGLFSMTS 239

Query: 259 LLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVET 313
            +    LI  +FS C   +    S +I FG +G  + +   ++ +A +G+   Y + +E 
Sbjct: 240 QMKH--LINGTFSQCLVPYSSKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEA 297

Query: 314 CCIGSSCLKQTSFKA-----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPW 367
             +G + +    + A      +D  ++FT LP + YE + AE  + +N T  ++      
Sbjct: 298 MSVGGNRVANNFYSAPKSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKL 357

Query: 368 KCCYKSSSQRLPKLPSVKLMF 388
             CYKS S      P + + F
Sbjct: 358 SLCYKSESDHDFDAPPITMHF 378


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 135/333 (40%), Gaps = 62/333 (18%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSL 147
           G + + + N     +   + +GTP     + LD  +D  W+PC  C  C+  +       
Sbjct: 89  GQQVLKIAN-----YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT------- 136

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
                 + P+AS+T   L CS   C    G SC             Y  ++S +  LV+D
Sbjct: 137 ------FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQD 190

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            +         L N V      GC    SGG    + P GL+GLG G I   SL+++AG 
Sbjct: 191 AI--------TLANDVIPGFTFGCINAVSGG---SIPPQGLLGLGRGPI---SLISQAGA 236

Query: 266 IRNS-FSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           + +  FS C         SG +  G  G P + ++T  L +  +   Y + +    +G  
Sbjct: 237 MYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRI 296

Query: 320 CL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
            +            T    I+DSG+  T   + VY  I  EF +QVN  I+S   +    
Sbjct: 297 KVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGAF--DT 354

Query: 370 CYKSSSQRLPKLPSVKLMF-------PQNNSFV 395
           C+ ++++   + P++ L F       P  NS +
Sbjct: 355 CFAATNEA--EAPAITLHFEGLNLVLPMENSLI 385


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 127/312 (40%), Gaps = 51/312 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +   + IGTP     + LD GSDL+W      +C P  A +    D+ L  + PS SST 
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTL 132

Query: 163 KHLSCSHRLCD--LGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
              SC   LC      SC +PK    Q C YT   Y + + ++G L  D    +  G + 
Sbjct: 133 SLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYS-YGDKSVTTGFLEVDKFTFVGAGASV 191

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                   V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF  
Sbjct: 192 ------PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTA 238

Query: 277 -----------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS------ 319
                      D    ++   +G    QST  + +      Y + ++   +GS+      
Sbjct: 239 VNGLKPSTVLLDLPADLYKSGRGAV--QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPE 296

Query: 320 ---CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
               LK  +   I+DSG++ T LP  VY  +   F  QV   + S        C  +  +
Sbjct: 297 SEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLR 356

Query: 377 RLPKLPSVKLMF 388
             P +P + L F
Sbjct: 357 AKPYVPKLVLHF 368


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 72/276 (26%), Positives = 124/276 (44%), Gaps = 36/276 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V ++   D GSDL+W  C  C++C   S   ++          P  S++  H+
Sbjct: 96  VSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFD----------PLKSTSFSHV 145

Query: 166 SCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            C+ + C  +  S    +  C Y+  Y  +  +   L  E I    + G +++K+     
Sbjct: 146 PCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI----TIGSSSVKS----- 196

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGR 281
            +IGCG +  GG+        +IGLG G++S+ S +++   I   FS C        +G+
Sbjct: 197 -VIGCGHESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 252

Query: 282 IFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA--IVDSGSS 335
           I FG      GP    +   L S      Y + +E   IG+     ++ +   I+DSG++
Sbjct: 253 INFGQNAVVSGPGVVSTP--LISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTT 310

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
            +FLPKE+Y+ + +   + V        G  W  C+
Sbjct: 311 LSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 346


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 91/331 (27%), Positives = 135/331 (40%), Gaps = 61/331 (18%)

Query: 110 GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           G+P  +  V +D GSDL W     V+C P SA Y     RD   + P+ S+T   + C+ 
Sbjct: 197 GSPAANLTVIVDTGSDLTW-----VQCKPCSACYAQ---RD-PLFDPAGSATYAAVRCNA 247

Query: 170 RLCDL------GT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
             C        GT  SC    + C Y +  Y + + S G+L  D +        AL  + 
Sbjct: 248 SACAASLKAATGTPGSCGGGNERCYYAL-AYGDGSFSRGVLATDTV--------ALGGAS 298

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCF----DK 276
               + GCG+    G   G A  GL+GLG  E+S+ S  A + G +   FS C       
Sbjct: 299 LDGFVFGCGLSNR-GLFGGTA--GLMGLGRTELSLVSQTALRYGGV---FSYCLPATTSG 352

Query: 277 DDSGRIFFGDQGPATQQS-----TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA--- 328
           D SG +  G    + + +     T  +A   +   Y + V    +G + L      A   
Sbjct: 353 DASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNV 412

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKL 381
           ++DSG+  T L   VY  + AEF RQ      +  GYP          CY  +     K+
Sbjct: 413 LIDSGTVITRLAPSVYRGVRAEFTRQF-----AAAGYPTAPGFSILDTCYDLTGHDEVKV 467

Query: 382 PSVKLMFPQNNSFVVNNP--VFVIY--GTQV 408
           P + L         V+    +FV+   G+QV
Sbjct: 468 PLLTLRLEGGAEVTVDAAGMLFVVRKDGSQV 498


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 88/307 (28%), Positives = 131/307 (42%), Gaps = 55/307 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IG P +  L  +D GS L W+ C  C  C+  S   ++          PS SST  +LSC
Sbjct: 99  IGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFD----------PSKSSTYSNLSC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S   C+    C      CPY+++ Y  + SS G+   + L L +  ++ +K     S+I 
Sbjct: 149 SE--CN---KCDVVNGECPYSVE-YVGSGSSQGIYAREQLTLETIDESIIK---VPSLIF 199

Query: 228 GCGMK---QSGGY-LDGVAPDGLIGLGLGEIS-VPSLLAK----AGLIRNSFSMCFDKDD 278
           GCG K    S GY   G+  +G+ GLG G  S +PS   K     G +RN+         
Sbjct: 200 GCGRKFSISSNGYPYQGI--NGVFGLGSGRFSLLPSFGKKFSYCIGNLRNT------NYK 251

Query: 279 SGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------- 327
             R+  GD+      ST+    NG    Y + +E   IG   L    T F+         
Sbjct: 252 FNRLVLGDKANMQGDSTTLNVING---LYYVNLEAISIGGRKLDIDPTLFERSITDNNSG 308

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE---GYPWKCCYKS-SSQRLPKLPS 383
            I+DSG+  T+L K  +E ++ E +  +   +   +     P+  CY    SQ L   P 
Sbjct: 309 VIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPL 368

Query: 384 VKLMFPQ 390
           V   F +
Sbjct: 369 VTFHFAE 375


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 80/304 (26%), Positives = 134/304 (44%), Gaps = 51/304 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP +++   +D GSDL+W  C  CV C   S   ++          PS+SST   + C
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATVPC 222

Query: 168 SHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           S   C DL TS       C YT   Y +++S+ G+L  +           L  S    V+
Sbjct: 223 SSASCSDLPTSKCTSASKCGYTY-TYGDSSSTQGVLATETF--------TLAKSKLPGVV 273

Query: 227 IGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRI 282
            GCG    G G+  G    GL+GLG G +   SL+++ GL  + FS C    D  ++  +
Sbjct: 274 FGCGDTNEGDGFSQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDTNNSPL 325

Query: 283 FFGD--------QGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK----- 327
             G            ++ Q+T  + +  +   Y + ++   +GS+   L  ++F      
Sbjct: 326 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 385

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
               IVDSG+S T+L  + Y  +   F  Q+        G     C+++ ++ + ++   
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVP 445

Query: 385 KLMF 388
           +L+F
Sbjct: 446 RLVF 449


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 77/290 (26%), Positives = 119/290 (41%), Gaps = 36/290 (12%)

Query: 96  GNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   +YT  + IG P   + + +D GSDL W+ CD  C  C         ++ R+  
Sbjct: 56  GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGC---------TIPRN-R 105

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
            Y P+ +     + C   LC    S     C  P + C Y ++Y  + +S   LL ++I 
Sbjct: 106 LYKPNGNL----VKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIP 161

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
              + G  A     +  +  GCG  Q   G+    +  G++GLG G+ S+ S L   GLI
Sbjct: 162 LKFTNGSLA-----RPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLI 216

Query: 267 RNSFSMCFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
           RN    C  +   G +FFGDQ  P +    + L  +     Y  G               
Sbjct: 217 RNVVGHCLSERGGGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKG 276

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
            + I DSGSS+T+   + ++ +       VN       G P     + SS
Sbjct: 277 LQLIFDSGSSYTYFNSKAHKAL-------VNLVTNDLRGKPLSRATEDSS 319


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 118/287 (41%), Gaps = 35/287 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP       +D  +D +W  C+ C  C   ++  ++          PS SST K + C
Sbjct: 95  IGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFD----------PSKSSTYKTIPC 144

Query: 168 SHRLCD--LGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           S   C     T C  + K+ C Y+  Y  E   S G L  D L L S  D  +      +
Sbjct: 145 SSPKCKNVENTHCSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLNSNNDTPIS---FKN 200

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
           ++IGCG +  G  L+G    G IGLG G +S  S L  +  I   FS C      ++  S
Sbjct: 201 IVIGCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGIS 256

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------QTSFKAIVD 331
           G++ FGD+   +   T         I Y   +    +G   +K              I+D
Sbjct: 257 GKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIID 316

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
           SG++ T LP+ VY  + +     V           +K CYK++ + L
Sbjct: 317 SGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKNL 363


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 127/312 (40%), Gaps = 51/312 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +   + IGTP     + LD GSDL+W      +C P  A +    D+ L  + PS SST 
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTL 132

Query: 163 KHLSCSHRLCD--LGTSCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
              SC   LC      SC +PK    Q C YT   Y + + ++G L  D    +  G + 
Sbjct: 133 SLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYS-YGDKSVTTGFLEVDKFTFVGAGASV 191

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                   V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF  
Sbjct: 192 ------PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTA 238

Query: 277 -----------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS------ 319
                      D    ++   +G    QST  + +      Y + ++   +GS+      
Sbjct: 239 VNGLKPSTVLLDLPADLYKSGRGAV--QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPE 296

Query: 320 ---CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
               LK  +   I+DSG++ T LP  VY  +   F  QV   + S        C  +  +
Sbjct: 297 SEFTLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLR 356

Query: 377 RLPKLPSVKLMF 388
             P +P + L F
Sbjct: 357 AKPYVPKLVLHF 368


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 132/323 (40%), Gaps = 44/323 (13%)

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           ++  PS+   T+  GN     +   + +GTP        D GSDL W      +C P + 
Sbjct: 122 KVTLPSKSGSTIGTGN-----YVVTVGLGTPKRDLTFIFDTGSDLTW-----TQCEPCAR 171

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENT 196
             Y+  +   N   PS S++  ++SCS   CD      G S       C Y +  Y + +
Sbjct: 172 YCYHQQEPIFN---PSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQ-YGDQS 227

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G   +D L L S         V  + + GCG    G ++ GVA  GLIGLG   +S+
Sbjct: 228 YSVGFFAQDKLALTS-------TDVFNNFLFGCGQNNRGLFV-GVA--GLIGLGRNALSL 277

Query: 257 PSLLA-KAGLIRNSFSMCFDKDDS--GRIFFGDQG---PATQQSTSFLASNGKYITYIIG 310
            S  A K G +   FS C     S  G + FG  G    A + + S + S G    Y + 
Sbjct: 278 VSQTAQKYGKL---FSYCLPSTSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSF-YFLN 333

Query: 311 VETCCIGSSCLKQ-----TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
           +    +G   L       ++   I+DSG+  + LP   Y  + A F +Q++    +    
Sbjct: 334 LIAISVGGRKLSTSASVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPAS 393

Query: 366 PWKCCYKSSSQRLPKLPSVKLMF 388
               CY  S      +P + L F
Sbjct: 394 ILDTCYDFSQYDTVDVPKINLYF 416


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 74/299 (24%), Positives = 130/299 (43%), Gaps = 46/299 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC+ C +C        N  D    ++ P  S T   + C
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCG-------NHQDP---KFQPDLSDTYHPVKC 51

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +        +C      C Y   Y  E +SSSG+L ED++    G  + LK       + 
Sbjct: 52  NPD-----CTCDTENDQCTYERQY-AEMSSSSGILGEDLVSF--GNMSELK---PQRAVF 100

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   +   G +  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P +    S  +   +   Y I +    +    L             I+DSG+++ +L
Sbjct: 160 QISPPSDMVFSH-SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYL 218

Query: 340 PKEVY----ETIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMF 388
           P+  +    + I +E    +Q+     ++       C+  +   +P+L    PSV ++F
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNYN----DVCFSGAGSEIPELYKTFPSVDMVF 273


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/299 (24%), Positives = 130/299 (43%), Gaps = 46/299 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC+ C +C        N  D    ++ P  S T   + C
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCG-------NHQDP---KFQPDLSDTYHPVKC 51

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +        +C      C Y   Y  E +SSSG+L ED++    G  + LK       + 
Sbjct: 52  NPD-----CTCDTENDQCTYERQY-AEMSSSSGILGEDLVSF--GNMSELK---PQRAVF 100

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L + G+I +SFS+C+   +   G +  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P +    S  +   +   Y I +    +    L             I+DSG+++ +L
Sbjct: 160 QISPPSDMVFSH-SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYL 218

Query: 340 PKEVY----ETIAAEFD--RQVNDTITSFEGYPWKCCYKSSSQRLPKL----PSVKLMF 388
           P+  +    + I +E    +Q+     ++       C+  +   +P+L    PSV ++F
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNYN----DVCFSGAGSEIPELYKTFPSVDMVF 273


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 127/297 (42%), Gaps = 58/297 (19%)

Query: 118 VALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---- 172
           V +D GSDL W+ C  C RC       YN  D   N   PS S + + + C+   C    
Sbjct: 79  VIVDTGSDLSWVQCQPCNRC-------YNQQDPVFN---PSKSPSYRTVLCNSLTCRSLQ 128

Query: 173 ----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
               + G    NP   C Y ++Y   + +S  + +E   HL       L N+   + I G
Sbjct: 129 LATGNSGVCGSNPPT-CNYVVNYGDGSYTSGEVGME---HL------NLGNTTVNNFIFG 178

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRIFFG 285
           CG K  G  L G A  GL+GLG  ++S+ S ++   +    FS C    + + SG +  G
Sbjct: 179 CGRKNQG--LFGGA-SGLVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMG 233

Query: 286 DQGPATQQSTSF----LASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTF 338
                 + +T      +  N     Y + +    +G   ++  SF   + I+DSG+  + 
Sbjct: 234 GNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISR 293

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMF 388
           LP  +Y+ + AEF +Q       F GYP          C+  S  +  K+P +K+ F
Sbjct: 294 LPPSIYQALKAEFVKQ-------FSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYF 343


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 134/333 (40%), Gaps = 54/333 (16%)

Query: 74  KMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDC 133
           K+ T   F    P +    + LG      +   +  GTP    L+  D GSDL+W+ C  
Sbjct: 29  KLATTTSFWAESPMESGAFLGLGQ-----YLVSMAFGTPPQEVLLIADTGSDLIWLQCST 83

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-------GTSCQNPKQPCP 186
               P          R    +  S S+T   + CS   C L       G +C +P  P P
Sbjct: 84  TAAPPAFCPKKACSRRP--AFVASKSATLSVVPCSAAQCLLVPAPRGHGPAC-SPAAPVP 140

Query: 187 YTMDY-YTENTSSSGLLVEDILHLISG--GDNALKNSVQASVIIGCGMKQSGGYLDGVAP 243
               Y Y + +S++G L  D   + +G  G  A++      V  GCG +  GG   G   
Sbjct: 141 CGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRG-----VAFGCGTRNQGGSFSGTG- 194

Query: 244 DGLIGLGLGEISVPSLLAKAG-LIRNSFSMCFDKDDSGR-------IFFGDQGPATQQST 295
            G+IGLG G++S P   A++G L   +FS C    + GR       +F G        + 
Sbjct: 195 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAY 250

Query: 296 SFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK----------AIVDSGSSFTFLPKEVY 344
           + L SN    T Y +GV    +G+  L     +           ++DSGS+ T+L    Y
Sbjct: 251 TPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAY 310

Query: 345 ETIAAEFDRQVN-----DTITSFEGYPWKCCYK 372
             + + F   V+      + T F+G   + CY 
Sbjct: 311 LHLVSAFAASVHLPRIPSSATFFQG--LELCYN 341


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 95/194 (48%), Gaps = 35/194 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAP----LSASY-----------------YNS 146
           IGTP   F + +D+GS + ++PC DC +C      LS+                   Y  
Sbjct: 98  IGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKVQIFKISYGL 157

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
            D D  ++ P  SST + + C     ++  +C + K+ C Y  +Y  E++SS G+L ED 
Sbjct: 158 FDED-PKFQPELSSTYQPVKC-----NMDCNCDDDKEQCVYEREY-AEHSSSKGVLGED- 209

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
             LIS G+ +     +A  + GC   ++G      A DG+IGLG G++S+   L   GLI
Sbjct: 210 --LISFGNESHLTPQRA--VFGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGLI 264

Query: 267 RNSFSMCFDKDDSG 280
            NSF +C+   D G
Sbjct: 265 SNSFGLCYGGLDVG 278


>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
 gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/262 (29%), Positives = 118/262 (45%), Gaps = 40/262 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVR--CAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           ++IG P+  + + +D GSDL W+ CD  R  C      YY   +  +    P   S   H
Sbjct: 24  LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYYKPSNNLVACKDPICQSL--H 81

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
                R       C+NP Q C Y ++Y  +  SS G+LV+D  +L     N      Q+ 
Sbjct: 82  TGGDQR-------CENPGQ-CDYEVEY-ADGGSSLGVLVKDAFNL-----NFTSEKRQSP 127

Query: 225 VI-IG-CGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR 281
           ++ +G CG  Q  GG    +  DG++GLG G+ S+ S L+  GL+RN    C     SGR
Sbjct: 128 LLALGLCGYDQLPGGTYHPI--DGVLGLGRGKPSIVSQLSGLGLVRNVIGHCL----SGR 181

Query: 282 IFFGDQGPATQQSTSFLA-----SNGKYITYIIGVETCCIGSSCLKQTSFKAIV---DSG 333
                        +S +A      N K+  Y  G           K T FK ++   DSG
Sbjct: 182 GGGFLFFGDDLYDSSRVAWTPMSPNAKH--YSPGFAELTFDG---KTTGFKNLIVAFDSG 236

Query: 334 SSFTFLPKEVYETIAAEFDRQV 355
           +S+T+L  +VY+ + +   R++
Sbjct: 237 ASYTYLNSQVYQGLISLIKREL 258


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 135/332 (40%), Gaps = 60/332 (18%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD 148
           G + + + N     +   + +GTP     + LD  +D  W+PC    C   S++      
Sbjct: 89  GQQVLKIAN-----YVVRVKLGTPGQQMFMVLDTSNDAAWVPCS--GCTGFSST------ 135

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
                + P+AS+T   L CS   C    G SC             Y  ++S +  LV+D 
Sbjct: 136 ----TFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDA 191

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +         L N V      GC    SGG    + P GL+GLG G I   SL+++AG +
Sbjct: 192 I--------TLANDVIPGFTFGCINAVSGG---SIPPQGLLGLGRGPI---SLISQAGAM 237

Query: 267 RNS-FSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
            +  FS C         SG +  G  G P + ++T  L +  +   Y + +    +G   
Sbjct: 238 YSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIK 297

Query: 321 L----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC 370
           +            T    I+DSG+  T   + VY  I  EF +QVN  I+S   +    C
Sbjct: 298 VPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGAF--DTC 355

Query: 371 YKSSSQRLPKLPSVKLMF-------PQNNSFV 395
           + ++++   + P++ L F       P  NS +
Sbjct: 356 FAATNEA--EAPAITLHFEGLNLVLPMENSLI 385


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 136/314 (43%), Gaps = 54/314 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  +  + LD GS+L W+      CAP          R    + P AS T   + 
Sbjct: 69  LAVGTPPQNVTMVLDTGSELSWL-----LCAPGGGGGGGG--RSALSFRPRASLTFASVP 121

Query: 167 CSHRLC---DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C    C   DL +  +C    + C  ++  Y + +SS G L  ++  +  G        +
Sbjct: 122 CGSAQCRSRDLPSPPACDGASKQCRVSLS-YADGSSSDGALATEVFTVGQG------PPL 174

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 280
           +A+   GC         DGVA  GL+G+  G +   S +++A   R  FS C  D+DD+G
Sbjct: 175 RAA--FGCMATAFDTSPDGVATAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAG 227

Query: 281 RIFFG---------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSF 326
            +  G         +  P  Q +        +A + + +   +G +   I +S L     
Sbjct: 228 VLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHT 287

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQ-------VNDTITSFEGYPWKCCYKSSSQ 376
            A   +VDSG+ FTFL  + Y  + AEF RQ       +ND   +F+   +  C++    
Sbjct: 288 GAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEA-FDTCFRVPQG 346

Query: 377 RLP--KLPSVKLMF 388
           R P  +LP+V L+F
Sbjct: 347 RAPPARLPAVTLLF 360


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/299 (27%), Positives = 127/299 (42%), Gaps = 47/299 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +G P    L  +D GS++LW+ C  C RC   +    +          PS SST   L C
Sbjct: 105 MGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLD----------PSKSSTYASLPC 154

Query: 168 SHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQAS 224
           ++ +C    S   N    C Y + Y T   SS+G+L  +  I H    G NA+      S
Sbjct: 155 TNTMCHYAPSAYCNRLNQCGYNLSYAT-GLSSAGVLATEQLIFHSSDEGVNAVP-----S 208

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----- 279
           V+ GC   ++G Y D     G+ GLG G   + S + + G   + FS C           
Sbjct: 209 VVFGCS-HENGDYKDRRF-TGVFGLGKG---ITSFVTRMG---SKFSYCLGNIADPHYGY 260

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYI----IGVETCCIGSSC--LKQTSFKAIVDSG 333
            ++ FG++      ST     NG Y   +    +G +   I S+   +K     A++DSG
Sbjct: 261 NQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSG 320

Query: 334 SSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---CCYKSS-SQRLPKLPSVKLMF 388
           ++ T+L +  +  +  E  + ++  +  F    W+    CYK + SQ L   P V   F
Sbjct: 321 TALTWLAESAFRALDNEVRQLLDGVLMPF----WRGSFACYKGTVSQDLIGFPVVTFHF 375


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 116/256 (45%), Gaps = 32/256 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P  S++ + L C
Sbjct: 82  IGTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPELSTSYQALKC 131

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                +   +C +  + C Y    Y E +SSSG+L ED   LIS G+ +  +  +A  + 
Sbjct: 132 -----NPDCNCDDEGKLCVYER-RYAEMSSSSGVLSED---LISFGNESQLSPQRA--VF 180

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC  +++G      A DG++GLG G++SV   L   G+I + FS+C+   +   G +  G
Sbjct: 181 GCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P      S  +   +   Y I ++   +    LK            ++DSG+++ + 
Sbjct: 240 KISPPPGMVFSH-SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYF 298

Query: 340 PKEVYETIAAEFDRQV 355
           PKE +  I     +++
Sbjct: 299 PKEAFIAIKDAVIKEI 314


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 136/314 (43%), Gaps = 54/314 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  +  + LD GS+L W+      CAP          R    + P AS T   + 
Sbjct: 70  LAVGTPPQNVTMVLDTGSELSWL-----LCAPGGGGGGGG--RSALSFRPRASLTFASVP 122

Query: 167 CSHRLC---DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C    C   DL +  +C    + C  ++  Y + +SS G L  ++  +  G        +
Sbjct: 123 CDSAQCRSRDLPSPPACDGASKQCRVSLS-YADGSSSDGALATEVFTVGQG------PPL 175

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 280
           +A+   GC         DGVA  GL+G+  G +   S +++A   R  FS C  D+DD+G
Sbjct: 176 RAA--FGCMATAFDTSPDGVATAGLLGMNRGAL---SFVSQASTRR--FSYCISDRDDAG 228

Query: 281 RIFFG---------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSF 326
            +  G         +  P  Q +        +A + + +   +G +   I +S L     
Sbjct: 229 VLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHT 288

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQ-------VNDTITSFEGYPWKCCYKSSSQ 376
            A   +VDSG+ FTFL  + Y  + AEF RQ       +ND   +F+   +  C++    
Sbjct: 289 GAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEA-FDTCFRVPQG 347

Query: 377 RLP--KLPSVKLMF 388
           R P  +LP+V L+F
Sbjct: 348 RAPPARLPAVTLLF 361


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 91/333 (27%), Positives = 132/333 (39%), Gaps = 51/333 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP     + LD GSDL+W  C  C+ C    A+    LD       P+ASST   L
Sbjct: 94  VSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAA--PVLD-------PAASSTHAAL 144

Query: 166 SCSHRLCDL--GTSCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            C   LC     TSC       + C Y   +Y + + + G L  D      GGD+     
Sbjct: 145 PCDAPLCRALPFTSCGGRSWGDRSCVYVY-HYGDRSLTVGQLATDSFTF--GGDDNAGGL 201

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DK 276
               V  GCG    G +       G+ G G G  S+PS L        SFS CF    D 
Sbjct: 202 AARRVTFGCGHINKGIFQ--ANETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFDT 254

Query: 277 DDSGRIFFGDQGPATQQSTSFLASNGKYIT------------YIIGVETCCIGSS--CLK 322
             S  +  G    A    T   A  G   T            Y + +    +G +   + 
Sbjct: 255 KSSSVVTLG-AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVP 313

Query: 323 QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---SSSQR 377
           ++  ++  I+DSG+S T LP++VYE + AEF  QV     +        C+    ++  R
Sbjct: 314 ESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWR 373

Query: 378 LPKLPSVKLMFPQNNSFVV--NNPVFVIYGTQV 408
            P +P++ L       + +   N VF  Y  +V
Sbjct: 374 RPAVPALTLHLDGGADWELPRGNYVFEDYAARV 406


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 116/256 (45%), Gaps = 32/256 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P  S++ + L C
Sbjct: 82  IGTPPQEFALIVDTGSTVTYVPCSTCKQCG----------KHQDPKFQPELSTSYQALKC 131

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                +   +C +  + C Y    Y E +SSSG+L ED   LIS G+ +  +  +A  + 
Sbjct: 132 -----NPDCNCDDEGKLCVYER-RYAEMSSSSGVLSED---LISFGNESQLSPQRA--VF 180

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC  +++G      A DG++GLG G++SV   L   G+I + FS+C+   +   G +  G
Sbjct: 181 GCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P      S  +   +   Y I ++   +    LK            ++DSG+++ + 
Sbjct: 240 KISPPPGMVFSH-SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYF 298

Query: 340 PKEVYETIAAEFDRQV 355
           PKE +  I     +++
Sbjct: 299 PKEAFIAIKDAVIKEI 314


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 161/417 (38%), Gaps = 62/417 (14%)

Query: 5   SLTIYLAVFWLLT-----ESSGAETVMFSTKLIHR------FSEEVKALGVSKNRNATSW 53
           SL +  +  + LT     E S  +     TKLIHR      +      +     R   + 
Sbjct: 10  SLPLIFSTHFALTIANNLEFSSIQPTRLVTKLIHRDSIVSPYYRSNDTVADRTERTMKAS 69

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPN 113
            A+ S+ Y ++    D+    +   P          S+ + L N           +G P 
Sbjct: 70  LARLSYLYAKIERDFDINDLWLNLHPS--------ASEPLFLVN---------FSMGQPP 112

Query: 114 VSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
           V  L  +D GS LLWI   C  C   S      +      + PS SST   LSC + +C 
Sbjct: 113 VPQLAIMDTGSSLLWI--QCAPCKSCSQQIIGPM------FDPSISSTYDSLSCKNIICR 164

Query: 174 LGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 231
              S  C +  Q C Y   Y  E   S G++  +   LI G  +  +N+V  +V+ GC  
Sbjct: 165 YAPSGECDSSSQ-CVYNQTY-VEGLPSVGVIATE--QLIFGSSDEGRNAVN-NVLFGCSH 219

Query: 232 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQ 287
           + +G Y D     G+ GLG G  SV + +       + FS C     D D S       +
Sbjct: 220 R-NGNYKDRRFT-GVFGLGSGITSVVNQMG------SKFSYCIGNIADPDYSYNQLVLSE 271

Query: 288 GPATQ-QSTSFLASNGKYITYIIGVET----CCIGSSCLKQTS--FKAIVDSGSSFTFLP 340
           G   +  ST     +G Y   + G+        I  S  K+T    + I+DSG++ T+L 
Sbjct: 272 GVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLA 331

Query: 341 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
           +  Y  +  E    ++  +T F    + C      Q L   P+V   F +    VV+
Sbjct: 332 ENEYRALEREVRNLLDRFLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVD 388


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 130/300 (43%), Gaps = 45/300 (15%)

Query: 103 HYTWI---DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSA 158
           HY ++    IGTP V     +D GSDL+W+     +C P +  Y     + LN  + P +
Sbjct: 56  HYDYLMELSIGTPPVKTYAQVDTGSDLIWL-----QCIPCTNCY-----KQLNPMFDPQS 105

Query: 159 SSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGD 214
           SST  +++     C     TSC   +  C YT   Y +++ + G+L ++ L L S  G  
Sbjct: 106 SSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYS-YEDDSITEGVLAQETLTLTSTTGKP 164

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            ALK      VI GCG   +G + D     G+IGLG G +S+ S +  +      FS C 
Sbjct: 165 VALK-----GVIFGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGSS-FGGKMFSQCL 216

Query: 275 -----DKDDSGRIFFGDQGPATQQ---STSFLASNGK---YITYIIGVETCCI------G 317
                +   +  + FG           ST  ++ N     Y   ++G+    I      G
Sbjct: 217 VPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDG 276

Query: 318 SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQ 376
           SS    T    ++DSG+  T LP++ Y  +  E   +V  D I       ++ CY++ + 
Sbjct: 277 SSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTN 336


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 121/312 (38%), Gaps = 49/312 (15%)

Query: 97  NDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYS 155
           + F +  Y   +  GTP     + LD GSD+ W    C RC P SA +    ++ L  + 
Sbjct: 81  DGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITWT--QCKRC-PASACF----NQTLPLFD 133

Query: 156 PSASSTSKHLSCSHRLCDLGTSC----QNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
           PSASS+   L CS   C+    C        +PC Y++  Y + + S G +  ++    S
Sbjct: 134 PSASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSIS-YGDGSVSRGEIGREVFTFAS 192

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
           G       +V   ++ GCG    G +       G+ G G G +S+PS L K G    +FS
Sbjct: 193 GTGEGSSAAVPG-LVFGCGHANRGVFTSNET--GIAGFGRGSLSLPSQL-KVG----NFS 244

Query: 272 MCFDK---DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
            CF       +  +  G  G A   ++      G Y                 +  S   
Sbjct: 245 HCFTTITGSKTSAVLLGLPGVAPPSASPLGRRRGSY-----------------RCRSTPR 287

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQRLPKLPSVKLM 387
             +SG+S T LP   Y  +  EF  QV   +       P+ C         P +P++ L 
Sbjct: 288 SSNSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALH 347

Query: 388 F-------PQNN 392
           F       PQ N
Sbjct: 348 FEGATMRLPQEN 359


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/315 (27%), Positives = 129/315 (40%), Gaps = 55/315 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP     + LD GSDL+W      +CAP    ++  L        P+ASST   L 
Sbjct: 96  LAVGTPPRPVALTLDTGSDLVW-----TQCAPCRDCFHQGLPL----LDPAASSTYAALP 146

Query: 167 CSHRLCDL--GTSCQ--------NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           C    C     TSC         N  + C Y + +Y + + + G +  D      GGDN 
Sbjct: 147 CGAPRCRALPFTSCGGGGRSSWGNGNRSCAY-IYHYGDKSVTVGEIATD--RFTFGGDNG 203

Query: 217 LKNSVQAS--VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
             +S   +  +  GCG    G +       G+ G G G  S+PS L        +FS CF
Sbjct: 204 DGDSRLPTRRLTFGCGHFNKGVFQSNET--GIAGFGRGRWSLPSQLNV-----TTFSYCF 256

Query: 275 D---KDDSGRIFFGDQGPATQ-------------QSTSFLASNGKYITYIIGVETCCIGS 318
               +  S  +  G   PA               ++T  L +  +   Y + ++   +G 
Sbjct: 257 TSMFESKSSLVTLGG-APAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGK 315

Query: 319 SCLKQTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-FEGYPWKCCYK-- 372
           + L     K    I+DSG+S T LP+ VYE + AEF  QV    T   EG     C+   
Sbjct: 316 TRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALP 375

Query: 373 -SSSQRLPKLPSVKL 386
            ++  R P +PS+ L
Sbjct: 376 VTALWRRPPVPSLTL 390


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 107/245 (43%), Gaps = 32/245 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC DC  C                 + P  SST   + C
Sbjct: 94  IGTPPQEFALIVDTGSTVTYVPCSDCEHCG----------KHQDPRFQPDESSTYHPVKC 143

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                ++  +C +    C Y   Y  E +SSSG+L EDI   IS G+ +    V    + 
Sbjct: 144 -----NMDCNCDHDGVNCVYERRY-AEMSSSSGVLGEDI---ISFGNQS--EVVPQRAVF 192

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--KDDSGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L    +I +SFS+C+       G +  G
Sbjct: 193 GCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLG 251

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFL 339
              P      S  +   +   Y I ++   +    LK            ++DSG+++ +L
Sbjct: 252 GIPPPPDMVFS-RSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYL 310

Query: 340 PKEVY 344
           P+E +
Sbjct: 311 PEEAF 315


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 125/311 (40%), Gaps = 29/311 (9%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G L +   +  GTP  ++ +  D GSD+ WI     +C P S   Y   D    
Sbjct: 110 STGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI-----QCLPCSGHCYKQHD---P 161

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
            + P+ S+T   + C H  C       +    C Y +  Y + +S++G+L  + L L S 
Sbjct: 162 IFDPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQ-YGDGSSTAGVLSHETLSLTSA 220

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
              AL          GCG    G + D    DGLIGLG G++S+ S  A +     S+ +
Sbjct: 221 --RALPG-----FAFGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCL 270

Query: 273 CFDKDDSGRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLKQ----- 323
                  G +  G   PA+     + T+ +        Y + + +  +G   L       
Sbjct: 271 PSYNTSHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF 330

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
           T    ++DSG+  T+LP E Y  +   F   +     +    P+  CY  + Q    +P 
Sbjct: 331 TRDGTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPL 390

Query: 384 VKLMFPQNNSF 394
           V   F   +SF
Sbjct: 391 VSFKFSDGSSF 401


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 125/317 (39%), Gaps = 36/317 (11%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD 148
           G ++  +   F +L Y  +++GTP    L   D GSDL+W+ C        S        
Sbjct: 88  GVESKIITRSFEYLMY--VNVGTPPAQMLAIADTGSDLVWVNCS-------SNGGGGGAS 138

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
                + PS S+T   LSC    C  L  +  +    C Y    Y + + + G+L  +  
Sbjct: 139 DGAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQY-AYGDGSRTIGVLSTETF 197

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
              + G           V  GC    +G +      DGL+GLG G +S+ S L  A  I 
Sbjct: 198 SFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAARIA 253

Query: 268 NSFSMCF-----DKDDSGRIFFG------DQGPATQQSTSFLASNGKYITYIIGVETCCI 316
             FS C        + S  + FG      D G A   ST  + S      Y + +E+  +
Sbjct: 254 RRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAA---STPLVPSEVDSY-YTVALESVAV 309

Query: 317 -GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY---- 371
            G       S + IVDSG++ TFL   +   + AE +R++            + CY    
Sbjct: 310 AGQDVASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQG 369

Query: 372 KSSSQRLPKLPSVKLMF 388
           KS ++    +P V L F
Sbjct: 370 KSQAEDF-GIPDVTLRF 385


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 161/385 (41%), Gaps = 76/385 (19%)

Query: 51  TSWPAKKSFEYYQVLLSSDVQKQKMKTGPQ--FQMLFPSQGSKTMSLGNDFGWLHYTWID 108
           T  P+   +EY   L ++ + +      P+  F ++      KT      +G    + + 
Sbjct: 37  TKRPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLI------KTPLFSRSYGGYSMS-LS 89

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP+ +  + +D GS L+W PC   R    S ++ N+    + ++ P  SS+SK + C 
Sbjct: 90  LGTPSQTVKLIMDTGSSLVWFPCTS-RYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCK 148

Query: 169 HRLCD--LGTSCQ------NPK-----QPC-PYTMDYYTENTSSSGLLVEDILHLISGGD 214
           +  C    G+S Q      NP+     Q C PY + Y   +T  +GLL+ + ++      
Sbjct: 149 NPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGST--AGLLLSETIN------ 200

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC- 273
               N   +  + GC +      L    P+G+ G G  + S+P  L   GL + S+ +  
Sbjct: 201 --FPNKTISDFLAGCSL------LSTRQPEGIAGFGRSQESLPLQL---GLKKFSYCLVS 249

Query: 274 --FDKDDSGRIFFGDQGPATQQS-------TSF---LASNGKYI---TYIIGVETCCIGS 318
             FD          D GP+T  S       T F   LAS         Y + +    +G 
Sbjct: 250 RRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGK 309

Query: 319 SCLK-QTSF---------KAIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFE 363
           + +K   SF           IVDSGS+FTF+   V+E +A EF++Q     V   +    
Sbjct: 310 THVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLT 369

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMF 388
           G   + C+  S ++   +P +   F
Sbjct: 370 G--LRPCFDISGEKSVVIPDLTFQF 392


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 170/414 (41%), Gaps = 72/414 (17%)

Query: 3   RISLTIYLAVFWLLTESSG-----AETVMFSTKLIHRFSEEVKALGVSKNR-----NATS 52
           R  L+  L++ +L    SG     AE + F+T+LIHR S        S+       NA  
Sbjct: 8   RTLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVE 67

Query: 53  WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTP 112
             A +    +  L+S+ +   +          FPS     +    DF       I IG P
Sbjct: 68  RSADR-VNRFNDLISNSITAAE----------FPS-----ILDNGDF----LMKISIGIP 107

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
               LV +  GSDL+WIP  C+   P +       + DL  + P  SST K++ C    C
Sbjct: 108 PTELLVNVATGSDLVWIP--CLSFKPCTH------NCDLRFFDPMESSTYKNVPCDSYRC 159

Query: 173 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
            +  +       C Y+ D   +++   G L  D L L S      K+ +  +    CG +
Sbjct: 160 QITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNS---TTGKSFMLPNTGFICGNR 216

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGP 289
             G Y  GV   G++GLG G +S+ + ++   LI   FS C   +  + + ++ FGD+  
Sbjct: 217 IGGDY-PGV---GILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAV 270

Query: 290 ATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI-------VDSGSSFTFL 339
            +     ST    + G Y +Y +      +G+  +      +        +DSG+ FT+ 
Sbjct: 271 VSGSAMFSTRLDMTGGPY-SYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYF 329

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYP-----WKCCYKSSSQRLPKLPSVKLMF 388
           P+  Y  +  E+D  V   I     YP      + CY+ S    P  P++ + F
Sbjct: 330 PEYFYSQL--EYD--VRYAIQQEPLYPDPTRRLRLCYRYSPDFSP--PTITMHF 377


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 122/300 (40%), Gaps = 40/300 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP    +   D GSDL+W  C  C RC       Y  +D     + P +S T +  
Sbjct: 99  LSLGTPPFKIMGIADTGSDLIWTQCKPCERC-------YKQVDP---LFDPKSSKTYRDF 148

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-AS 224
           SC  R C L          C Y    Y + + + G +  D + L    D+   + V    
Sbjct: 149 SCDARQCSLLDQSTCSGNICQYQYS-YGDRSYTMGNVASDTITL----DSTTGSPVSFPK 203

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
            +IGCG +  G + D     G++GLG G +S+ S +  +  +   FS C         +S
Sbjct: 204 TVIGCGHENDGTFSD--KGSGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNS 259

Query: 280 GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGS-------SCLKQTSFKA 328
            ++ FG      GP   QST  L+S      Y + +E   +G+       S L       
Sbjct: 260 SKLNFGSNAVVSGPGV-QSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNI 318

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           I+DSG++ T +P + +  ++     QV              CY ++S    K+P++   F
Sbjct: 319 IIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDL--KVPAITAHF 376


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 150/352 (42%), Gaps = 64/352 (18%)

Query: 27  FSTKLIHRFSEEVKALGVSKNR-NATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           F+ +LIHR S +      S+   +      ++S     V+L SD  +      P F    
Sbjct: 27  FTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLESDTAE-----APIFNN-- 79

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
                         G  +   I +GTP  S +   D GSD++W      +C P S  Y  
Sbjct: 80  --------------GGEYLVEISVGTPPFSIVAVADTGSDVIW-----TQCKPCSNCY-- 118

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQPCPYTMDYYTENTSSSGLL 202
              ++   + PS S+T K+++CS  +C     G+SC +  + C Y++ Y  ++ S   L 
Sbjct: 119 --QQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSE-CLYSIAYGDDSHSQGNLA 175

Query: 203 VEDI-LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           V+ + +   SG   A   +V     IGCG   +G +   V+  G++GLG G  S+ + L 
Sbjct: 176 VDTVTMQSTSGRPVAFPRTV-----IGCGHDNAGTFNANVS--GIVGLGRGPASLVTQLG 228

Query: 262 KAGLIRNSFSMCF------DKDDSGRIFFGDQGPATQQST--SFLASNGKYIT-YIIGVE 312
            A      FS C         +DS ++ FG     +   T  + + S+ +Y T Y + +E
Sbjct: 229 PA--TGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLE 286

Query: 313 TCCI---------GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
              +         G+S L   S   I+DSG++ T+LP  +  +  +   + +
Sbjct: 287 AVSVGDTKFNFPEGASKLGGES-NIIIDSGTTLTYLPSALLNSFGSAISQSM 337


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/301 (27%), Positives = 121/301 (40%), Gaps = 41/301 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG P V F+   D GSDL W  C  C  C P          +D   Y PSASST   L
Sbjct: 75  LAIGKPPVPFVALADTGSDLTWTQCQPCKLCFP----------QDTPVYDPSASSTFSPL 124

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS   C  + +    P   C Y    Y +   S+G+L  + L L   G ++   SV   
Sbjct: 125 PCSSATCLPIWSRNCTPSSLCRYRY-AYGDGAYSAGILGTETLTL---GPSSAPVSV-GG 179

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRI 282
           V  GCG    G   D +   G +GLG G +   SLLA+ G+ + S+ +   F+       
Sbjct: 180 VAFGCGTDNGG---DSLNSTGTVGLGRGTL---SLLAQLGVGKFSYCLTDFFNSALDSPF 233

Query: 283 FFGD-----QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFK 327
             G       GP+T QST  L S      Y + ++   +G   L             +  
Sbjct: 234 LLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGG 293

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
            IVDSG++FT L +  +  +     R +     +        C+ + +   P +P + L 
Sbjct: 294 MIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLH 352

Query: 388 F 388
           F
Sbjct: 353 F 353


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 153/394 (38%), Gaps = 65/394 (16%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
           + +   + L E + A    FS  LIHR S        SK +     +A      +   + 
Sbjct: 13  VVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFR 72

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
              ++SD  + +         + PS G   M+L             IGTP V  +  +D 
Sbjct: 73  PTAMTSDGIQSR---------IVPSAGEYLMNL------------YIGTPPVPVIAIVDT 111

Query: 123 GSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SC 178
           GSDL W  C  C  C       Y  +   +  + P  SST +  SC    C  LG   SC
Sbjct: 112 GSDLTWTQCRPCTHC-------YKQV---VPLFDPKNSSTYRDSSCGTSFCLALGKDRSC 161

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
              K+ C +    Y + + + G L  + L + S    A K         GCG   SGG  
Sbjct: 162 SKEKK-CTFRYS-YADGSFTGGNLASETLTVDS---TAGKPVSFPGFAFGCG-HSSGGIF 215

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ 293
           D  +  G++GLG GE+S+ S L     I   FS C      D   S RI FG  G  +  
Sbjct: 216 DK-SSSGIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISSRINFGASGRVSGY 272

Query: 294 ST--SFLASNGKYITYIIGVETCCIGSSCL------KQTSFKA---IVDSGSSFTFLPKE 342
            T  + L        Y + +E   +G   L      K+T  +    IVDSG+++TFLP+E
Sbjct: 273 GTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQE 332

Query: 343 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
            Y  +       +           +  CY ++++
Sbjct: 333 FYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 366


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 93/329 (28%), Positives = 140/329 (42%), Gaps = 46/329 (13%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   I +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 172 SSGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYKQQEK--- 223

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL T  C      C Y++  Y + + S G    D L L 
Sbjct: 224 LFDPARSSTYANVSCAAPACSDLYTRGCSGGH--CLYSVQ-YGDGSYSIGFFAMDTLTLS 280

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 281 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 327

Query: 270 FSMCFDKDDSGRIF--FGDQGPA---TQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F+ C     SG  +  FG   PA    +Q+T  L  NG    Y +G+    +G   L   
Sbjct: 328 FAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIP 386

Query: 323 QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSS 375
           Q+ F     IVDSG+  T LP   Y ++ + F   +      ++  P       CY  + 
Sbjct: 387 QSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAM--AARGYKKAPALSLLDTCYDFTG 444

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNPVFVIY 404
                +P V L+F Q  +++  N   ++Y
Sbjct: 445 MSEVAIPKVSLLF-QGGAYLDVNASGIMY 472


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/384 (23%), Positives = 149/384 (38%), Gaps = 75/384 (19%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
           ++  +F  +   S A    F+ +LIHR S +      ++N+     NA      +   +Y
Sbjct: 10  LFFTIFCFIISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFY 69

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
           +  L+S  Q        ++ M +                       IGTP       +D 
Sbjct: 70  KYSLTSTPQSTVNSDKGEYLMSY----------------------SIGTPPFKVFGFVDT 107

Query: 123 GSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP 181
           GSDL+W+ C+ C +C P     ++          PS SS+ +++ C      L  +C + 
Sbjct: 108 GSDLVWLQCEPCKQCYPQITPIFD----------PSLSSSYQNIPC------LSDTCHSM 151

Query: 182 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDG 240
           +          T +    G L  + L L    D+    SV     +IGCG + +G +   
Sbjct: 152 R----------TTSCDVRGYLSVETLTL----DSTTGYSVSFPKTMIGCGYRNTGTFHG- 196

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGD------QGPAT 291
               G++GLG G +S+PS L  +  I   FS C      + + ++ FGD       G  T
Sbjct: 197 -PSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMT 253

Query: 292 QQSTSFLASNGKYIT---YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIA 348
                  A +G Y+T   + +G +    G           ++DSG++FTFLP +VY    
Sbjct: 254 TPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFTFLPYDVYYRFE 313

Query: 349 AEFDRQVNDTITSFEGYPWKCCYK 372
           +     +N          +K CY 
Sbjct: 314 SAVAEYINLEHVEDPNGTFKLCYN 337


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/293 (27%), Positives = 136/293 (46%), Gaps = 45/293 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +G P       +D GSD++W+ C  C +C       YN   R    + PS S+T K L  
Sbjct: 92  VGIPPFQLYGIIDTGSDMIWLQCKPCEKC-------YNQTTR---IFDPSKSNTYKILPF 141

Query: 168 SHRLCD--LGTSC-QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           S   C     TSC  + ++ C YT+ YY + + S G L  + L L S   +++K      
Sbjct: 142 SSTTCQSVEDTSCSSDNRKMCEYTI-YYGDGSYSQGDLSVETLTLGSTNGSSVKFR---R 197

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCFD--KDDSGR 281
            +IGCG   +  + +G +  G++GLG G +S +  L  ++  I   FS C     + S +
Sbjct: 198 TVIGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSK 255

Query: 282 IFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCLKQT--SFK------AIVD 331
           + FGD    +   T  + + ++   + Y + +E   +G++ ++ T  SF+       I+D
Sbjct: 256 LNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIID 315

Query: 332 SGSSFTFLPKEVYETIAA------EFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
           SG++ T LP ++Y  + +      E DR V D +          CY+S+   L
Sbjct: 316 SGTTLTLLPNDIYSKLESAVADLVELDR-VKDPLKQLS-----LCYRSTFDEL 362


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/342 (25%), Positives = 128/342 (37%), Gaps = 63/342 (18%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  + T I +GTP   F V  D GSDL+WI C  C  C       +N  D     + P  
Sbjct: 37  GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQAC-------FNQKDP---IFDPEG 86

Query: 159 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLIS--GG 213
           SS+   +SC   LCD       P++ C    DY   Y + + + G L  + + L S  G 
Sbjct: 87  SSSYTTMSCGDTLCD-----SLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGE 141

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
             A KN     +  GCG    G + D     GL+GLG G +S  S L    L  + FS C
Sbjct: 142 KLAAKN-----IAFGCGHLNRGSFNDA---SGLVGLGRGNLSFVSQLGD--LFGHKFSYC 191

Query: 274 F-----DKDDSGRIFFGDQGPATQQS-------TSFLASNGKYITYIIGVETCCIGSSCL 321
                     +  +FFGD+  +           T  + +      Y + ++   I    L
Sbjct: 192 LVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRAL 251

Query: 322 K----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
           +            S   I DSG++ T LP   Y+ +      +V+             CY
Sbjct: 252 RIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCY 311

Query: 372 KSSSQRL---PKLPSVKLMF-------PQNNSFVVNNPVFVI 403
             S  +     K+P++   F       P  N F+  N    I
Sbjct: 312 DVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTI 353


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/307 (26%), Positives = 129/307 (42%), Gaps = 53/307 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           I IGTP +     LD GSDL+W  CD  C RC P  A            Y+P+ S+T  +
Sbjct: 96  IAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL----------YAPARSATYAN 145

Query: 165 LSCSHRLCDLGTS----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SC   +C    S    C  P   C Y    Y + TS+ G+L  +   L  G D A++  
Sbjct: 146 VSCRSPMCQALQSPWSRCSPPDTGCAYYFS-YGDGTSTDGVLATETFTL--GSDTAVRG- 201

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
               V  GCG +  G   +     GL+G+G G +   SL+++ G+ R   S C  +  + 
Sbjct: 202 ----VAFGCGTENLGSTDNS---SGLVGMGRGPL---SLVSQLGVTRPRRS-CRARAAAR 250

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFT 337
                        +TS L    + IT  +G     I  +  + T       I+DSG++FT
Sbjct: 251 GG-------GAPTTTSPL----EGIT--VGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFT 297

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP------QN 391
            L +  +  +A     +V   + S        C+ ++S    ++P + L F       + 
Sbjct: 298 ALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRR 357

Query: 392 NSFVVNN 398
            S+VV +
Sbjct: 358 ESYVVED 364


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 130/309 (42%), Gaps = 47/309 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP     + LD GSDL W      +CAP  + +  SL R    ++PS S T   L C 
Sbjct: 117 IGTPPQPVQLILDTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCD 167

Query: 169 HRLC-DLG-TSCQNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            R+C DL  +SC         C Y    Y +++ ++G L  D     S  D+A+  +   
Sbjct: 168 LRICRDLTWSSCGEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVP 225

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSG 280
            +  GCG+  +G ++      G+ G   G +S+P     A L  ++FS CF      +  
Sbjct: 226 DLTFGCGLFNNGIFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPS 278

Query: 281 RIFFG----------DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL-------- 321
            +F G            G    QST+ +  +   +  Y I ++   +G++ L        
Sbjct: 279 PVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFA 338

Query: 322 --KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             +  +   IVDSG+  T LP+ VY  +   F  Q   T+ +      + C+       P
Sbjct: 339 LKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKP 398

Query: 380 KLPSVKLMF 388
            +P++ L F
Sbjct: 399 DVPALVLHF 407


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 130/309 (42%), Gaps = 47/309 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP     + LD GSDL W      +CAP  + +  SL R    ++PS S T   L C 
Sbjct: 91  IGTPPQPVQLILDTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCD 141

Query: 169 HRLC-DLG-TSCQNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            R+C DL  +SC         C Y    Y +++ ++G L  D     S  D+A+  +   
Sbjct: 142 LRICRDLTWSSCGEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVP 199

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSG 280
            +  GCG+  +G ++      G+ G   G +S+P     A L  ++FS CF      +  
Sbjct: 200 DLTFGCGLFNNGIFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPS 252

Query: 281 RIFFG----------DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL-------- 321
            +F G            G    QST+ +  +   +  Y I ++   +G++ L        
Sbjct: 253 PVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFA 312

Query: 322 --KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             +  +   IVDSG+  T LP+ VY  +   F  Q   T+ +      + C+       P
Sbjct: 313 LKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKP 372

Query: 380 KLPSVKLMF 388
            +P++ L F
Sbjct: 373 DVPALVLHF 381


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 129/315 (40%), Gaps = 59/315 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G+P  +  + LD GS+L W+ C   + AP   S ++ L    + YSP   ++    +
Sbjct: 67  LTVGSPPQTVTMVLDTGSELSWLHC---KKAPNLHSVFDPLRS--SSYSPIPCTSP---T 118

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C  R  D        K+   + +  Y + +S  G L  D  H+         NS   + I
Sbjct: 119 CRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI--------GNSAIPATI 170

Query: 227 IGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGR 281
            GC      G+      D    GLIG+  G +S    + + GL    FS C   +D SG 
Sbjct: 171 FGC---MDSGFSSNSDEDSKTTGLIGMNRGSLS---FVTQMGL--QKFSYCISGQDSSGI 222

Query: 282 IFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------- 322
           + FG+            P  Q ST     +   + Y + +E   + +S L+         
Sbjct: 223 LLFGESSFSWLKALKYTPLVQISTPLPYFD--RVAYTVQLEGIKVANSMLQLPKSVYAPD 280

Query: 323 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSS 375
              + + +VDSG+ FTFL   VY  +  EF RQ   ++   E   +        CY+   
Sbjct: 281 HTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPL 340

Query: 376 QR--LPKLPSVKLMF 388
            R  LP LP+V LMF
Sbjct: 341 TRRTLPPLPTVTLMF 355


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 120/280 (42%), Gaps = 39/280 (13%)

Query: 91  KTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRD 150
           KT      FG  +   + +GTP   F +  D GSDL W      +C P S   +   D  
Sbjct: 120 KTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLTW-----TQCEPCSGGCFPQNDE- 173

Query: 151 LNEYSPSASSTSKHLSCSHRLC-----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
             ++ P+ S++ K+LSCS   C     +    C +    C Y + Y T  T   G L  +
Sbjct: 174 --KFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSS-SNSCLYGVKYGTGYT--VGFLATE 228

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            L +         + V  + +IGCG +++GG   G A  GL+GLG   +++PS  +    
Sbjct: 229 TLTIT-------PSDVFENFVIGCG-ERNGGRFSGTA--GLLGLGRSPVALPSQTSST-- 276

Query: 266 IRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL- 321
            +N FS C     S  G + FG       Q+  F     K    Y + V    +G   L 
Sbjct: 277 YKNLFSYCLPASSSSTGHLSFGG---GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLP 333

Query: 322 -KQTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
              + F+    I+DSG++ T+LP   +  +++ F   + +
Sbjct: 334 IDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTN 373


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 127/321 (39%), Gaps = 55/321 (17%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYN 145
           + G + +++GN     +   + +GTP     + LD   D  W+PC DC  C+  +     
Sbjct: 88  ASGQQVLNIGN-----YVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPT----- 137

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
                   +SP+ SST   L CS   C    G SC        +    Y  ++S S +L 
Sbjct: 138 --------FSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLS 189

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           +D L         L      S   GC    SG  L    P GL+GLG G +   SLL+++
Sbjct: 190 QDSL--------GLAVDTLPSYSFGCVNAVSGSTLP---PQGLLGLGRGPM---SLLSQS 235

Query: 264 G-LIRNSFSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIG 317
           G L    FS CF        SG +  G  G P   ++T  L +  +   Y + +    +G
Sbjct: 236 GSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVG 295

Query: 318 SSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
              +            T    I+DSG+  T   + VY  I  EF +QV     +   +  
Sbjct: 296 RVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGAF-- 353

Query: 368 KCCYKSSSQRLPKLPSVKLMF 388
             C+ ++++ +   P V   F
Sbjct: 354 DTCFAATNEDI--APPVTFHF 372


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 130/309 (42%), Gaps = 47/309 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP     + LD GSDL W      +CAP  + +  SL R    ++PS S T   L C 
Sbjct: 117 IGTPPQPVQLILDTGSDLTW-----TQCAPCVSCFRQSLPR----FNPSRSMTFSVLPCD 167

Query: 169 HRLC-DLG-TSCQNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            R+C DL  +SC         C Y    Y +++ ++G L  D     S  D+A+  +   
Sbjct: 168 LRICRDLTWSSCGEQSWGNGICVYAY-AYADHSITTGHLDSDTFSFASA-DHAIGGASVP 225

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---DDSG 280
            +  GCG+  +G ++      G+ G   G +S+P     A L  ++FS CF      +  
Sbjct: 226 DLTFGCGLFNNGIFVSN--ETGIAGFSRGALSMP-----AQLKVDNFSYCFTAITGSEPS 278

Query: 281 RIFFG----------DQGPATQQSTSFLASNGKYI-TYIIGVETCCIGSSCL-------- 321
            +F G            G    QST+ +  +   +  Y I ++   +G++ L        
Sbjct: 279 PVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFA 338

Query: 322 --KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             +  +   IVDSG+  T LP+ VY  +   F  Q   T+ +      + C+       P
Sbjct: 339 LKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKP 398

Query: 380 KLPSVKLMF 388
            +P++ L F
Sbjct: 399 DVPALVLHF 407


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 116/257 (45%), Gaps = 38/257 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P  SST + + C
Sbjct: 87  IGTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPDLSSTYQPVKC 136

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +     L  +C N +  C Y   Y  E ++SSG+L ED++   +  + A + +V      
Sbjct: 137 T-----LDCNCDNDRMQCVYERQY-AEMSTSSGVLGEDVVSFGNQSELAPQRAV-----F 185

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L    ++ +SFS+C+   D   G +  G
Sbjct: 186 GCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG 244

Query: 286 DQGPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTF 338
              P +     F  S+  +   Y I ++   +    L            +++DSG+++ +
Sbjct: 245 GISPPSDM--VFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAY 302

Query: 339 LPKEVY----ETIAAEF 351
           LP+E +    E I  E 
Sbjct: 303 LPEEAFLAFKEAIVKEL 319


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 129/315 (40%), Gaps = 59/315 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G+P  +  + LD GS+L W+ C   + AP   S ++ L    + YSP   ++    +
Sbjct: 60  LTVGSPPQTVTMVLDTGSELSWLHC---KKAPNLHSVFDPLRS--SSYSPIPCTSP---T 111

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C  R  D        K+   + +  Y + +S  G L  D  H+         NS   + I
Sbjct: 112 CRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI--------GNSAIPATI 163

Query: 227 IGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSGR 281
            GC      G+      D    GLIG+  G +S    + + GL    FS C   +D SG 
Sbjct: 164 FGC---MDSGFSSNSDEDSKTTGLIGMNRGSLS---FVTQMGL--QKFSYCISGQDSSGI 215

Query: 282 IFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------- 322
           + FG+            P  Q ST     +   + Y + +E   + +S L+         
Sbjct: 216 LLFGESSFSWLKALKYTPLVQISTPLPYFD--RVAYTVQLEGIKVANSMLQLPKSVYAPD 273

Query: 323 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSS 375
              + + +VDSG+ FTFL   VY  +  EF RQ   ++   E   +        CY+   
Sbjct: 274 HTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPL 333

Query: 376 QR--LPKLPSVKLMF 388
            R  LP LP+V LMF
Sbjct: 334 TRRTLPPLPTVTLMF 348


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 112/244 (45%), Gaps = 30/244 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P  SST + + C
Sbjct: 19  IGTPPQRFALIVDTGSSVTYVPCSSCEQCG----------RHQDPKFQPDLSSTYQSVKC 68

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
                ++  +C + KQ C Y   Y  E ++SSG+L EDI   IS G+  L        + 
Sbjct: 69  -----NIDCNCDDEKQQCVYERQY-AEMSTSSGVLGEDI---ISFGN--LSALAPQRAVF 117

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 287
           GC   ++G      A DG++G+G G++S+   L   G+I +SFS+C+     G       
Sbjct: 118 GCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLG 176

Query: 288 GPATQQSTSFLASNG-KYITYIIGVETCCIGSS--CLKQTSFKA----IVDSGSSFTFLP 340
           G +   +  F  S+  +   Y I ++   +      L  T F      I+DSG+++ +LP
Sbjct: 177 GISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYLP 236

Query: 341 KEVY 344
           +  +
Sbjct: 237 EAAF 240


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 150/369 (40%), Gaps = 66/369 (17%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           FS KLIH+ S        S    + ++   K   +YQV   S VQK        +  +  
Sbjct: 30  FSFKLIHKNSPN------SPFYKSNNFHKNKLRSFYQVPKKSFVQKSP------YTRVTS 77

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           + G   M L             +G+P V     +D GSDL+W      +C P    Y   
Sbjct: 78  NNGDYLMKL------------TLGSPPVDIYGLVDTGSDLVW-----AQCTPCGGCYRQK 120

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
                  + P  S T   + C    C   G SC +P++ C Y+  Y   + +   L  E 
Sbjct: 121 SPM----FEPLRSKTYSPIPCESEQCSFFGYSC-SPQKMCAYSYSYADSSVTKGVLAREA 175

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAG 264
           I    + GD      V   +I GCG   SG + +           +G    P SL+++ G
Sbjct: 176 ITFSSTDGDPV----VVGDIIFGCGHSNSGTFNENDM------GIIGMGGGPLSLVSQIG 225

Query: 265 LIRNS--FSMCF-----DKDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCC 315
            +  S  FS C      D   SG I FG++   + +   T+ LAS     +Y++ +E   
Sbjct: 226 TLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGIS 285

Query: 316 IGSSCLKQTSFKAI------VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--- 366
           +G + ++  S + +      +DSG+  T++P+E YE +  E   +V  ++   E  P   
Sbjct: 286 VGDTFVRFNSSETLSKGNIMIDSGTPATYIPQEFYERLVEEL--KVQSSLLPIEDDPDLG 343

Query: 367 WKCCYKSSS 375
            + CY+S +
Sbjct: 344 TQLCYRSET 352


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 123/272 (45%), Gaps = 38/272 (13%)

Query: 103 HYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASS 160
           HY   + IGTP        D GSDL W    CV C        N+  +  N  + P  S+
Sbjct: 71  HYLMELSIGTPPFKIYGIADTGSDLTWT--SCVPC--------NNCYKQRNPMFDPQKST 120

Query: 161 TSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNAL 217
           T +++SC  +LC  L T   +P++ C YT  Y +    + G+L ++ + L S  G    L
Sbjct: 121 TYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAI-TRGVLAQETITLSSTKGKSVPL 179

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--- 274
           K      ++ GCG   +GG+ D     G+IGLG G +S+ S +  +      FS C    
Sbjct: 180 KG-----IVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPF 231

Query: 275 --DKDDSGRIFFGDQGPATQQ---STSFLASNGK---YITYI-IGVETCCIGSSCLKQTS 325
             D   S ++ FG     + +   ST  +A   K   ++T + I VE   +  +   Q  
Sbjct: 232 HTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNV 291

Query: 326 FKA--IVDSGSSFTFLPKEVYETIAAEFDRQV 355
            K    +DSG+  T LP ++Y+ + A+   +V
Sbjct: 292 EKGNMFLDSGTPPTILPTQLYDQVVAQVRSEV 323


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 135/331 (40%), Gaps = 53/331 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   + + LD GSDL WI C  C+ C   S  YY+          P  SS+ ++++C
Sbjct: 198 IGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------PKESSSFENITC 247

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
               C L +S      C++  Q CPY   Y   + ++    +E     ++  +   +   
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
             +V+ GCG    G +        L+GLG G +S  S L    +  +SFS C      D 
Sbjct: 308 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQL--QSIYGHSFSYCLVDRNSDT 362

Query: 277 DDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSSCL--------- 321
             S ++ FG+            TSF+      +   Y +G+++  +    L         
Sbjct: 363 SVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHL 422

Query: 322 -KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQRLP 379
            K+     I+DSG++ T+  +  YE I   F +++       EG+ P K CY  S     
Sbjct: 423 SKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKG-YELVEGFPPLKPCYNVSGIEKM 481

Query: 380 KLPSVKLM--------FPQNNSFVVNNPVFV 402
           +LP   ++        FP  N F+   P  V
Sbjct: 482 ELPDFGILFSDGAMWDFPVENYFIQIEPDLV 512


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 132/323 (40%), Gaps = 59/323 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +  GTP  +    +D GSD++W PC         +   +S    +  + P  SS+SK L 
Sbjct: 71  LSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLG 130

Query: 167 C----------SHRLCDLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           C          S+  CD      SC N  Q CP  M +Y   T+  G+ + + LHL S  
Sbjct: 131 CKNPKCSWIHHSNINCDQDCSIKSCLN--QTCPPYMIFYGSGTTG-GVALSETLHLHS-- 185

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                   + + ++GC +  S        P G+ G G G  S+PS L          S  
Sbjct: 186 ------LSKPNFLVGCSVFSSH------QPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233

Query: 274 FDKD---DSGRIFFGDQGPATQQSTSFL----ASNGKY-------ITYIIGVETCCIGSS 319
           FD D    S  +   +Q  + +++ + +      N K        + Y +G+    +G  
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293

Query: 320 CLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND--TITSFE-GY 365
            +K   +K            I+DSG++FTF+ +E +E ++ EF RQ+ D   +   E   
Sbjct: 294 HVK-VPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAI 352

Query: 366 PWKCCYKSSSQRLPKLPSVKLMF 388
             + C+  S  +    P ++L F
Sbjct: 353 GLRPCFNVSDAKTVSFPELRLYF 375


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 94/388 (24%), Positives = 150/388 (38%), Gaps = 45/388 (11%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
           M+   LT++  V  +L ++S +  + FS  LI R S     +    N   T     KS  
Sbjct: 1   MHHFVLTLFFLVSTMLVDASKS-LMGFSIDLIPRHS----PISPLYNSQMTQTELVKSAA 55

Query: 61  YYQVLLSSDVQKQKMKTGPQFQML--FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLV 118
              +  S  V      + P   ++   P  G   M               +GTP+V  L 
Sbjct: 56  LRSITRSKRVNFIGQISPPLSPIITPIPDHGEYLMRF------------SLGTPSVERLA 103

Query: 119 ALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--- 174
             D GSDL W+ C  C  C P  A  ++          P+ SST   + C  + C L   
Sbjct: 104 IFDTGSDLSWLQCTPCKTCYPQEAPLFD----------PTQSSTYVDVPCESQPCTLFPQ 153

Query: 175 -GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 233
               C + KQ C Y   Y T+ + + G L  D +   S G      +   SV  GC    
Sbjct: 154 NQRECGSSKQ-CIYLHQYGTD-SFTIGRLGYDTISFSSTGMGQGGATFPKSV-FGCAFYS 210

Query: 234 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFFGDQGPA 290
           +  +      +G +GLG G +S+ S L     I + FS C   F    +G++ FG   P 
Sbjct: 211 NFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLKFGSMAPT 268

Query: 291 TQ-QSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKAIVDSGSSFTFLPKEVYETI 347
            +  ST F+ +      Y++ +E   +G   +   Q     I+DS    T L + +Y   
Sbjct: 269 NEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHLEQGIYTDF 328

Query: 348 AAEFDRQVNDTITSFEGYPWKCCYKSSS 375
            +     +N  +      P++ C ++ +
Sbjct: 329 ISSVKEAINVEVAEDAPTPFEYCVRNPT 356


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 128/342 (37%), Gaps = 63/342 (18%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  + T I +GTP   F V  D GSDL+WI C  C  C       +N  D     + P  
Sbjct: 37  GGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQAC-------FNQKDP---IFDPEG 86

Query: 159 SSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLIS--GG 213
           SS+   +SC   LCD       P++ C    DY   Y + + + G L  + + L S  G 
Sbjct: 87  SSSYTTMSCGDTLCD-----SLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGE 141

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
             A KN     +  GCG    G + D     GL+GLG G +S  S L    L  + FS C
Sbjct: 142 KLAAKN-----IAFGCGHLNRGSFNDA---SGLVGLGRGNLSFVSQLGD--LFGHKFSYC 191

Query: 274 F-----DKDDSGRIFFGDQGPATQQS-------TSFLASNGKYITYIIGVETCCIGSSCL 321
                     +  +FFGD+  +           T  + +      Y + ++   I    L
Sbjct: 192 LVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRAL 251

Query: 322 K----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
           +            S   I DSG++ T LP   Y+ +      +++             CY
Sbjct: 252 RIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCY 311

Query: 372 KSSSQRLP---KLPSVKLMF-------PQNNSFVVNNPVFVI 403
             S  +     K+P++   F       P  N F+  N    I
Sbjct: 312 DVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTI 353


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 125/311 (40%), Gaps = 32/311 (10%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           +  GTP  +  V  D GS++ WI C    V C P     ++          P+ SST ++
Sbjct: 20  VGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFD----------PTLSSTYRN 69

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           +SC+   C   +S       C Y +  Y + +S+ G L  +   L +G  N   N     
Sbjct: 70  ISCTSAACTGLSSRGCSGSTCVYGVT-YGDGSSTVGFLATETFTLAAG--NVFNN----- 121

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
            I GCG     G   G A  GLIGLG    S+ S LA +  + N FS C     S   + 
Sbjct: 122 FIFGCGQNNQ-GLFTGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYL 176

Query: 285 GDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSS--CLKQTSFKA---IVDSGSSFTF 338
               P      + + +N +  T Y I +    +G +   L  T F++   I+DSG+  T 
Sbjct: 177 NIGNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITR 236

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNN 398
           LP   Y  +   F   +     +        CY  S       P++KL +   +  +   
Sbjct: 237 LPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGA 296

Query: 399 PVF-VIYGTQV 408
            VF VI  +QV
Sbjct: 297 GVFYVISSSQV 307


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 82/296 (27%), Positives = 127/296 (42%), Gaps = 49/296 (16%)

Query: 96  GNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA-SYYNSLDRDLNE 153
           G D G  +Y     +GTP ++  + +D GSDL W     V+C P +A S Y   D     
Sbjct: 129 GYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSW-----VQCKPCAAPSCYRQKD---PL 180

Query: 154 YSPSASSTSKHLSCSHRLC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           + P+ SS+   + C    C  LG   ++C   +  C Y +  Y + ++++G+   D L L
Sbjct: 181 FDPAQSSSYAAVPCGRSACAGLGIYASACSAAQ--CGYVVS-YGDGSNTTGVYSSDTLTL 237

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRN 268
            +       N+     + GCG  QSGG   G+  DGL+G G  +   PSL+ + AG    
Sbjct: 238 AA-------NATVQGFLFGCGHAQSGGLFTGI--DGLLGFGREQ---PSLVQQTAGAYGG 285

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
            FS C     S   +    GP+       +T  L S      Y++ +    +G   L   
Sbjct: 286 VFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVP 345

Query: 323 QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVN-----------DTITSFEGY 365
            ++F A  +VD+G+  T LP   Y  + + F   +            DT  SF GY
Sbjct: 346 ASAFAAGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGY 401


>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 431

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 69/252 (27%), Positives = 116/252 (46%), Gaps = 20/252 (7%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           ++IG P   + + +D GSDL W+ CD  C  C+       + L R  N++ P        
Sbjct: 75  LNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETP----HPLHRPSNDFVPCRDPLCAS 130

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           L  +        +C++P Q C Y ++ Y +  S+ G+L+ D+  L S     LK      
Sbjct: 131 LQPTEDY-----NCEHPDQ-CDYEIN-YADQYSTYGVLLNDVYLLNSSNGVQLK----VR 179

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
           + +GCG  Q          DGL+GLG G+ S+ S L   GL+RN    C      G IFF
Sbjct: 180 MALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLSSQGGGYIFF 239

Query: 285 GDQGPATQQSTSFLAS-NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEV 343
           G+   + + + + ++S + K+  Y  G      G       S  A+ D+GSS+T+     
Sbjct: 240 GNAYDSARVTWTPISSVDSKH--YSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHA 297

Query: 344 YETIAAEFDRQV 355
           Y+ + +  ++++
Sbjct: 298 YQALLSWLNKEL 309


>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
          Length = 423

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 112/263 (42%), Gaps = 21/263 (7%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+   ++IG P   + + +D GS L W+ CD  C+ C    + +Y  L     
Sbjct: 30  GNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFV 89

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGTSCQN-----PKQPCPYTMDYYTENTSSSGLLVEDI 206
            +          + C+ + C DL    +      PK  C Y + Y     SS G+L+ D 
Sbjct: 90  PHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDS 147

Query: 207 LHL-ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAG 264
             L  S G N        S+  GCG  Q     +   P +G++GLG G++++ S L   G
Sbjct: 148 FSLPASNGTNP------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQG 201

Query: 265 LI-RNSFSMCFDKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
           +I ++    C      G +FFGD + P +  + S +    K+ +   G       S  + 
Sbjct: 202 VITKHVLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPIS 261

Query: 323 QTSFKAIVDSGSSFTFLPKEVYE 345
               + I DSG+++T+   + Y 
Sbjct: 262 AAPMEVIFDSGATYTYFALQPYH 284


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 134/320 (41%), Gaps = 47/320 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I +GTP   + V  D GSD  W     V+C P     Y   ++    + P+ SST  ++S
Sbjct: 190 IGLGTPAGRYTVVFDTGSDTTW-----VQCEPCVVVCYEQQEK---LFDPARSSTDANIS 241

Query: 167 CSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C+   C DL T  C      C Y +  Y + + S G    D L L S   +A+K      
Sbjct: 242 CAAPACSDLYTKGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLSS--YDAIKG----- 291

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF 283
              GCG +  G + +     GL+GLG G+ S+P     K G +   F+ CF    SG  +
Sbjct: 292 FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAHCFPARSSGTGY 345

Query: 284 FGDQGP------ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDS 332
             D GP      +T+ +T  L  NG    Y +G+    +G   L       T+   IVDS
Sbjct: 346 L-DFGPGSSPAVSTKLTTPMLVDNGLTF-YYVGLTGIRVGGKLLSIPPSVFTTAGTIVDS 403

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMF 388
           G+  T LP   Y ++ + F   +      ++  P       CY  +      +P+V L+F
Sbjct: 404 GTVITRLPPAAYSSLRSAFASAI--AARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLF 461

Query: 389 PQNNSFVVNNPVFVIYGTQV 408
               S  V+    +IY   V
Sbjct: 462 QGGASLDVDA-SGIIYAASV 480


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 96/361 (26%), Positives = 141/361 (39%), Gaps = 96/361 (26%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNSLDRDLNE---YSPSA 158
           ++IGTP  +  V LD GSDL W+PC     DC+ C       Y+  + DL     +SP  
Sbjct: 87  LNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIEC-------YDLKNNDLKSPSVFSPLH 139

Query: 159 SSTSKHLSCSHRLCDLGTSCQNP-------------------KQPCPYTMDYYTENTSSS 199
           SSTS   SC+   C    S  NP                    +PCP     Y E    S
Sbjct: 140 SSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLIS 199

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G+L  DIL          +         GC    +  Y +   P G+ G G G +S+PS 
Sbjct: 200 GILTRDILK--------ARTRDVPRFSFGC---VTSTYRE---PIGIAGFGRGLLSLPSQ 245

Query: 260 LAKAGLIRNSFSMCF-------DKDDSGRIFFGDQGPATQ-----QSTSFLASNGKYITY 307
           L   G +   FS CF       + + S  +  G    +       Q T  L +     +Y
Sbjct: 246 L---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSY 302

Query: 308 IIGVETCCIGSS--------CLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQV 355
            IG+E+  IG++         L+Q   +     +VDSG+++T LP+  Y         Q+
Sbjct: 303 YIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYS--------QL 354

Query: 356 NDTITSFEGYP----------WKCCYK--SSSQRLPKLPS-VKLMFPQNNSFVVNNPVFV 402
             T+ S   YP          +  CYK    +  L  L + V ++FP      +NN   +
Sbjct: 355 LTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLL 414

Query: 403 I 403
           +
Sbjct: 415 L 415


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 89/323 (27%), Positives = 133/323 (41%), Gaps = 60/323 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +   +++G  N+S +V  D GSDL W     V+C P  + Y    ++    Y PS SS+ 
Sbjct: 87  YIVTVELGGKNMSLIV--DTGSDLTW-----VQCQPCRSCY----NQQGPLYDPSVSSSY 135

Query: 163 KHLSCSHRLC-DL--GTSCQNP--------KQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
           K + C+   C DL   TS   P        K PC Y + Y   + +   L  E IL    
Sbjct: 136 KTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL--- 192

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
            GD  L+N      + GCG    G +       GL       +S+ S   K       FS
Sbjct: 193 -GDTKLEN-----FVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFS 241

Query: 272 MCF---DKDDSGRIFFGDQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQ 323
            C    +   SG + FG+       STS     L  N +  + YI+ +    IG   LK 
Sbjct: 242 YCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKS 301

Query: 324 TSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSS 374
           +SF    ++DSG+  T LP  +Y+ +  EF +Q       F G+P          C+  +
Sbjct: 302 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQ-------FSGFPTAPGYSILDTCFNLT 354

Query: 375 SQRLPKLPSVKLMFPQNNSFVVN 397
           S     +P +K++F  N    V+
Sbjct: 355 SYEDISIPIIKMIFQGNAELEVD 377


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 130/297 (43%), Gaps = 51/297 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP  + L+A+D  +D  WIPC  CV C   S++ +N++           S+T K + C
Sbjct: 102 IGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK----------STTFKTVGC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
               C    + +     C + M Y + + +++  L +D++ L +       +S+  S   
Sbjct: 149 EAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLAT-------DSI-PSYTF 198

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRIF 283
           GC  + +G     + P GL+GLG G +S+  L     L +++FS C       + SG + 
Sbjct: 199 GCLTEATG---SSIPPQGLLGLGRGPMSL--LSQTQNLYQSTFSYCLPSFRSLNFSGSLR 253

Query: 284 FGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVDS 332
            G  G   +  T+ L  N +     Y+  +   +G     I  S L     T    I DS
Sbjct: 254 LGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDS 313

Query: 333 GSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           G+ FT L    Y  +   F ++V N T+TS  G+    CY S        P++  MF
Sbjct: 314 GTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGGF--DTCYTSPI----VAPTITFMF 364


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 87/327 (26%), Positives = 134/327 (40%), Gaps = 60/327 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP +S+   +D GSDL+W  C  CV C   S   ++          PS+SST   +
Sbjct: 104 VAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFD----------PSSSSTYATV 153

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS  LC DL TS       C YT   Y + +S+ G+L  +   L        +      
Sbjct: 154 PCSSALCSDLPTSTCTSASKCGYTYT-YGDASSTQGVLASETFTL------GKEKKKLPG 206

Query: 225 VIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDS 279
           V  GCG    G G+  G    GL+GLG G +   SL+++ GL  + FS C     D D  
Sbjct: 207 VAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--DKFSYCLTSLDDGDGK 258

Query: 280 GRIFFGDQGPATQ--------QSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK-- 327
             +  G    A          Q+T  + +  +   Y + +    +GS+   L  ++F   
Sbjct: 259 SPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQ 318

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ----- 376
                  IVDSG+S T+L  + Y  +   F  Q+              C++  ++     
Sbjct: 319 DDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEV 378

Query: 377 RLPKL-----PSVKLMFPQNNSFVVNN 398
           ++PKL         L  P  N  V+++
Sbjct: 379 QVPKLVLHFDGGADLDLPAENYMVLDS 405


>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
 gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
 gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
          Length = 410

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 71/274 (25%), Positives = 116/274 (42%), Gaps = 34/274 (12%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+   ++IG P  S+ + +D GS L W+ CD  C  C  +    Y    + L 
Sbjct: 30  GNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL- 88

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQ-----PCPYTMDYYTENTSSSGLLVEDI 206
                       ++C+  LC DL T    PK+      C Y + Y   ++SS G+LV D 
Sbjct: 89  ------------VTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYV--DSSSMGVLVIDR 134

Query: 207 LHL-ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAG 264
             L  S G N        ++  GCG  Q     +   P D ++GL  G++++ S L   G
Sbjct: 135 FSLSASNGTNP------TTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQG 188

Query: 265 LI-RNSFSMCFDKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK 322
           +I ++    C      G +FFGD Q P +  + + +    KY +   G       S  + 
Sbjct: 189 VITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAIS 248

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                 I DSG+++T+   + Y+   +     +N
Sbjct: 249 AAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLN 282


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 132/319 (41%), Gaps = 60/319 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +++G  N+S +V  D GSDL W     V+C P  + Y    ++    Y PS SS+ K + 
Sbjct: 139 VELGGKNMSLIV--DTGSDLTW-----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVF 187

Query: 167 CSHRLC-DL--GTSCQNP--------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           C+   C DL   TS   P        K PC Y + Y   + +   L  E IL     GD 
Sbjct: 188 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDT 243

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
            L+N      + GCG    G +       GL       +S+ S   K       FS C  
Sbjct: 244 KLEN-----FVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLP 293

Query: 275 --DKDDSGRIFFGDQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSFK 327
             +   SG + FG+       STS     L  N +  + YI+ +    IG   LK +SF 
Sbjct: 294 SLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFG 353

Query: 328 A--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRL 378
              ++DSG+  T LP  +Y+ +  EF +Q       F G+P          C+  +S   
Sbjct: 354 RGILIDSGTVITRLPPSIYKAVKIEFLKQ-------FSGFPTAPGYSILDTCFNLTSYED 406

Query: 379 PKLPSVKLMFPQNNSFVVN 397
             +P +K++F  N    V+
Sbjct: 407 ISIPIIKMIFQGNAELEVD 425


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 131/321 (40%), Gaps = 46/321 (14%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P   + Y   ++   
Sbjct: 169 SPGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVACYEQREK--- 220

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P++SST  ++SC+   C DL  S C      C Y +  Y + + S G    D L L 
Sbjct: 221 LFDPASSSTYANVSCAAPACSDLDVSGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 277

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P  +   G     F
Sbjct: 278 S--YDAVKG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVF 325

Query: 271 SMCFDKDDSGRIF--FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSF 326
           + C     +G  +  FG   P    +T  L  NG    Y +G+    +G   L    + F
Sbjct: 326 AHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVF 384

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQ 376
            A   IVDSG+  T LP   Y ++     R       +  GY           CY  +  
Sbjct: 385 AAAGTIVDSGTVITRLPPAAYSSL-----RSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 439

Query: 377 RLPKLPSVKLMFPQNNSFVVN 397
               +P+V L+F    +  V+
Sbjct: 440 SQVAIPTVSLLFQGGAALDVD 460


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 84/380 (22%), Positives = 153/380 (40%), Gaps = 59/380 (15%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
              K + R  + +  +G  +N ++    AK+S +  +V+ ++ + +  M++      +  
Sbjct: 65  MQAKDLFRHEQMITMMGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSALNIAHV-- 122

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR----------- 135
                          ++   + IGTP + + + LD  +DL WI C   R           
Sbjct: 123 --------------GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQST 168

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNP--KQPCPYTMDY 191
              +S     + +   N Y P+ SS+ + + CS + C +    +CQ+P   + C Y    
Sbjct: 169 GQTMSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQK 227

Query: 192 YTENTSSSGLL-VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLG 250
             + T + G+   E     +S G    + +    +I+GC + ++GG +D  A DG++ LG
Sbjct: 228 TQDGTVTIGIYGKEKATVTVSDG----RMAKLPGLILGCSVLEAGGSVD--AHDGVLSLG 281

Query: 251 LGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ----GPATQQSTSFL--- 298
            G++S     AK       FS C       +D S  + FG      GP T ++       
Sbjct: 282 NGDMSFAVHAAKR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVD 339

Query: 299 ---ASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFD 352
              A   +    ++G E   I         F     I+D+ +S T L  E Y  + A  D
Sbjct: 340 VKPAYGAQVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALD 399

Query: 353 RQVNDTITSFEGYPWKCCYK 372
           R ++     +E   ++ CYK
Sbjct: 400 RHLSHLPRVYELEGFEYCYK 419


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 132/319 (41%), Gaps = 60/319 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +++G  N+S +V  D GSDL W     V+C P  + Y    ++    Y PS SS+ K + 
Sbjct: 139 VELGGKNMSLIV--DTGSDLTW-----VQCQPCRSCY----NQQGPLYDPSVSSSYKTVF 187

Query: 167 CSHRLC-DL--GTSCQNP--------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           C+   C DL   TS   P        K PC Y + Y   + +   L  E IL     GD 
Sbjct: 188 CNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDT 243

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
            L+N      + GCG    G +       GL       +S+ S   K       FS C  
Sbjct: 244 KLEN-----FVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKT--FNGVFSYCLP 293

Query: 275 --DKDDSGRIFFGDQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSFK 327
             +   SG + FG+       STS     L  N +  + YI+ +    IG   LK +SF 
Sbjct: 294 SLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFG 353

Query: 328 A--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRL 378
              ++DSG+  T LP  +Y+ +  EF +Q       F G+P          C+  +S   
Sbjct: 354 RGILIDSGTVITRLPPSIYKAVKIEFLKQ-------FSGFPTAPGYSILDTCFNLTSYED 406

Query: 379 PKLPSVKLMFPQNNSFVVN 397
             +P +K++F  N    V+
Sbjct: 407 ISIPIIKMIFQGNAELEVD 425


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 118/289 (40%), Gaps = 39/289 (13%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSA 158
           Y  ++IG P   + + +D GS+L W+ C      C  C P     Y         Y+P+ 
Sbjct: 39  YATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPY---------YTPAD 89

Query: 159 SSTSKHLSCSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
                 + C   LC         +    +N    C Y + Y T    S G L  DI+  +
Sbjct: 90  GKLK--VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIIS-V 144

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLIR-N 268
           +G D       +  +  GCG KQ        +P +G++GLG+G+    + L    +I+ N
Sbjct: 145 NGRD-------KKRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKEN 197

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-QTSFK 327
               C      G ++ GD  P T+  T +         Y  G+    I    ++   +F+
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFE 256

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITSFEGYPWKCCYKSSS 375
           A+ DSGS++T +P ++Y  I ++     ++ ++   +G     C+K   
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKK 305


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 131/321 (40%), Gaps = 46/321 (14%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P   + Y   ++   
Sbjct: 173 SPGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVACYEQREK--- 224

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P++SST  ++SC+   C DL  S C      C Y +  Y + + S G    D L L 
Sbjct: 225 LFDPASSSTYANVSCAAPACSDLDVSGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 281

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P  +   G     F
Sbjct: 282 S--YDAVKG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVF 329

Query: 271 SMCFDKDDSGRIF--FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSF 326
           + C     +G  +  FG   P    +T  L  NG    Y +G+    +G   L    + F
Sbjct: 330 AHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVF 388

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQ 376
            A   IVDSG+  T LP   Y ++     R       +  GY           CY  +  
Sbjct: 389 AAAGTIVDSGTVITRLPPAAYSSL-----RSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 443

Query: 377 RLPKLPSVKLMFPQNNSFVVN 397
               +P+V L+F    +  V+
Sbjct: 444 SQVAIPTVSLLFQGGAALDVD 464


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 125/313 (39%), Gaps = 51/313 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP V  L+ALD  SDL W+ C  C RC P S   ++          P  S++   +
Sbjct: 138 IAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFD----------PRHSTSYGEM 187

Query: 166 SCSHRLCD-LGTSCQN--PKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKN 219
           +     C  LG S      +  C YT+ Y   +   ++S G LVE+ L    G       
Sbjct: 188 NYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG------- 240

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
             QA + IGCG    G  L G    G++GLG G+IS+P  +A  G    SFS C     S
Sbjct: 241 VRQAYLSIGCGHDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDFIS 297

Query: 280 G------RIFFG----DQGPATQQSTSFLASNGK--YITYIIGVETCCIGSSCLKQTSFK 327
           G       + FG    D  P    + + L  N    Y   +IGV    +    + +   +
Sbjct: 298 GPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQ 357

Query: 328 ---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSS 375
                     I+DSG++ T L +  Y      F            G P   +  CY    
Sbjct: 358 LDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGG 417

Query: 376 QRLPKLPSVKLMF 388
           +   K+P+V + F
Sbjct: 418 RAGVKVPAVSMHF 430


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 87/321 (27%), Positives = 131/321 (40%), Gaps = 46/321 (14%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P   + Y   ++   
Sbjct: 170 SPGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVACYEQREK--- 221

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P++SST  ++SC+   C DL  S C      C Y +  Y + + S G    D L L 
Sbjct: 222 LFDPASSSTYANVSCAAPACSDLDVSGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 278

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P  +   G     F
Sbjct: 279 S--YDAVKG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVF 326

Query: 271 SMCFDKDDSGRIF--FGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSF 326
           + C     +G  +  FG   P    +T  L  NG    Y +G+    +G   L    + F
Sbjct: 327 AHCLPPRSTGTGYLDFGAGSPPATTTTPMLTGNGPTF-YYVGMTGIRVGGRLLPIAPSVF 385

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQ 376
            A   IVDSG+  T LP   Y ++     R       +  GY           CY  +  
Sbjct: 386 AAAGTIVDSGTVITRLPPAAYSSL-----RSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 440

Query: 377 RLPKLPSVKLMFPQNNSFVVN 397
               +P+V L+F    +  V+
Sbjct: 441 SQVAIPTVSLLFQGGAALDVD 461


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/303 (26%), Positives = 125/303 (41%), Gaps = 51/303 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  S    +D GSDL+W  C+ C +C       +N          P  SS+   L
Sbjct: 100 VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFN----------PQDSSSFSTL 149

Query: 166 SCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            C  + C DL + SC N    C YT  Y  + +S+ G +  +            + S   
Sbjct: 150 PCESQYCQDLPSESCYND---CQYTYGY-GDGSSTQGYMATETF--------TFETSSVP 197

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-- 280
           ++  GCG    G G  +G    GLIG+G G +S+PS L         FS C     S   
Sbjct: 198 NIAFGCGEDNQGFGQGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSP 249

Query: 281 -RIFFGDQG---PATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK------- 327
             +  G      P    ST+ + S+     Y I ++   +G   L    ++F+       
Sbjct: 250 STLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTG 309

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVK 385
             I+DSG++ T+LP++ Y  +A  F  Q+N +           C++  S     ++P + 
Sbjct: 310 GMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEIS 369

Query: 386 LMF 388
           + F
Sbjct: 370 MQF 372


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 115/245 (46%), Gaps = 32/245 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P +SST + + C
Sbjct: 90  IGTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPESSSTYQPVKC 139

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +     +  +C + +  C Y   Y  E ++SSG+L ED   LIS G+ +     +A  + 
Sbjct: 140 T-----IDCNCDSDRMQCVYERQY-AEMSTSSGVLGED---LISFGNQSELAPQRA--VF 188

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L    +I +SFS+C+   D   G +  G
Sbjct: 189 GCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLG 247

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFL 339
              P +  + ++ +   +   Y I ++   +    L   +         ++DSG+++ +L
Sbjct: 248 GISPPSDMAFAY-SDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 306

Query: 340 PKEVY 344
           P+  +
Sbjct: 307 PEAAF 311


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 155/367 (42%), Gaps = 64/367 (17%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
           LT+ L     +   S A +  FS +LIHR S +      ++N+             YQ  
Sbjct: 7   LTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENK-------------YQHF 53

Query: 66  LSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSD 125
           +  D  ++ +     F     +   ++  + +  G+L      +GTP        D GSD
Sbjct: 54  V--DAARRSINRANHFFKDSDTSTPESTVIPDRGGYL--MTYSVGTPPTKIYGIADTGSD 109

Query: 126 LLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPK 182
           ++W+ C+ C +C   +   +N          PS SS+ K++ C  +LC     TSC + +
Sbjct: 110 IVWLQCEPCEQCYNQTTPIFN----------PSKSSSYKNIPCLSKLCHSVRDTSCSD-Q 158

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
             C Y +  Y +++ S G L  D L L S   + +        +IGCG   +G +  G A
Sbjct: 159 NSCQYKIS-YGDSSHSQGDLSVDTLSLESTSGSPVS---FPKTVIGCGTDNAGTF--GGA 212

Query: 243 PDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF------DKDDSGRIFFGDQGPATQQ--- 293
             G++GLG G +S+ + L  +  I   FS C       + + S  + FGD    +     
Sbjct: 213 SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVV 270

Query: 294 STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA-----------IVDSGSSFTFLPKE 342
           ST  +  +  +  Y + ++   +G+   K+  F             I+DSG++ T +P +
Sbjct: 271 STPLIKKDPVF--YFLTLQAFSVGN---KRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSD 325

Query: 343 VYETIAA 349
           VY  + +
Sbjct: 326 VYTNLES 332


>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
          Length = 775

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 112/264 (42%), Gaps = 31/264 (11%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +  ++IG P  S+ + +D GS L W+ CD  C  C  +    Y    + L          
Sbjct: 404 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL---------- 453

Query: 162 SKHLSCSHRLC-DLGTSCQNPKQ-----PCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
              ++C+  LC DL T    PK+      C Y + Y   ++SS G+LV D   L     +
Sbjct: 454 ---VTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYV--DSSSMGVLVIDRFSL-----S 503

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RNSFSMC 273
           A   +   ++  GCG  Q     +   P D ++GL  G++++ S L   G+I ++    C
Sbjct: 504 ASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 563

Query: 274 FDKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDS 332
                 G +FFGD Q P +  + + +    KY +   G       S  +       I DS
Sbjct: 564 ISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDS 623

Query: 333 GSSFTFLPKEVYETIAAEFDRQVN 356
           G+++T+   + Y+   +     +N
Sbjct: 624 GATYTYFAAQPYQATLSVVKSTLN 647


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 117/294 (39%), Gaps = 55/294 (18%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLV-ALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE 153
           G+D G   Y   + IGTP    +V  LD GSDL+W  C C  C           D+ +  
Sbjct: 86  GSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVC----------FDQPVPV 135

Query: 154 YSPSASSTSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
           +  S S T   + CS  LC        + C    + C Y   Y  +++ ++G + ED   
Sbjct: 136 FRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYM-DHSITTGKMAEDTF- 193

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
                D A   +   ++  GCGM   G +    +  G+ G G G +S+PS L     +R 
Sbjct: 194 TFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQS--GIAGFGTGPLSLPSQLK----VRR 247

Query: 269 SFSMCFDKDDSGR---IFFGDQ---------GPATQQSTSFL-----ASNGKYITYIIGV 311
            FS CF   +  R   +  G +         GP   QST F      A  G    Y + +
Sbjct: 248 -FSYCFTAMEESRVSPVILGGEPENIEAHATGPI--QSTPFAPGPAGAPVGSQPFYFLSL 304

Query: 312 ETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
               +G + L             S    +DSG++ TF P+ V+ ++   F  QV
Sbjct: 305 RGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQV 358


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 134/331 (40%), Gaps = 51/331 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + LD GSDL WI C  C  C   +  YY+          P  SS+ K+++C
Sbjct: 201 VGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYD----------PKDSSSFKNITC 250

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
               C L +S      C+   Q CPY   Y   + ++    +E     ++  +   +  +
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD-- 278
             +V+ GCG    G +        L+GLG G +S  + L    L  +SFS C  D++   
Sbjct: 311 VENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVDRNSNS 365

Query: 279 --SGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSSCLK-------- 322
             S ++ FG+            TSF+      +   Y + +++  +G   LK        
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHL 425

Query: 323 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
             Q     I+DSG++ T+  +  YE I   F R++          P K CY  S     +
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKME 485

Query: 381 LPSVKLM--------FPQNNSFVVNNPVFVI 403
           LP   ++        FP  N F+   P  V+
Sbjct: 486 LPEFAILFADGAMWDFPVENYFIQIEPEDVV 516


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 113/245 (46%), Gaps = 32/245 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C +C                ++ P +SST + + C
Sbjct: 118 IGTPPQMFALIVDTGSTVTYVPCSTCEQCG----------RHQDPKFQPESSSTYQPVKC 167

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +     +  +C   +  C Y   Y  E ++SSG+L ED++   +  + A + +V      
Sbjct: 168 T-----IDCNCDGDRMQCVYERQY-AEMSTSSGVLGEDVISFGNQSELAPQRAV-----F 216

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD--SGRIFFG 285
           GC   ++G      A DG++GLG G++S+   L    +I +SFS+C+   D   G +  G
Sbjct: 217 GCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLG 275

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFL 339
              P +  + ++ +   +   Y I ++   +    L   +         ++DSG+++ +L
Sbjct: 276 GISPPSDMTFAY-SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 334

Query: 340 PKEVY 344
           P+  +
Sbjct: 335 PEAAF 339


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 114/453 (25%), Positives = 183/453 (40%), Gaps = 88/453 (19%)

Query: 3   RISLTIYLAVFWLLTESSGAET-VMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEY 61
           + +L ++L   W+  +S+  E+ V  +T+ + R     K +   KN+NA S         
Sbjct: 97  KQTLKLHLKHRWINRDSTHKESFVASTTRDLTRIQTLHKRILEKKNQNALS--------- 147

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQG-----SKTMSLGNDFGWLHYTWID--IGTPNV 114
               L+ +  KQ +         +P+ G       T+  G   G   Y ++D  IGTP  
Sbjct: 148 ---RLNKEEPKQPVVAPAASPESYPANGLSGQLMATLESGVSLGSGEY-FMDVFIGTPPR 203

Query: 115 SFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
            F + LD GSDL WI C  C  C   +  YY+          P  SS+ K++ C    C 
Sbjct: 204 HFSLILDTGSDLNWIQCVPCYDCFVQNGPYYD----------PKESSSFKNIGCHDPRCH 253

Query: 174 LGTS------CQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           L +S      C+   Q CPY   Y  + NT+    L    ++L S    +    V+ +V+
Sbjct: 254 LVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVE-NVM 312

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGR 281
            GCG    G +        L+GLG G +S  S L    L  +SFS C      D + S +
Sbjct: 313 FGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSK 367

Query: 282 IFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSSCLK----------QTS 325
           + FG+            TS +A     +   Y + +++  +G   LK          + +
Sbjct: 368 LIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGA 427

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRL 378
              IVDSG++ ++  +  YE I   F ++V       +GYP          CY  S    
Sbjct: 428 GGTIVDSGTTLSYFAEPSYEIIKDAFVKKV-------KGYPVIKDFPILDPCYNVSGVEK 480

Query: 379 PKLPSVKLM--------FPQNNSFVVNNPVFVI 403
            +LP  +++        FP  N F+   P  ++
Sbjct: 481 MELPEFRILFEDGAVWNFPVENYFIKLEPEEIV 513


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 130/323 (40%), Gaps = 59/323 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP   F V +D GSDL W     V+C+P    Y     ++ + + P+ S++   L+
Sbjct: 7   VRLGTPERVFSVIVDTGSDLTW-----VQCSPCGTCY----SQNDSLFIPNTSTSFTKLA 57

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C   LC+        +  C Y    Y + + S+G  V D + +   G N  K  V  +  
Sbjct: 58  CGTELCNGLPYPMCNQTTCVYWYS-YGDGSLSTGDFVYDTITM--DGINGQKQQV-PNFA 113

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGR 281
            GCG    G +      DG++GLG G +S PS L    +    FS C          +  
Sbjct: 114 FGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSP 168

Query: 282 IFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------A 328
           + FGD    T     +  L +N K  T Y + +    +G   L    T+F          
Sbjct: 169 LLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGT 228

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---------CCYKSSSQRLP 379
           I DSG++ T L  EV++ + A  +    D       YP K         C    +  +LP
Sbjct: 229 IFDSGTTVTQLAGEVHQEVLAAMNASTMD-------YPRKSDDSSGLDLCLGGFAEGQLP 281

Query: 380 KLPSVKLMF-------PQNNSFV 395
            +PS+   F       P +N F+
Sbjct: 282 TVPSMTFHFEGGDMELPPSNYFI 304


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 131/313 (41%), Gaps = 41/313 (13%)

Query: 102 LHY--TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 159
           LH+    +D+   N  F V  DAG+ ++ +  +      +  ++       L  +  S S
Sbjct: 123 LHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTS 182

Query: 160 STSKHLSCSHRLCD--LGTSCQN----PKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           ST    SC   LC   L  SC N    P Q C YT  YY + + ++GLL  D     +G 
Sbjct: 183 STLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTY-YYNDKSVTTGLLEVDKFTFGAGA 241

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                      V  GCG+  +G +       G+ G G G +S+PS L K G    +FS C
Sbjct: 242 S-------VPGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHC 287

Query: 274 FDKDDSGRI------FFGD---QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F   +  +          D    G    QST  + ++     Y + ++   +GS+ L   
Sbjct: 288 FTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347

Query: 323 QTSFK-------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           +++F         I+DSG+S T LP +VY+ +  EF  Q+   +          C+ + S
Sbjct: 348 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPS 407

Query: 376 QRLPKLPSVKLMF 388
           Q  P +P + L F
Sbjct: 408 QAKPDVPKLVLHF 420



 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 31/60 (51%)

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           I+DSG+S T LP +VY+ +  EF  Q+   +          C+ + SQ  P +P + L F
Sbjct: 66  IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHF 125


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/264 (28%), Positives = 115/264 (43%), Gaps = 41/264 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I IG+P V+ L+ +D  SDLLW+ C  C+ C   S          L  + PS S T ++ 
Sbjct: 89  ISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQS----------LPIFDPSRSYTHRNE 138

Query: 166 SCSHRLCDLGTSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           SC      + +   N K + C Y+M  Y + T S G+L +++L   +  D +   ++   
Sbjct: 139 SCRTSQYSMPSLRFNAKTRSCEYSMR-YMDGTGSKGILAKEMLMFNTIYDESSSAALH-D 196

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----- 279
           V+ GCG    G  L G    G++GLG GE    SL+ + G     FS CF   D      
Sbjct: 197 VVFGCGHDNYGEPLVGT---GILGLGYGEF---SLVHRFG---TKFSYCFGSLDDPSYPH 247

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKA- 328
             +  GD G      T+ L     +  Y + +E   +    L           QT     
Sbjct: 248 NVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDGIILPIDPWVFNRNHQTGLGGT 305

Query: 329 IVDSGSSFTFLPKEVYETIAAEFD 352
           I+D+G+S T L +E Y+ +  + +
Sbjct: 306 IIDTGNSLTSLVEEAYKPLKNKIE 329


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 133/315 (42%), Gaps = 51/315 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHL 165
           + +GTP  +  + LD GS+L W+ C   R      S        + E + P AS+T   +
Sbjct: 67  LAVGTPPQNVTMVLDTGSELSWLLCATGR----QGSAAAGAAAAMGESFRPRASATFAAV 122

Query: 166 SCSHRLC---DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            C    C   DL    SC    + C  ++  Y + ++S G L  D+  +  G    L+++
Sbjct: 123 PCGSTQCSSRDLPAPPSCDGASRQCHVSLS-YADGSASDGALATDVFAV--GEAPPLRSA 179

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS 279
                  GC         DGVA  GL+G+  G +   S + +A   R  FS C  D+DD+
Sbjct: 180 ------FGCMSTAYDSSPDGVATAGLLGMNRGTL---SFVTQASTRR--FSYCISDRDDA 228

Query: 280 GRIFFG---------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTS 325
           G +  G         +  P  Q +        +A + + +   +G +   I +S L    
Sbjct: 229 GVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDH 288

Query: 326 FKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK------CCYKSSSQ 376
             A   +VDSG+ FTFL  + Y  + AEF +Q    + + +   +        C++  + 
Sbjct: 289 TGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAG 348

Query: 377 RLP---KLPSVKLMF 388
           R P   +LP V L+F
Sbjct: 349 RPPPSARLPPVTLLF 363


>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
 gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
          Length = 433

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 129/306 (42%), Gaps = 38/306 (12%)

Query: 60  EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVA 119
            + + +LS ++    M       ++FP  G+   +     G+ + T + IG P   + + 
Sbjct: 34  RWRKAVLSGEITSSMMINRAGSSLVFPLHGNVYPA-----GYYNVT-LSIGQPAKPYFLD 87

Query: 120 LDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--- 174
           +D GSDL W+ CD  C +C              +    P    ++  + C   LC     
Sbjct: 88  VDTGSDLTWLQCDAPCRQC--------------IEAPHPLYRPSNNLVICEDPLCASLQP 133

Query: 175 --GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQASVIIGCG 230
               +CQ+P Q C Y ++Y  +  SS G+LV+D+  L+  +G        +   + +GCG
Sbjct: 134 PGVHNCQDPDQ-CDYEVEY-ADGGSSLGVLVKDVFVLNFTNG------KRLNPLLALGCG 185

Query: 231 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPA 290
             Q  G  +    DG++GLG G  S+PS L+  GL+ N    C      G +FFG+    
Sbjct: 186 YDQLPGRSNHPL-DGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGEDIYD 244

Query: 291 TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAE 350
           +   T    S      Y  G              +   + DSGSS+T+L  + Y+ +   
Sbjct: 245 SSGVTWTPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFS 304

Query: 351 FDRQVN 356
             R+++
Sbjct: 305 LKRELS 310


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/308 (24%), Positives = 126/308 (40%), Gaps = 51/308 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +G+P     + +D+GSD++W+ C  C +C       Y   D     + P+ASS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 218
              +SC   +C   +             DY   Y + + + G L  + L L   G  A++
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQ 236

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---D 275
                 V IGCG + SG +   V   GL+GLG G +S+   L   G     FS C     
Sbjct: 237 G-----VAIGCGHRNSGLF---VGAAGLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 276 KDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQ 323
              +G +  G  +  P  ++++SF         Y +G+    +G   L          + 
Sbjct: 287 AGGAGSLVLGRTEAVPRGRRASSF---------YYVGLTGIGVGGERLPLQDSLFQLTED 337

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
            +   ++D+G++ T LP+E Y  +   FD  +     S        CY  S     ++P+
Sbjct: 338 GAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPT 397

Query: 384 VKLMFPQN 391
           V   F Q 
Sbjct: 398 VSFYFDQG 405


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 98/418 (23%), Positives = 165/418 (39%), Gaps = 61/418 (14%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVM------FSTKLIHRFSEEVKALGVSKNRNATSWP 54
           ++R +L        L + + G  TV+      FS    +   EE  AL  +     +S  
Sbjct: 45  LSRRALQGRQRRHHLRSRAVGGATVLELRHHSFSPAPANSREEEADALLSTDAARVSSLQ 104

Query: 55  AKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNV 114
            +   E+Y++  +S   +  +           S+    +S G     L+Y    +G    
Sbjct: 105 GR--IEHYRLTTTSSSAEVAVTA---------SKAQVPVSSGARLRTLNYVAT-VGLGGG 152

Query: 115 SFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD- 173
              V +D  S+L W     V+CAP  + +    D+    + PS+S +   + C    CD 
Sbjct: 153 EATVIVDTASELTW-----VQCAPCESCH----DQQGPLFDPSSSPSYAAVPCDSPSCDA 203

Query: 174 ----LGTSCQNPKQPC----PYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
               L T       PC    P    Y   Y + + S G+L  D L        +L   V 
Sbjct: 204 LQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRL--------SLAGEVI 255

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK--AGLIRNSFSMCFDKDDSG 280
              + GCG    G    G +  GL+GLG  ++S+ S       G+      +  + D SG
Sbjct: 256 DGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASG 313

Query: 281 RIFFGDQGPATQQST----SFLASNGKYIT----YIIGVETCCIGSSCLKQTSF--KAIV 330
            +  GD   A + ST    + + SN   +     Y++ +    +G   ++ T F  +AIV
Sbjct: 314 SLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAIV 373

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           DSG+  T L   VY  + AEF  Q+ +   +        C+  +  +  ++PS+ L+F
Sbjct: 374 DSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVF 431


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 131/316 (41%), Gaps = 41/316 (12%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           SLG  F  L Y   I IGTP  +F V  D GSDL W     V+C P + S Y   +    
Sbjct: 116 SLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTW-----VQCKPCTDSCYQQQE---P 167

Query: 153 EYSPSASSTSKHLSCSHRLCDLG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
            + PS SST   + C    C +G     +C      C Y++  Y + + + G L ++   
Sbjct: 168 LFDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTT--CEYSVK-YGDQSVTRGNLAQEAFT 224

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYL---DGVAPDGLIGLGLGEISVPSLLAKAGL 265
           L      A      A V+ GC  + S G     + ++  GL+GLG G+ S+ S   + G 
Sbjct: 225 LSPSAPPA------AGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGN 277

Query: 266 IRNSFSMCFDKDDSGRIFFGDQGPATQQS----TSFLASNGK----YITYIIGVETCCIG 317
             + FS C     S   +      A  QS    T  +  N +    Y+  ++G+     G
Sbjct: 278 SGDVFSYCLPPRGSSAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVS--G 335

Query: 318 SSC-LKQTSF--KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY--PWKCCYK 372
           ++  +  ++F    ++DSG+  T +P   Y  +  EF R +       EG+      CY 
Sbjct: 336 AALPIDASAFYIGTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYD 395

Query: 373 SSSQRLPKLPSVKLMF 388
            +   +   P V L F
Sbjct: 396 VTGHDVVTAPPVALEF 411


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 144/366 (39%), Gaps = 53/366 (14%)

Query: 29  TKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQ 88
           T L HR   +V    +          + K+   +Q LL   +++   +      ML    
Sbjct: 28  TALNHRHEAKVTGFQIMLEH----VDSGKNLTKFQ-LLERAIERGSRRLQRLEAMLNGPS 82

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSL 147
           G +T     D  +L    + IGTP   F   +D GSDL+W  C  C +C   S   +N  
Sbjct: 83  GVETSVYAGDGEYLMN--LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN-- 138

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
                   P  SS+   L CS +LC   +S       C YT  Y  + + + G +  + L
Sbjct: 139 --------PQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY-GDGSETQGSMGTETL 189

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
                G  ++ N     +  GCG    G G  +G    GL+G+G G +S+PS L      
Sbjct: 190 TF---GSVSIPN-----ITFGCGENNQGFGQGNGA---GLVGMGRGPLSLPSQLDVT--- 235

Query: 267 RNSFSMCFDKDDSG---RIFFG---DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
              FS C     S     +  G   +   A   +T+ + S+     Y I +    +GS+ 
Sbjct: 236 --KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTR 293

Query: 321 L-----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
           L              +   I+DSG++ T+     Y+++  EF  Q+N  + +     +  
Sbjct: 294 LPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDL 353

Query: 370 CYKSSS 375
           C+++ S
Sbjct: 354 CFQTPS 359


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 133/337 (39%), Gaps = 60/337 (17%)

Query: 72  KQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC 131
           K K +  P   +   + G + +S+ N     +     +GTP  + LVA+D  +D  W+PC
Sbjct: 79  KPKNRANPPVPI---APGRQILSIPN-----YIARAGLGTPAQTLLVAIDPSNDAAWVPC 130

Query: 132 D-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMD 190
             C  CA  S S           +SP+ SST + + C    C      Q P   CP  + 
Sbjct: 131 SACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQC-----AQVPSPSCPAGVG 174

Query: 191 YYTENTSSSGL---LVEDILHLISGGDN-ALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 246
                 SS G            + G D+ AL+N+V  S   GC    SG   + V P GL
Sbjct: 175 ------SSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVSG---NSVPPQGL 225

Query: 247 IGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQG-PATQQSTSFLASN 301
           IG G G +S   L        + FS C       + SG +  G  G P   ++T  L + 
Sbjct: 226 IGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNP 283

Query: 302 GKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 351
            +   Y + +    +GS  ++           T    I+D+G+ FT L   VY  +   F
Sbjct: 284 HRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF 343

Query: 352 DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
             +V   +    G  +  CY  +      +P+V  MF
Sbjct: 344 RGRVRTPVAPPLGG-FDTCYNVTV----SVPTVTFMF 375


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 127/311 (40%), Gaps = 54/311 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG+P   F   +D GSDL+W  C  C+ C      Y+           P+ S++   L
Sbjct: 92  VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFE----------PAKSTSYASL 141

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS  +C+   S    +  C Y   +Y ++ SS+G+L  +       G N+ + +V   V
Sbjct: 142 PCSSAMCNALYSPLCFQNACVY-QAFYGDSASSAGVLANETFTF---GTNSTRVAVP-RV 196

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFF 284
             GCG   +G   +G    G++G G G +   SL+++ G  R S+ +  F    + R++F
Sbjct: 197 SFGCGNMNAGTLFNG---SGMVGFGRGAL---SLVSQLGSPRFSYCLTSFMSPATSRLYF 250

Query: 285 G-----------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------- 322
           G             GP   QST F+ +      Y + +    +    L            
Sbjct: 251 GAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINET 308

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYK--SSSQR 377
             +   I+DSG++ TFL +  Y  +   F   V   +      P   +  C+K     +R
Sbjct: 309 DGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRR 366

Query: 378 LPKLPSVKLMF 388
           +  LP + L F
Sbjct: 367 MVTLPEMVLHF 377


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 142/344 (41%), Gaps = 56/344 (16%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P+     +  GN     +   I +GTP   + V  D GSD  W     V+C P     Y
Sbjct: 148 LPASSGSALGTGN-----YVVTIGLGTPAGRYTVVFDTGSDTTW-----VQCEPCVVVCY 197

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DL---GTSCQNPKQPCPYTMDYYTENTSSSG 200
              ++    + P+ SST  ++SC+   C DL   G S  +    C Y +  Y + + S G
Sbjct: 198 KQQEK---LFDPARSSTYANISCAAPACSDLYIKGCSGGH----CLYGVQ-YGDGSYSIG 249

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SL 259
               D L L S   +A+K         GCG +  G Y +     GL+GLG G+ S+P   
Sbjct: 250 FFAMDTLTLSS--YDAIKG-----FRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQA 299

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPAT------QQSTSFLASNGKYITYIIGVET 313
             K G +   F+ CF    SG  +  D GP +      + +T  L  NG    Y +G+  
Sbjct: 300 YDKYGGV---FAHCFPARSSGTGYL-DFGPGSLPAVSAKLTTPMLVDNGPTF-YYVGLTG 354

Query: 314 CCIGSSCLK--QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-- 366
             +G   L   Q+ F     IVDSG+  T LP   Y ++ + F   + +    ++  P  
Sbjct: 355 IRVGGKLLSIPQSVFTTSGTIVDSGTVITRLPPAAYSSLRSAFASAMAE--RGYKKAPAL 412

Query: 367 --WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 408
                CY  +      +P+V L+F    S  V+    +IY   V
Sbjct: 413 SLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHAS-GIIYAASV 455


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/384 (22%), Positives = 150/384 (39%), Gaps = 93/384 (24%)

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP-SQGSKTMSLGNDFGWLHYTWIDIGTP 112
           P++   +    L+S+ + +      PQ   +F  S G  ++SL              GTP
Sbjct: 39  PSQDHLQKLNYLVSTSLARAHHLKNPQTTPVFSHSYGGYSISL------------SFGTP 86

Query: 113 NVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
             +    +D GS  +W PC     C  C         S    ++ + P  SS+SK + C 
Sbjct: 87  PQTLSFVMDTGSSFVWFPCTLRYLCNNC---------SFTSRISPFLPKHSSSSKIIGCK 137

Query: 169 HRLC-----------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           +  C           D   + +N  Q CP  +  Y   T+  G+ + + LHL        
Sbjct: 138 NPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTG-GVALSETLHL-------- 188

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--- 274
              +  + ++GC +  S        P G+ G G G  S+PS L   GL +  FS C    
Sbjct: 189 HGLIVPNFLVGCSVFSS------RQPAGIAGFGRGPSSLPSQL---GLTK--FSYCLLSH 237

Query: 275 ---DKDDSGRIFFGDQGPATQQSTSF----LASNGKY-------ITYIIGVETCCIGSSC 320
              D  +S  +    Q  + +++ +     L  N K        + Y + +    IG   
Sbjct: 238 KFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRS 297

Query: 321 LKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQVND-----TITSFEG 364
           +K   +K            I+DSG++FT++  E +E ++ EF  QV +      + +  G
Sbjct: 298 VK-IPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSG 356

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMF 388
              K C+  S  +  +LP ++L F
Sbjct: 357 --LKPCFNVSGAKELELPQLRLHF 378


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/299 (27%), Positives = 132/299 (44%), Gaps = 39/299 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP    L   D GSDL+W      +C P    Y    ++D   + P +SST + +SCS
Sbjct: 98  LGTPAFDILAIADTGSDLIW-----TQCKPCDQCY----EQDAPLFDPKSSSTYRDISCS 148

Query: 169 HRLCDL---GTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            + CDL   G SC     + C Y+   Y + + +SG +  D + L   G  + +  +   
Sbjct: 149 TKQCDLLKEGASCSGEGNKTCHYSYS-YGDRSFTSGNVAADTITL---GSTSGRPVLLPK 204

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
            IIGCG    G + +  +  G++GLG G IS+ S L     I   FS C      +  +S
Sbjct: 205 AIIGCGHNNGGSFTEKGS--GIVGLGGGPISLISQLGST--IDGKFSYCLVPLSSNATNS 260

Query: 280 GRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF-----KAI 329
            ++ FG  G  +    QST  ++ +     Y + +E   +GS  +K   +SF       I
Sbjct: 261 SKLNFGSNGIVSGGGVQSTPLISKDPDTF-YFLTLEAVSVGSERIKFPGSSFGTSEGNII 319

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           +DSG++ T  P++ +  +++     V  T           CY   +    K PS+   F
Sbjct: 320 IDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDADL--KFPSITAHF 376


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 78/299 (26%), Positives = 126/299 (42%), Gaps = 52/299 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP     +  D GS L+W  C  C  C P            +  + P+ S++ K L
Sbjct: 136 VGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYP-----------KVPVFDPTKSASFKGL 184

Query: 166 SCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            CS +LC  +   C +PK  C Y +  Y +N+SS+G L  + +       + LK   + +
Sbjct: 185 PCSSKLCQSIRQGCSSPK--CTY-LTAYVDNSSSTGTLATETISF-----SHLKYDFK-N 235

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSGRI 282
           ++IGC  + SG   + +   G++GL    IS+ S    A +    FS C       +G +
Sbjct: 236 ILIGCSDQVSG---ESLGESGIMGLNRSPISLAS--QTANIYDKLFSYCIPSTPGSTGHL 290

Query: 283 FFGDQGPATQQ--STSFLASNGKYITYIIGV----ETCCIGSSCLKQTSFKAIVDSGSSF 336
            FG + P   +    S  A +  Y   + G+        I +S  K  S    +DSG+  
Sbjct: 291 TFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAS---TIDSGAVL 347

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMF 388
           T LP + Y  + + F   +       +GYP          CY  S+     +PS+ + F
Sbjct: 348 TRLPPKAYSALRSVFREMM-------KGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFF 399


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 124/284 (43%), Gaps = 71/284 (25%)

Query: 120 LDAGSDLLWIPC----DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-- 173
           +D GSDL+W+PC     C+ C   SAS  N +      + P  SS+   ++C+   C   
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSAS--NGV------FLPRMSSSLHLVTCADSNCKTL 52

Query: 174 -------LGTSCQNPKQPC-----PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
                  L  SC    + C     PY + Y     S++GLL+ + L+L       L+N  
Sbjct: 53  YGNNTELLCQSCAGSLKNCSETCPPYGIQY--GRGSTAGLLLTETLNL------PLENGE 104

Query: 222 QASVI----IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---- 273
            A  I    +GC +  S        P G+ G G G +S+PS L +  + ++ F+ C    
Sbjct: 105 GARAITHFAVGCSIVSS------QQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSH 157

Query: 274 -FDKDDSGRIF-FGDQGPATQ---QSTSFLASN-----GKY-ITYIIGVETCCIGSSCLK 322
            FD+++   +   GD+          T FL ++      +Y + Y IG+    IG   LK
Sbjct: 158 RFDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLK 217

Query: 323 QTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           Q   K            I+DSG++FT    E+++ IAA F  Q+
Sbjct: 218 QLPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQI 261


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 133/337 (39%), Gaps = 60/337 (17%)

Query: 72  KQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC 131
           K K +  P   +   + G + +S+ N     +     +GTP  + LVA+D  +D  W+PC
Sbjct: 60  KPKNRANPPVPI---APGRQILSIPN-----YIARAGLGTPAQTLLVAIDPSNDAAWVPC 111

Query: 132 D-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMD 190
             C  CA  S S           +SP+ SST + + C    C      Q P   CP  + 
Sbjct: 112 SACAGCAASSPS-----------FSPTQSSTYRTVPCGSPQC-----AQVPSPSCPAGVG 155

Query: 191 YYTENTSSSGL---LVEDILHLISGGDN-ALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 246
                 SS G            + G D+ AL+N+V  S   GC    SG   + V P GL
Sbjct: 156 ------SSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVSG---NSVPPQGL 206

Query: 247 IGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQG-PATQQSTSFLASN 301
           IG G G +S   L        + FS C       + SG +  G  G P   ++T  L + 
Sbjct: 207 IGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNP 264

Query: 302 GKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEF 351
            +   Y + +    +GS  ++           T    I+D+G+ FT L   VY  +   F
Sbjct: 265 HRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF 324

Query: 352 DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
             +V   +    G  +  CY  +      +P+V  MF
Sbjct: 325 RGRVRTPVAPPLGG-FDTCYNVTV----SVPTVTFMF 356


>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 654

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 142/322 (44%), Gaps = 49/322 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           HYTW+  GTP     V  D GS L+  PC  C  C   +   + + +          SST
Sbjct: 65  HYTWVYAGTPPQRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQADN----------SST 114

Query: 162 SKHLSCSHRLCDLG-TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG-----DN 215
             H++CS +        C      C  +  Y  E +S    +VED+++L  GG     D 
Sbjct: 115 LIHVTCSQQQSHFQCKECTEKSDTCAISQSYM-EGSSWKASVVEDVVYL--GGESSFHDE 171

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCF 274
           A+++        GC   ++G ++  VA DG++GL   +  + + L +   I  N FS+CF
Sbjct: 172 AMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLCF 230

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLA--------SNGKYITYIIGVETCCIGSSCL--KQT 324
             ++ G +  G+  P T+     ++        S G +  Y + ++   IG   +  K+ 
Sbjct: 231 -TENGGTMSVGE--PNTKAHRGEISYAKVIKDRSAGHF--YNVNMKDIRIGGKSINAKEE 285

Query: 325 SFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           ++     IVDSG++ ++LP+     +  EF  QV   +   +      C+  +++ L  L
Sbjct: 286 AYTRGHYIVDSGTTDSYLPR----AMKNEF-LQVFKEVAGRDYQVGTSCHGYTNEDLASL 340

Query: 382 PSVKLMFP----QNNSFVVNNP 399
           P ++L+      +N   +++ P
Sbjct: 341 PKIQLVMEAYGDENGEVIIDIP 362


>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 407

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 108/249 (43%), Gaps = 44/249 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           IG P  ++ + +D GSDL W+ CD  C  C         +L RD  +Y P  +     + 
Sbjct: 54  IGNPPKAYELDIDTGSDLTWVQCDAPCKGC---------TLPRD-RQYKPHGNL----VK 99

Query: 167 CSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNALKNS 220
           C   LC    S     C NP + C Y ++Y  +  SS G+LV DI+ L ++ G   L +S
Sbjct: 100 CVDPLCAAIQSAPNPPCVNPNEQCDYEVEY-ADQGSSLGVLVRDIIPLKLTNG--TLTHS 156

Query: 221 VQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           + A    GCG  Q+  G+    +  G++GLG G  S+ S L   GLIRN    C      
Sbjct: 157 MLA---FGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCLSGTGG 213

Query: 280 GRIFFGDQ---------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIV 330
           G +FFGDQ          P  Q S+S L        Y  G                +   
Sbjct: 214 GFLFFGDQLIPQSGVVWTPILQSSSSLLKH------YKTGPADMFFNGKATSVKGLELTF 267

Query: 331 DSGSSFTFL 339
           DSGSS+T+ 
Sbjct: 268 DSGSSYTYF 276


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 127/311 (40%), Gaps = 54/311 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG+P   F   +D GSDL+W  C  C+ C      Y+           P+ S++   L
Sbjct: 89  VGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFE----------PAKSTSYASL 138

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS  +C+   S    +  C Y   +Y ++ SS+G+L  +       G N+ + +V   V
Sbjct: 139 PCSSAMCNALYSPLCFQNACVY-QAFYGDSASSAGVLANETFTF---GTNSTRVAVP-RV 193

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFF 284
             GCG   +G   +G    G++G G G +   SL+++ G  R S+ +  F    + R++F
Sbjct: 194 SFGCGNMNAGTLFNG---SGMVGFGRGAL---SLVSQLGSPRFSYCLTSFMSPATSRLYF 247

Query: 285 G-----------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------- 322
           G             GP   QST F+ +      Y + +    +    L            
Sbjct: 248 GAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINET 305

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYK--SSSQR 377
             +   I+DSG++ TFL +  Y  +   F   V   +      P   +  C+K     +R
Sbjct: 306 DGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATPSDTFDTCFKWPPPPRR 363

Query: 378 LPKLPSVKLMF 388
           +  LP + L F
Sbjct: 364 MVTLPEMVLHF 374


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 111/259 (42%), Gaps = 41/259 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I IG+P ++ L+ +D  SDLLWI C  C+ C   S          L  + PS S T ++ 
Sbjct: 89  ISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQS----------LPIFDPSRSYTHRNE 138

Query: 166 SCSHRLCDLGTSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           +C      + +   N   + C Y+M  Y ++T S G+L  ++L   +  D +   ++   
Sbjct: 139 TCRTSQYSMPSLKFNANTRSCEYSMR-YVDDTGSKGILAREMLLFNTIYDESSSAALH-D 196

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----- 279
           V+ GCG    G  L G    G++GLG GE S+     K       FS CF   D      
Sbjct: 197 VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGK------KFSYCFGSLDDPSYPH 247

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKA- 328
             +  GD G      T+ L  +  +  Y + +E   +    L           QT     
Sbjct: 248 NVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQTGLGGT 305

Query: 329 IVDSGSSFTFLPKEVYETI 347
           I+D+G+S T L +E Y+ +
Sbjct: 306 IIDTGNSLTSLVEEAYKPL 324


>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 413

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 131/295 (44%), Gaps = 39/295 (13%)

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYT-WIDIGTPNVSFLVALD 121
           + L +   Q + ++ G    +LFP +G       N +   H+T  ++IG P+  F + +D
Sbjct: 21  KFLFADSEQVKTLRFGSS--VLFPVRG-------NVYPLGHFTVLLNIGNPSKVFELDID 71

Query: 122 AGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC- 178
            GSDL W+ CD  C+ C         +L RD+  Y P  ++ S+       L  LG    
Sbjct: 72  TGSDLTWVQCDVECIGC---------TLPRDM-LYRPHNNAVSREDPLCAALSSLGKFIF 121

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDI--LHLISGGDNALKNSVQASVIIGCGMKQSGG 236
           +NP   C Y ++ Y ++ SS G+LV+D+  + L +G        +  ++  GCG  Q  G
Sbjct: 122 KNPNDQCAYEVE-YADHGSSVGVLVKDLVPMRLTNG------KRISPNLGFGCGYDQENG 174

Query: 237 YLDG-VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQGPATQQS 294
            L    +  G++GL   + ++ S L+  G + N    C   +      F GD  P++  S
Sbjct: 175 DLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGFLFFGGDVVPSSGMS 234

Query: 295 TSFLASN--GKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETI 347
            + +  N  GKY +   G          +         DSGSS+T+   +VY  I
Sbjct: 235 WTPILRNSEGKYSS---GPAEVYFNGRAVGIGGLTLTFDSGSSYTYFNSQVYRAI 286


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 111/279 (39%), Gaps = 26/279 (9%)

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-----CQNPKQPCPYTMDYYTENTSSSGL 201
           L  DL  Y P+ S TS  + C    C    S     C+     CPY++ Y  + +++SG 
Sbjct: 42  LGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITY-GDGSTTSGS 99

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGV--APDGLIGLGLGEISVPSL 259
            V D L       N       +SVI GCG KQSG        A DG+IG G    SV S 
Sbjct: 100 FVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQ 159

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI-----IGVETC 314
           LA +G ++  FS C D    G IF   Q    + +T+ L     +   I     +  E  
Sbjct: 160 LAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPI 219

Query: 315 CIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAE-FDRQVNDTITSFEGYPWKCCYK 372
            +        S +  I+DSG++  +LP  +Y  +  +   RQ    +   E      C+ 
Sbjct: 220 LLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVE--DQFTCFH 277

Query: 373 SSSQRLPKLPSVKLMF--------PQNNSFVVNNPVFVI 403
            S +     P VK  F        P +  F+    ++ I
Sbjct: 278 YSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDIYCI 316


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 105/383 (27%), Positives = 157/383 (40%), Gaps = 54/383 (14%)

Query: 49  NATSWP--AKKSFEYYQVLLSSDVQKQKMKTGPQFQML-FPSQGSKTMSLGNDFGWLHYT 105
           N++SW     +SFE     L++   K    +GP   M   P Q   T+  GN     +  
Sbjct: 88  NSSSWIDLVSQSFERDNARLNTIRSK---NSGPYTTMSNLPLQSGTTVGTGN-----YIV 139

Query: 106 WIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
               GTP  + L+ +D GSDL WI C  C  C       Y+ +D     + P  SS+ K 
Sbjct: 140 TAGFGTPAKNSLLIIDTGSDLTWIQCKPCADC-------YSQVDA---IFEPKQSSSYKT 189

Query: 165 LSCSHRLC-DLGTSCQNPKQ----PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           L C    C +L TS  NP       C Y ++ Y + +SS G   ++ L L   G ++ +N
Sbjct: 190 LPCLSATCTELITSESNPTPCLLGGCVYEIN-YGDGSSSQGDFSQETLTL---GSDSFQN 245

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL-LAKAGLIRNSFSMCF-DKD 277
                   GCG   +G +       GL+GLG   +S PS   +K G     F+ C  D  
Sbjct: 246 -----FAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFG 294

Query: 278 DSGRIFFGDQGPATQQSTSF---LASNGKYIT-YIIGVETCCIGSSCLKQTSF-----KA 328
            S        G  +  +++    L SN  Y T Y +G+    +G   L            
Sbjct: 295 SSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGST 354

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           IVDSG+  T L  + Y  +   F  +  D  ++        CY  S     ++P++   F
Sbjct: 355 IVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHF 414

Query: 389 PQNNSFVVNNPVFVIYGTQVGVS 411
            QNN+ V  + V ++   Q G S
Sbjct: 415 -QNNADVAVSDVGILVPVQNGGS 436


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 131/340 (38%), Gaps = 69/340 (20%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC----DCVRCAPLSASYYNSLDRDLNEYSPSA 158
           ++  + +GTP    L+  D GSDL+W+ C    +C R  P SA     L R    +SP+ 
Sbjct: 89  YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSA----FLARHSTTFSPNH 144

Query: 159 SSTS--------KHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED--IL 207
              S        KH  C+H RL            PC Y    Y + + +SG   ++   L
Sbjct: 145 CYDSACQLVPLPKHHRCNHARL----------HSPCRYEYS-YGDGSKTSGFFSKETTTL 193

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAG 264
           +  SG +  LK      +  GC  + SG  + G +     G++GLG G IS+ S L    
Sbjct: 194 NTSSGREAKLKG-----IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR- 247

Query: 265 LIRNSFSMCFDKDD-----SGRIFFG----DQGPATQQS--TSFLASNGKYITYIIGVET 313
              N FS C    D     +  +  G    D  P  ++   T    +      Y IG+E+
Sbjct: 248 -FGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIES 306

Query: 314 CCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
             +    L          +  +   IVDSG++ TFLP+  Y  I     R+V     +  
Sbjct: 307 VSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEP 366

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMF--------PQNNSFV 395
              +  C   S    P+LP +            P  N FV
Sbjct: 367 TPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFV 406


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/338 (23%), Positives = 132/338 (39%), Gaps = 54/338 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP   F +  D GSDL W+   C   +P               + P  S + 
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWV--KCAGASPPG-----------RVFRPKTSRSW 162

Query: 163 KHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLL-VEDILHLISGGDNA 216
             + CS   C L       +C +P  PC Y   Y   +  + G++  E     + GG  A
Sbjct: 163 APIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVA 222

Query: 217 -LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
            LK+     V++GC     G        DG++ LG  +IS  +    A     SFS C  
Sbjct: 223 QLKD-----VVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARFGGSFSYCLV 273

Query: 275 ----DKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-------K 322
                ++ +G + FG  Q P T  + + L  + +   Y + V+   +    L        
Sbjct: 274 DHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWD 333

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR----- 377
             S   I+DSG++ T L    Y+ + A   + + D +      P++ CY  +++R     
Sbjct: 334 AKSGGVILDSGNTLTVLAAPAYKAVVAALSKHL-DGVPKVSFPPFEHCYNWTARRPGAPE 392

Query: 378 -LPKLP-----SVKLMFPQNNSFVVNNPVFVIYGTQVG 409
            +PKL      S +L  P  +  +   P     G Q G
Sbjct: 393 IIPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQEG 430


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 134/317 (42%), Gaps = 46/317 (14%)

Query: 93  MSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDR 149
           ++ G   G  +Y T + +GTP  ++++ +D+GS L W+ C    V C P +   Y+    
Sbjct: 97  LASGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYD---- 152

Query: 150 DLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNPKQ-----PCPYTMDYYTENTSSSGLLV 203
                 P ASST   + CS   C +L  +  NP        C Y    Y + + S G L 
Sbjct: 153 ------PRASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQAS-YGDGSFSFGYLS 205

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           +D + L S G              GCG    G  L G A  GLIGL   ++S+ S LA +
Sbjct: 206 KDTVSLSSSGSF-------PGFYYGCGQDNVG--LFGRA-AGLIGLARNKLSLLSQLAPS 255

Query: 264 GLIRNSFSMCFDKD---DSGRIFFG----DQGPATQQSTSFLASNGKYITYIIGVETCCI 316
             + NSF+ C        +G + FG    ++ P     TS ++S+     Y + +    +
Sbjct: 256 --VGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSV 313

Query: 317 GSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
             S L     +  S   I+DSG+  T LP  VY  ++      +            + C+
Sbjct: 314 AGSPLAVPSSEYGSLPTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-LQTCF 372

Query: 372 KSSSQRLPKLPSVKLMF 388
           K    +LP +P+V + F
Sbjct: 373 KGQVAKLP-VPAVNMAF 388


>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 429

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 131/287 (45%), Gaps = 28/287 (9%)

Query: 77  TGPQFQMLFPSQGSKTMSL-GNDF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD- 132
           T  + ++L P+  S  + L GN +  G+ + T ++IG P   + + +D GSDL W+ CD 
Sbjct: 41  TSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCDA 99

Query: 133 -CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY 191
            C  C+      Y    R  N++ P        L  +        +C++P Q C Y ++ 
Sbjct: 100 PCTHCSETPHPLY----RPSNDFVPCRDPLCASLQPTEDY-----NCEHPDQ-CDYEIN- 148

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASV--IIGCGMKQSGGYLDGVAPDGLIGL 249
           Y +  S+ G+L+ D+  L         N VQ  V   +GCG  Q          DGL+GL
Sbjct: 149 YADQYSTFGVLLNDVYLL------NFTNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGL 202

Query: 250 GLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS-NGKYITYI 308
           G G+ S+ S L   GL+RN    C      G IFFG+   + + + + ++S + K+  Y 
Sbjct: 203 GRGKASLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNAYDSARVTWTPISSVDSKH--YS 260

Query: 309 IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
            G      G       S  A+ D+GSS+T+     Y+ + +   +++
Sbjct: 261 AGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLSWLKKEL 307


>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Brachypodium distachyon]
          Length = 436

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 117/265 (44%), Gaps = 32/265 (12%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+   + +G P+  + +A   GSD++W+PC      P       SLD     Y P  SST
Sbjct: 75  LYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDCPTPDDIGFSLDL----YDPKNSST 130

Query: 162 -----------SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL- 209
                      +  L   H +C   TS  +  Q C Y   Y     +++G  V D +H  
Sbjct: 131 SSEISCSDDRCADALKTGHAICH--TSHSSGDQ-CGYNQIYADGVLATTGYYVSDDIHFD 187

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
           I  G+ +  +S  ASVI GC   +SG     +  DG+IG G    S+ S L   G + ++
Sbjct: 188 IFMGNESFASS-SASVIFGCSKSRSG----HLQADGVIGFGKDAPSLISQLNSQG-VSHA 241

Query: 270 FSMCFDK-DDSGRIFFGDQ-GPATQQSTSFLAS----NGKYITYIIGVETCCIGSSCLKQ 323
           FS C D  DD G +   D+ G    + TS +AS    N    +  +  +   I SS    
Sbjct: 242 FSRCLDDSDDGGGVLILDEVGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTT 301

Query: 324 TSFKA-IVDSGSSFTFLPKEVYETI 347
           +S +   +DSG+S  + P  VY+ +
Sbjct: 302 SSTQGTFLDSGTSLAYFPDGVYDPV 326


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 143/330 (43%), Gaps = 59/330 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I IGTP V  L   D GSDL+W  C+ C  C   ++  ++          P  SST + +
Sbjct: 90  ISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFD----------PKESSTYRKV 139

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN--ALKNSV 221
           SCS   C      SC   +  C YT+  Y +N+ + G +  D + + S G    +L+N  
Sbjct: 140 SCSSSQCRALEDASCSTDENTCSYTIT-YGDNSYTKGDVAVDTVTMGSSGRRPVSLRN-- 196

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
              +IIGCG + +G +    A  G+IGLG G  S+ S L K+  I   FS C      + 
Sbjct: 197 ---MIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSET 249

Query: 277 DDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSF------- 326
             + +I FG  G  +     STS +  +     Y + +E   +GS  ++ TS        
Sbjct: 250 GLTSKINFGTNGIVSGDGVVSTSMVKKD-PATYYFLNLEAISVGSKKIQFTSTIFGTGEG 308

Query: 327 KAIVDSGSSFTFLPKEVY--------ETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
             ++DSG++ T LP   Y         TI AE   Q  D I S        CY+ SS   
Sbjct: 309 NIVIDSGTTLTLLPSNFYYELESVVASTIKAE-RVQDPDGILSL-------CYRDSSSF- 359

Query: 379 PKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 408
            K+P + + F   +  + N   FV     V
Sbjct: 360 -KVPDITVHFKGGDVKLGNLNTFVAVSEDV 388


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 131/299 (43%), Gaps = 37/299 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP       +D GS+++W+ C  C  C   ++  +N          PS SS+ K++ C
Sbjct: 95  VGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFN----------PSKSSSYKNIPC 144

Query: 168 SHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           +   C    D   SC N    C Y++  Y  +  S G L  D L L S   +++   +  
Sbjct: 145 TSSTCKDTNDTHISCSNGGDVCEYSIT-YGGDAKSQGDLSNDSLTLDSTSGSSV---LFP 200

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDD 278
           +++IGCG        D     G++G+G G +S+   +  +  + + FS C      D + 
Sbjct: 201 NIVIGCG--HINVLQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSNS 257

Query: 279 SGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAI 329
           S ++ FG+    + +   ST  +  NG+   Y + +E   +G++ ++       ++   +
Sbjct: 258 SSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNIL 317

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           +DSG+  T LP      + +   ++V         +    CY ++ ++L  +P +   F
Sbjct: 318 IDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQL-NVPDITAHF 375


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/318 (26%), Positives = 132/318 (41%), Gaps = 65/318 (20%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           IGTP ++    LD GSDL+W  CD  C RC P  A            Y+P+ S T  ++S
Sbjct: 106 IGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPL----------YAPARSVTYANVS 155

Query: 167 CSHRLCDLGTSCQ-------------NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
           C  RLCD   S +               +  C Y    Y + +S+ G+L  +     +G 
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYS-YGDGSSTDGVLATETFTFGAG- 213

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
                 +    +  GCG    GG  +     GL+G+G G +   SL+++ G+ +  FS C
Sbjct: 214 ------TTVHDLAFGCGTDNLGGTDNS---SGLVGMGRGPL---SLVSQLGVTK--FSYC 259

Query: 274 F----DKDDSGRIFFGDQG---PATQQSTSFLASNG---KYITYIIGVETCCIGSSCL-- 321
           F    D   S  +F G      PA  +ST F+ S     +   Y + +E   +G + L  
Sbjct: 260 FTPFNDTTTSSPLFLGSSASLSPAA-KSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPI 318

Query: 322 KQTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS 373
               F+         I+DSG++FT L +  +  +A     +V   + S        C+ +
Sbjct: 319 DPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAA 378

Query: 374 SSQRLPK---LPSVKLMF 388
              R P+   +P + L F
Sbjct: 379 PQGRGPEAVDVPRLVLHF 396


>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
 gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
          Length = 408

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/269 (26%), Positives = 117/269 (43%), Gaps = 43/269 (15%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSA 158
           Y  ++IG P   + + +D GS   W+ C      C  C  +    Y    + L       
Sbjct: 40  YVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKL------- 92

Query: 159 SSTSKHLSCSHRLCD-----LGTS--CQN-PKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
                 + C+  LCD     LGT+  C +  K  C Y + Y  +  SS G+L+ D   L 
Sbjct: 93  ------VPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKY-QDGLSSLGVLLLDKFSLP 145

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGG----YLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +GG          ++  GCG  Q  G      + V  DG++GLG G + + S L  +G +
Sbjct: 146 TGG--------ARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAV 197

Query: 267 -RNSFSMCFDKDDSGRIFFGDQG-PATQQSTSFLA--SNGKYITYIIGVETCCIGSSCLK 322
            +N    C      G +F G++  P++  +   +A  + G+   Y  G  T  + S+ + 
Sbjct: 198 SKNVIGHCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIG 257

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEF 351
               KAI DSGS++T+LP+ ++  + +  
Sbjct: 258 TKPLKAIFDSGSTYTYLPENLHAQLVSAL 286


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 103/399 (25%), Positives = 169/399 (42%), Gaps = 59/399 (14%)

Query: 15  LLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQK 74
           L+   S A       K IH  + + +   V  N + +S   K  F Y     S+ + +Q 
Sbjct: 28  LVLRDSAARGGGIGFKAIHVAAPQFR---VKANPSPSSAAQKSLFPY-----SAHIFQQH 79

Query: 75  MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DC 133
            K     +       S T +LG  FG  +YT I +G+P    ++ +D GS+L W+ C  C
Sbjct: 80  TKNPAALR-------SSTTTLGRKFGE-YYTSIKLGSPGQEAILIVDTGSELTWLKCLPC 131

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS----HRLCDLGTSCQNPKQPCPYTM 189
             CAP   + Y++  R ++ Y P   + S+  S S    +  C  G+ CQ          
Sbjct: 132 KVCAPSVDTIYDAA-RSVS-YKPVTCNNSQLCSNSSQGTYAYCARGSQCQ--------FA 181

Query: 190 DYYTENTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 247
            +Y + + S G L  D  I+  + GG    K         GC   Q    L      G++
Sbjct: 182 AFYGDGSFSYGSLSTDTLIMETVVGG----KPVTVQDFAFGCA--QGDLELVPTGASGIL 235

Query: 248 GLGLGEISVPSLLAKAGLIRNSFSMCF-DK----DDSGRIFFGD-QGPATQ-QSTSFLAS 300
           GL  G++++P  L +       FS CF D+    + +G +FFG+ + P  Q Q TS   +
Sbjct: 236 GLNAGKMALPMQLGQR--FGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALT 293

Query: 301 NGKYIT--YIIGVETCCIGSS--CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           N +     Y + ++   I S    L       I+DSGSSF+   +  +  +   F +   
Sbjct: 294 NSELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGSSFSSFVRPFHSQLREAFLKHRP 353

Query: 357 DTITSFEGYPW---KCCYKSSSQRLPK----LPSVKLMF 388
            ++   EG  +     C+K S+  + +    LPS+ L+F
Sbjct: 354 PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVF 392


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 129/316 (40%), Gaps = 54/316 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP   F   +D GSDL+W  C  C +C   S   +N          P  SS+   L
Sbjct: 99  LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN----------PQGSSSFSTL 148

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS +LC    S       C YT   Y + + + G +  + L     G  ++ N     +
Sbjct: 149 PCSSQLCQALQSPTCSNNSCQYTYG-YGDGSETQGSMGTETLTF---GSVSIPN-----I 199

Query: 226 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGR 281
             GCG    G G  +G    GL+G+G G +S+PS L         FS C       +S  
Sbjct: 200 TFGCGENNQGFGQGNGA---GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSNSST 251

Query: 282 IFFG---DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------- 327
           +  G   +   A   +T+ + S+     Y I +    +GS+ L    + FK         
Sbjct: 252 LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKL 386
            I+DSG++ T+     Y+ +   F  Q+N ++ +     +  C++  S Q   ++P+  +
Sbjct: 312 IIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVM 371

Query: 387 MF-------PQNNSFV 395
            F       P  N F+
Sbjct: 372 HFDGGDLVLPSENYFI 387


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 139/335 (41%), Gaps = 64/335 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG P V +   +D GSDL+W  C  C  C           D+    + P  SS+   +
Sbjct: 112 LSIGNPAVKYAAIVDTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKV 161

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            CS  LC+    ++C   K  C Y +  Y + +S+ GLL  +            +NS+ +
Sbjct: 162 GCSSGLCNALPRSNCNEDKDSCEY-LYTYGDYSSTRGLLATETFTFED------ENSI-S 213

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDD 278
            +  GCG++  G G+  G    GL+GLG G +S+ S L +       FS C     D + 
Sbjct: 214 GIGFGCGVENEGDGFSQG---SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEA 265

Query: 279 SGRIFFGDQGPATQQST------------SFLASNGKYITYIIGVETCCIGSSCL--KQT 324
           S  +F G         T            S L +  +   Y + ++   +G+  L  +++
Sbjct: 266 SSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKS 325

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---- 372
           +F+         I+DSG++ T+L +  ++ +  EF  +++  +          C+K    
Sbjct: 326 TFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNA 385

Query: 373 SSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 403
           + +  +PKL        L  P  N  V ++   V+
Sbjct: 386 AKNIAVPKLIFHFKGADLELPGENYMVADSSTGVL 420


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 124/289 (42%), Gaps = 44/289 (15%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSL 147
            ++ + L +D  +L    + IGTP   +   LD GSDL+W  C  C+ C          +
Sbjct: 78  AARILVLASDGEYLM--EMGIGTPTRYYSAILDTGSDLIWTQCAPCLLC----------V 125

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           D+    + P+ S+T + L C+   C+        ++ C Y   +Y ++ S++G+L  +  
Sbjct: 126 DQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVYQY-FYGDSASTAGVLANETF 184

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
                G N  + S+   +  GCG   +G   +G    G++G G G +   SL+++ G  R
Sbjct: 185 TF---GTNETRVSLPG-ISFGCGNLNAGSLANG---SGMVGFGRGSL---SLVSQLGSPR 234

Query: 268 NSFSMC-FDKDDSGRIFFG--------DQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
            S+ +  F      R++FG        +      QST F+ +      Y + +    +G 
Sbjct: 235 FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG 294

Query: 319 SCLK-----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
             L              +   I+DSG++ T+L +  Y+ + A F  Q+ 
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT 343


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/275 (28%), Positives = 120/275 (43%), Gaps = 52/275 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRC----APLSASYYNSLDRDLNEYSPSASST 161
           + +GTP    L   D GSDL+W  C  C +C    APL              + P +S T
Sbjct: 97  LSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPL--------------FDPKSSKT 142

Query: 162 SKHLSCSHRLC-DLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNA 216
            + LSC  R C +LG  +SC + +Q C Y+  YY + + ++G L  D + L S  GG   
Sbjct: 143 YRDLSCDTRQCQNLGESSSCSS-EQLCQYSY-YYGDRSFTNGNLAVDTVTLPSTNGGPVY 200

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--- 273
              +V     IGCG + +G +       G+IGLG G +S+ S +  +  +   FS C   
Sbjct: 201 FPKTV-----IGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVP 251

Query: 274 FDKDDSG---RIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCL------ 321
           F  + +G   ++ FG     +    QST  ++ N     Y+  +E   +G   +      
Sbjct: 252 FSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLT-LEAMSVGDKKIEFGGSS 310

Query: 322 -KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
              +    I+DSG+S T  P   +   A   +  V
Sbjct: 311 FGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAV 345


>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 681

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 139/324 (42%), Gaps = 54/324 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           HYTW+  GTP     V  D GS L+  PC  C  C   +   + + +          SST
Sbjct: 67  HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCGHHTDQPFQAAN----------SST 116

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG-----DNA 216
             H++C+ +       C      C  +  Y  E +S    +VEDI++L  GG     D  
Sbjct: 117 LVHITCAQKSLFQCKECHVQSDTCGISQSYM-EGSSWKASVVEDIVYL--GGESSFDDKE 173

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFD 275
           ++N        GC   + G ++  VA DG++GL   E  + + L +   I  N FS+CF 
Sbjct: 174 MRNRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCF- 231

Query: 276 KDDSGRIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA- 328
            ++ G +  G    A  +        +A       Y + ++   IG   +  K+ ++   
Sbjct: 232 TENGGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRG 291

Query: 329 --IVDSGSSFTFLPK-------EVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             IVDSG++ ++LP+       ++++ IA   D QV ++   F           +++ L 
Sbjct: 292 HYIVDSGTTDSYLPRALKTEFLQMFKEIAGR-DYQVGNSCKGF-----------TNKDLA 339

Query: 380 KLPSVKLMFP----QNNSFVVNNP 399
            LP+++L+      +N   +++ P
Sbjct: 340 SLPTIQLVMEAYGDENAEVILDVP 363


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 132/334 (39%), Gaps = 61/334 (18%)

Query: 110 GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH 169
           G+P  +  V +D GSDL W     V+C P SA Y     RD   + P+ S+T   + C+ 
Sbjct: 155 GSPAANLTVIVDTGSDLTW-----VQCKPCSACYAQ---RD-PLFDPAGSATYAAVRCNA 205

Query: 170 RLC--DLGTSCQNP---------KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
             C   L  +   P          + C Y +  Y + + S G+L  D +        AL 
Sbjct: 206 SACADSLRAATGTPGSCGSTGAGSEKCYYAL-AYGDGSFSRGVLATDTV--------ALG 256

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK--AGLIRNSFSMCFDK 276
            +     + GCG+    G   G A  GL+GLG  E+S+ S  A    G+           
Sbjct: 257 GASLGGFVFGCGLSNR-GLFGGTA--GLMGLGRTELSLVSQTASRYGGVFSYCLPAATSG 313

Query: 277 DDSGRIFF--GDQGPATQQSTS------FLASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
           D SG +    GD   ++ ++T+       +A   +   Y + V    +G + L      A
Sbjct: 314 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 373

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRL 378
              ++DSG+  T L   VY  + AEF RQ         GYP          CY  +    
Sbjct: 374 SNVLIDSGTVITRLAPSVYRAVRAEFMRQFG-----AAGYPAAPGFSILDTCYDLTGHDE 428

Query: 379 PKLPSVKLMFPQNNSFVVNNP--VFVIY--GTQV 408
            K+P + L         V+    +FV+   G+QV
Sbjct: 429 VKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQV 462


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 131/316 (41%), Gaps = 50/316 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP    L+ +D  S+L W+    C  C+P     +N          P  SS+     C
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFN----------PGLSSSFISEPC 54

Query: 168 SHRLC----DLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           +  +C     LG  ++C      C + + Y  + + + G++  +I  L S    A   S 
Sbjct: 55  TSSVCLGRSKLGFQSACNRSTGSCSFQVAYL-DGSEAYGVIAREIFSLQSWDGAA---ST 110

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL---AKAGLIRNSFSMCFDK-- 276
              VI GC  K     +D     G +GL  G  S P+ +   +K+GL  + FS CF    
Sbjct: 111 LGDVIFGCASKDLQRPVD--FSSGTLGLNRGSFSFPAQIGSRSKSGL-SDRFSYCFPNRA 167

Query: 277 ---DDSGRIFFGDQG-PATQQSTSFLASNGKYIT----YIIGVETCCIGSSCLK--QTSF 326
              + SG I FGD G PA       L       +    Y +G++   +G   L   +++F
Sbjct: 168 EHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAF 227

Query: 327 K--------AIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSS-- 375
           K           DSG++ +FL +  +  +   F R+V +   TS   +  + CY  ++  
Sbjct: 228 KIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGD 287

Query: 376 QRLPKLPSVKLMFPQN 391
            RLP  P V L F  N
Sbjct: 288 ARLPTAPLVTLHFKNN 303


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 76/270 (28%), Positives = 117/270 (43%), Gaps = 43/270 (15%)

Query: 98  DFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR----------CAPLSASYYNSL 147
           DF +L    +++GTP V FL   D GSDL+W+ C+  +              ++S     
Sbjct: 79  DFEYL--AAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPP 136

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLC-DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVE 204
              +  ++P  SS+   + C    C  L T  SC      C +   Y  +  S++GLL  
Sbjct: 137 PEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYR-DGASATGLLAA 195

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           D      GG+     +  AS+  GC    +G        DG++GLG G +S+ S L +  
Sbjct: 196 DTFTF--GGNINNDTTSTASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQLGR-- 248

Query: 265 LIRNSFSMC---FDKDDSGRIF-FG------DQGPATQQSTSFLASNGKYIT-YIIGVET 313
                FS C   +D DD+  I  FG      D G AT   T  +AS+      Y I +++
Sbjct: 249 ----KFSFCLTAYDIDDASSILNFGARAVVSDPGAAT---TPLIASSSNAAAYYAISIDS 301

Query: 314 CCIGSSCLKQTS--FKAIVDSGSSFTFLPK 341
             +    +  T+   K IVD+G+  TFL +
Sbjct: 302 LKVAGQPVPGTTSVSKVIVDTGTVLTFLDR 331


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 72/293 (24%), Positives = 115/293 (39%), Gaps = 31/293 (10%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IG+P V  L  +D GS L+W+ C  C  C P          ++   + P  SST K+ +C
Sbjct: 95  IGSPPVERLAMVDTGSSLIWLQCSPCHNCFP----------QETPLFEPLKSSTYKYATC 144

Query: 168 SHRLCDL----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
             + C L       C    Q C Y +  Y + + S G+L  + L    G     +     
Sbjct: 145 DSQPCTLLQPSQRDCGKLGQ-CIYGI-MYGDKSFSVGILGTETLSF--GSTGGAQTVSFP 200

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSG 280
           + I GCG+  +          G+ GLG G +S+ S L     I + FS C   +D   + 
Sbjct: 201 NTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTS 258

Query: 281 RIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSGSS 335
           ++ FG +   T     ST  +        Y + +E   IG   +   QT    ++DSG+ 
Sbjct: 259 KLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTP 318

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            T+L    Y    A     +   +      P K C+ + +     +P +   F
Sbjct: 319 LTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRANL--AIPDIAFQF 369


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 89/314 (28%), Positives = 132/314 (42%), Gaps = 49/314 (15%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 169 SPGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYEQREK--- 220

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL T  C      C Y +  Y + + S G    D L L 
Sbjct: 221 LFDPARSSTYANVSCAAPACSDLDTRGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 277

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 278 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 324

Query: 270 FSMCFDKDDSGRIF--FGDQGPATQQSTS-FLASNGKYITYIIGVETCCIGSSCL--KQT 324
           F+ C     +G  +  FG   PA + +T+  L  NG    Y +G+    +G   L   Q+
Sbjct: 325 FAHCLPARSTGTGYLDFGAGSPAARLTTTPMLVDNGPTF-YYVGLTGIRVGGRLLYIPQS 383

Query: 325 SFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSS 374
            F     IVDSG+  T LP   Y ++ + F   +     S  GY           CY  +
Sbjct: 384 VFATAGTIVDSGTVITRLPPAAYSSLRSAFAAAM-----SARGYKKAPAVSLLDTCYDFA 438

Query: 375 SQRLPKLPSVKLMF 388
                 +P+V L+F
Sbjct: 439 GMSQVAIPTVSLLF 452


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 155/387 (40%), Gaps = 69/387 (17%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
              K + R  + +  +G  +N ++    AK+S +  +V+ ++ + +  M++      +  
Sbjct: 64  MQAKDLFRHEQMITMMGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSALNIAHV-- 121

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY-- 144
                          ++   + IGTP + + + LD  +DL WI C   R       +Y  
Sbjct: 122 --------------GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRR---RKGKHYGR 164

Query: 145 NSLDRDL----------------NEYSPSASSTSKHLSCSHRLCDL--GTSCQNP--KQP 184
            S+ + +                N Y P+ SS+ + + CS + C +    +CQ+P   + 
Sbjct: 165 QSMGQTMSVGGEGATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAES 224

Query: 185 CPYTMDYYTENTSSSGLL-VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 243
           C Y      + T + G+   E     +S G    + +    +I+GC + ++GG +D  A 
Sbjct: 225 CSY-FQKTQDGTVTIGIYGKEKATVTVSDG----RMAKLPGLILGCSVLEAGGSVD--AH 277

Query: 244 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ----GPATQQS 294
           DG++ LG G++S     AK       FS C       +D S  + FG      GP T ++
Sbjct: 278 DGVLSLGNGDMSFAVHAAKR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMET 335

Query: 295 TSFL------ASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYE 345
                     A   K    ++G E   I         F     I+D+ +S T L  E Y 
Sbjct: 336 DILYNVDVKPAYGAKVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYA 395

Query: 346 TIAAEFDRQVNDTITSFEGYPWKCCYK 372
            + A  DR ++     +E   ++ CYK
Sbjct: 396 PVTAALDRHLSHLPRVYELEGFEYCYK 422


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 128/315 (40%), Gaps = 60/315 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP + F V +D GS+L+W  C  C RC P                 P+ SST   L
Sbjct: 95  ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146

Query: 166 SCSHRLCD-LGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            C+   C  L TS +    N    C Y   Y +  T  +G L  + L +   GD      
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTV---GDGTFPK- 200

Query: 221 VQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
               V  GC  +      +GV    G++GLG G +S+ S LA     R S+ +  D  D 
Sbjct: 201 ----VAFGCSTE------NGVDNSSGIVGLGRGPLSLVSQLAVG---RFSYCLRSDMADG 247

Query: 280 GR--IFFGDQGPATQQST---------SFLASNGKYITYIIGV-----ETCCIGSS-CLK 322
           G   I FG     T++S           +L  +  Y   + G+     E    GS+    
Sbjct: 248 GASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFT 307

Query: 323 QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVND----TITSFEGYPWKCCYKSSS- 375
           QT      IVDSG++ T+L K+ Y  +   F  Q+ +    T  S   Y    CYK S+ 
Sbjct: 308 QTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAG 367

Query: 376 --QRLPKLPSVKLMF 388
              +  ++P + L F
Sbjct: 368 GGGKAVRVPRLALRF 382


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 147/363 (40%), Gaps = 57/363 (15%)

Query: 3   RISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYY 62
           R  LT+       +   S A+   FS +LIHR S           ++    P +  ++Y+
Sbjct: 4   RSFLTLLFFSICFIVSFSHAQKNGFSVELIHRDSL----------KSPLYKPTQNKYQYF 53

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
                 D  ++ +     F     +   ++  +  D G    T+  +GTP       +D 
Sbjct: 54  -----VDAARRSINRANHFYKYSLANIPQSTVIP-DIGEYLMTY-SVGTPPFKLYGIVDT 106

Query: 123 GSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQ 179
           GSD++W+ C+ C  C   +   +N          PS SS+ K++ C  +LC     TSC 
Sbjct: 107 GSDIVWLQCEPCQECYNQTTPMFN----------PSKSSSYKNIPCPSKLCQSMEDTSC- 155

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           N K  C Y+  YY +N+ S G L  D L L S   N L  S   +++IGCG      Y +
Sbjct: 156 NDKNYCEYST-YYGDNSHSGGDLSVDTLTLES--TNGLTVSF-PNIVIGCGTNNILSY-E 210

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---------DKDDSGRIFFGDQGPA 290
           G A  G++G G G  S  + L  +      FS C            + + ++ FGD    
Sbjct: 211 G-ASSGIVGFGSGPASFITQLGSS--TGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATV 267

Query: 291 TQQ---STSFLASNGKYITYI------IGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPK 341
           +     +T  L  + +   Y+      +G     IG           I+DSG++ T L K
Sbjct: 268 SGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGTTLTSLTK 327

Query: 342 EVY 344
           + Y
Sbjct: 328 DDY 330


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 77/317 (24%), Positives = 133/317 (41%), Gaps = 59/317 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           ++IG  N++ +V  D GSDL W+ C   R        YN  D   N   PS S + + + 
Sbjct: 71  VEIGGRNMTVIV--DTGSDLTWVQCQPCRLC------YNQQDPLFN---PSGSPSYQTIL 119

Query: 167 CSHRLC--------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
           C+   C        +LG  C +    C Y ++Y   + +   L +E +          L 
Sbjct: 120 CNSSTCQSLQYATGNLGV-CGSNTPTCNYVVNYGDGSYTRGDLGMEQL---------NLG 169

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--- 275
            +  ++ I GCG + + G   G +  GL+GLG  ++S+ S    + +    FS C     
Sbjct: 170 TTHVSNFIFGCG-RNNKGLFGGAS--GLMGLGKSDLSLVS--QTSAIFEGVFSYCLPTTA 224

Query: 276 KDDSGRIFFGDQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLKQTSFKA-- 328
            D SG +  G      + +T      + +N +  T Y + +    IG   L+  +++   
Sbjct: 225 ADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSG 284

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPK 380
            ++DSG+  T LP  VY  + AEF +Q       F G+P          C+  +      
Sbjct: 285 ILIDSGTVITRLPPPVYRDLKAEFLKQ-------FSGFPSAPPFSILDTCFNLNGYDEVD 337

Query: 381 LPSVKLMFPQNNSFVVN 397
           +P++++ F  N    V+
Sbjct: 338 IPTIRMQFEGNAELTVD 354


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 128/316 (40%), Gaps = 54/316 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP   F   +D GSDL+W  C  C +C   S   +N          P  SS+   L
Sbjct: 99  LSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN----------PQGSSSFSTL 148

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS +LC    S       C YT   Y + + + G +  + L     G  ++ N     +
Sbjct: 149 PCSSQLCQALQSPTCSNNSCQYTYG-YGDGSETQGSMGTETLTF---GSVSIPN-----I 199

Query: 226 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGR 281
             GCG    G G  +G    GL+G+G G +S+PS L         FS C        S  
Sbjct: 200 TFGCGENNQGFGQGNGA---GLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTSST 251

Query: 282 IFFG---DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK--------- 327
           +  G   +   A   +T+ + S+     Y I +    +GS+ L    + FK         
Sbjct: 252 LLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK-SSSQRLPKLPSVKL 386
            I+DSG++ T+     Y+ +   F  Q+N ++ +     +  C++  S Q   ++P+  +
Sbjct: 312 IIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVM 371

Query: 387 MF-------PQNNSFV 395
            F       P  N F+
Sbjct: 372 HFDGGDLVLPSENYFI 387


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 133/312 (42%), Gaps = 41/312 (13%)

Query: 110 GTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           GTP  + L+ +D GSD+ WI C  C  C       Y+ +D     + P  SS+ KHLSC 
Sbjct: 145 GTPAKNSLLIIDTGSDVTWIQCKPCSDC-------YSQVDP---IFEPQQSSSYKHLSCL 194

Query: 169 HRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
              C +L T        C Y ++ Y + + S G   ++ L L  G D+        S   
Sbjct: 195 SSACTELTTMNHCRLGGCVYEIN-YGDGSRSQGDFSQETLTL--GSDSF------PSFAF 245

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL-AKAGLIRNSFSMC---FDKDDSGRIF 283
           GCG   + G   G A  GL+GLG   +S PS   +K G     FS C   F    S   F
Sbjct: 246 GCGHTNT-GLFKGSA--GLLGLGRTALSFPSQTKSKYG---GQFSYCLPDFVSSTSTGSF 299

Query: 284 FGDQG--PATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSF-----KAIVDSGSS 335
              QG  PAT      L SN  Y + Y +G+    +G   L            IVDSG+ 
Sbjct: 300 SVGQGSIPATATFVP-LVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTV 358

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFV 395
            T L  + Y+ +   F  +  +  ++        CY  SS    ++P++   F QNN+ V
Sbjct: 359 ITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHF-QNNADV 417

Query: 396 VNNPVFVIYGTQ 407
             + V +++  Q
Sbjct: 418 AVSAVGILFTIQ 429


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 145/387 (37%), Gaps = 72/387 (18%)

Query: 8   IYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSWPAKKSFEYY 62
           + +   + L E + A    FS  LIHR S        SK +     +A      +   + 
Sbjct: 13  VVVGFLFQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFR 72

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
              ++SD        G Q +++ PS G   M+L             IGTP V  +  +D 
Sbjct: 73  PTAMTSD--------GIQSRIV-PSAGEYLMNL------------YIGTPPVPVIAIVDT 111

Query: 123 GSDLLWIPCD-CVRC----APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT 176
           GSDL W  C  C  C     PL              + P  SST +  SC    C  LG 
Sbjct: 112 GSDLTWTQCRPCTHCYKQVVPL--------------FDPKNSSTYRDSSCGTSFCLALGK 157

Query: 177 --SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 234
             SC   K+ C +    Y + + + G L  + L + S    A K         GCG   S
Sbjct: 158 DRSCSKEKK-CTFRYS-YADGSFTGGNLASETLTVDS---TAGKPVSFPGFAFGCG-HSS 211

Query: 235 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGP 289
           GG  D  +  G++GLG GE+S+ S L     I   FS C      D   S RI FG  G 
Sbjct: 212 GGIFDK-SSSGIVGLGGGELSLISQLKST--INGLFSYCLLPVSTDSSISSRINFGASGR 268

Query: 290 ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAA 349
            +   T        Y  Y          S   +      IVDSG+++TFLP+E Y  +  
Sbjct: 269 VSGYGTVSTPLRLPYKGY----------SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK 318

Query: 350 EFDRQVNDTITSFEGYPWKCCYKSSSQ 376
                +           +  CY ++++
Sbjct: 319 SVANSIKGKRVRDPNGIFSLCYNTTAE 345


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 81/341 (23%), Positives = 137/341 (40%), Gaps = 74/341 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++ GTP  +    +D GS L+W PC     C  C     ++ N     +  + P  SS+S
Sbjct: 87  LNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSEC-----NFPNIKKTGIPTFLPKLSSSS 141

Query: 163 KHLSCSHRLCDL-------------GTSCQNPKQPCP-YTMDYYTENTSSSGLLVEDILH 208
           K + C +  C +              ++ QN  Q CP Y + Y   + S++GLL+ + L 
Sbjct: 142 KLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY--GSGSTAGLLLSETL- 198

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
                D   K ++    ++GC +           P+G+ G G    S+PS L        
Sbjct: 199 -----DFPNKKTI-PDFLVGCSI------FSIKQPEGIAGFGRSPESLPSQLGLKKFSYC 246

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYIT---------YIIGVETCCIGSS 319
             S  FD   +      D G  +  + +   S+  ++          Y + +    IG +
Sbjct: 247 LVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDT 306

Query: 320 CLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQ-----VNDTITSFE 363
            +K   +K            IVDSG++FTF+   VYE +A EF++Q     V   I +  
Sbjct: 307 HVK-VPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLT 365

Query: 364 GYPWKCCYKSSSQRLPKLPSV--------KLMFPQNNSFVV 396
           G   + CY  S ++   +P +        K+  P +N F +
Sbjct: 366 G--LRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSI 404


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 113/244 (46%), Gaps = 30/244 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C  C                ++ P AS T + + C
Sbjct: 99  IGTPPQRFALIVDTGSTVTYVPCSTCKHCG----------SHQDPKFRPEASETYQPVKC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           + + C+    C + ++ C Y   Y  E ++SSG+L ED+   +S G+ +  +  +A  I 
Sbjct: 149 TWQ-CN----CDDDRKQCTYERRY-AEMSTSSGVLGEDV---VSFGNQSELSPQRA--IF 197

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 287
           GC   ++G   +  A DG++GLG G++S+   L +  +I ++FS+C+     G       
Sbjct: 198 GCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256

Query: 288 GPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLP 340
           G +      F  S+  +   Y I ++   +    L             ++DSG+++ +LP
Sbjct: 257 GISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316

Query: 341 KEVY 344
           +  +
Sbjct: 317 ESAF 320


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 135/314 (42%), Gaps = 40/314 (12%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P++   T+  GN     +   I IGTP     +  D GSDL W      +C P   S Y
Sbjct: 119 LPAKSGITLGSGN-----YIVTIGIGTPKHDLSLVFDTGSDLTW-----TQCEPCLGSCY 168

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
           +  +   N   PS+SST +++SCS  +C+   SC      C Y++  Y + + + G L +
Sbjct: 169 SQKEPKFN---PSSSSTYQNVSCSSPMCEDAESCS--ASNCVYSI-VYGDKSFTQGFLAK 222

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           +   L +       + V   V  GCG    G +      DG+ GL        SL A+  
Sbjct: 223 EKFTLTN-------SDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTT 269

Query: 265 LIRNS-FSMC---FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
              N+ FS C   F  + +G + FG  G +     + ++S      Y I +    +G   
Sbjct: 270 TTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKE 329

Query: 321 LKQT--SFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS 374
           L  T  SF    AI+DSG+ FT LP +VY  + + F  +++ +  S  GY  +  CY  +
Sbjct: 330 LAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMS-SYKSTSGYGLFDTCYDFT 388

Query: 375 SQRLPKLPSVKLMF 388
                  P++   F
Sbjct: 389 GLDTVTYPTIAFSF 402


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 127/314 (40%), Gaps = 56/314 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  +  + LD GS+L W+ C   R          +     + + P AS+T   + 
Sbjct: 65  LAVGTPPQNVTMVLDTGSELSWLLCATGR----------AAAAAADSFRPRASATFAAVP 114

Query: 167 CSHRLC---DLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C    C   DL    SC    + C  ++  Y + ++S G L  D+         A+ ++ 
Sbjct: 115 CGSARCSSRDLPAPPSCDAASRRCRVSLS-YADGSASDGALATDVF--------AVGDAP 165

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 280
                 GC         D VA  GL+G+  G +   S + +A   R  FS C  D+DD+G
Sbjct: 166 PLRSAFGCMSAAYDSSPDAVATAGLLGMNRGAL---SFVTQASTRR--FSYCISDRDDAG 220

Query: 281 RIFFG---------DQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLKQTSF 326
            +  G         +  P  Q +        +A + + +   +G +   I  S L     
Sbjct: 221 VLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHT 280

Query: 327 KA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWKCCYKSSSQR 377
            A   +VDSG+ FTFL  + Y  + AEF +Q    + + E         +  C++    R
Sbjct: 281 GAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGR 340

Query: 378 LP---KLPSVKLMF 388
            P   +LP V L+F
Sbjct: 341 PPPSARLPPVTLLF 354


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 75/273 (27%), Positives = 117/273 (42%), Gaps = 39/273 (14%)

Query: 103 HYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASS 160
           HY   + IGTP        D GSDL W    CV C        N   +  N  + P  S+
Sbjct: 24  HYLMEVSIGTPPFKIYGIADTGSDLTWT--SCVPC--------NKCYKQRNPIFDPQKST 73

Query: 161 TSKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGDNAL 217
           + +++SC  +LC  L T   +P++ C YT  Y +    + G+L ++ + L S  G    L
Sbjct: 74  SYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAI-TQGVLAQETITLSSTKGESVPL 132

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--- 274
           K      ++ GCG   +GG+ D     G+IGLG G +S  S +  +      FS C    
Sbjct: 133 KG-----IVFGCGHNNTGGFND--REMGIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPF 184

Query: 275 --DKDDSGRIFFGDQGPATQQ---STSFLASNGK--YITYIIGVETCCI-----GSSCLK 322
             D   S ++  G     + +   ST  +A   K  Y   ++G+          GSS   
Sbjct: 185 HTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQS 244

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
                  +DSG+  T LP ++Y+ + A+   +V
Sbjct: 245 VEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEV 277


>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
          Length = 410

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 67/258 (25%), Positives = 110/258 (42%), Gaps = 24/258 (9%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+   ++IG P   + + +D GS L W+ CD  C+ C  +    Y        
Sbjct: 30  GNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ 83

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-I 210
           E   +   T +   C+    DL    +  PK  C Y + Y     SS G+L+ D   L  
Sbjct: 84  ELKYAVKCTEQR--CADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPA 139

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RN 268
           S G N        S+  GCG  Q     +   P +G++GLG G++++ S L   G+I ++
Sbjct: 140 SNGTNP------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKH 193

Query: 269 SFSMCFDKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
               C      G +FFGD + P +  + S +    K+ +   G       S  +     +
Sbjct: 194 VLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPME 253

Query: 328 AIVDSGSSFTFLPKEVYE 345
            I DSG+++T+   + Y 
Sbjct: 254 VIFDSGATYTYFALQPYH 271


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 83/300 (27%), Positives = 127/300 (42%), Gaps = 47/300 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLN-EYSPSASSTSKH 164
           I IG P +  LV +D GSD+LW+ C  C  C           D DL   + PS SST   
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC-----------DNDLGLLFDPSKSSTFSP 153

Query: 165 LSCSHRLCDL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           L  +   CD  G  C     P P+T+  Y +N+++SG    D +   +  +   + S   
Sbjct: 154 LCKTP--CDFEGCRC----DPIPFTVT-YADNSTASGTFGRDTVVFETTDEGTSRIS--- 203

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----DD 278
            V+ GCG   + G+      +G++GL  G     SL+ K G     FS C         +
Sbjct: 204 DVLFGCG--HNIGHDTDPGHNGILGLNNGP---DSLVTKLG---QKFSYCIGNLADPYYN 255

Query: 279 SGRIFFGDQGPATQQSTSFLASNGKYITYI----IGVETCCIGSSCLKQTSFKA---IVD 331
             ++  G+       ST F   NG Y   +    +G +   I     +    +A   I+D
Sbjct: 256 YHQLILGEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIID 315

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPW-KCCYKSSSQRLPKLPSVKLMF 388
           +GS+ TFL   V++ ++ E    +  +    + E  PW +C Y S S+ L   P V   F
Sbjct: 316 TGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHF 375


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 135/314 (42%), Gaps = 40/314 (12%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P++   T+  GN     +   I IGTP     +  D GSDL W      +C P   S Y
Sbjct: 119 LPAKSGITLGSGN-----YIVTIGIGTPKHDLSLVFDTGSDLTW-----TQCEPCLGSCY 168

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
           +  +   N   PS+SST +++SCS  +C+   SC      C Y++  Y + + + G L +
Sbjct: 169 SQKEPKFN---PSSSSTYQNVSCSSPMCEDAESCS--ASNCVYSIG-YGDKSFTQGFLAK 222

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           +   L +       + V   V  GCG    G +      DG+ GL        SL A+  
Sbjct: 223 EKFTLTN-------SDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTT 269

Query: 265 LIRNS-FSMC---FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
              N+ FS C   F  + +G + FG  G +     + ++S      Y I +    +G   
Sbjct: 270 TTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKE 329

Query: 321 LKQT--SFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS 374
           L  T  SF    AI+DSG+ FT LP +VY  + + F  +++ +  S  GY  +  CY  +
Sbjct: 330 LAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMS-SYKSTSGYGLFDTCYDFT 388

Query: 375 SQRLPKLPSVKLMF 388
                  P++   F
Sbjct: 389 GLDTVTYPTIAFSF 402


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 128/322 (39%), Gaps = 49/322 (15%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
            P++   T+  GN     +   + +GTP     +  D GSDL W  C  CVR        
Sbjct: 120 LPAKDGSTLGSGN-----YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-------- 166

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSS 198
               D+    ++PS S++  ++SCS   C       G +       C Y +  Y + + S
Sbjct: 167 -TCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ-YGDQSFS 224

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G L +D   L S       + V   V  GCG + + G   GVA  GL+GLG  ++S PS
Sbjct: 225 VGFLAKDKFTLTS-------SDVFDGVYFGCG-ENNQGLFTGVA--GLLGLGRDKLSFPS 274

Query: 259 LLAKAGLIRNSFSMCFDKDDS--GRIFFGDQG----------PATQQSTSFLASNGKYIT 306
             A A      FS C     S  G + FG  G                TSF   N   IT
Sbjct: 275 QTATA--YNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAIT 332

Query: 307 YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
             +G +   I S+        A++DSG+  T LP + Y  + + F  +++   T+     
Sbjct: 333 --VGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI 388

Query: 367 WKCCYKSSSQRLPKLPSVKLMF 388
              C+  S  +   +P V   F
Sbjct: 389 LDTCFDLSGFKTVTIPKVAFSF 410


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 132/317 (41%), Gaps = 47/317 (14%)

Query: 94  SLGNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G D G L+Y     +GTP V+  + +D GSDL W+ C     AP   S  + L     
Sbjct: 130 SWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPL----- 184

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHL 209
            + P+ SS+   + C   +C  G               Y   Y + ++++G+   D L L
Sbjct: 185 -FDPAQSSSYAAVPCGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRN 268
                 +  ++VQ     GCG  QS G  +GV  DGL+GLG  +   PSL+ + AG    
Sbjct: 243 ------SASSAVQG-FFFGCGHAQS-GLFNGV--DGLLGLGREQ---PSLVEQTAGTYGG 289

Query: 269 SFSMCFDKDDS--GRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            FS C     S  G +  G  GP+       +T  L S      Y++ +    +G   L 
Sbjct: 290 VFSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349

Query: 323 --QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCY 371
              ++F    +VD+G+  T LP   Y  + + F       + S+ GYP          CY
Sbjct: 350 VPASAFAGGTVVDTGTVITRLPPTAYAALRSAF----RSGMASY-GYPTAPSNGILDTCY 404

Query: 372 KSSSQRLPKLPSVKLMF 388
             +      LP+V L F
Sbjct: 405 NFAGYGTVTLPNVALTF 421


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 134/323 (41%), Gaps = 32/323 (9%)

Query: 95  LGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE 153
           LG+    L Y   + +GTP V+  V +D GSD+ W+ C+     P  A      D     
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFD----- 172

Query: 154 YSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
             P+ SST + +SC+   C      G  C      C Y +  Y + ++++G    D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQ-YGDGSTTNGTYSRDTLTL 229

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
            SG  +A+K         GC   +S G+ D    DGL+GLG G  S+ S  A A    NS
Sbjct: 230 -SGASDAVKG-----FQFGCSHLES-GFSDQT--DGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 270 FSMCFDKDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSS--CLKQT 324
           FS C         F    G        +T  L S      Y   ++   +G     L  +
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPS 338

Query: 325 SFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            F A  +VDSG+  T LP   Y  +++ F   +    ++        C+  + Q    +P
Sbjct: 339 VFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIP 398

Query: 383 SVKLMFPQNNSFVVNNPVFVIYG 405
           +V L+F    + +  +P  ++YG
Sbjct: 399 TVALVF-SGGAAIDLDPNGIMYG 420


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 127/314 (40%), Gaps = 54/314 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP ++F V  D GSDL+W  C  C +C            +    + P++SST   L
Sbjct: 90  ISVGTPLLTFSVVADTGSDLIWTQCAPCTKC----------FQQPAPPFQPASSSTFSKL 139

Query: 166 SCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            C+   C       N  + C  T    +Y   +  ++G L  + L +   GD +      
Sbjct: 140 PCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKV---GDASFP---- 189

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-- 280
            SV  GC  +       G +  G+ GLG G +   SL+ + G+ R  FS C     +   
Sbjct: 190 -SVAFGCSTENG----VGNSTSGIAGLGRGAL---SLIPQLGVGR--FSYCLRSGSAAGA 239

Query: 281 -RIFFGDQGPATQ---QSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK-------- 327
             I FG     T    QST F+ +   + + Y + +    +G + L  T+          
Sbjct: 240 SPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGL 299

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL--P 382
               IVDSG++ T+L K+ YE +   F  Q  D  T         C+KS+      +  P
Sbjct: 300 GGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVP 359

Query: 383 SVKLMFPQNNSFVV 396
           S+ L F     + V
Sbjct: 360 SLVLRFDGGAEYAV 373


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 124/301 (41%), Gaps = 55/301 (18%)

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSA 141
           ++  S+G   MS+G            IGTP   +   LD GSDL+W  C  C+ C     
Sbjct: 81  LVLASEGEYLMSMG------------IGTPPRYYSAILDTGSDLIWTQCAPCMLC----- 123

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 201
                +D+    + P+ S +   L C+  +C+        +  C Y   +Y ++ +++G+
Sbjct: 124 -----VDQPTPFFDPAQSPSYAKLPCNSPMCNALYYPLCYRNVCVYQY-FYGDSANTAGV 177

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           L  +       G N  + +V   +  GCG   +G   +G    G++G G G +   SL++
Sbjct: 178 LSNETFTF---GTNDTRVTVP-RIAFGCGNLNAGSLFNG---SGMVGFGRGPL---SLVS 227

Query: 262 KAGLIRNSFSMC-FDKDDSGRIFFGDQGPATQ---------QSTSFLASNGKYITYIIGV 311
           + G  R S+ +  F      R++FG                QST F+ + G    Y + +
Sbjct: 228 QLGSPRFSYCLTSFMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNM 287

Query: 312 ETCCIGSSCL-----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
               +G   L              +   I+DSGS+ T+L +  Y+ +   F  QV   +T
Sbjct: 288 TGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLT 347

Query: 361 S 361
           +
Sbjct: 348 N 348


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 78/301 (25%), Positives = 130/301 (43%), Gaps = 81/301 (26%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++ GTP+ +F   LD GS L+W+PC     C +C   S         +  ++ P  SS+S
Sbjct: 90  LEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFS---------NTPKFIPKNSSSS 140

Query: 163 KHLSCSHRLC------DLGTSC--------QNPKQPCP-YTMDYYTENTSSSGLLVEDIL 207
           K + C++  C      D+ + C         N  Q CP YT+ Y   +T  +G L+ + L
Sbjct: 141 KFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGST--AGFLLSENL 198

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           +              +  ++GC +      +    P G+ G G GE S+PS   +  L R
Sbjct: 199 N--------FPTKKYSDFLLGCSV------VSVYQPAGIAGFGRGEESLPS---QMNLTR 241

Query: 268 NSFSMCFDK-DDSGRI-----------------------FFGDQGPATQQSTSFLASNGK 303
            S+ +   + DDS  I                       F   + P T+++ +F A    
Sbjct: 242 FSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFL--KNPTTKKNPAFGAY--Y 297

Query: 304 YITY---IIGVETCCIGSSCLKQT---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
           YIT    ++G +   +    L+         IVDSGS+FTF+ + +++ +A EF +QV+ 
Sbjct: 298 YITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSY 357

Query: 358 T 358
           T
Sbjct: 358 T 358


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 119/283 (42%), Gaps = 55/283 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP+ +    +D GS L+W PC     C RC     S+ N     +  + P  SS++
Sbjct: 110 LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSA 164

Query: 163 KHLSCSHRLCDLGTSCQNPK---QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           K + C +  C      +N     + CP     Y   T+   LL+E ++            
Sbjct: 165 KIVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLV---------FAE 215

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-DD 278
             +   ++GC +      L    P G+ G G G  S+P    + GL + S+ +   + DD
Sbjct: 216 RTEPDFVVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFSYCLLSHRFDD 266

Query: 279 SGR-----IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETCCIGSSCLK-Q 323
           S +     ++ G    D        T F    ++SN  +   Y + +    +G   +K  
Sbjct: 267 SPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVP 326

Query: 324 TSFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            SF           IVDSGS+FTF+ K V+E +A EFDRQ+ +
Sbjct: 327 YSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMAN 369


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 132/308 (42%), Gaps = 60/308 (19%)

Query: 118 VALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 173
           V +D  S+L W     V+CAP ++ +    D+    + P++S +   L C+   CD    
Sbjct: 140 VIVDTASELTW-----VQCAPCASCH----DQQGPLFDPASSPSYAVLPCNSSSCDALQV 190

Query: 174 ----LGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
                  +C   +QP C YT+ Y  + + S G+L  D L        +L   V    + G
Sbjct: 191 ATGSAAGACGGGEQPSCSYTLSY-RDGSYSQGVLAHDKL--------SLAGEVIDGFVFG 241

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFF 284
           CG    G +       GL+GLG  ++S+ S  + + G +   FS C    + + SG +  
Sbjct: 242 CGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPLKESESSGSLVL 295

Query: 285 GDQGPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           GD     + ST  + +        G +  Y + +    IG   ++ ++ K IVDSG+  T
Sbjct: 296 GDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGITIGGQEVESSAGKVIVDSGTIIT 353

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQ 390
            L   VY  + AEF       ++ F  YP          C+  +  R  ++PS+K +F  
Sbjct: 354 SLVPSVYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG 406

Query: 391 NNSFVVNN 398
           N    V++
Sbjct: 407 NVEVEVDS 414


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 63/356 (17%)

Query: 80  QFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPL 139
           Q QM   SQ S  +S  ++        + +G+P     + LD GS+L W+ C   + +P 
Sbjct: 19  QTQMGLISQPSNKLSFHHNVTL--TVSLTVGSPPQQVTMVLDTGSELSWLHC---KKSPN 73

Query: 140 SASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSS 198
             S +N L    + YSP   S+     C  R  DL      +PK+ C + +  Y + +S 
Sbjct: 74  LTSVFNPLSS--SSYSPIPCSSP---VCRTRTRDLPNPVTCDPKKLC-HAIVSYADASSL 127

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD----GLIGLGLGEI 254
            G L  D   +   G +AL  +     + GC      G+      D    GL+G+  G +
Sbjct: 128 EGNLASDNFRI---GSSALPGT-----LFGC---MDSGFSSNSEEDAKTTGLMGMNRGSL 176

Query: 255 SVPSLLAKAGLIRNSFSMCFD-KDDSGRIFFGDQG----------PATQQSTSFLASNGK 303
           S    + + GL +  FS C   +D SG + FGD            P  Q ST     +  
Sbjct: 177 S---FVTQLGLPK--FSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFD-- 229

Query: 304 YITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR 353
            + Y + ++   +G+  L             + + +VDSG+ FTFL   VY  +  EF  
Sbjct: 230 RVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLE 289

Query: 354 QVNDTITS-------FEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 402
           Q    +         F+G    C    +  +LP+LP+V LMF +    VV   V +
Sbjct: 290 QTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMF-RGAEMVVGGEVLL 344


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 132/308 (42%), Gaps = 60/308 (19%)

Query: 118 VALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD---- 173
           V +D  S+L W     V+CAP ++ +    D+    + P++S +   L C+   CD    
Sbjct: 139 VIVDTASELTW-----VQCAPCASCH----DQQGPLFDPASSPSYAVLPCNSSSCDALQV 189

Query: 174 ----LGTSCQNPKQP-CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
                  +C   +QP C YT+ Y  + + S G+L  D L        +L   V    + G
Sbjct: 190 ATGSAAGACGGGEQPSCSYTLSY-RDGSYSQGVLAHDKL--------SLAGEVIDGFVFG 240

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFF 284
           CG    G +       GL+GLG  ++S+ S  + + G +   FS C    + + SG +  
Sbjct: 241 CGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPLKESESSGSLVL 294

Query: 285 GDQGPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFT 337
           GD     + ST  + +        G +  Y + +    IG   ++ ++ K IVDSG+  T
Sbjct: 295 GDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTGITIGGQEVESSAGKVIVDSGTIIT 352

Query: 338 FLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQ 390
            L   VY  + AEF       ++ F  YP          C+  +  R  ++PS+K +F  
Sbjct: 353 SLVPSVYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEG 405

Query: 391 NNSFVVNN 398
           N    V++
Sbjct: 406 NVEVEVDS 413


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 130/313 (41%), Gaps = 45/313 (14%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYEQREK--- 221

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL T  C      C Y +  Y + + S G    D L L 
Sbjct: 222 LFDPARSSTYANISCAAPACSDLDTRGCSGGN--CLYGVQ-YGDGSYSIGFFAMDTLTLS 278

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 279 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 325

Query: 270 FSMCFDKDDSGRIF--FGDQGPA---TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ- 323
           F+ C     SG  +  FG   PA    + +T  L  NG    Y +G+    +G   L   
Sbjct: 326 FAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIP 384

Query: 324 ----TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSS 375
               T+   IVDSG+  T LP   Y ++ + F   +      ++  P       CY  + 
Sbjct: 385 QSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAM--AARGYKKAPAVSLLDTCYDFTG 442

Query: 376 QRLPKLPSVKLMF 388
                +P+V L+F
Sbjct: 443 MSQVAIPTVSLLF 455


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 135/342 (39%), Gaps = 62/342 (18%)

Query: 93  MSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD-RD 150
           M    D+G   Y+    +GTP+  F++  D GSDL W+ C    C   + S   +   R 
Sbjct: 72  MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRH 130

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLL 202
              +  + SS+ K + C   +C +        T+C  P  PC Y  DY Y++ +++ G  
Sbjct: 131 KRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFF 188

Query: 203 VED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
             +   + L  G    L N     V+IGC     G      A DG++GLG  + S    +
Sbjct: 189 ANETVTVELKEGRKMKLHN-----VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--I 239

Query: 261 AKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET-- 313
             A      FS C       K+ S  + FG     + +S   L +N  Y   ++G+    
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSF 294

Query: 314 -------CCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------ 352
                    IG + LK        + +   I+DSGSS TFL +  Y+ + A         
Sbjct: 295 YAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF 354

Query: 353 RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 394
           R+V   I      P + C+ S+      +P +   F     F
Sbjct: 355 RKVEMDIG-----PLEYCFNSTGFEESLVPRLVFHFADGAEF 391


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 131/313 (41%), Gaps = 50/313 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +GTP     + +D GSD+ W+ C  C  C     + +N          PS+SS+
Sbjct: 16  YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFN----------PSSSSS 65

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL- 217
            K L CS  LC   D+   C + K  C Y  D Y + + + G LV D + L    D+A  
Sbjct: 66  FKVLDCSSSLCLNLDV-MGCLSNK--CLYQAD-YGDGSFTMGELVTDNVVL----DDAFG 117

Query: 218 -KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
               V  ++ +GCG    G +  G A  G++GLG G +S P+ L  +   RN FS C   
Sbjct: 118 PGQVVLTNIPLGCGHDNEGTF--GTAA-GILGLGRGPLSFPNNLDAS--TRNIFSYCLPD 172

Query: 275 ---DKDDSGRIFFGDQG-PATQQ-STSFLAS--NGKYIT-YIIGVETCCIGSSCLKQ--- 323
              D +    + FGD   P T   S  F+    N +  T Y + +    +G + L     
Sbjct: 173 RESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPA 232

Query: 324 TSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           + F+         I DSG++ T L    Y  +   F        ++ +   +  CY  + 
Sbjct: 233 SVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTG 292

Query: 376 QRLPKLPSVKLMF 388
                +P+V   F
Sbjct: 293 MNSISVPTVTFHF 305


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 131/315 (41%), Gaps = 34/315 (10%)

Query: 87  SQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
           S  S  ++ G   G  +Y T + +GTP   +++ +D GS L W+     +C+P   S + 
Sbjct: 100 SLASVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL-----QCSPCRVSCHR 154

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTM----DYYTENTSSSG 200
              +    + P  SS+   +SCS   CD L T+  NP    P  +      Y +++ S G
Sbjct: 155 ---QSGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVG 211

Query: 201 LLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
            L +D    +S G N++ N        GCG    G +       GL+GL   ++S+  L 
Sbjct: 212 YLSKDT---VSFGANSVPN-----FYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LY 258

Query: 261 AKAGLIRNSFSMCF-DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
             A  +  SFS C      SG +  G   P     T  +++      Y I +    +   
Sbjct: 259 QLAPTLGYSFSYCLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGK 318

Query: 320 CL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKS 373
            L     + TS   I+DSG+  T LP  VY  ++      +  +      Y     C++ 
Sbjct: 319 PLAVSSSEYTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEG 378

Query: 374 SSQRLPKLPSVKLMF 388
            + +L  +P+V + F
Sbjct: 379 QASKLRAVPAVSMAF 393


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 126/310 (40%), Gaps = 52/310 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP V  L+ALD  SDL W+ C  C RC P S   ++          P  S++ + +
Sbjct: 142 IAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFD----------PRHSTSYREM 191

Query: 166 SCSHRLCD-LGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           S +   C  LG S      +  C YT+  Y + +++ G  +E+ L   +GG    + S  
Sbjct: 192 SFNAADCQALGRSGGGDAKRGTCVYTVG-YGDGSTTVGDFIEETLTF-AGGVRLPRIS-- 247

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-- 280
               IGCG    G  L G    G++GLG G +S P+ +   G    +FS C     SG  
Sbjct: 248 ----IGCGHDNKG--LFGAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGPG 297

Query: 281 ----RIFFG----DQGPATQQSTSFLASNGKYITYII-------GVETCCIGSSCLKQTS 325
                + FG    D  P    + + L  N     Y+        GV    +    L+   
Sbjct: 298 SLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDP 357

Query: 326 FKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSSQRL 378
           +      IVDSG++ T L +  Y      F     D      G P   +  CY    + +
Sbjct: 358 YTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGM 417

Query: 379 PKLPSVKLMF 388
            K+P+V + F
Sbjct: 418 KKVPTVSMHF 427


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 128/314 (40%), Gaps = 49/314 (15%)

Query: 96  GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEY 154
           G + G L+Y   + +GTP V+  + +D GSDL W     V+C P +A    S    L  +
Sbjct: 132 GFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSW-----VQCTPCAAPACYSQKDPL--F 184

Query: 155 SPSASSTSKHLSCSHRLC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            P+ SS+   + C   +C  LG   +SC   +  C Y +  Y + + ++G+   D L L 
Sbjct: 185 DPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQ--CGYVVS-YGDGSKTTGVYSSDTLTLS 241

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
                   N        GCG  QSG   +    DGL+GLG  E S+  +   AG     F
Sbjct: 242 -------PNDAVRGFFFGCGHAQSGFTGN----DGLLGLGREEASL--VEQTAGTYGGVF 288

Query: 271 SMCFDKDDSGRIFFGDQGPATQ-----QSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
           S C     S   +    GP+        +T  L+S      Y++ +    +G   L   S
Sbjct: 289 SYCLPTRPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPS 348

Query: 326 ----FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSS 374
                  +VD+G+  T LP   Y  + + F       + S+ GYP          CY  S
Sbjct: 349 SVFAGGTVVDTGTVITRLPPTAYAALRSAF----RSGMASY-GYPSAPATGILDTCYNFS 403

Query: 375 SQRLPKLPSVKLMF 388
                 LP+V L F
Sbjct: 404 GYGTVTLPNVALTF 417


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 75/299 (25%), Positives = 121/299 (40%), Gaps = 46/299 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +G+P     + +D+GSD++W+ C  C +C       Y   D     + P+ASS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 218
              +SC   +C   +             DY   Y + + + G L  + L L   G  A++
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQ 236

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
                 V IGCG + SG +   V   GL+GLG G +S+   L   G     FS C     
Sbjct: 237 G-----VAIGCGHRNSGLF---VGAAGLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 279 SGRIFFGDQGPATQQSTSFLASNGKYITYI---IGVETCCIGSSCLKQTSFKA---IVDS 332
           +G                 LAS+  Y+      +G E   +  S  + T   A   ++D+
Sbjct: 287 AG-------------GAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDT 333

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 391
           G++ T LP+E Y  +   FD  +     S        CY  S     ++P+V   F Q 
Sbjct: 334 GTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQG 392


>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
 gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
          Length = 490

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 81/338 (23%), Positives = 143/338 (42%), Gaps = 44/338 (13%)

Query: 89  GSKTMSLGNDFGWLHY--TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           GS  M L +D     Y  + + IGTP   F + +D  S   ++    + C     S++  
Sbjct: 19  GSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---FVSPKTMFC-----SFFFL 70

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
            D     +SP+ SS+ K L C +  C  G  C   ++        Y E ++SSG+L +D+
Sbjct: 71  QD---PRFSPALSSSYKPLECGNE-CSTGF-CDGSRK----YQRQYAEKSTSSGVLGKDV 121

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +   +  D   +      ++ GC   ++G   D  A DG+IGLG G +S+   L +   +
Sbjct: 122 ISFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAM 175

Query: 267 RNSFSMCFDKDDSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK- 322
            + FS+C+   D G    I  G Q P     TS       Y  Y + ++   +G S L+ 
Sbjct: 176 EDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY--YNLMLKGIRVGGSPLRL 233

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK---CCYKSS 374
                   +  ++DSG+++ + P   ++   +    QV  ++    G   K    CY  +
Sbjct: 234 KPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVG-SLKEVPGPDEKFKDICYAGA 292

Query: 375 SQRLPKL----PSVKLMFPQNNSFVVNNPVFVIYGTQV 408
              +  L    PSV  +F    S  ++   ++   T++
Sbjct: 293 GTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKI 330


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 125/302 (41%), Gaps = 51/302 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLN-EYSPSASSTSKH 164
           + IG P++  LV +D GSD+LWI C+ C  C           D  L   + PS SST   
Sbjct: 105 LSIGQPSIPQLVVMDTGSDILWIMCNPCTNC-----------DNHLGLLFDPSMSSTFSP 153

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           L C       G  C     P P+T+  Y +N+S+SG    DIL   +  +     S  + 
Sbjct: 154 L-CKTPCGFKGCKC----DPIPFTIS-YVDNSSASGTFGRDILVFETTDEGT---SQISD 204

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----DDS 279
           VIIGCG   + G+      +G++GL  G    P+ LA    I   FS C         + 
Sbjct: 205 VIIGCG--HNIGFNSDPGYNGILGLNNG----PNSLATQ--IGRKFSYCIGNLADPYYNY 256

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAI 329
            ++  G+       ST F   +G Y   + G+    +G   L          +  +   I
Sbjct: 257 NQLRLGEGADLEGYSTPFEVYHGFYYVTMEGIS---VGEKRLDIALETFEMKRNGTGGVI 313

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITS--FEGYPWKCCYKS-SSQRLPKLPSVKL 386
           +DSG++ T+L    ++ +  E    +  +     FE  PWK CY    S+ L   P V  
Sbjct: 314 LDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTF 373

Query: 387 MF 388
            F
Sbjct: 374 HF 375


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 82/308 (26%), Positives = 130/308 (42%), Gaps = 67/308 (21%)

Query: 118 VALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--- 174
           V +D  S+L W     V+CAP  + +    D+    + PS+S +   + C+   CD    
Sbjct: 166 VIVDTASELTW-----VQCAPCESCH----DQQDPLFDPSSSPSYAAVPCNSSSCDALQL 216

Query: 175 ---GTS-----CQNPKQ---PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
              GTS     CQ   Q    C YT+ Y  + + S G+L  D L        +L   V  
Sbjct: 217 ATGGTSGGAAACQGQDQSAAACSYTLSY-RDGSYSRGVLAHDRL--------SLAGEVID 267

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCF---DKDDS 279
             + GCG    G    G +  GL+GLG  ++S V   + + G +   FS C    + D S
Sbjct: 268 GFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTMDQFGGV---FSYCLPLKESDSS 322

Query: 280 GRIFFGDQGPATQQST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSF-------K 327
           G +  GD     + ST     S ++   +   Y + +    +G   ++ + F       K
Sbjct: 323 GSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGK 382

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPK 380
           AI+DSG+  T L   +Y  + AEF       ++ F  YP          C+  +  R  +
Sbjct: 383 AIIDSGTVITSLVPSIYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNMTGLREVQ 435

Query: 381 LPSVKLMF 388
           +PS+KL+F
Sbjct: 436 VPSLKLVF 443


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 129/315 (40%), Gaps = 60/315 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP + F V +D GS+L+W  C  C RC P                 P+ SST   L
Sbjct: 95  ISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTP--------APVLQPARSSTFSRL 146

Query: 166 SCSHRLCD-LGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            C+   C  L TS +    N    C Y   Y +  T  +G L  + L +   GD      
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTV---GDGTFPK- 200

Query: 221 VQASVIIGCGMKQSGGYLDGV-APDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
               V  GC  +      +GV    G++GLG G +S+ S LA     R S+ +  D  D 
Sbjct: 201 ----VAFGCSTE------NGVDNSSGIVGLGRGPLSLVSQLAVG---RFSYCLRSDMADG 247

Query: 280 GR--IFFGDQGPATQ----QST-----SFLASNGKYITYIIGV-----ETCCIGSS-CLK 322
           G   I FG     T+    QST      +L  +  Y   + G+     E    GS+    
Sbjct: 248 GASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFT 307

Query: 323 QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVND----TITSFEGYPWKCCYKSSS- 375
           QT      IVDSG++ T+L K+ Y  +   F  Q+ +    T  S   Y    CYK S+ 
Sbjct: 308 QTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAG 367

Query: 376 --QRLPKLPSVKLMF 388
              +  ++P + L F
Sbjct: 368 GGGKAVRVPRLALRF 382


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 80/304 (26%), Positives = 129/304 (42%), Gaps = 58/304 (19%)

Query: 99  FGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPS 157
           FG LH+T  + IGTP     + LD GSDL+W  C           +     R+   Y P+
Sbjct: 84  FGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKL---------FDTRQHREKPLYDPA 134

Query: 158 ASSTSKHLSCSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
            SS+     C  RLC+ G+    +C   K  C YT +Y +  T   G L  +       G
Sbjct: 135 KSSSFAAAPCDGRLCETGSFNTKNCSRNK--CIYTYNYGSATT--KGELASETFTF---G 187

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           ++     V  S+  GCG K + G L G +  G++G+    +   SL+++  + R S+ + 
Sbjct: 188 EH---RRVSVSLDFGCG-KLTSGSLPGAS--GILGISPDRL---SLVSQLQIPRFSYCLT 238

Query: 274 --FDKDDSGRIFFGDQGPATQ-------QSTSFL----ASNGKYITYIIGVETCCIGSSC 320
              D++ +  IFFG     ++       Q+TS +     SN  Y   +IG+    +G+  
Sbjct: 239 PFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGIS---VGTKR 295

Query: 321 L----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF--EGYPWK 368
           L          +  S    VDSG +   LP  V E +       V   + +    GY ++
Sbjct: 296 LNVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYE 355

Query: 369 CCYK 372
            C++
Sbjct: 356 LCFQ 359


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 79/289 (27%), Positives = 118/289 (40%), Gaps = 69/289 (23%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++IGTP  +  V +D GSDL W+PC     DC+ C  L +   N+L +  + +SP  SS+
Sbjct: 15  LNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKS---NNL-KSSSIFSPLHSSS 70

Query: 162 SKHLSCSHRLCDLGTSCQNP-------------------KQPCPYTMDYYTENTSSSGLL 202
           S   SC+   C    S  NP                    +PCP     Y E    SG+L
Sbjct: 71  SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
             DIL          +         GC    +  Y +   P G+ G G G +S+PS L  
Sbjct: 131 TRDILK--------ARTRDVPRFSFGC---VTSTYHE---PIGIAGFGRGLLSLPSQL-- 174

Query: 263 AGLIRNSFSMCF-------DKDDSGRIFFGDQGPATQ-----QSTSFLASNGKYITYIIG 310
            G +   FS CF       + + S  +  G    +       Q T  L +     +Y IG
Sbjct: 175 -GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIG 233

Query: 311 VETCCIGSS--------CLKQTSFKA----IVDSGSSFTFLPKEVYETI 347
           +E+  IG++         L+Q   +     +VDSG+++T LP   Y  +
Sbjct: 234 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQL 282


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 134/342 (39%), Gaps = 62/342 (18%)

Query: 93  MSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD-RD 150
           M    D+G   Y     +GTP+  F++  D GSDL W+ C    C   + S   +   R 
Sbjct: 72  MHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRH 130

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLL 202
              +  + SS+ K + C   +C +        T+C  P  PC Y  DY Y++ +++ G  
Sbjct: 131 KRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFF 188

Query: 203 VED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
             +   + L  G    L N     V+IGC     G      A DG++GLG  + S    +
Sbjct: 189 ANETVTVELKEGRKMKLHN-----VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--I 239

Query: 261 AKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET-- 313
             A      FS C       K+ S  + FG     + +S   L +N  Y   ++G+    
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSF 294

Query: 314 -------CCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------ 352
                    IG + LK        + +   I+DSGSS TFL +  Y+ + A         
Sbjct: 295 YAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF 354

Query: 353 RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 394
           R+V   I      P + C+ S+      +P +   F     F
Sbjct: 355 RKVEMDIG-----PLEYCFNSTGFEESLVPRLVFHFADGAEF 391


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 123/308 (39%), Gaps = 42/308 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +G+P     + +D+GSD++W+ C  C +C       Y   D     + P+ASS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 218
              +SC   +C   +             DY   Y + + + G L  + L L   G  A++
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQ 236

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---D 275
                 V IGCG + SG +   V   GL+GLG G +S+   L   G     FS C     
Sbjct: 237 G-----VAIGCGHRNSGLF---VGAAGLLGLGWGAMSLVGQL--GGAAGGVFSYCLASRG 286

Query: 276 KDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQ 323
              +G +  G  +  P        + +N     Y +G+    +G   L          + 
Sbjct: 287 AGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTED 346

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
            +   ++D+G++ T LP+E Y  +   FD  +     S        CY  S     ++P+
Sbjct: 347 GAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPT 406

Query: 384 VKLMFPQN 391
           V   F Q 
Sbjct: 407 VSFYFDQG 414


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 139/335 (41%), Gaps = 64/335 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG P V +   +D GSDL+W  C  C  C           D+    + P  SS+   +
Sbjct: 111 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKV 160

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            CS  LC+    ++C   K  C Y +  Y + +S+ GLL  +            +NS+ +
Sbjct: 161 GCSSGLCNALPRSNCNEDKDACEY-LYTYGDYSSTRGLLATETFTFED------ENSI-S 212

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDD 278
            +  GCG++  G G+  G    GL+GLG G +S+ S L +       FS C     D + 
Sbjct: 213 GIGFGCGVENEGDGFSQG---SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEA 264

Query: 279 SGRIFFGDQGPATQQST------------SFLASNGKYITYIIGVETCCIGSSCL--KQT 324
           S  +F G         T            S L +  +   Y + ++   +G+  L  +++
Sbjct: 265 SSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKS 324

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---- 372
           +F+         I+DSG++ T+L +  ++ +  EF  +++  +          C+K    
Sbjct: 325 TFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDA 384

Query: 373 SSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 403
           + +  +PK+        L  P  N  V ++   V+
Sbjct: 385 AKNIAVPKMIFHFKGADLELPGENYMVADSSTGVL 419


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 124/289 (42%), Gaps = 44/289 (15%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSL 147
            ++ + L +D  +L    + IGTP   +   LD GSDL+W  C  C+ C          +
Sbjct: 78  AARILVLASDGEYLM--EMGIGTPTRYYSAILDTGSDLIWTQCAPCLLC----------V 125

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           D+    + P+ S+T + L C+   C+        ++ C Y   +Y ++ S++G+L  +  
Sbjct: 126 DQPTPYFDPARSATYRSLGCASPACNALYYPLCYQKVCVYQY-FYGDSASTAGVLANETF 184

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
                G N  + S+   +  GCG   +G   +G    G++G G G +   SL+++ G  R
Sbjct: 185 TF---GTNETRVSLPG-ISFGCGNLNAGLLANG---SGMVGFGRGSL---SLVSQLGSPR 234

Query: 268 NSFSMC-FDKDDSGRIFFG--------DQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
            S+ +  F      R++FG        +      QST F+ +      Y + +    +G 
Sbjct: 235 FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG 294

Query: 319 SCLK-----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
             L              +   I+DSG++ T+L +  Y+ + A F  Q+ 
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT 343


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 135/315 (42%), Gaps = 53/315 (16%)

Query: 118 VALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--- 174
           V +D  S+L W     V+C P  A +    D+    + PS+S +   + C+   CD    
Sbjct: 126 VIVDTASELTW-----VQCEPCDACH----DQQEPLFDPSSSPSYAAVPCNSSSCDALRV 176

Query: 175 -----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGC 229
                G +C +    C YT+ Y  + + S G+L  D L L +G D      +Q   + GC
Sbjct: 177 ATGMSGQACDDQPAACSYTLSY-RDGSYSRGVLAHDRLSL-AGED------IQG-FVFGC 227

Query: 230 GMKQSGGYLDGVAPDGLIGLGLGEISVPS-LLAKAGLIRNSFSMCF---DKDDSGRIFFG 285
           G    G +       GL+GLG  ++S+ S  + + G +   FS C    +   SG +  G
Sbjct: 228 GTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQFGGV---FSYCLPPKESGSSGSLVLG 281

Query: 286 DQGPATQQSTSFLAS-------NGKYITYIIGVETCCIGSSCLKQTSF------KAIVDS 332
           D     + ST  + +        G +  Y+  +    +G   ++   F      KAIVDS
Sbjct: 282 DDASVYRNSTPIVYTAMVSDPLQGPF--YLANLTGITVGGEDVQSPGFSAGGGGKAIVDS 339

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
           G+  T L   VY  + AEF  Q+ +   +        C+  +  R  ++PS+KL+F    
Sbjct: 340 GTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGA 399

Query: 393 SFVVNNP--VFVIYG 405
              V++   ++V+ G
Sbjct: 400 EVEVDSKGVLYVVTG 414


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 40/320 (12%)

Query: 83  MLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
           +  P++    +  GN     ++  + +GTP     +  D GSDL W      +C P + S
Sbjct: 130 VTLPAKSGSLIGSGN-----YFVVVGLGTPKRDLSLIFDTGSDLTW-----TQCEPCARS 179

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS------CQNPKQPCPYTMDYYTEN 195
            Y   D   +   PS S++  +++C+  LC  L T+      C    + C Y +  Y ++
Sbjct: 180 CYKQQDAIFD---PSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQ-YGDS 235

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
           + S G    + L + +         +  + + GCG + + G   G A  GLIGLG   IS
Sbjct: 236 SFSVGYFSRERLSVTA-------TDIVDNFLFGCG-QNNQGLFGGSA--GLIGLGRHPIS 285

Query: 256 VPSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVET 313
              +   A + R  FS C     S  GR+ FG    +  + T F   +     Y + +  
Sbjct: 286 F--VQQTAAVYRKIFSYCLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITG 343

Query: 314 CCIGSSCLKQTSFK-----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 368
             +G + L  +S       AI+DSG+  T LP   Y  + + F + ++   ++ E     
Sbjct: 344 ISVGGAKLPVSSSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILD 403

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            CY  S   +  +P +   F
Sbjct: 404 TCYDLSGYEVFSIPKIDFSF 423


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 135/325 (41%), Gaps = 58/325 (17%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P++   +++ GN     +   I +GTP   F V  D GSD  W     V+C P  A  Y
Sbjct: 152 LPAKSGLSLNTGN-----YVVPIRLGTPAARFTVVFDTGSDTTW-----VQCQPCVAYCY 201

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLL 202
              +     ++P+ S+T  ++SC+   C DL T  C      C Y +  Y + + + G  
Sbjct: 202 QQKE---PLFTPTKSATYANISCTSSYCSDLDTRGCSGGH--CLYAVQ-YGDGSYTVGFY 255

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
            +D L L   G + +K+        GCG K  G  L G A  GL+GLG G+ SVP  +  
Sbjct: 256 AQDTLTL---GYDTVKD-----FRFGCGEKNRG--LFGKAA-GLMGLGRGKTSVP--VQA 302

Query: 263 AGLIRNSFSMCFDKDDSGRIFF----GDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
                  F+ C     SG  F     G    A  + T  L  NG    Y +G+    +G 
Sbjct: 303 YDKYSGVFAYCIPATSSGTGFLDFGPGAPAAANARLTPMLVDNGPTF-YYVGMTGIKVGG 361

Query: 319 SCLK--QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK----- 368
             L    T F    A+VDSG+  T LP   YE + + F +         EG  +K     
Sbjct: 362 HLLSIPATVFSDAGALVDSGTVITRLPPSAYEPLRSAFAK-------GMEGLGYKTAPAF 414

Query: 369 ----CCYK-SSSQRLPKLPSVKLMF 388
                CY  +  Q    LP+V L+F
Sbjct: 415 SILDTCYDLTGYQGSIALPAVSLVF 439


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 123/308 (39%), Gaps = 42/308 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +G+P     + +D+GSD++W+ C  C +C       Y   D     + P+ASS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQC-------YAQTD---PLFDPAASSS 179

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 218
              +SC   +C   +             DY   Y + + + G L  + L L   G  A++
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL---GGTAVQ 236

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---D 275
                 V IGCG + SG +   V   GL+GLG G +S+   L   G     FS C     
Sbjct: 237 G-----VAIGCGHRNSGLF---VGAAGLLGLGWGAMSLIGQL--GGAAGGVFSYCLASRG 286

Query: 276 KDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQ 323
              +G +  G  +  P        + +N     Y +G+    +G   L          + 
Sbjct: 287 AGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTED 346

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
            +   ++D+G++ T LP+E Y  +   FD  +     S        CY  S     ++P+
Sbjct: 347 GAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPT 406

Query: 384 VKLMFPQN 391
           V   F Q 
Sbjct: 407 VSFYFDQG 414


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 139/335 (41%), Gaps = 64/335 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG P V +   +D GSDL+W  C  C  C           D+    + P  SS+   +
Sbjct: 3   LSIGNPAVKYSAIVDTGSDLIWTQCKPCTEC----------FDQPTPIFDPEKSSSYSKV 52

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
            CS  LC+    ++C   K  C Y +  Y + +S+ GLL  +            +NS+ +
Sbjct: 53  GCSSGLCNALPRSNCNEDKDACEY-LYTYGDYSSTRGLLATETFTFED------ENSI-S 104

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDD 278
            +  GCG++  G G+  G    GL+GLG G +S+ S L +       FS C     D + 
Sbjct: 105 GIGFGCGVENEGDGFSQG---SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEA 156

Query: 279 SGRIFFGDQGPATQQST------------SFLASNGKYITYIIGVETCCIGSSCL--KQT 324
           S  +F G         T            S L +  +   Y + ++   +G+  L  +++
Sbjct: 157 SSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKS 216

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK---- 372
           +F+         I+DSG++ T+L +  ++ +  EF  +++  +          C+K    
Sbjct: 217 TFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDA 276

Query: 373 SSSQRLPKL----PSVKLMFPQNNSFVVNNPVFVI 403
           + +  +PK+        L  P  N  V ++   V+
Sbjct: 277 AKNIAVPKMIFHFKGADLELPGENYMVADSSTGVL 311


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 89/336 (26%), Positives = 137/336 (40%), Gaps = 67/336 (19%)

Query: 54  PAKKSFEYYQVLLSSDVQKQKM----KTGPQFQMLFPSQGSKT-MSLGNDFGWLHY-TWI 107
           P K   +  + LL SD  +++M    + G + +    S  ++  +  G D G   Y   I
Sbjct: 64  PPKSRLDGTRQLLQSDNARRQMISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSI 123

Query: 108 DIGTPN-VSFLVALDAGSDLLWIPCD-----CVRCAPLSASYYNSLDRDLNEYSPSASST 161
            IGTP    F++  D GSDL W+ C+     C +  P     + + D          SS+
Sbjct: 124 RIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRAND----------SSS 173

Query: 162 SKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGG 213
            + + CS   C +        T C NP  PC +  DY Y     + G+   +    ++ G
Sbjct: 174 FRTIPCSSDDCKIELQDYFSLTECPNPNAPCLF--DYRYLNGPRAIGVFANET---VTVG 228

Query: 214 DNALKNSVQASVIIGC--GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
            N  K      V+IGC     ++ G+     PDG++GLG  + S+   LA+  +  N FS
Sbjct: 229 LNDHKKIRLFDVLIGCTESFNETNGF-----PDGVMGLGYRKHSLALRLAE--IFGNKFS 281

Query: 272 MCF-----DKDDSGRIFFGD----QGPATQQSTSFLASNGKYIT--YIIGVETCCIGSSC 320
            C        +    + FGD    + P  Q +   L     YI   Y + V    +G S 
Sbjct: 282 YCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG----YINAFYPVNVSGISVGGSM 337

Query: 321 LKQTS--------FKAIVDSGSSFTFLPKEVYETIA 348
           L  +S           IVDSG+S T L  E Y+ + 
Sbjct: 338 LSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVV 373


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 137/319 (42%), Gaps = 39/319 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C  C    A +          + P  SS+ + +SC
Sbjct: 105 IGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDP-------RFKPDNSSSYQTVSC 157

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VI 226
           +   C +   C      C Y    Y E +SS G+L +D+L   +G      + +Q   ++
Sbjct: 158 NSPDC-ITKMCDARVHQCKYER-VYAEMSSSKGVLGKDLLGFGNG------SRLQPHPLL 209

Query: 227 IGCGMKQSGG-YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIF 283
            GC   ++G  YL     DG++GLG G +S+   L   G + +SFS+C+   D   G + 
Sbjct: 210 FGCETAETGDLYLQHA--DGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMV 267

Query: 284 FGDQGPATQQSTSFLASNGKYITYI------IGVETCCIG-SSCLKQTSFKAIVDSGSSF 336
            G   P    +  F  S+     Y       I V+   +   S +       ++DSG+++
Sbjct: 268 LGAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTY 325

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEG----YPWKC--CYKSSSQRLPK-LPSVKLMFP 389
            +LP + ++       +Q+  ++ +  G    YP  C     S S+ L K  P V  +F 
Sbjct: 326 AYLPDKAFDAFKDAITQQLG-SLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFS 384

Query: 390 QNNSFVVNNPVFVIYGTQV 408
            N    +    ++   T+V
Sbjct: 385 GNQKVFLAPENYLFKHTKV 403


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 90/327 (27%), Positives = 140/327 (42%), Gaps = 50/327 (15%)

Query: 86  PSQGSKTMSL----GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           PS  SK +SL    G   G  +Y   + +GTP    LV  D GSDL W     V+C P  
Sbjct: 116 PSSASKGVSLPARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSW-----VQCKPCD 170

Query: 141 ASY--YNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTEN 195
             Y  ++ L      + PS S+T   + C  + C   D G SC + K  C Y +  Y + 
Sbjct: 171 GCYQQHDPL------FDPSQSTTYSAVPCGAQECRRLDSG-SCSSGK--CRYEV-VYGDM 220

Query: 196 TSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
           + + G L  D L L     ++  + +Q   + GCG   +G  L G A DGL GLG   +S
Sbjct: 221 SQTDGNLARDTLTLGPSSSSSSSDQLQ-EFVFGCGDDDTG--LFGKA-DGLFGLGRDRVS 276

Query: 256 VPS-LLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGK---YITYII 309
           + S   AK G     FS C     +  G +  G   P   + T+ +  +     Y   ++
Sbjct: 277 LASQAAAKYGA---GFSYCLPSSSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLV 333

Query: 310 GVE----TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
           G++    T  +  +  +      ++DSG+  T LP   Y  + + F   +     S++  
Sbjct: 334 GIKVAGRTVRVSPAVFRTPG--TVIDSGTVITRLPSRAYAALRSSFAGLMRR--YSYKRA 389

Query: 366 P----WKCCYKSSSQRLPKLPSVKLMF 388
           P       CY  + +   ++PSV L+F
Sbjct: 390 PALSILDTCYDFTGRNKVQIPSVALLF 416


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 113/244 (46%), Gaps = 30/244 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C  C     S+ +       ++ P  S T + + C
Sbjct: 99  IGTPPQRFALIVDTGSTVTYVPCSTCRHCG----SHQDP------KFRPEDSETYQPVKC 148

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           + + C+    C N ++ C Y   Y  E ++SSG L ED+   +S G+    +  +A  I 
Sbjct: 149 TWQ-CN----CDNDRKQCTYERRY-AEMSTSSGALGEDV---VSFGNQTELSPQRA--IF 197

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQ 287
           GC   ++G   +  A DG++GLG G++S+   L +  +I +SFS+C+     G       
Sbjct: 198 GCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLG 256

Query: 288 GPATQQSTSFLASNG-KYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTFLP 340
           G +      F  S+  +   Y I ++   +    L             ++DSG+++ +LP
Sbjct: 257 GISPPADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316

Query: 341 KEVY 344
           +  +
Sbjct: 317 ESAF 320


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 93/383 (24%), Positives = 166/383 (43%), Gaps = 50/383 (13%)

Query: 25  VMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML 84
           + FS  L   FS E+       +R+++  P  ++ E  Q    ++  ++ M     F  +
Sbjct: 17  ICFSEALKSGFSVEII------HRDSSRSPFYRATET-QFQRVTNAVRRSMNRANHFNQI 69

Query: 85  --FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSA 141
             + +     ++L +D  +L      +GTP       +D  SD++W+ C  C  C     
Sbjct: 70  SVYSNAVESPVTLLDDGDYL--MSYSLGTPPFPVYGIVDTASDIIWVQCQLCETC----- 122

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQ-NPKQPCPYTMDYYTENTSS 198
             YN        + PS S T K+L CS   C    GTSC  + ++ C +T++Y  + + S
Sbjct: 123 --YNDTSP---MFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNY-KDGSHS 176

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G L+ + + L S  D  +        +IGC ++ +    D +   G++GLG G +S+  
Sbjct: 177 QGDLIVETVTLGSYNDPFVH---FPRTVIGC-IRNTNVSFDSI---GIVGLGGGPVSLVP 229

Query: 259 LLAKAGLIRNSFSMCFD--KDDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVET 313
            L+ +  I   FS C     D S ++ FGD    +     ST  +  + K   Y + +E 
Sbjct: 230 QLSSS--ISKKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKF-YYLTLEA 286

Query: 314 CCIGSSCLKQTSF--------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
             +G++ ++  S           I+DSG++FT LP +VY  + +     V          
Sbjct: 287 FSVGNNRIEFRSSSSRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLK 346

Query: 366 PWKCCYKSSSQRLPKLPSVKLMF 388
            +  CYKS+  ++  +P +   F
Sbjct: 347 QFSLCYKSTYDKV-DVPVITAHF 368


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 126/290 (43%), Gaps = 42/290 (14%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSL 147
            ++ + L +D  +L    + IGTP   +   LD GSDL+W  C  C+ C          +
Sbjct: 80  AARILVLASDGEYLM--EMGIGTPARFYSAILDTGSDLIWTQCAPCLLC----------V 127

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
           D+    + P+ SST + L CS   C+        ++ C Y   +Y ++ S++G+L  +  
Sbjct: 128 DQPTPYFDPANSSTYRSLGCSAPACNALYYPLCYQKTCVYQY-FYGDSASTAGVLANETF 186

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
                G N  + ++   +  GCG   +G   +G    G++G G G +   SL+++ G  R
Sbjct: 187 TF---GTNDTRVTLP-RISFGCGNLNAGSLANG---SGMVGFGRGSL---SLVSQLGSPR 236

Query: 268 NSFSMC-FDKDDSGRIFFG------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
            S+ +  F      R++FG          +T QST F+ +      Y + +    +G + 
Sbjct: 237 FSYCLTSFLSPVRSRLYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNR 296

Query: 321 L-----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
           L              +   I+DSG++ T+L +  Y  +   F   +N T+
Sbjct: 297 LPIDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTL 346


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 158/380 (41%), Gaps = 56/380 (14%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATS-WPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLF 85
           FST L H   ++ +A  ++     TS  P+++         ++ ++K K   G     L 
Sbjct: 67  FSTVLTH---DDARAAHLASRLATTSNAPSRRP--------TTSLRKPKAAAGASGGPLD 115

Query: 86  PSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            S  S  ++ G   G  +Y T + +GTP  S+ + +D GS L W+     +C+P   S +
Sbjct: 116 DSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWL-----QCSPCVVSCH 170

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNP-----KQPCPYTMDYYTENTSS 198
             +      Y P ASST   + CS   CD L  +  NP     +  C Y    Y +++ S
Sbjct: 171 RQVG---PLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQAS-YGDSSFS 226

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G L  D    +S G  +  N        GCG    G +       GLIGL   ++S+  
Sbjct: 227 VGYLSRDT---VSFGSGSYPN-----FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLY 275

Query: 259 LLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
            LA +  +  SFS C     S G +  G         T   +S+     Y + +    +G
Sbjct: 276 QLAPS--LGYSFSYCLPTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVG 333

Query: 318 SSCL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WK 368
            S L     + +S   I+DSG+  T LP  VY  ++    + V   +   +  P      
Sbjct: 334 GSPLAVSPAEYSSLPTIIDSGTVITRLPTAVYTALS----KAVAAAMVGVQSAPAFSILD 389

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            C++  + +L ++P+V + F
Sbjct: 390 TCFQGQASQL-RVPAVAMAF 408


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 135/342 (39%), Gaps = 62/342 (18%)

Query: 93  MSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD-RD 150
           M    D+G   Y+    +GTP+  F++  D GSDL W+ C    C   + S   +   R 
Sbjct: 1   MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCK-YHCRSRNCSNRKARRIRH 59

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSGLL 202
              +  + SS+ K + C   +C +        T+C  P  PC Y  DY Y++ +++ G  
Sbjct: 60  KRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALGFF 117

Query: 203 VED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
             +   + L  G    L N     V+IGC     G      A DG++GLG  + S    +
Sbjct: 118 ANETVTVELKEGRKMKLHN-----VLIGCSESFQGQSFQ--AADGVMGLGYSKYSFA--I 168

Query: 261 AKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVET-- 313
             A      FS C       K+ S  + FG     + +S   L +N  Y   ++G+    
Sbjct: 169 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFG-----SSRSKEALLNNMTYTELVLGMVNSF 223

Query: 314 -------CCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFD------ 352
                    IG + LK        + +   I+DSGSS TFL +  Y+ + A         
Sbjct: 224 YAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF 283

Query: 353 RQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 394
           R+V   I      P + C+ S+      +P +   F     F
Sbjct: 284 RKVEMDIG-----PLEYCFNSTGFEESLVPRLVFHFADGAEF 320


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 54/173 (31%), Positives = 79/173 (45%), Gaps = 25/173 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP V  L+A+D GSD+ W+ C  C RC P S   ++          P  S++ + +
Sbjct: 138 IAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFD----------PRHSTSYREM 187

Query: 166 SCSHRLCD-LGTSCQN--PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
                 C  LG S      +  C Y + Y  + +++ G  +E+ L    G        VQ
Sbjct: 188 GYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGG--------VQ 239

Query: 223 ASVI-IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              + IGCG    G +    A  G++GLG G+IS PS +A  G    SFS C 
Sbjct: 240 VPHMSIGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCL 290


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 95/385 (24%), Positives = 153/385 (39%), Gaps = 60/385 (15%)

Query: 30  KLIHRF-----SEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML 84
           +L HR      + +  ALG   +   T    ++  EY Q  +S           P  Q+ 
Sbjct: 68  RLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSG-----AAAAAPGMQLA 122

Query: 85  FPSQGSKTMSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
                +   +LG   G L Y   + +GTP V+  + +D GSD+ W+ C      P     
Sbjct: 123 GSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPC---- 178

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGTSCQNPKQPCPYTMDYYTENTSSS 199
           Y+  D     + P+ SS+   + C+   C         C   +  C Y +  Y + ++++
Sbjct: 179 YSQRD---PLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQ--CGYVVS-YGDGSTTT 232

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL-GEISVPS 258
           G+   D L L   G NALK       + GCG  Q  G   GV  DGL+GLG  G+    S
Sbjct: 233 GVYSSDTLTLT--GSNALKG-----FLFGCGHAQQ-GLFAGV--DGLLGLGRQGQ----S 278

Query: 259 LLAKAGLIRNS-FSMCFDKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETC 314
           L+++A       FS C     +   +    GP++     +T  L ++     YI+ +   
Sbjct: 279 LVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGI 338

Query: 315 CIGSSCLK--QTSFK--AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---- 366
            +G   L    + F   A+VD+G+  T LP   Y  + + F   +        GYP    
Sbjct: 339 SVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-----YGYPSAPA 393

Query: 367 ---WKCCYKSSSQRLPKLPSVKLMF 388
                 CY  +      LP++ + F
Sbjct: 394 TGILDTCYDFTRYGTVTLPTISIAF 418


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 135/323 (41%), Gaps = 32/323 (9%)

Query: 95  LGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE 153
           LG+    L Y   + +GTP V+  V +D GSD+ W+ C+     P  A      D     
Sbjct: 118 LGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFD----- 172

Query: 154 YSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
             P+ SST + +SC+   C      G  C      C Y +  Y + ++++G    D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQ-YGDGSTTNGTYSRDTLTL 229

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
            SG  +A+K         GC   +S G+ D    DGL+GLG G  S+ S  A A    NS
Sbjct: 230 -SGASDAVKG-----FQFGCSHVES-GFSDQT--DGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 270 FSMCF---DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQT 324
           FS C              G  G +   +T  L S      Y   ++   +G     L  +
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPS 338

Query: 325 SFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            F A  +VDSG+  T LP   Y  +++ F   +    ++        C+  + Q    +P
Sbjct: 339 VFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIP 398

Query: 383 SVKLMFPQNNSFVVNNPVFVIYG 405
           +V L+F    + +  +P  ++YG
Sbjct: 399 TVALVF-SGGAAIDLDPNGIMYG 420


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 126/285 (44%), Gaps = 38/285 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP +     +D GSDL+W+ C  C+ C       YN ++     + P  SST  ++SC
Sbjct: 70  IGTPPIKISGTVDTGSDLIWVQCVPCLGC-------YNQINP---MFDPLKSSTYTNISC 119

Query: 168 SHRLC--DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
              LC       C +P++ C YT   Y +++ + G+L ++ + L S   N  K      +
Sbjct: 120 DSPLCYKPYIGEC-SPEKRCDYTYG-YADSSLTKGVLAQETVTLTS---NTGKPISLQGI 174

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI--RNSFSMCF-----DKDD 278
           + GCG   +G + D     GLIGLG G     SL+++ G +     FS C      D   
Sbjct: 175 LFGCGHNNTGNFNDHEM--GLIGLGGGPT---SLVSQIGPLFGGKKFSQCLVPFLTDITI 229

Query: 279 SGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSF----KAIVD 331
           S ++ FG       +   +T  +       +Y + +    +  + L   S       +VD
Sbjct: 230 SSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVD 289

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSS 375
           SG+    LP+++Y+ +  E   +V  + IT       + CY++ +
Sbjct: 290 SGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQT 334


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 130/340 (38%), Gaps = 46/340 (13%)

Query: 81  FQMLFPSQGSKTMSLGNDFGWL--------HYTWIDIGTPNVSFLVALDAGSDLLWIPCD 132
           F ML P     TMS  ++ G          +   + IGTP V F+   D GSDL W  C 
Sbjct: 67  FMMLLPRY--STMSTSSNAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCK 124

Query: 133 -CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY 191
            C  C P     Y++         P AS+T   +  S R C   T+      PC Y    
Sbjct: 125 PCKLCFPQDTPIYDTAASASFSPVPCASATCLPIWRSSRNCTATTT-----SPCRYRYA- 178

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLG 250
           Y +   S+G+L  + L        A    V    V  GCG+   G   +     G +GLG
Sbjct: 179 YDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNST---GTVGLG 235

Query: 251 LGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGDQ---------GPATQQSTSFLA 299
            G +   SL+A+ G+ + S+ +   F+      + FG           G A  QST  + 
Sbjct: 236 RGSL---SLVAQLGVGKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQ 292

Query: 300 SNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAA 349
                  Y + +E   +G + L             S   IVDSG+ FT L +  +  +  
Sbjct: 293 GPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVN 352

Query: 350 EFDRQVNDTITSFEGYPWKCCYKSSS-QRLPKLPSVKLMF 388
                +N  + +       C   ++  Q+LP +P + L F
Sbjct: 353 HVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPDMLLHF 392


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 102/399 (25%), Positives = 167/399 (41%), Gaps = 59/399 (14%)

Query: 15  LLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQK 74
           L+   S A       K IH  + + +   V  N + +S   K  F Y     S+ + +Q 
Sbjct: 28  LVLRDSAARGGGIGFKAIHVAAPQSR---VKANPSPSSAAQKSLFPY-----SAHIFQQH 79

Query: 75  MKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DC 133
            K     +       S T +LG  FG  +YT I +G+P    ++ +D GS+L W+ C  C
Sbjct: 80  TKNPAALR-------SSTTTLGRKFGE-YYTSIKLGSPGQEAILIVDTGSELTWLQCLPC 131

Query: 134 VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS----HRLCDLGTSCQNPKQPCPYTM 189
             CAP   + Y++       Y P   + S+  S S    +  C  G+ CQ          
Sbjct: 132 KVCAPSVDTIYDAARS--ASYRPVTCNNSQLCSNSSQGTYAYCARGSQCQ--------FA 181

Query: 190 DYYTENTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 247
            +Y + + S G L  D  I+  + GG    K         GC   Q    L      G++
Sbjct: 182 AFYGDGSFSYGSLSTDTLIMETVVGG----KPVTVQDFAFGCA--QGDLELVPTGASGIL 235

Query: 248 GLGLGEISVPSLLAKAGLIRNSFSMCF-DK----DDSGRIFFGD-QGPATQ-QSTSFLAS 300
           GL  G++++P  L +       FS CF D+    + +G +FFG+ + P  Q Q TS   +
Sbjct: 236 GLNAGKMALPMQLGQR--FGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALT 293

Query: 301 NGKYIT--YIIGVETCCIGSSCLKQTSFKAIV--DSGSSFTFLPKEVYETIAAEFDRQVN 356
           N +     Y + ++   I S  L      ++V  DSGSSF+   +  +  +   F +   
Sbjct: 294 NSELQRKFYHVALKGVSINSHELVFLPRGSVVILDSGSSFSSFVRPFHSQLREAFLKHRP 353

Query: 357 DTITSFEGYPW---KCCYKSSSQRLPK----LPSVKLMF 388
            ++   EG  +     C+K S+  + +    LPS+ L+F
Sbjct: 354 PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVF 392


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 123/314 (39%), Gaps = 56/314 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP     + LD GSDL+W  C  C  C            R L    PS SST   L
Sbjct: 419 LAIGTPPQPVQLILDTGSDLVWTQCRPCPVC----------FSRALGPLDPSNSSTFDVL 468

Query: 166 SCSHRLCDLGT--SCQNP---KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            CS  +CD  T  SC       Q C Y   Y   + ++  L  E      + G      +
Sbjct: 469 PCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTG---QA 525

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---D 277
               +  GCG+  +G +       G+ G G G +S+PS L       ++FS CF      
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNET--GIAGFGRGALSLPSQLKV-----DNFSHCFTAITGS 578

Query: 278 DSGRIFFG------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK-- 327
           +   +  G             QST  + +      Y + ++   +GS+ L   +++F   
Sbjct: 579 EPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALK 638

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQV-----NDTITSFEGYPWKCCYKSSSQ 376
                  I+DSG+  T LP++ Y+ +   F  QV     N T +S      + C+  S  
Sbjct: 639 QDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLS----RLCFSFSVP 694

Query: 377 RL--PKLPSVKLMF 388
           R   P +P + L F
Sbjct: 695 RRAKPDVPKLVLHF 708


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/333 (27%), Positives = 139/333 (41%), Gaps = 46/333 (13%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 169 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYEQQEK--- 220

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL T  C      C Y +  Y + + S G    D L L 
Sbjct: 221 LFDPARSSTYANVSCAAPACFDLDTRGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 277

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 278 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 324

Query: 270 FSMCFDKDDSGRIF--FGDQGPA---TQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F+ C     SG  +  FG   PA    + +T  L  NG    Y +G+    +G   L   
Sbjct: 325 FAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIP 383

Query: 323 QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSS 375
           Q+ F     IVDSG+  T LP   Y ++ + F   +      ++  P       CY  + 
Sbjct: 384 QSVFATAGTIVDSGTVITRLPPPAYSSLRSAFVSAM--AARGYKKAPAVSLLDTCYDFTG 441

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 408
                +P+V L+F Q  + +  +   ++Y   V
Sbjct: 442 MSQVAIPTVSLLF-QGGAILDVDASGIMYAASV 473


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 137/347 (39%), Gaps = 68/347 (19%)

Query: 93  MSLGN--DFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR 149
           M LG+  D+G   Y T I +GTP   F V +D GS+L W+ C            Y +  +
Sbjct: 93  MDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR-----------YRARGK 141

Query: 150 DLNE-YSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSG 200
           D    +    S + K + C  + C +        T+C  P  PC Y  DY Y + +++ G
Sbjct: 142 DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQG 199

Query: 201 LLVEDILHLISGGDNALKNSVQASV---IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
           +  ++ + +       L N   A +   +IGC    +G    G   DG++GL   + S  
Sbjct: 200 VFAKETITV------GLTNGRMARLPGHLIGCSSSFTGQSFQGA--DGVLGLAFSDFSFT 251

Query: 258 SLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI---- 308
           S      L    FS C      +K+ S  + FG    +    T+F  +    +T I    
Sbjct: 252 S--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGS---SRSTKTAFRRTTPLDLTRIPPFY 306

Query: 309 --------IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQ-VNDT 358
                   +G +   I S     TS    I+DSG+S T L    Y+ +     R  V   
Sbjct: 307 AINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELK 366

Query: 359 ITSFEGYPWKCCYK-SSSQRLPKLPSVKLMF-------PQNNSFVVN 397
               EG P + C+  +S   + KLP +           P   S++V+
Sbjct: 367 RVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 413


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 116/272 (42%), Gaps = 43/272 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP    +   D GSDLLW      +CAP     Y  +D     + P  SST K +S
Sbjct: 94  VSIGTPPFPIMAIADTGSDLLW-----TQCAPCD-DCYTQVDP---LFDPKTSSTYKDVS 144

Query: 167 CSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           CS   C   +   SC      C Y++  Y +N+ + G +  D L L S     ++     
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLS-YGDNSYTKGNIAVDTLTLGSSDTRPMQ---LK 200

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAG-LIRNSFSMCF-----DK 276
           ++IIGCG   +G +      +      +G    P SL+ + G  I   FS C       K
Sbjct: 201 NIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKK 254

Query: 277 DDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK-------QTSF 326
           D + +I FG     +     ST  +A   +   Y + +++  +GS  ++        +  
Sbjct: 255 DQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEG 314

Query: 327 KAIVDSGSSFTFLPKEVY----ETIAAEFDRQ 354
             I+DSG++ T LP E Y    + +A+  D +
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAE 346


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 137/347 (39%), Gaps = 68/347 (19%)

Query: 93  MSLGN--DFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR 149
           M LG+  D+G   Y T I +GTP   F V +D GS+L W+ C            Y +  +
Sbjct: 71  MDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCR-----------YRARGK 119

Query: 150 DLNE-YSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENTSSSG 200
           D    +    S + K + C  + C +        T+C  P  PC Y  DY Y + +++ G
Sbjct: 120 DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY--DYRYADGSAAQG 177

Query: 201 LLVEDILHLISGGDNALKNSVQASV---IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
           +  ++ + +       L N   A +   +IGC    +G    G   DG++GL   + S  
Sbjct: 178 VFAKETITV------GLTNGRMARLPGHLIGCSSSFTGQSFQGA--DGVLGLAFSDFSFT 229

Query: 258 SLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYI---- 308
           S      L    FS C      +K+ S  + FG    +    T+F  +    +T I    
Sbjct: 230 S--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGS---SRSTKTAFRRTTPLDLTRIPPFY 284

Query: 309 --------IGVETCCIGSSCLKQTSFKA-IVDSGSSFTFLPKEVYETIAAEFDRQ-VNDT 358
                   +G +   I S     TS    I+DSG+S T L    Y+ +     R  V   
Sbjct: 285 AINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELK 344

Query: 359 ITSFEGYPWKCCYK-SSSQRLPKLPSVKLMF-------PQNNSFVVN 397
               EG P + C+  +S   + KLP +           P   S++V+
Sbjct: 345 RVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 391


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 84/315 (26%), Positives = 131/315 (41%), Gaps = 60/315 (19%)

Query: 107  IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
            + +G+P     + LD GS+L W+ C   + +P   S +N L    + YSP   S+     
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHC---KKSPNLTSVFNPLSS--SSYSPIPCSSP---I 1055

Query: 167  CSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C  R  DL      +PK+ C + +  Y + +S  G L  D   +   G +AL  +     
Sbjct: 1056 CRTRTRDLPNPVTCDPKKLC-HAIVSYADASSLEGNLASDNFRI---GSSALPGT----- 1106

Query: 226  IIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KDDSG 280
            + GC      G+      D    GL+G+  G +S    + + GL +  FS C   +D SG
Sbjct: 1107 LFGC---MDSGFSSNSEEDAKTTGLMGMNRGSLS---FVTQLGLPK--FSYCISGRDSSG 1158

Query: 281  RIFFGD----------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--------- 321
             + FGD            P  Q ST     +   + Y + ++   +G+  L         
Sbjct: 1159 VLLFGDLHLSWLGNLTYTPLVQISTPLPYFD--RVAYTVQLDGIRVGNKILPLPKSIFAP 1216

Query: 322  -KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKS 373
                + + +VDSG+ FTFL   VY  +  EF  Q    +         F+G    C   +
Sbjct: 1217 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVA 1276

Query: 374  SSQRLPKLPSVKLMF 388
            +  +LP LPSV LMF
Sbjct: 1277 AGGKLPTLPSVSLMF 1291


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 116/272 (42%), Gaps = 43/272 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP    +   D GSDLLW      +CAP     Y  +D     + P  SST K +S
Sbjct: 94  VSIGTPPFPIMAIADTGSDLLW-----TQCAPCD-DCYTQVDP---LFDPKTSSTYKDVS 144

Query: 167 CSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           CS   C   +   SC      C Y++  Y +N+ + G +  D L L S     ++     
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLS-YGDNSYTKGNIAVDTLTLGSSDTRPMQ---LK 200

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAG-LIRNSFSMCF-----DK 276
           ++IIGCG   +G +      +      +G    P SL+ + G  I   FS C       K
Sbjct: 201 NIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKK 254

Query: 277 DDSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK-------QTSF 326
           D + +I FG     +     ST  +A   +   Y + +++  +GS  ++        +  
Sbjct: 255 DQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEG 314

Query: 327 KAIVDSGSSFTFLPKEVY----ETIAAEFDRQ 354
             I+DSG++ T LP E Y    + +A+  D +
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAE 346


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 138/332 (41%), Gaps = 50/332 (15%)

Query: 90  SKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLD 148
           S  ++LG   G  +Y  + +GTP V  ++ +D GSD+ WI C  C  C P     +N   
Sbjct: 127 SPVVTLGQA-GLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRH 185

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
                  P ASST     C++    +   C    + C +++  Y + + SSGLL    + 
Sbjct: 186 SSSFFKLPCASST-----CTNVYQGVKPFCSPSGRTCLFSIQ-YGDGSLSSGLLA---ME 236

Query: 209 LISGGDNALKNSVQ---ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            I+G      +      +++ +GC      G   G +  GL+G+    IS PS L+    
Sbjct: 237 TIAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR-- 292

Query: 266 IRNSFSMCF-DK----DDSGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGV 311
               FS CF DK    + SG +FFG+           P  Q      AS   Y   ++G+
Sbjct: 293 YARKFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGI 352

Query: 312 ETCCIGSSCLKQT-----------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
               +  S L  +           S   I+DSG++FT+L K  ++ +  EF  + +    
Sbjct: 353 S---VDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK 409

Query: 361 SFEGYPWKCCYK----SSSQRLPKLPSVKLMF 388
             +   +  CY     +++     LPS+ L F
Sbjct: 410 VDDNSGFTPCYNITSGTAALESTILPSITLHF 441


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 82/297 (27%), Positives = 119/297 (40%), Gaps = 36/297 (12%)

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
           +  +    GP   +  P++   ++  GN     +   + +GTP     V  D GSDL W 
Sbjct: 128 ITNETSAVGPGVSL--PAERGISVGTGN-----YVVSVGLGTPARDLTVVFDTGSDLSW- 179

Query: 130 PCDCVRCAPLSA-SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNP--KQPCP 186
               V+C P S+   Y   D     ++PS SST   + C  R C    SC        CP
Sbjct: 180 ----VQCGPCSSGGCYKQQD---PLFAPSDSSTFSAVRCGARECRARQSCGGSPGDDRCP 232

Query: 187 YTMDYYTENTSSSGLLVEDILHL---ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 243
           Y +  Y + + + G L  D L L        +A  ++     + GCG   +G  L G A 
Sbjct: 233 YEV-VYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTG--LFGQA- 288

Query: 244 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS---GRIFFGD--QGPATQQSTSFL 298
           DGL GLG G++S+ S    AG     FS C     S   G +  G     PA  Q T  L
Sbjct: 289 DGLFGLGRGKVSLSS--QAAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPML 346

Query: 299 ASNGKYITYIIGVETCCIGSSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEF 351
                   Y + +    +    ++ +S +     IVDSG+  T L    Y  + A F
Sbjct: 347 NRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPRAYRALRAAF 403


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 89/345 (25%), Positives = 143/345 (41%), Gaps = 80/345 (23%)

Query: 59  FEYYQVLLSSDVQKQKMKTGPQFQM--------LFP-SQGSKTMSLGNDFGWLHYTWIDI 109
           F+   +LLS+ + + +    PQ +         LFP S G+ ++SL              
Sbjct: 91  FKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRSYGAYSVSLA------------F 138

Query: 110 GTP--NVSFLVALDAGSDLLWIPCDC-VRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           GTP  N+SF+   D GS L+W PC    RC+  S  Y +     ++++ P  SS+ K + 
Sbjct: 139 GTPPQNLSFI--FDTGSSLVWFPCTAGYRCSRCSFPYVDP--ATISKFVPKLSSSVKVVG 194

Query: 167 CSHRLC------DLGTSCQNPKQP-------CP-YTMDYYTENTSSSGLLVEDILHLISG 212
           C +  C      +L + C+N           CP Y + Y +  T+  G+L+ + L L   
Sbjct: 195 CRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA--GILLSETLDL--- 249

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
                +N      ++GC +      +    P G+ G G G  S+PS +          S 
Sbjct: 250 -----ENKRVPDFLVGCSV------MSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSR 298

Query: 273 CFDKDDSGRIFFGDQGPATQQST--SFL---------ASNGKYITYI-IGVETCCIGSSC 320
            FD          D G  + +S   SF+          SN  +  Y  + +    IG   
Sbjct: 299 GFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKP 358

Query: 321 LK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           +K            +  AI+DSGS+FTFL K ++E IA E ++Q+
Sbjct: 359 VKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQL 403


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 133/309 (43%), Gaps = 46/309 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +GTP     + +D GSD+LW+ C  CV C       Y+  D     + P  SST
Sbjct: 37  YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSC-------YHQCDE---VFDPYKSST 86

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL--ISGGDNA 216
              L C+ R C   D+G    N    C Y +D Y + + S+G    D + L   SGG   
Sbjct: 87  YSTLGCNSRQCLNLDVGGCVGN---KCLYQVD-YGDGSFSTGEFATDAVSLNSTSGGGQV 142

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
           + N +     +GCG    G +   V   GL+GLG G +S P+ +      R  FS C   
Sbjct: 143 VLNKIP----LGCGHDNEGYF---VGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTG 193

Query: 275 -DKDDSGR--IFFGDQG--PA----TQQSTSFLASNGKYITYI---IGVETCCIGSSCLK 322
            D D + R  + FGD    PA    T Q+++   S   Y+      +G     I +S  +
Sbjct: 194 RDTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQ 253

Query: 323 QTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP 379
             S      I+DSG+S T L    Y ++   F    +D + + E   +  CY  S     
Sbjct: 254 LDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSV 313

Query: 380 KLPSVKLMF 388
            +P+V L F
Sbjct: 314 DVPTVTLHF 322


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 95/385 (24%), Positives = 153/385 (39%), Gaps = 60/385 (15%)

Query: 30  KLIHRF-----SEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQML 84
           +L HR      + +  ALG   +   T    ++  EY Q  +S           P  Q+ 
Sbjct: 57  RLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSG-----AAAAAPGMQLA 111

Query: 85  FPSQGSKTMSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
                +   +LG   G L Y   + +GTP V+  + +D GSD+ W+ C      P     
Sbjct: 112 GSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPC---- 167

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD----LGTSCQNPKQPCPYTMDYYTENTSSS 199
           Y+  D     + P+ SS+   + C+   C         C   +  C Y +  Y + ++++
Sbjct: 168 YSQRD---PLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQ--CGYVVS-YGDGSTTT 221

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL-GEISVPS 258
           G+   D L L   G NALK       + GCG  Q  G   GV  DGL+GLG  G+    S
Sbjct: 222 GVYSSDTLTLT--GSNALKG-----FLFGCGHAQQ-GLFAGV--DGLLGLGRQGQ----S 267

Query: 259 LLAKAGLIRNS-FSMCFDKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETC 314
           L+++A       FS C     +   +    GP++     +T  L ++     YI+ +   
Sbjct: 268 LVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGI 327

Query: 315 CIGSSCLK--QTSFK--AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---- 366
            +G   L    + F   A+VD+G+  T LP   Y  + + F   +        GYP    
Sbjct: 328 SVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-----YGYPSAPA 382

Query: 367 ---WKCCYKSSSQRLPKLPSVKLMF 388
                 CY  +      LP++ + F
Sbjct: 383 TGILDTCYDFTRYGTVTLPTISIAF 407


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 84/326 (25%), Positives = 131/326 (40%), Gaps = 44/326 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP  + LVA+D  +D  W+PC  C+ CAP ++S           + P+ SST + + C
Sbjct: 106 LGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS---------PSFDPTQSSTYRPVRC 156

Query: 168 SHRLC----DLGTSC-QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
               C        SC   P   C + + Y +    +  +L +D L L      A+ +   
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA--VLGQDALSLSDSNGAAVPDD-- 212

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCF----DKD 277
                GC ++   G    V P GL+G G G +   S L++      S FS C       +
Sbjct: 213 -HYTFGC-LRVVTGSGGSVPPQGLVGFGRGPL---SFLSQTKATYGSIFSYCLPSYKSSN 267

Query: 278 DSGRIFFGDQGPATQQSTSFLASNGK----YITYIIGV----ETCCIGSSCLKQTSFKA- 328
            SG +  G  G   +  T+ L SN      Y   ++GV    +   I +S L   +    
Sbjct: 268 FSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGR 327

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
              IVD+G+ FT L    Y  +   F R V+       G    C Y + ++    +P+V 
Sbjct: 328 GGTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALGGFDTCYYVNGTK---SVPAVA 384

Query: 386 LMFPQNNSFVVNNPVFVIYGTQVGVS 411
            +F       +     VI  T  GV+
Sbjct: 385 FVFAGGARVTLPEENVVISSTSGGVA 410


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 129/318 (40%), Gaps = 56/318 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLN-EYSPSASSTSKH 164
           I IG P +  LV +D GSD+LW+ C  C  C           D  L   + PS SST   
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNC-----------DNHLGLLFDPSMSSTFSP 153

Query: 165 L---SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           L    C  + C   + C     P P+T+  Y +N+++SG+   D +   +  +     S 
Sbjct: 154 LCKTPCDFKGC---SRC----DPIPFTVT-YADNSTASGMFGRDTVVFETTDEGT---SR 202

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS- 279
              V+ GCG   + G       +G++GL  G    P  LA    I   FS C  D  D  
Sbjct: 203 IPDVLFGCG--HNIGQDTDPGHNGILGLNNG----PDSLATK--IGQKFSYCIGDLADPY 254

Query: 280 ---GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSF 326
               ++  G+       ST F   NG Y   + G+    +G   L          K  + 
Sbjct: 255 YNYHQLILGEGADLEGYSTPFEVHNGFYYVTMEGIS---VGEKRLDIAPETFEMKKNRTG 311

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYPW-KCCYKSSSQRLPKLPS 383
             I+D+GS+ TFL   V+  ++ E    +  +   T+ E  PW +C Y S S+ L   P 
Sbjct: 312 GVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPV 371

Query: 384 VKLMFPQNNSFVVNNPVF 401
           V   F       +++  F
Sbjct: 372 VTFHFADGADLALDSGSF 389


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 140/339 (41%), Gaps = 83/339 (24%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V+F V  D GS L+W  C  C  CA           R    + P++SST   L
Sbjct: 94  LSIGTPPVTFSVLADTGSSLIWTQCAPCTECA----------ARPAPPFQPASSSTFSKL 143

Query: 166 SCSHRLCDLGTSCQNPKQPC---------PYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            C+  LC   TS   P   C         PY M +      ++G L  + LH+  GG + 
Sbjct: 144 PCASSLCQFLTS---PYLTCNATGCVYYYPYGMGF------TAGYLATETLHV--GGAS- 191

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                   V  GC  +       G +  G++GLG   +   SL+++ G+ R  FS C   
Sbjct: 192 -----FPGVAFGCSTENG----VGNSSSGIVGLGRSPL---SLVSQVGVGR--FSYCLRS 237

Query: 277 D-DSGR--IFFGDQGPATQ---QSTSFL-----ASNGKYITYIIGVETCCIGSSCLKQTS 325
           D D+G   I FG     T    QST  L      S+  Y   + G+    +G++ L  TS
Sbjct: 238 DADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGIT---VGATDLPVTS 294

Query: 326 FK--------------AIVDSGSSFTFLPKEVYETIAAEFDRQV--NDTITSFEG--YPW 367
                            IVDSG++ T+L KE Y  +   F  Q+   +  T+  G  + +
Sbjct: 295 TTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGF 354

Query: 368 KCCYKSSS----QRLPKLPSVKLMFPQNNSFVVNNPVFV 402
             C+ +++      +P +P++ L F     + V    +V
Sbjct: 355 DLCFDATAAGGGSGVP-VPTLVLRFAGGAEYAVRRRSYV 392


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 102/259 (39%), Gaps = 26/259 (10%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP V  L   D GSDL+W+ C  C  C P S   +           P  SST    +C
Sbjct: 96  IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQ----------PLKSSTFMPTTC 145

Query: 168 SHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
             + C L    Q        C YT  Y  + + S GLL  + L   S G   ++     +
Sbjct: 146 RSQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQG--GVQTVAFPN 203

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGR 281
              GCG+  +          G++GLG G +S+ S +     I + FS C        + +
Sbjct: 204 SFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSK 261

Query: 282 IFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKAIVDSGSSF 336
           + FG++   T +   ST  +        Y + +E   +    +    T    I+DSG+  
Sbjct: 262 LKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLL 321

Query: 337 TFLPKEVYETIAAEFDRQV 355
           T+L +  Y   AA     +
Sbjct: 322 TYLGESFYYNFAASLQESL 340


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/257 (26%), Positives = 107/257 (41%), Gaps = 38/257 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP   F + +D GS + ++PC  C  C                ++ P  S T + + C
Sbjct: 95  IGTPPQRFALIVDTGSTVTYVPCSTCEHCG----------RHQDPKFQPDLSETYQPVKC 144

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           +   C+    C      C Y   Y  E +SSSG+L ED+   +S G+  L        + 
Sbjct: 145 TPD-CN----CDGDTNQCMYDRQY-AEMSSSSGVLGEDV---VSFGN--LSELAPQRAVF 193

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFF 284
           GC   ++G      A DG++GLG G++S+   L    +I +SFS+C+   D G    I  
Sbjct: 194 GCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG 252

Query: 285 GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------QTSFKAIVDSGSSFTF 338
           G   P     T        Y  Y I ++   +    L+            ++DSG+++ +
Sbjct: 253 GISPPEDMVFTHSDPDRSPY--YNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAY 310

Query: 339 LPKEVYETIAAEFDRQV 355
           LP    ET    F R +
Sbjct: 311 LP----ETAFLAFKRAI 323


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/320 (27%), Positives = 124/320 (38%), Gaps = 43/320 (13%)

Query: 95  LGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRD 150
           LG  F  L Y   I IGTP  +F V  D GSDL W+   PC    C P     ++     
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFD----- 167

Query: 151 LNEYSPSASSTSKHLSCSHRLCDLGTSCQNP--KQPCPYTMDYYTENTSSSGLLVEDILH 208
                PS SST   + CS   C +G   Q       C Y++ Y  E + + G L E+   
Sbjct: 168 -----PSKSSTYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDE-SETHGSLAEETFT 221

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLD-GVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           L      A        V+ GC  +    + D G+   GL+GLG G+ S+   L++     
Sbjct: 222 LSPPSPLA---PAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSI---LSQTRRSI 275

Query: 268 NS----FSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYIT-------YIIGVETC 314
           NS    FS C     S  G +  G    A QQ  S L+      T       Y++ +   
Sbjct: 276 NSGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGV 335

Query: 315 CIGSSCL----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--WK 368
            +  + +       S  A++DSG+  T +P   Y  +  EF   +       EG      
Sbjct: 336 SVNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLD 395

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            CY  + Q +   P V L F
Sbjct: 396 TCYDVTGQDVVTAPRVALEF 415


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 104/248 (41%), Gaps = 43/248 (17%)

Query: 167 CSHRLCD--LGTSCQN----PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           C   LC   L  SC N    P Q C YT  YY + + ++GL+  D     +G        
Sbjct: 38  CDSTLCQGLLVASCGNTKFWPNQTCVYTY-YYNDKSVTTGLIEVDKFTFGAGAS------ 90

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK---- 276
               V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF      
Sbjct: 91  -VPGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGL 142

Query: 277 -------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK 327
                  D    ++    G    QST  + ++     Y + ++   +GS+ L   +++F 
Sbjct: 143 KQSTVLLDLPADLY--KNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFA 200

Query: 328 -------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
                   I+DSG+S T LP +VY+ +  EF  Q+   +          C+ + SQ  P 
Sbjct: 201 LTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPD 260

Query: 381 LPSVKLMF 388
           +P + L F
Sbjct: 261 VPKLVLHF 268


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 138/332 (41%), Gaps = 50/332 (15%)

Query: 90  SKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLD 148
           S  ++LG   G  +Y  + +GTP V  ++ +D GSD+ WI C  C  C P     +N   
Sbjct: 126 SPVVTLGQA-GLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRH 184

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
                  P ASST     C++    +   C    + C +++  Y + + SSGLL    + 
Sbjct: 185 SSSFFKLPCASST-----CTNVYQGVKPFCSPSGRTCLFSIQ-YGDGSLSSGLLA---ME 235

Query: 209 LISGGDNALKNSVQ---ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL 265
            I+G      +      +++ +GC      G   G +  GL+G+    IS PS L+    
Sbjct: 236 TIAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR-- 291

Query: 266 IRNSFSMCF-DK----DDSGRIFFGDQG---------PATQQSTSFLASNGKYITYIIGV 311
               FS CF DK    + SG +FFG+           P  Q      AS   Y   ++G+
Sbjct: 292 YARKFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGI 351

Query: 312 ETCCIGSSCLKQT-----------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
               +  S L  +           S   I+DSG++FT+L K  ++ +  EF  + +    
Sbjct: 352 S---VDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK 408

Query: 361 SFEGYPWKCCYK----SSSQRLPKLPSVKLMF 388
             +   +  CY     +++     LPS+ L F
Sbjct: 409 VDDNSGFTPCYNITSGTAALESTILPSITLHF 440


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/299 (25%), Positives = 124/299 (41%), Gaps = 50/299 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP    L+A+D  SD+ WIPC  CV C   +A            +SP+ S++ K++SC
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 168

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S   C    +     + C + + Y + + +++  L +D + L +    A           
Sbjct: 169 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAF--------TF 218

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIF 283
           GC  K +GG   G  P     LGLG   +  +     + +++FS C         SG + 
Sbjct: 219 GCVNKVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 275

Query: 284 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDS 332
            G    P   + T  L +  +   Y + +    +G   +            T    I DS
Sbjct: 276 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 335

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDT---ITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           G+ +T L K VYE +  EF ++V  T   +TS  G+    CY        K+P++  MF
Sbjct: 336 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGF--DTCYSGQV----KVPTITFMF 388


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 128/322 (39%), Gaps = 49/322 (15%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
            P++   T+  GN     +   + +GTP     +  D GSDL W  C  CVR        
Sbjct: 91  LPAKDGSTLGSGN-----YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTC------ 139

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSS 198
               D+    ++PS S++  ++SCS   C       G +       C Y +  Y + + S
Sbjct: 140 ---YDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ-YGDQSFS 195

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G L ++   L +       + V   V  GCG + + G   GVA  GL+GLG  ++S PS
Sbjct: 196 VGFLAKEKFTLTN-------SDVFDGVYFGCG-ENNQGLFTGVA--GLLGLGRDKLSFPS 245

Query: 259 LLAKAGLIRNSFSMCFDKDDS--GRIFFGDQG----------PATQQSTSFLASNGKYIT 306
             A A      FS C     S  G + FG  G                TSF   N   IT
Sbjct: 246 QTATA--YNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAIT 303

Query: 307 YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
             +G +   I S+        A++DSG+  T LP + Y  + + F  +++   T+     
Sbjct: 304 --VGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI 359

Query: 367 WKCCYKSSSQRLPKLPSVKLMF 388
              C+  S  +   +P V   F
Sbjct: 360 LDTCFDLSGFKTVTIPKVAFSF 381


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/383 (24%), Positives = 159/383 (41%), Gaps = 58/383 (15%)

Query: 12  VFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQ 71
           +F+L   + G     FS ++IHR S          +R+    P +  F+       ++  
Sbjct: 19  IFYLEAFNGG-----FSVEMIHRDS----------SRSPFFSPTETQFQRV-----ANAV 58

Query: 72  KQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC 131
            + +         F S  S   ++ +  G    ++  +GTP++     LD GSD++W+ C
Sbjct: 59  HRSINRANHLNQSFVSPNSPETTVISALGEYLISY-SVGTPSLQVFGILDTGSDIIWLQC 117

Query: 132 D-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYT 188
             C +C   +   ++S          S S T K L C    C    GT C + K  C Y+
Sbjct: 118 QPCKKCYEQTTPIFDS----------SKSQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYS 166

Query: 189 MDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLI 247
           + +Y + + S G L  + L L  G  N   + VQ    +IGCG   + G  +     G++
Sbjct: 167 I-HYVDGSQSLGDLSVETLTL--GSTNG--SPVQFPGTVIGCGRYNAIGIEE--KNSGIV 219

Query: 248 GLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQGPATQQ---STSFLASN 301
           GLG G +S+ + L+ +      FS C        S ++ FG+    + +   ST   + N
Sbjct: 220 GLGRGPMSLITQLSPS--TGGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKN 277

Query: 302 GKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           G  + Y + +E   +G + ++  S         I+DSG++ T LP  VY  + A   + V
Sbjct: 278 G-LVFYFLTLEAFSVGRNRIEFGSPGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTV 336

Query: 356 NDTITSFEGYPWKCCYKSSSQRL 378
                         CYK +  +L
Sbjct: 337 ILQRVRDPNQVLGLCYKVTPDKL 359


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 104/417 (24%), Positives = 170/417 (40%), Gaps = 72/417 (17%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVM--FSTKLIHRFSEEVKALGVSKNRNATSWPAKKS 58
           MN +S  + L+ F+L    S ++ V   FS +LIHR S +      ++N+          
Sbjct: 1   MNTVSF-LTLSFFFLCFSISFSQAVSNGFSIELIHRDSSKSPFYKPTQNK---------- 49

Query: 59  FEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLV 118
              YQ ++ +  +            L  +  S  +S   D+  + Y+   +GTP +    
Sbjct: 50  ---YQHVVDAVHRSINRVNHSNKNSLASTPESTVISYEGDY-IMSYS---VGTPPIKSYG 102

Query: 119 ALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LG 175
            +D GSD++W+ C+ C +C       YN      N   PS SS+ K++SCS +LC     
Sbjct: 103 IVDTGSDIVWLQCEPCEQC-------YNQTTPKFN---PSKSSSYKNISCSSKLCQSVRD 152

Query: 176 TSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVIIGCGMKQS 234
           TSC N K+ C Y+++Y  ++ S   L +E + L   +G   +   +V     IGCG    
Sbjct: 153 TSC-NDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTV-----IGCGTNNI 206

Query: 235 GGY--------LDGVAPDGLIGLGLGEISVPSLLAKAG--LIRNSFSMCFDKDDSGRIFF 284
           G +          G  P  LI   LG    PS+  K    L+R S ++      S ++ F
Sbjct: 207 GSFKRVSSGVVGLGGGPASLI-TQLG----PSIGGKFSYCLVRMSITLKNMSMGSSKLNF 261

Query: 285 GDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA----------IVD 331
           GD    +     ST  +  +  +  Y+  +E   +G    K+  F            I+D
Sbjct: 262 GDVAIVSGHNVLSTPIVKKDHSFFYYLT-IEAFSVGD---KRVEFAGSSKGVEEGNIIID 317

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           S +  TF+P +VY  + +     V           +  CY  SS      P +   F
Sbjct: 318 SSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAHF 374


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/282 (27%), Positives = 111/282 (39%), Gaps = 53/282 (18%)

Query: 107 IDIGTPNVSFL-VALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP    + + LD GSDL+W  C C  C       +++L          AS T+  +
Sbjct: 104 LSIGTPRPQRVALTLDTGSDLVWTQCACHVCFAQPFPTFDAL----------ASQTTLAV 153

Query: 166 SCSHRLCDLG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS----GGDNAL 217
            CS  +C  G    + C      C Y  D Y + + +SG +VED     S     G  A 
Sbjct: 154 PCSDPICTSGKYPLSGCTFNDNTCFYLYD-YADKSITSGRIVEDTFTFRSPQGNNGSKAH 212

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                 +V  GCG    G +    +  G+ G   G +S+PS L  A      FS CF   
Sbjct: 213 AGVAVPNVRFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQLKVA-----RFSHCFTAI 265

Query: 278 DSGR---IFFGDQ-GP--------ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
              R   +F G   GP           QST F  SNG    Y + ++   +G + L   +
Sbjct: 266 ADARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSL--YYLTLKGITVGKTRLPLNA 323

Query: 326 FK------------AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
                          I+DSG+    LP  +Y ++ A F  +V
Sbjct: 324 LAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARV 365


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 124/303 (40%), Gaps = 49/303 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL--------------DRDLN 152
           +  GTP + + + LD  +DL WI C   R       +Y                  R  N
Sbjct: 131 VRFGTPALPYNLVLDTANDLTWINCRLRR---RKGKHYGRTMSVGAGDDGAAAKEARRKN 187

Query: 153 EYSPSASSTSKHLSCSHRLCDL--GTSCQNP--KQPCPYTMDYYTENTSSSGLL-VEDIL 207
            Y P+ SS+ + + CS + C L    +CQ+P   + C Y      + T + G+   E   
Sbjct: 188 WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQ-MQDGTLTMGIYGKEKAT 246

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
             +S G    + +    +I+GC + ++GG +D  A DG++ LG GE+S     AK     
Sbjct: 247 VTVSDG----RMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FG 298

Query: 268 NSFSMCF-----DKDDSGRIFFGDQ----GPATQQS-----TSFLASNGKYITYI-IGVE 312
             FS C       +D S  + FG      GP T ++          + G  +T I +G E
Sbjct: 299 QRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGE 358

Query: 313 TCCIGSS---CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
              I        K      I+D+ +S T L  E Y  + +  DR ++     +E   ++ 
Sbjct: 359 RLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEY 418

Query: 370 CYK 372
           CY+
Sbjct: 419 CYR 421


>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
          Length = 947

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 135/315 (42%), Gaps = 52/315 (16%)

Query: 100 GW-LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 157
           GW  H+ ++  GTP     V +D GS     PC +C  C   +  +++           S
Sbjct: 122 GWGTHFAYVYAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTDPHWDQ----------S 171

Query: 158 ASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
            S++S  ++C    C     CQ  K+ C ++   Y+E +S     VED+L +   G+  L
Sbjct: 172 KSTSSHIVTCED--CHGSFRCQKDKR-CGFSQ-RYSEGSSWRAYQVEDVLWV---GELTL 224

Query: 218 KNSVQ---------ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
           + S +            + GC   Q+G +   +A DG++G+     ++   LAKAG I+ 
Sbjct: 225 QQSEKINHDESAYSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIKE 283

Query: 269 -SFSMCFDKDDSGRIFFG-----DQGPATQQSTSFLASNGKYITYI--IGVETCCIGSS- 319
            +FS+CF K+    +  G     ++       T    +NG +   +  I V    I    
Sbjct: 284 RTFSLCFGKNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDP 343

Query: 320 CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS------ 373
            + Q     IVDSG++ T+LP+ V +  +A ++R          G P+  C  +      
Sbjct: 344 AIFQRGKGIIVDSGTTDTYLPRSVAKGFSAAWERAT--------GSPYANCKDNHFCMIL 395

Query: 374 SSQRLPKLPSVKLMF 388
           +S  L  LP+V +  
Sbjct: 396 TSAELEALPTVTIHM 410


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 136/318 (42%), Gaps = 40/318 (12%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP    +V LD GSD  W+ C  C  C       Y   D     + P+ASST   +
Sbjct: 143 LRLGTPATELVVELDTGSDQSWVQCKPCADC-------YEQRD---PVFDPTASSTYSAV 192

Query: 166 SCSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            C  R C              +  + CPY +  Y +++ + G L  D L L      +  
Sbjct: 193 PCGARECQELASSSSSRNCSSDNNKNCPYEVS-YDDDSHTVGDLARDTLTLSPSPSPSPA 251

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
           ++V    + GCG   +G + +    DGL+GLGLG+ S+PS +  A     +FS C     
Sbjct: 252 DTVPG-FVFGCGHSNAGTFGE---VDGLLGLGLGKASLPSQV--AARYGAAFSYCLPSSP 305

Query: 279 SGRIFFGDQGPATQQSTSF--LASNGKYITYIIGVETCCIGSSCLK------QTSFKAIV 330
           S   +    G A + +  F  + +     +Y + +    +    +K       T+   I+
Sbjct: 306 SAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTII 365

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKL 386
           DSG++F+ LP   Y  + + F   +      ++  P    +  CY  +     ++P+V+L
Sbjct: 366 DSGTAFSRLPPSAYAALRSSFRSAMGR--YRYKRAPSSPIFDTCYDFTGHETVRIPAVEL 423

Query: 387 MFPQNNSFVVNNPVFVIY 404
           +F  + + V  +P  V+Y
Sbjct: 424 VF-ADGATVHLHPSGVLY 440


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/334 (24%), Positives = 134/334 (40%), Gaps = 59/334 (17%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYN 145
           + G + +++GN     +   + +GTP  +  + LD  +D  W PC  C+ C+        
Sbjct: 84  ASGQQVLNVGN-----YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCS-------- 130

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQ-PCPYTMDYYTENTSSSGLL 202
                   +S   SST   L CS   C    G SC       C +   Y  ++T S+  L
Sbjct: 131 ----STTTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDSTFSA-TL 185

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           V+D LHL   G N + N        GC    SG     + P GL+GLG G +   SL+++
Sbjct: 186 VQDSLHL---GPNVIPN-----FSFGCISSASG---SSIPPQGLMGLGRGPL---SLISQ 231

Query: 263 AG-LIRNSFSMCFDKDD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCI 316
           +G L    FS C         SG +  G  G P   ++T  L +  +   Y + +    +
Sbjct: 232 SGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISV 291

Query: 317 GSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
           G   +            T    I+DSG+  T     +Y  +  EF +QV  + +    + 
Sbjct: 292 GRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLGAF- 350

Query: 367 WKCCYKSSSQ-RLP----KLPSVKLMFPQNNSFV 395
              C+ ++++   P     L  + L  P  NS +
Sbjct: 351 -DTCFATNNEVSAPAITLHLSGLDLKLPMENSLI 383


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/299 (25%), Positives = 124/299 (41%), Gaps = 50/299 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP    L+A+D  SD+ WIPC  CV C   +A            +SP+ S++ K++SC
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 152

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S   C    +     + C + + Y + + +++  L +D + L +    A           
Sbjct: 153 SAPQCKQVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAFT--------F 202

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIF 283
           GC  K +GG   G  P     LGLG   +  +     + +++FS C         SG + 
Sbjct: 203 GCVNKVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLR 259

Query: 284 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDS 332
            G    P   + T  L +  +   Y + +    +G   +            T    I DS
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDT---ITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           G+ +T L K VYE +  EF ++V  T   +TS  G+    CY        K+P++  MF
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGF--DTCYSGQV----KVPTITFMF 372


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 124/303 (40%), Gaps = 49/303 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL--------------DRDLN 152
           +  GTP + + + LD  +DL WI C   R       +Y                  R  N
Sbjct: 131 VRFGTPALPYNLVLDTANDLTWINCRLRR---RKGKHYGRTMSVGAGDDGAAAKEARRKN 187

Query: 153 EYSPSASSTSKHLSCSHRLCDL--GTSCQNP--KQPCPYTMDYYTENTSSSGLL-VEDIL 207
            Y P+ SS+ + + CS + C L    +CQ+P   + C Y      + T + G+   E   
Sbjct: 188 WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQ-MQDGTLTMGIYGKEKAT 246

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
             +S G    + +    +I+GC + ++GG +D  A DG++ LG GE+S     AK     
Sbjct: 247 VTVSDG----RMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FG 298

Query: 268 NSFSMCF-----DKDDSGRIFFGDQ----GPATQQS-----TSFLASNGKYITYI-IGVE 312
             FS C       +D S  + FG      GP T ++          + G  +T I +G E
Sbjct: 299 QRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGE 358

Query: 313 TCCIGSS---CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
              I        K      I+D+ +S T L  E Y  + +  DR ++     +E   ++ 
Sbjct: 359 RLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEY 418

Query: 370 CYK 372
           CY+
Sbjct: 419 CYR 421


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 113/287 (39%), Gaps = 59/287 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++ GTP  +    +D GS L+W PC     C RC      + N     +  + P  SS+S
Sbjct: 96  LNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRC-----DFPNIEVTGIPTFIPKQSSSS 150

Query: 163 KHLSCSHRLCD--LGTSCQNPKQPC------------PYTMDYYTENTSSSGLLVEDILH 208
             + C +  C    G   Q+  Q C            PY + Y   +T  +GLL+ + L 
Sbjct: 151 NLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGST--AGLLLSETL- 207

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
                D   K ++    ++GC +           P+G+ G G    S+PS L        
Sbjct: 208 -----DFPHKKTIPG-FLVGCSL------FSIRQPEGIAGFGRSPESLPSQLGLKKFSYC 255

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYIT---------YIIGVETCCIGSS 319
             S  FD   +      D G  +  + +   S   +           Y + +    IG +
Sbjct: 256 LVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDT 315

Query: 320 CLKQTSFK-----------AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
            +K   +K            IVDSG++FTF+ K VYE +A EF++QV
Sbjct: 316 HVK-VPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQV 361


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 107/440 (24%), Positives = 179/440 (40%), Gaps = 86/440 (19%)

Query: 23  ETVMFSTKLIHRFSEEVKALGVSKNRNATSWPA--KKSFEYYQVLLSSDVQKQKMKTGPQ 80
           +TV  + K     +E+ +++GVSK ++        K+  E       S ++KQ+ K  PQ
Sbjct: 90  QTVKLNLKRRSAGTEKKESVGVSKMKDLARIQTLYKRMTEKKNQNTVSRLKKQQSK--PQ 147

Query: 81  FQM----------LFPSQGSKTMSLGNDFGWLHYTWIDI--GTPNVSFLVALDAGSDLLW 128
                        +F  Q   T+  G   G   Y +ID+  GTP   F + LD GSDL W
Sbjct: 148 VAPPAAAPESSASVFSGQLIATLESGVSLGSGEY-FIDVFVGTPPKHFSLILDTGSDLNW 206

Query: 129 IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS------CQNPK 182
           I   CV C       Y   +++   Y P  SS+ +++ C    C L +S      C+   
Sbjct: 207 I--QCVPC-------YECFEQNGPHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAEN 257

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHL---ISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
           Q CPY   +Y ++++++G    +   +   +S G   L+     +V+ GCG    G +  
Sbjct: 258 QTCPYYY-WYGDSSNTTGDFALETFTVNLTMSSGKPELRRV--ENVMFGCGHWNRGLFHG 314

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQS 294
                 L+GLG G +S  S L    L  +SFS C      D + S ++ FG+        
Sbjct: 315 AAG---LLGLGRGPLSFSSQLQS--LYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHP 369

Query: 295 ----TSFLASNGKYIT--YIIGVETCCIGSSCLKQTSFK----------AIVDSGSSFTF 338
               T+ +A     +   Y + +++  +G   +     K           I+DSG++ ++
Sbjct: 370 ELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSY 429

Query: 339 LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM---- 387
             +  Y+ I   F  +V       +GYP        + CY  +    P LP   ++    
Sbjct: 430 FAEPAYQVIKEAFMAKV-------KGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDG 482

Query: 388 ----FPQNNSFVVNNPVFVI 403
               FP  N F+   P  V+
Sbjct: 483 AVWNFPVENYFIEIEPREVV 502


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 117/290 (40%), Gaps = 50/290 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP       +D GS++ W  C  CV C   +A  ++          PS SST K  
Sbjct: 384 LQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFD----------PSKSSTFKEK 433

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C                 CPY +DY+ + T + G L  D + + S         V A  
Sbjct: 434 RCH-------------DHSCPYEVDYF-DKTYTKGTLATDTVTIHSTSGEPF---VMAET 476

Query: 226 IIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIF 283
           IIGCG   S        P  +G +GL  G +S+  +    G      S CF  + + +I 
Sbjct: 477 IIGCGRNNS-----WFRPSFEGFVGLNWGPLSL--ITQMGGEYPGLMSYCFAGNGTSKIN 529

Query: 284 FGDQ---GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSG 333
           FG     G     ST+   +  +   Y + ++   +G + ++   T F A     ++DSG
Sbjct: 530 FGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSG 589

Query: 334 SSFTFLPKEVYETIAAEFDRQVNDTITSFE--GYPWKCCYKSSSQRLPKL 381
           ++ T+ P E Y  +  +    V   + + +  G    C Y ++++  P +
Sbjct: 590 TTLTYFP-ESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVI 638



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 107/287 (37%), Gaps = 64/287 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP       LD GS+L+W  C  C+ C           D+    + PS SST K  
Sbjct: 69  LQIGTPPFEVEAVLDTGSELIWTQCLPCLHC----------YDQKAPIFDPSKSSTFKE- 117

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQAS 224
                     T C  P   CPY + Y  ++ +   L  E + +H  SG        V   
Sbjct: 118 ----------TRCNTPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSG-----VPFVMPE 162

Query: 225 VIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 282
            IIGC    SG    G  P   G++GL  G +S+ S +  A                   
Sbjct: 163 TIIGCSRNNSG---SGFRPSSSGIVGLSRGSLSLISQMGGA------------------- 200

Query: 283 FFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGSS 335
           + GD       ST+  A   K   Y + ++   +G + ++   T F A     ++DSG+ 
Sbjct: 201 YPGDG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTP 256

Query: 336 FTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKL 381
            T+ P      +    +R V  D +         C Y ++ +  P +
Sbjct: 257 LTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEIFPVI 303


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 122/293 (41%), Gaps = 65/293 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP+ +    +D GS L+W PC     C RC     S+ N     +  + P  SS++
Sbjct: 94  LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSA 148

Query: 163 KHLSCSHRLC------DLGTSC-------QNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           K + C +  C      ++ T C        N  + CP     Y   T+   LL+E ++  
Sbjct: 149 KIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLV-- 206

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
                       +   ++GC +      L    P G+ G G G  S+P    + GL + S
Sbjct: 207 -------FAERTEPDFVVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFS 250

Query: 270 FSMCFDK-DDSGR-----IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETC 314
           + +   + DDS +     ++ G    D        T F    ++SN  +   Y + +   
Sbjct: 251 YCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHI 310

Query: 315 CIGSSCLKQT-SFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            +G   +K   SF           IVDSGS+FTF+ K V+E +A EFDRQ+ +
Sbjct: 311 IVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMAN 363


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/393 (23%), Positives = 157/393 (39%), Gaps = 57/393 (14%)

Query: 45  SKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGW--- 101
           S N +    P   SF+ +  + SS  +    K  P F+ +  ++ S+  +     GW   
Sbjct: 18  SINVHCEKQPVSSSFDKHDNVSSSLAELFSGKRIPLFRYI-SNKTSRLSTQAVQVGWDRG 76

Query: 102 ----LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPS 157
               L+   + +GTP  + +V +D GS   W+ C+C  C     ++             S
Sbjct: 77  LQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------S 125

Query: 158 ASSTSKHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISG 212
            S+T   +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L     
Sbjct: 126 RSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF--- 181

Query: 213 GDNALKNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
                 + VQ   S   GC +   G    G   DGL+G+G G +SV   L ++    + F
Sbjct: 182 ------SDVQKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGF 231

Query: 271 SMCFDKDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSC 320
           S C     S R FF         G     T  + T  +A       + + +    +    
Sbjct: 232 SYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGER 291

Query: 321 LKQT----SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           L  +    S K +V DSGS  +++P      ++    R++     + E    + CY   S
Sbjct: 292 LGLSPSIFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRS 350

Query: 376 QRLPKLPSVKLMFPQNNSFVV-NNPVFVIYGTQ 407
                +P++ L F     F + ++ VFV    Q
Sbjct: 351 VDEGDMPAISLHFDDGARFDLGSHGVFVERSVQ 383


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 143/342 (41%), Gaps = 55/342 (16%)

Query: 85  FPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSAS 142
           F    SK +S G D G   Y   + +G+P     + +D+GSD++W+ C  C+ C      
Sbjct: 153 FSGSESKVVS-GLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLEC------ 205

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPK-QPCPYTMDYYTENTSSS 199
            Y   D     + P+ S+T   +SC   +C +   ++C + +   C Y +  Y + + + 
Sbjct: 206 -YVQAD---PLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVS-YADGSYTK 260

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G L  + L L   G  A++      V+IGCG +  G +   V   GL+GLG G +S+   
Sbjct: 261 GALALETLTL---GGTAVEG-----VVIGCGHRNRGLF---VGAAGLMGLGWGPMSLVGQ 309

Query: 260 LAKAGLIRNSFSMCF----------DKDDSGRIFFGDQGPATQQSTSF-LASNGKYIT-Y 307
           L   G +  +FS C             DD+G +  G      + +    L  N +  + Y
Sbjct: 310 L--GGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFY 367

Query: 308 IIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            +G+    +G   L          +  +   ++D+G++ T LP+E Y  +   F   +  
Sbjct: 368 YVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAG 427

Query: 358 TITSFEGYP---WKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
            +   +G        CY  S     ++P+V   F  +   ++
Sbjct: 428 AVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDARLIL 469


>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
          Length = 410

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/258 (25%), Positives = 109/258 (42%), Gaps = 24/258 (9%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+   ++I  P   + + +D GS L W+ CD  C+ C  +    Y        
Sbjct: 30  GNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ 83

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-I 210
           E   +   T +   C+    DL    +  PK  C Y + Y     SS G+L+ D   L  
Sbjct: 84  ELKYAVKCTEQR--CADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPA 139

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RN 268
           S G N        S+  GCG  Q     +   P +G++GLG G++++ S L   G+I ++
Sbjct: 140 SNGTNP------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKH 193

Query: 269 SFSMCFDKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK 327
               C      G +FFGD + P +  + S +    K+ +   G       S  +     +
Sbjct: 194 VLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPME 253

Query: 328 AIVDSGSSFTFLPKEVYE 345
            I DSG+++T+   + Y 
Sbjct: 254 VIFDSGATYTYFALQPYH 271


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 134/313 (42%), Gaps = 57/313 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IG+P  +  + LD GS+L W+ C   +  P   S +N L    + Y+P+  ++S    
Sbjct: 63  LTIGSPPQNVTMVLDTGSELSWLHC---KKLPNLNSTFNPLLS--SSYTPTPCNSS---V 114

Query: 167 CSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C  R  DL    SC +P     + +  Y + +S+ G L  +          +L  + Q  
Sbjct: 115 CMTRTRDLTIPASC-DPNNKLCHVIVSYADASSAEGTLAAETF--------SLAGAAQPG 165

Query: 225 VIIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS- 279
            + GC    S GY   +  D    GL+G+  G +S+ +      ++   FS C   +D+ 
Sbjct: 166 TLFGC--MDSAGYTSDINEDAKTTGLMGMNRGSLSLVT-----QMVLPKFSYCISGEDAF 218

Query: 280 GRIFFGD--QGPATQQSTSFLASNGK-----YITYIIGVETCCIGSSCLK--QTSF---- 326
           G +  GD    P+  Q T  + +         + Y + +E   +    L+  ++ F    
Sbjct: 219 GVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDH 278

Query: 327 ----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSS 375
               + +VDSG+ FTFL   VY ++  EF  Q    +T        FEG     CY + +
Sbjct: 279 TGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGA-MDLCYHAPA 337

Query: 376 QRLPKLPSVKLMF 388
             L  +P+V L+F
Sbjct: 338 S-LAAVPAVTLVF 349


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 91/337 (27%), Positives = 141/337 (41%), Gaps = 45/337 (13%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P++   T+  GN     +   + +GTP     +  D GSDL W      +C P +   Y
Sbjct: 118 IPAKSGATIGSGN-----YIVSVGLGTPKKYLSLIFDTGSDLTW-----TQCQPCARYCY 167

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQ---NPKQPCPYTMDYYTENTSS 198
           N  D     + PS S+T  ++SCS   C   + GT  Q   +  + C Y +  Y + + S
Sbjct: 168 NQKDP---VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQ-YGDQSFS 223

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G   ++ L L S         V  + + GCG    G  L G A  GLIGLG  +IS+  
Sbjct: 224 VGYFAKETLTLTS-------TDVIENFLFGCGQNNRG--LFGSAA-GLIGLGQDKISIVK 273

Query: 259 LLA-KAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
             A K G +   FS C  K  S      F G  G    + T    ++G    Y + +   
Sbjct: 274 QTAQKYGQV---FSYCLPKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGM 330

Query: 315 CIG------SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK 368
            +G      SS +  TS  AI+DSG+  T LP + Y  + + F++ +     + E     
Sbjct: 331 KVGGTQIPISSSVFSTS-GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILD 389

Query: 369 CCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 405
            CY  S     ++P V  +F       ++  + ++YG
Sbjct: 390 TCYDLSKYSTIQIPKVGFVFKGGEELDLDG-IGIMYG 425


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 112/290 (38%), Gaps = 53/290 (18%)

Query: 100 GWLHYTWID--IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSP 156
           G L Y  ID  IGTP       LD GSDL+W  C  C  C          L +    ++P
Sbjct: 99  GDLEY-LIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASC----------LAQPDPLFAP 147

Query: 157 SASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
           +ASS+   + CS +LC+  L  SCQ P   C Y  +Y    T+      E      S G+
Sbjct: 148 AASSSYVPMRCSGQLCNDILHHSCQRPDT-CTYRYNYGDGTTTLGVYATERFTFASSSGE 206

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
                 +   +  GCG    G   +G    G++G G   +S+ S L+    IR  FS C 
Sbjct: 207 K-----LSVPLGFGCGTMNVGSLNNG---SGIVGFGRDPLSLVSQLS----IRR-FSYCL 253

Query: 275 DKDDSGR------------IFFGDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCL 321
               S R            +F GD     Q Q+T  L S      Y +      +G+  L
Sbjct: 254 TPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRL 313

Query: 322 K----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 361
           +            S   IVDSG++ T  P  V   +   F  Q+    TS
Sbjct: 314 RIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTS 363


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 76/303 (25%), Positives = 126/303 (41%), Gaps = 28/303 (9%)

Query: 94  SLGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           +LG     L Y   + IG+P V+  +++D GSD+ W+ C  C +C     S ++      
Sbjct: 112 TLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSST 171

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
                 +S+    LS S      G  C + +  C Y ++Y   ++++     + +     
Sbjct: 172 YSPFSCSSAPCAQLSQSQE----GNGCMSSQ--CQYIVNYGDSSSTTGTYSSDTL----- 220

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
                L +S       GC   +SGG+ D    DGL+GLG G  S+ S    AG    +FS
Sbjct: 221 ----TLGSSAMTDFQFGCSQSESGGFNDQT--DGLMGLGGGAQSLAS--QTAGTFGTAFS 272

Query: 272 MCF--DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            C       SG +  G  G +    T  L S      Y++ +E+  +GS  L       S
Sbjct: 273 YCLPPTSGSSGFLTLG-TGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS 331

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
             +++DSG+  T LP   Y  +++ F   +     +        C+  S Q    +P+V 
Sbjct: 332 AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVT 391

Query: 386 LMF 388
           L+F
Sbjct: 392 LVF 394


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 90/332 (27%), Positives = 130/332 (39%), Gaps = 56/332 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++    +GTP   F + +D GSDL +     V+CAP    Y    ++D   Y PS SST 
Sbjct: 34  YFVDFSLGTPEQKFHLIVDTGSDLAF-----VQCAPCDLCY----EQDGPLYQPSNSSTF 84

Query: 163 KHLSCSHRLCDL-----GTSCQN------PKQPCPYTMDYYTENTSSSGLLVEDILHLIS 211
             + C    C L     G  C +      P+  C Y    Y +N+S+ G+   +   +  
Sbjct: 85  TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYR-YGDNSSTVGVFAYETATV-- 141

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
           GG           V  GCG +  G +   V+  G++GLG G +S  S    A    N F+
Sbjct: 142 GGIRV------NHVAFGCGNRNQGSF---VSAGGVLGLGQGALSFTSQAGYA--FENKFA 190

Query: 272 MCFDKDDS-----GRIFFGDQGPATQQSTSF--LASNG--------KYITYIIGVETCCI 316
            C     S       + FGD   +T     F  L SN         + +    G ET  I
Sbjct: 191 YCLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLI 250

Query: 317 GSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQV--NDTITSFEGYPWKCCY 371
             S  K  S      I DSG++ T+   + Y  I A F++ V       S +G P   C 
Sbjct: 251 PDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL--CV 308

Query: 372 KSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
             S    P  PS  + F Q  ++  N   + I
Sbjct: 309 NVSGIDHPIYPSFTIEFDQGATYRPNQGNYFI 340


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 87/363 (23%), Positives = 139/363 (38%), Gaps = 54/363 (14%)

Query: 50  ATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDI 109
           A S   K     Y+ ++       K    PQ     P    + +S  N     +   +  
Sbjct: 76  AVSESIKGDTARYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSN-----YIIKLGF 130

Query: 110 GTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           GTP  SF   LD GS++ WIPC+ C  C+                + PS SST  +L+C+
Sbjct: 131 GTPPQSFYTVLDTGSNIAWIPCNPCSGCS-----------SKQQPFEPSKSSTYNYLTCA 179

Query: 169 HRLCDLGTSCQNPKQP--CPYTMDYYTENTSSSGLLVEDIL--HLISGGDNALKNSVQAS 224
            + C L   C        C  T  Y  ++       V++IL    +S G   ++N     
Sbjct: 180 SQQCQLLRVCTKSDNSVNCSLTQRYGDQSE------VDEILSSETLSVGSQQVEN----- 228

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSG 280
            + GC     G  L    P  L+G G   +S  S    A L  ++FS C    F    +G
Sbjct: 229 FVFGCSNAARG--LIQRTPS-LVGFGRNPLSFVS--QTATLYDSTFSYCLPSLFSSAFTG 283

Query: 281 RIFFGDQGPATQQ-STSFLASNGKYIT-YIIGVETCCIGSSCL----------KQTSFKA 328
            +  G +  + Q    + L SN +Y + Y +G+    +G   +          + T    
Sbjct: 284 SLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGT 343

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           I+DSG+  T L +  Y  +   F  Q+++   +     +  CY   S  + + P + L F
Sbjct: 344 IIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDV-EFPLITLHF 402

Query: 389 PQN 391
             N
Sbjct: 403 DDN 405


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 91/391 (23%), Positives = 155/391 (39%), Gaps = 77/391 (19%)

Query: 30  KLIHRFS--------EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
           KL HR+S         E   LG+SK+             + Q L+  + ++ +   G   
Sbjct: 25  KLQHRYSGLEGSSKQNEKLGLGMSKH-------------HLQHLVEHNDRRGRFLQG--- 68

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRC---- 136
            + FP +G+ +     D G L+YT I +G P     V +D GSD+LW+ C  C  C    
Sbjct: 69  -ISFPLKGNYS-----DLG-LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQ 121

Query: 137 ---APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT 193
               PLS    ++              T +   CS                C Y + Y  
Sbjct: 122 DIIPPLSIYNLSASSTSSVSSCSDPLCTGEQAVCSR---------SGSNSACAYGISYQD 172

Query: 194 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
           ++TS    + +D+ +++ GG     N+  + +  GC +  +G +      DG++G G   
Sbjct: 173 KSTSIGAYVKDDMHYVLQGG-----NATTSHIFFGCAINITGSW----PADGIMGFGQIS 223

Query: 254 ISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 311
            +VP+ +A    +   FS C   +K   G + FG++   T+   + L +   +  Y + +
Sbjct: 224 KTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEMVFTPLLNVTTH--YNVDL 281

Query: 312 ETCCIGSSCL----KQTSF--------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
            +  + S  L    K+ S+          I+DSG+SF  L  +    + +E        +
Sbjct: 282 LSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKL 341

Query: 360 -TSFEGYPWKCCY-KSSSQRLPKLPSVKLMF 388
               EG   +C Y KS        P+V L F
Sbjct: 342 GPKLEG--LQCFYLKSGLTVETSFPNVTLTF 370


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 126/313 (40%), Gaps = 61/313 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP + +   +D GSDL+W  C  CV CA          D+    + P+ S+T + +
Sbjct: 96  LAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLV 145

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            C   LC  L       +  C Y   YY +  S++G+L  +      G  N+ K  V + 
Sbjct: 146 PCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTF--GAANSSKVMV-SD 201

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIF 283
           V  GCG   SG   +     G++GLG G +   SL+++ G  R S+ +  F   +  R+ 
Sbjct: 202 VAFGCGNINSGQLANS---SGMVGLGRGPL---SLVSQLGPSRFSYCLTSFLSPEPSRLN 255

Query: 284 FG----------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------- 326
           FG              +  QST  + +      Y + ++   +G   L            
Sbjct: 256 FGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDD 315

Query: 327 ---KAIVDSGSSFTFLPKEVYETIAAEFDRQV------NDTITSFEG-YPWKCCYKSSSQ 376
                 +DSG+S T+L ++ Y+ +  E    +      NDT    E  +PW         
Sbjct: 316 GTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPW--------- 366

Query: 377 RLPKLPSVKLMFP 389
             P  PSV +  P
Sbjct: 367 --PPPPSVAVTVP 377


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 128/322 (39%), Gaps = 49/322 (15%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
            P++   T+  GN     +   + +GTP     +  D GSDL W  C  CVR        
Sbjct: 119 LPAKDGSTLGSGN-----YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-------- 165

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSS 198
               D+    ++PS S++  ++SCS   C       G +       C Y +  Y + + S
Sbjct: 166 -TCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ-YGDQSFS 223

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G L ++   L +       + V   V  GCG + + G   GVA  GL+GLG  ++S PS
Sbjct: 224 VGFLAKEKFTLTN-------SDVFDGVYFGCG-ENNQGLFTGVA--GLLGLGRDKLSFPS 273

Query: 259 LLAKAGLIRNSFSMCFDKDDS--GRIFFGDQG----------PATQQSTSFLASNGKYIT 306
             A A      FS C     S  G + FG  G                TSF   N   IT
Sbjct: 274 QTATA--YNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAIT 331

Query: 307 YIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
             +G +   I S+        A++DSG+  T LP + Y  + + F  +++   T+     
Sbjct: 332 --VGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI 387

Query: 367 WKCCYKSSSQRLPKLPSVKLMF 388
              C+  S  +   +P V   F
Sbjct: 388 LDTCFDLSGFKTVTIPKVAFSF 409


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 126/313 (40%), Gaps = 61/313 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP + +   +D GSDL+W  C  CV CA          D+    + P+ S+T + +
Sbjct: 96  LAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCA----------DQPTPYFRPARSATYRLV 145

Query: 166 SCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            C   LC  L       +  C Y   YY +  S++G+L  +      G  N+ K  V + 
Sbjct: 146 PCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTF--GAANSSKVMV-SD 201

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIF 283
           V  GCG   SG   +     G++GLG G +   SL+++ G  R S+ +  F   +  R+ 
Sbjct: 202 VAFGCGNINSGQLANS---SGMVGLGRGPL---SLVSQLGPSRFSYCLTSFLSPEPSRLN 255

Query: 284 FG----------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------- 326
           FG              +  QST  + +      Y + ++   +G   L            
Sbjct: 256 FGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDD 315

Query: 327 ---KAIVDSGSSFTFLPKEVYETIAAEFDRQV------NDTITSFEG-YPWKCCYKSSSQ 376
                 +DSG+S T+L ++ Y+ +  E    +      NDT    E  +PW         
Sbjct: 316 GTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPW--------- 366

Query: 377 RLPKLPSVKLMFP 389
             P  PSV +  P
Sbjct: 367 --PPPPSVAVTVP 377


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 127/313 (40%), Gaps = 53/313 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP ++F V  D GSDL+W  C  C +C            +    + P++SST   L
Sbjct: 90  ISVGTPLLTFPVVADTGSDLIWTQCAPCTKC----------FQQPAPPFQPASSSTFSKL 139

Query: 166 SCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            C+   C       N  + C  T    +Y   +  ++G L  + L +   GD +      
Sbjct: 140 PCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKV---GDASFP---- 189

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR- 281
            SV  GC  +       G +  G+ GLG G +   SL+ + G+ R  FS C     +   
Sbjct: 190 -SVAFGCSTENG----VGNSTSGIAGLGRGAL---SLIPQLGVGR--FSYCLRSGSAAGA 239

Query: 282 --IFFGDQGPATQ---QSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK-------- 327
             I FG     T    QST F+ +   + + Y + +    +G + L  T+          
Sbjct: 240 SPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGL 299

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLP-KLPS 383
               IVDSG++ T+L K+ YE +   F  Q  +  T         C+KS+       +PS
Sbjct: 300 GGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPS 359

Query: 384 VKLMFPQNNSFVV 396
           + L F     + V
Sbjct: 360 LVLRFDGGAEYAV 372


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 81/333 (24%), Positives = 136/333 (40%), Gaps = 69/333 (20%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P++    +  GN     ++  + +GTP     +  D GSDL W      +C P + S Y
Sbjct: 133 LPAKSGSLIGSGN-----YFVVVGLGTPKRDLSLIFDTGSDLTW-----TQCEPCARSCY 182

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS------CQNPKQPCPYTMDYYTENTS 197
              D     + PS S++  +++C+  LC  L T+      C    + C Y +  Y +++ 
Sbjct: 183 KQQDV---IFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQ-YGDSSF 238

Query: 198 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
           S G    + L + +         V  + + GCG + + G   G A  GLIGLG   IS  
Sbjct: 239 SVGYFSRERLTVTA-------TDVVDNFLFGCG-QNNQGLFGGSA--GLIGLGRHPISF- 287

Query: 258 SLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL----ASNGKYITY------ 307
            +   A   R  FS C               P+T  ST  L    A+ G+Y+ Y      
Sbjct: 288 -VQQTAAKYRKIFSYCL--------------PSTSSSTGHLSFGPAATGRYLKYTPFSTI 332

Query: 308 -----IIGVETCCIGSSCLK----QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
                  G++   I    +K     ++F    AI+DSG+  T LP   Y  + + F + +
Sbjct: 333 SRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGM 392

Query: 356 NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           +   ++ E      CY  S  ++  +P+++  F
Sbjct: 393 SKYPSAGELSILDTCYDLSGYKVFSIPTIEFSF 425


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 89/349 (25%), Positives = 145/349 (41%), Gaps = 64/349 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP    L+ LD GSD++W+ C  C RC           D+    + P  S +
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRC----------YDQSGQVFDPRRSRS 191

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
              + CS  LC   D G  C   ++ C Y +  Y + + ++G    + L    G      
Sbjct: 192 YGAVGCSAPLCRRLDSG-GCDLRRKACLYQV-AYGDGSVTAGDFATETLTFAGG------ 243

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
            +  A + +GCG    G +   VA  GL+GLG G +S P+ +++      SFS C  D+ 
Sbjct: 244 -ARVARIALGCGHDNEGLF---VAAAGLLGLGRGSLSFPAQISR--RYGRSFSYCLVDRT 297

Query: 278 DSGR-------IFFGDQGPATQQSTSF--LASNGKYITY----IIGVETCCIGSSCLKQT 324
            S         + FG     +  + SF  +  N +  T+    ++G+       S +  +
Sbjct: 298 SSANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADS 357

Query: 325 SFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS 374
             +          IVDSG+S T L +  Y  +   F         S  G+  +  CY  S
Sbjct: 358 DLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLS 417

Query: 375 SQRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
            +++ K+P+V + F         P+N    V++     F   GT  GVS
Sbjct: 418 GRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVS 466


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/325 (24%), Positives = 128/325 (39%), Gaps = 48/325 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + IG P  S L+  D GSDL+W+ C  C  C+  S +           + P  SST
Sbjct: 84  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRHSST 134

Query: 162 SKHLSCSHRLC------DLGTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDI--LHLISG 212
                C   +C      D    C + +       +Y Y + + +SGL   +   L   SG
Sbjct: 135 FSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSG 194

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRNS 269
            +  LK     SV  GCG + SG  + G +    +G++GLG G IS  S L +     N 
Sbjct: 195 KEARLK-----SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNK 247

Query: 270 FSMC-----FDKDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLK 322
           FS C          +  +  G+ G    +   T  L +      Y + +++  +  + L+
Sbjct: 248 FSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLR 307

Query: 323 ----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
                       +   +VDSG++  FL +  Y ++ A   R+V   I       +  C  
Sbjct: 308 IDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVN 367

Query: 373 SSSQRLPK--LPSVKLMFPQNNSFV 395
            S    P+  LP +K  F     FV
Sbjct: 368 VSGVTKPEKILPRLKFEFSGGAVFV 392


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 135/337 (40%), Gaps = 56/337 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           + IG P   + + +D+GSDL W+ CD  CV C                   P        
Sbjct: 72  LRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCT--------------KAPHPPYKPNKGP 117

Query: 165 LSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           ++C+  +C          C+   + C Y + Y  ++ SS G+LV DI  L       L N
Sbjct: 118 ITCNDPMCSALHWPSKPPCKASHEQCDYEVSY-ADHGSSLGVLVHDIFSL------QLTN 170

Query: 220 SVQAS--VIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
              A+  +  GCG  QS  Y    AP   DG++GLG G+ S+ + L   GLIR+    C 
Sbjct: 171 GTLAAPRLAFGCGYDQS--YPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCL 228

Query: 275 DKDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSG 333
                G +F GD    T     + ++       Y +G                + + DSG
Sbjct: 229 SGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSG 288

Query: 334 SSFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYP--WKCC------------YKSSSQR 377
           SS+T+   + Y+T  +   + +N  +  T+ E  P  W+              +K  +  
Sbjct: 289 SSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALS 348

Query: 378 LPKLPSVKLMFPQNNSFVV----NNPVFVIYGTQVGV 410
             K  S +L  P  +  ++    N  + ++ G++VG+
Sbjct: 349 FTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGL 385


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 134/329 (40%), Gaps = 72/329 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP+ +    +D GS L+W PC     C RC     S+ N     +  + P  SS++
Sbjct: 94  LSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRC-----SFPNIDPAKIPTFIPKLSSSA 148

Query: 163 KHLSCSHRLC------DLGTSC-------QNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           K + C +  C      ++ T C        N  + CP     Y   T+   LL+E ++  
Sbjct: 149 KIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLV-- 206

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
                       +   ++GC +      L    P G+ G G G  S+P    + GL + S
Sbjct: 207 -------FAERTEPDFVVGCSI------LSSRQPSGIAGFGRGPSSLPK---QMGLKKFS 250

Query: 270 FSMCFDK-DDSGR-----IFFG----DQGPATQQSTSF----LASNGKYIT-YIIGVETC 314
           + +   + DDS +     ++ G    D        T F    ++SN  +   Y + +   
Sbjct: 251 YCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHI 310

Query: 315 CIGSSCLK-QTSFKA---------IVDSGSSFTFLPKEVYETIAAEFDRQVND-----TI 359
            +G   +K   SF           IVDSGS+FTF+ K V+E +A EFDRQ+ +      +
Sbjct: 311 IVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADV 370

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            +  G   K C+  S      LPS+   F
Sbjct: 371 EALSG--LKPCFNLSGVGSVALPSLVFQF 397


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 133/336 (39%), Gaps = 54/336 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS---- 160
           + IG P   + + +D+GSDL W+ CD  CV C       Y      +    P  S+    
Sbjct: 39  LRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCSALHWP 98

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +      SH  CD   S              Y ++ SS G+LV DI  L       L N 
Sbjct: 99  SKPPCKASHEQCDYEVS--------------YADHGSSLGVLVHDIFSL------QLTNG 138

Query: 221 VQAS--VIIGCGMKQSGGYLDGVAP---DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             A+  +  GCG  QS  Y    AP   DG++GLG G+ S+ + L   GLIR+    C  
Sbjct: 139 TLAAPRLAFGCGYDQS--YPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLS 196

Query: 276 KDDSGRIFFGDQGPATQQST-SFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGS 334
               G +F GD    T     + ++       Y +G                + + DSGS
Sbjct: 197 GRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGS 256

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTI--TSFEGYP--WKCC------------YKSSSQRL 378
           S+T+   + Y+T  +   + +N  +  T+ E  P  W+              +K  +   
Sbjct: 257 SYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSF 316

Query: 379 PKLPSVKLMFPQNNSFVV----NNPVFVIYGTQVGV 410
            K  S +L  P  +  ++    N  + ++ G++VG+
Sbjct: 317 TKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGL 352


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 76/309 (24%), Positives = 121/309 (39%), Gaps = 50/309 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP       LD GS+ +W  C  CV C   +A  ++          PS SST K +
Sbjct: 69  LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFD----------PSKSSTFKEI 118

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQAS 224
            C                 CPY + Y  ++ +   L+ E + +H  SG     +  V   
Sbjct: 119 RCDTH-----------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSG-----QPFVMPE 162

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
            IIGCG   S G+  G A  G++GL  G  S+  +    G      S CF    + +I F
Sbjct: 163 TIIGCGRNNS-GFKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINF 217

Query: 285 GDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGS 334
           G           ST+      K   Y + ++   +G++ ++   T F A     ++DSGS
Sbjct: 218 GANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 277

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP-----SVKLMFP 389
           + T+ P+     +    ++ V  T   F      C Y  +    P +         L+  
Sbjct: 278 TLTYFPESYCNLVRKAVEQVV--TAVRFPRSDILCYYSKTIDIFPVITMHFSGGADLVLD 335

Query: 390 QNNSFVVNN 398
           + N +V +N
Sbjct: 336 KYNMYVASN 344


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 83/314 (26%), Positives = 123/314 (39%), Gaps = 56/314 (17%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +GTP    L+ +D GSD++W+ C  CV C       Y  L      Y P  SST
Sbjct: 99  YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHC-------YRQLS---PLYDPRGSST 148

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
                CS   C    +C      C Y +  Y + +S+SG L  D   L+   D ++ N  
Sbjct: 149 YAQTPCSPPQCRNPQTCDGTTGGCGYRI-VYGDASSTSGNLATD--RLVFSNDTSVGN-- 203

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSG 280
              V +GCG    G  L G A  GL+G+  G  S  + +A +      F+ C  D+  SG
Sbjct: 204 ---VTLGCGHDNEG--LFGSA-AGLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRSG 255

Query: 281 R----IFFGDQGPATQQST-SFLASNGK----YITYIIGVETCCIGSSCLKQTSFK---- 327
                + FG   P    S  + L SN +    Y   ++G        +     S      
Sbjct: 256 SSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPA 315

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFD--------RQVNDTITSFEGYPWKCCYKSS 374
                 +VDSG+S T   ++ Y  +   FD        R+V   I+ F+      CY   
Sbjct: 316 TGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFD-----ACYDLR 370

Query: 375 SQRLPKLPSVKLMF 388
              +   P V L F
Sbjct: 371 GVAVADAPGVVLHF 384


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 76/309 (24%), Positives = 121/309 (39%), Gaps = 50/309 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP       LD GS+ +W  C  CV C   +A  ++          PS SST K +
Sbjct: 63  LQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFD----------PSKSSTFKEI 112

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQAS 224
            C                 CPY + Y  ++ +   L+ E + +H  SG     +  V   
Sbjct: 113 RCDTH-----------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSG-----QPFVMPE 156

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
            IIGCG   S G+  G A  G++GL  G  S+  +    G      S CF    + +I F
Sbjct: 157 TIIGCGRNNS-GFKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYCFAGKGTSKINF 211

Query: 285 GDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDSGS 334
           G           ST+      K   Y + ++   +G++ ++   T F A     ++DSGS
Sbjct: 212 GANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGS 271

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP-----SVKLMFP 389
           + T+ P+     +    ++ V  T   F      C Y  +    P +         L+  
Sbjct: 272 TLTYFPESYCNLVRKAVEQVV--TAVRFPRSDILCYYSKTIDIFPVITMHFSGGADLVLD 329

Query: 390 QNNSFVVNN 398
           + N +V +N
Sbjct: 330 KYNMYVASN 338


>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
          Length = 566

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 106/263 (40%), Gaps = 55/263 (20%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L+YT + +GTP   F V +D GSD+LW+ C      P ++     L   L+ + P  SS+
Sbjct: 131 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTS----ELQIQLSFFDPGVSSS 186

Query: 162 SKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           +  +SCS R C       + C +P   C Y+   Y + + +SG  + D +          
Sbjct: 187 ASLVSCSDRRCYSNFQTESGC-SPNNLCSYSFK-YGDGSGTSGYYISDFM---------- 234

Query: 218 KNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
                      C   QSG       A DG+ GLG G +SV S LA  GL    FS C   
Sbjct: 235 -----------CSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKG 283

Query: 275 DKDDSGRIFFGD-------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
           DK   G +  G                P    +   +A NG+ +     V T   G    
Sbjct: 284 DKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDG-- 341

Query: 322 KQTSFKAIVDSGSSFTFLPKEVY 344
                  I+D+G++  +LP E Y
Sbjct: 342 ------TIIDTGTTLAYLPDEAY 358


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 78/336 (23%), Positives = 132/336 (39%), Gaps = 50/336 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP   F +  D GS+L W+ C      P               + P AS + 
Sbjct: 91  YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLV------------FRPEASKSW 138

Query: 163 KHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-ISGGDNA 216
             + CS   C L       +C +   PC Y   Y   +  + G++  D   + + GG   
Sbjct: 139 APVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGG--- 195

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
            K +    V++GC     G     V  DG++ LG  +IS  S    A     SFS C   
Sbjct: 196 -KVAQLQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGGSFSYCLVD 250

Query: 275 ---DKDDSGRIFFGD-QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL-------KQ 323
               ++ +G + FG  Q P T  + + L  +     Y + V+   +    L         
Sbjct: 251 HLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310

Query: 324 TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR--LPKL 381
            S   I+DSG++ T L    Y+ + A   + +   +   +  P++ CY  ++ R   P++
Sbjct: 311 KSGGVILDSGTTLTVLATPAYKAVVAALTKLLAG-VPKVDFPPFEHCYNWTAPRPGAPEI 369

Query: 382 PSVKLMF-------PQNNSFVVN-NPVFVIYGTQVG 409
           P + + F       P   S+V++  P     G Q G
Sbjct: 370 PKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEG 405


>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 436

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 87/333 (26%), Positives = 140/333 (42%), Gaps = 69/333 (20%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T I   TP V   + +D G    W+ CD         SY               SST 
Sbjct: 47  YTTQIKQRTPLVPINLTIDLGGGYFWVNCD--------KSY--------------VSSTL 84

Query: 163 KHLSCSHRLCDL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           K + CS   C L G+   + K+ C  +        S+SG +  DI+ + S   N     V
Sbjct: 85  KPILCSSSQCSLFGSHGCSDKKICGRSPYNIVTGVSTSGDIQSDIVSVQSTNGNYSGRFV 144

Query: 222 QAS---VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
                  I G  + Q+G    GV   G+ GLG  ++S+PS  + A   +N F++C    +
Sbjct: 145 SVPNFLFICGSNVVQNG-LAKGV--KGMAGLGRTKVSLPSQFSSAFSFKNKFAICLGTQN 201

Query: 279 SGRIFFGD-------------------QGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
            G +FFGD                     P +   +SFL    K + Y IGV++  + S 
Sbjct: 202 -GVLFFGDGPYLFNFDESKNLIYTPLITNPVSTSPSSFLGE--KSVEYFIGVKSIRVSSK 258

Query: 320 CLK-QTSFKAIVDSG---------SSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWK 368
            +K  T+  +I  +G         + +T +   +Y+ +A  F + +N  +++ E   P+ 
Sbjct: 259 NVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADAFVKALN--VSTVEPVAPFG 316

Query: 369 CCYKS---SSQRL-PKLPSVKLMFPQNNSFVVN 397
            C+ S   SS R+ P +PS+ L+  QN + V N
Sbjct: 317 TCFASQSISSSRMGPDVPSIDLVL-QNENVVWN 348


>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
 gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
          Length = 406

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 126/313 (40%), Gaps = 62/313 (19%)

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP---CDCVRCAPLSASYY 144
           +G   + L N     ++T I +GTP  +F V LD GS  LW+P   C  + C  L A   
Sbjct: 80  KGGHGVPLTNFMNAQYFTEITLGTPPQNFKVILDTGSSNLWVPSSKCTSIACF-LHA--- 135

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
                   +Y  SASST K               QN  +   +++ Y   + S  G + +
Sbjct: 136 --------KYDSSASSTYK---------------QNGTE---FSIQY--GSGSMEGFVSQ 167

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL----- 259
           D+L +   GD  +     A  +   G+  + G  DG+     +GLG   ISV  +     
Sbjct: 168 DVLTI---GDLTIPGQDFAEAVKEPGLTFAFGKFDGI-----LGLGYDTISVNHIVPPHY 219

Query: 260 -LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
            +   GL+     SF +   ++D G   FG    +  +         +   + + +E   
Sbjct: 220 NMINKGLLDEPVFSFRLGKSEEDGGEAIFGGVDKSAYKGDLTYVPVRRKAYWEVELEKIS 279

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
            GS  L+  S  A +D+G+S   LP ++ E I AE   + +          W   Y+   
Sbjct: 280 FGSEELELESTGAAIDTGTSLIALPTDMAEMINAEIGAKKS----------WNGQYQVEC 329

Query: 376 QRLPKLPSVKLMF 388
            ++P LP + L F
Sbjct: 330 SKVPDLPELSLYF 342


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 130/319 (40%), Gaps = 58/319 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASST 161
           ++  I++G P    LV +D GSDL+W+     +C P    Y     R +   Y P +SST
Sbjct: 88  YFAVINVGDPPTRALVVIDTGSDLIWL-----QCVPCRHCY-----RQVTPLYDPRSSST 137

Query: 162 SKHLSCSHRLCD---LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            + + C+   C        C      C Y M  Y + ++SSG L  D   L+   D  + 
Sbjct: 138 HRRIPCASPRCRDVLRYPGCDARTGGCVY-MVVYGDGSASSGDLATD--RLVFPDDTHVH 194

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--- 275
           N     V +GCG    G  L+  A  GL+G+G G++S P+ LA A    + FS C     
Sbjct: 195 N-----VTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRL 244

Query: 276 ---KDDSGRIFFGDQGPATQQSTSFLASNGK----YITYIIG-------VETCCIGSSCL 321
              ++ S  + FG        + + L +N +    Y   ++G       V      S  L
Sbjct: 245 SRAQNGSSYLVFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLAL 304

Query: 322 KQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQ------VNDTITSFEGYPWKCCYKS 373
              + +   +VDSG++ +   ++ Y  +   FD        +    T F  +    CY  
Sbjct: 305 NPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVF--DACYDL 362

Query: 374 SSQRLP----KLPSVKLMF 388
                P    ++PS+ L F
Sbjct: 363 RGNGAPAAAVRVPSIVLHF 381


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 131/307 (42%), Gaps = 47/307 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + IG+P     + +D GSD+ WI     +C+P  + Y     ++   + P ASS+ 
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWI-----QCSPCKSCY----KQNDAVFDPRASSSF 64

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           + LSCS   C L    +C +    C Y +  Y + + + G L  D   L+S G       
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDNRCLYQVS-YGDGSFTVGDLASDSF-LVSRGRT----- 117

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
             + V+ GCG    G +   V   GL+GLG G++S PS L+        FS C    D+G
Sbjct: 118 --SPVVFGCGHDNEGLF---VGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNG 167

Query: 281 -----RIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--- 327
                 + FGD    T  S ++  L  N K  T Y  G+    IG + L    T+FK   
Sbjct: 168 VRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSS 227

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                  I+DSG+S T LP   Y  +   F         + +   +  CY  S+     +
Sbjct: 228 STGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTI 287

Query: 382 PSVKLMF 388
           P+V   F
Sbjct: 288 PTVSFHF 294


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 123/301 (40%), Gaps = 40/301 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G+P  SF V +D GSDL W     V+C P    Y     +   ++ PS S + +  +
Sbjct: 43  LTLGSPPQSFDVIVDTGSDLNW-----VQCLPCRVCY----QQPGPKFDPSKSRSFRKAA 93

Query: 167 CSHRLCDLGT----SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           C+  LC++      +C      C Y   Y  ++ ++  L  E I      G  ++ N   
Sbjct: 94  CTDNLCNVSALPLKACA--ANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPN--- 148

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDS 279
                GCG  Q+ G   G A  GL+GLG G +S+ S L+      N FS C    +   +
Sbjct: 149 --FAFGCG-TQNLGTFAGAA--GLVGLGQGPLSLNSQLSHT--FANKFSYCLVSLNSLSA 201

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYI--------IGVETCCIGSS--CLKQTSFKA- 328
             + FG    A     + +  N ++ TY         +G +   +  S   + Q++ +  
Sbjct: 202 SPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGG 261

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
            I+DSG++ T L    Y  +   ++  VN        Y    C+  +    P +P +   
Sbjct: 262 TIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFK 321

Query: 388 F 388
           F
Sbjct: 322 F 322


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 114/265 (43%), Gaps = 47/265 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP + +   +D GSDL+W  C  C+ CA     Y+     D+ +     S+T + L
Sbjct: 93  LAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYF-----DVKK-----SATYRAL 142

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS- 224
            C    C   +S    K+ C Y   YY +  S++G+L  +      G  N+ K  V+A+ 
Sbjct: 143 PCRSSRCASLSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATN 197

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---R 281
           +  GCG   +G   D     G++G G G +S+ S L       + FS C     S    R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLGP-----SRFSYCLTSYLSATPSR 249

Query: 282 IFFG---------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------ 326
           ++FG             +  QST F+ +      Y + ++   +G+  L           
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIND 309

Query: 327 ----KAIVDSGSSFTFLPKEVYETI 347
                 I+DSG+S T+L ++ YE +
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAV 334


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 135/313 (43%), Gaps = 57/313 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G+P  +  + LD GS+L W+ C   +  P   S +N L    + Y+P+  ++S    
Sbjct: 64  LTVGSPPQNVTMVLDTGSELSWLHC---KKLPNLNSTFNPLLS--SSYTPTPCNSSI--- 115

Query: 167 CSHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C+ R  DL    SC +P     + +  Y + +S+ G L  +          +L  + Q  
Sbjct: 116 CTTRTRDLTIPASC-DPNNKLCHVIVSYADASSAEGTLAAETF--------SLAGAAQPG 166

Query: 225 VIIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS- 279
            + GC    S GY   +  D    GL+G+  G +S   L+ +  L +  FS C   +D+ 
Sbjct: 167 TLFGC--MDSAGYTSDINEDSKTTGLMGMNRGSLS---LVTQMSLPK--FSYCISGEDAL 219

Query: 280 GRIFFGD--QGPATQQSTSFLASNG-----KYITYIIGVETCCIGSSCLK--QTSF---- 326
           G +  GD    P+  Q T  + +         + Y + +E   +    L+  ++ F    
Sbjct: 220 GVLLLGDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDH 279

Query: 327 ----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS-------FEGYPWKCCYKSSS 375
               + +VDSG+ FTFL   VY ++  EF  Q    +T        FEG     CY + +
Sbjct: 280 TGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEG-AMDLCYHAPA 338

Query: 376 QRLPKLPSVKLMF 388
                +P+V L+F
Sbjct: 339 S-FAAVPAVTLVF 350


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 115/285 (40%), Gaps = 48/285 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + IG P  S L+  D GSDL+W+ C  C  C+  S +           + P  SST
Sbjct: 83  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPA---------TVFFPRHSST 133

Query: 162 SKHLSCSHRLCDL------GTSCQNPK--QPCPYTMDYYTENTSSSGLLVEDI--LHLIS 211
                C   +C L         C + +    CPY    Y + + +SGL   +   L   S
Sbjct: 134 FSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYG-YADGSLTSGLFARETTSLKTSS 192

Query: 212 GGDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGLGLGEISVPSLLAKAGLIRN 268
           G +  LK     SV  GCG + SG  + G +    +G++GLG G IS  S L +     N
Sbjct: 193 GKEAKLK-----SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGN 245

Query: 269 SFSMC-----FDKDDSGRIFFGDQGPATQQ--STSFLASNGKYITYIIGVETCCIGSSCL 321
            FS C          +  +  GD G A  +   T  L +      Y + +++  +  + L
Sbjct: 246 KFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKL 305

Query: 322 K----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           +            +   ++DSG++  FL    Y  + A   +++ 
Sbjct: 306 RIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIK 350


>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
          Length = 445

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 135/344 (39%), Gaps = 76/344 (22%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T I+  TP V   + ++ G + LW+ C+          Y               SST 
Sbjct: 47  YLTQINQRTPLVPVKLTVNLGGEFLWVDCE--------KGY--------------VSSTY 84

Query: 163 KHLSCSHRLCDL------GTSCQNPKQPCPY-TMDYYTEN----TSSSGLLVEDILHLIS 211
           K   C    C+L      G     PK  C   T   +  N    TS+SG L +DI+ + S
Sbjct: 85  KPARCRSAQCNLAGSKSCGECFDGPKPGCNNNTCGLFPYNPFIRTSTSGELAQDIISIQS 144

Query: 212 -GGDNALKNSVQASVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRN 268
             G N  K     +VI  CG   S   L+G+A    G+ GLG  +I++PS  A A   + 
Sbjct: 145 TNGSNPSKVVSFPNVIFTCG---STFLLEGLASGVTGIAGLGRKKIALPSQFAAAFSFKR 201

Query: 269 SFSMCFDKDD--SGRIFFGDQGPATQQSTSFLASNGKYIT-------------------- 306
            F++C       +G +FFGD GP        ++ N  Y                      
Sbjct: 202 KFALCLSSSTRATGVVFFGD-GPYIMLPNKDVSQNLIYTPLILNPVSTAGASFEGEPSAD 260

Query: 307 YIIGVETCCIGSSCLK-QTSFKAIVDSGSS---------FTFLPKEVYETIAAEFDRQVN 356
           Y IGV+   +    +K  TS  +I   G+          +T L   +Y+ +   F + V 
Sbjct: 261 YFIGVKGIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPYTSLETSIYKAVIGAFGKAVA 320

Query: 357 DTITSFEGYPWKCCYKS---SSQRL-PKLPSVKLMFPQNNSFVV 396
                    P++ C+ S   SS R+ P +P + L+ P N ++ +
Sbjct: 321 KVPRVTAVAPFELCFNSTSFSSTRVGPGVPQIDLVLPNNKAWTI 364


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 140/325 (43%), Gaps = 54/325 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +G+P   F + LD GSDL WI C  C  C   + ++Y+          P AS++ K+++C
Sbjct: 176 VGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYD----------PKASASYKNITC 225

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNS 220
           + + C+L +S      C++  Q CPY   Y   + ++    VE   ++L + G ++   +
Sbjct: 226 NDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYN 285

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----D 275
           V+ +++ GCG    G +        L+GLG G +S  S L    L  +SFS C      D
Sbjct: 286 VE-NMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSD 339

Query: 276 KDDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIGSSCLK------- 322
            + S ++ FG+            TSF+A     +   Y + +++  +    L        
Sbjct: 340 TNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWN 399

Query: 323 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRL 378
                +   I+DSG++ ++  +  YE I  +   +       +  +P    C+  S    
Sbjct: 400 ISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHN 459

Query: 379 PKLPSVKLM--------FPQNNSFV 395
            +LP + +         FP  NSF+
Sbjct: 460 VQLPELGIAFADGAVWNFPTENSFI 484


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 92/341 (26%), Positives = 147/341 (43%), Gaps = 58/341 (17%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +GTP     + LD GSD++WI C  C++C       Y+  D     + P+ S +
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKC-------YSQTD---PVFDPTKSRS 194

Query: 162 SKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             ++ C   LC       C   KQ C Y +  Y + + + G    + L          + 
Sbjct: 195 FANIPCGSPLCRRLDYPGCSTKKQICLYQVS-YGDGSFTVGEFSTETL--------TFRG 245

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD 278
           +    V++GCG    G +   V   GL+GLG G +S PS + +     + FS C  D+  
Sbjct: 246 TRVGRVVLGCGHDNEGLF---VGAAGLLGLGRGRLSFPSQIGRR--FNSKFSYCLGDRSA 300

Query: 279 SGR---IFFGDQGPATQQSTSF--LASNGKYITY----IIGVETCCIGSSCLKQTSFK-- 327
           S R   I FGD   A  ++T F  L SN K  T+    ++G+       S +  + FK  
Sbjct: 301 SSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLD 358

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                  I+DSG+S T L +  Y  +   F    ++   + E   +  C+  S +   K+
Sbjct: 359 STGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKV 418

Query: 382 PSVKLMF-------PQNNSFV-VNNP---VFVIYGTQVGVS 411
           P+V L F       P +N  + V+N     F   GT  G+S
Sbjct: 419 PTVVLHFRGADVPLPASNYLIPVDNSGSFCFAFAGTASGLS 459


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 121/300 (40%), Gaps = 50/300 (16%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  ++  I IGTP +  LV  D GSDL+W+ C  C  C    +  +N          P  
Sbjct: 91  GGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFN----------PKQ 140

Query: 159 SSTSKHLSCSHRLCDL------GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           SST + + C  R C+         S     + C Y+  Y   + +   L  E     I G
Sbjct: 141 SSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATE---RFIIG 197

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL-IRNSFS 271
             N   NS+Q  +  GCG   +GG  D    +   G+        SL+++ G  I N FS
Sbjct: 198 STN---NSIQ-ELAFGCG-NSNGGNFD----EVGSGIVGLGGGSLSLISQLGTKIDNKFS 248

Query: 272 MC----FDKDDS--GRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCL 321
            C     +K +   G+I FGD     G  T  ST  L S      Y + +E   +G+  L
Sbjct: 249 YCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTP-LVSKEPETFYYLTLEAISVGNERL 307

Query: 322 KQTSFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
              + +          I+DSG++ TFL  ++Y  +    ++ V     S     +  C++
Sbjct: 308 AYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFR 367


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 113/262 (43%), Gaps = 41/262 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP + +   +D GSDL+W  C  C+ CA     Y++ + R         S+T + L
Sbjct: 93  LAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFD-VKR---------SATYRAL 142

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C    C   +S    K+ C Y   YY +  S++G+L  +     +     ++    A++
Sbjct: 143 PCRSSRCAALSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGAASSTKVR---AANI 198

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC-FDKDDSGRIFF 284
             GCG   +G   +     G++G G G +   SL+++ G  R S+ +  +      R++F
Sbjct: 199 SFGCGSLNAGELANS---SGMVGFGRGPL---SLVSQLGPSRFSYCLTSYLSPTPSRLYF 252

Query: 285 G---------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF--------- 326
           G             +  QST F+ +      Y + V+   +G+  L              
Sbjct: 253 GVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGT 312

Query: 327 -KAIVDSGSSFTFLPKEVYETI 347
              I+DSG+S T+L ++ YE +
Sbjct: 313 GGVIIDSGTSITWLQQDAYEAV 334


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 118/285 (41%), Gaps = 37/285 (12%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY- 143
            P++   ++  GN     +   + +GTP     V  D GSDL W     V+C P S+   
Sbjct: 72  LPAERGISVGTGN-----YVVSVGLGTPARDLTVVFDTGSDLSW-----VQCGPCSSGGC 121

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT-SCQNP--KQPCPYTMDYYTENTSSSG 200
           Y+  D     ++PS+SST   + C    C     SC +      CPY +  Y + + + G
Sbjct: 122 YHQQD---PLFAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEV-VYGDKSRTVG 177

Query: 201 LLVEDILHL-ISGGDNALKNSVQ--ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
            L  D L L  +   NA +N+       + GCG   +G  L G A DGL GLG G++S+ 
Sbjct: 178 HLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTG--LFGKA-DGLFGLGRGKVSLS 234

Query: 258 SLLAKAGLIRNSFSMCFDKDDS---GRIFFGDQGPATQQS--TSFLASNGKYITYIIGVE 312
           S    AG     FS C     S   G +  G   PA   +  T  L  +     Y + + 
Sbjct: 235 S--QAAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLV 292

Query: 313 TCCIGSSCLKQTSFKA------IVDSGSSFTFLPKEVYETIAAEF 351
              +    +K +S  A      IVDSG+  T L    Y  +   F
Sbjct: 293 GIRVAGRAIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAF 337


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 91/398 (22%), Positives = 151/398 (37%), Gaps = 63/398 (15%)

Query: 15  LLTESSGAETVMFSTKLIHRFSEEVKALGVSKN-----RNATSWPAKKSFEYYQVLLSSD 69
            L+ ++    + F+  LIHR S +      ++      RNA      + F +  +     
Sbjct: 19  FLSNANAKSKLGFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDI----- 73

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
            QK      PQ   L  + G   M+            I +GTP    +   D GSDLLW 
Sbjct: 74  SQKDASDNAPQID-LTSNSGEYLMN------------ISLGTPPFPIMAIADTGSDLLWT 120

Query: 130 PCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNPKQPC 185
            C  C  C       Y  +D     + P ASST K +SCS   C   +   SC      C
Sbjct: 121 QCKPCDDC-------YTQVDP---LFDPKASSTYKDVSCSSSQCTALENQASCSTEDNTC 170

Query: 186 PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDG 245
            Y+   Y + + + G +  D L L   G    +     ++IIGCG   +G +    +   
Sbjct: 171 SYSTS-YGDRSYTKGNIAVDTLTL---GSTDTRPVQLKNIIIGCGHNNAGTFNKKGS--- 223

Query: 246 LIGLGLGEISVPSLLAKAG-LIRNSFSMCF-----DKDDSGRIFFGDQG--PATQQSTSF 297
             G+        SL+ + G  I   FS C      + D + +I FG       T   ++ 
Sbjct: 224 --GIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTP 281

Query: 298 LASNGKYITYIIGVETCCIGSSCLKQTSFKA-------IVDSGSSFTFLPKEVYETIAAE 350
           L +  +   Y + +++  +GS  ++     +       I+DSG++ T LP E Y  +   
Sbjct: 282 LIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDA 341

Query: 351 FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
               ++             CY ++     K+P++ + F
Sbjct: 342 VASSIDAEKKQDPQTGLSLCYSATGDL--KVPAITMHF 377


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 136/334 (40%), Gaps = 59/334 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + LD GSDL WI C  C+ C   S  YY+          P  SS+ +++SC
Sbjct: 203 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------PKDSSSFRNISC 252

Query: 168 SHRLCDLGTSCQNPK------QPCPYTMDYYTENTSSSGLLVEDILHL---ISGGDNALK 218
               C L ++   PK      Q CPY   +Y + ++++G    +   +      G + LK
Sbjct: 253 HDPRCQLVSAPDPPKPCKAENQSCPYFY-WYGDGSNTTGDFALETFTVNLTTPNGTSELK 311

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
           +    +V+ GCG    G +       GL    L   S         L   SFS C  D++
Sbjct: 312 HV--ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRN 364

Query: 278 D----SGRIFFG-DQGPATQQSTSFLASNGKY-----ITYIIGVETCCIGSSCLK----- 322
                S ++ FG D+   +  + +F +  G         Y + +++  +    LK     
Sbjct: 365 SNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEET 424

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQ 376
                + +   I+DSG++ T+  +  YE I   F R++       EG  P K CY  S  
Sbjct: 425 WHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKG-YQLVEGLPPLKPCYNVSGI 483

Query: 377 RLPKLPSVKLM--------FPQNNSFVVNNPVFV 402
              +LP   ++        FP  N F+  +P  V
Sbjct: 484 EKMELPDFGILFADEAVWNFPVENYFIWIDPEVV 517


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 121/300 (40%), Gaps = 40/300 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP     +  D GSDL W      +C P + S Y   D     + PS SS+  +++
Sbjct: 50  VGLGTPKRDLSLVFDTGSDLTW-----TQCEPCAGSCYKQQDA---IFDPSKSSSYTNIT 101

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDY-------YTENTSSSGLLVEDILHLISGGDNALKN 219
           C+  LC   TS    K  C  + D        Y +N++S G L ++ L + +        
Sbjct: 102 CTSSLCTQLTS-DGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITA-------T 153

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +    + GCG     G  +G A  GL+GLG   IS+  +   +      FS C     S
Sbjct: 154 DIVDDFLFGCGQDNE-GLFNGSA--GLMGLGRHPISI--VQQTSSNYNKIFSYCLPATSS 208

Query: 280 --GRIFFGDQGPATQQS---TSFLASNGKYITYIIGVETCCIGSSCLKQTS---FKA--- 328
             G + FG    AT  S   T     +G    Y + + +  +G + L   S   F A   
Sbjct: 209 SLGHLTFG-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGS 267

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           I+DSG+  T L   VY  + + F R +     + E      CY  S  +   +P +   F
Sbjct: 268 IIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEF 327


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 77/305 (25%), Positives = 130/305 (42%), Gaps = 34/305 (11%)

Query: 117 LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-- 173
            + +D GSD+ WI CD C +C       Y   D   + + P+ S+T K L C+  +C   
Sbjct: 2   FLLIDTGSDITWIQCDPCPQC-------YKQQD---SLFQPAGSATYKPLPCNSTMCQQL 51

Query: 174 --LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGM 231
                SC N    C Y + Y  ++T+     +E    L    D+ +  SV  +   GCG 
Sbjct: 52  QSFSHSCLNSS--CNYMVSYGDKSTTRGDFALET---LTLRSDDTILVSV-PNFAFGCG- 104

Query: 232 KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFGDQ 287
             + G  +G A  GL+GLG   I  P+  + A      FS C         SG + FG+ 
Sbjct: 105 HANKGLFNGAA--GLMGLGKSSIGFPAQTSVA--FGKVFSYCLPSVSSTIPSGILHFGEA 160

Query: 288 GPATQ--QSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYE 345
                  + T  + S+     Y + +    +G   L   S   +VDSG+  +   +  YE
Sbjct: 161 AMLDYDVRFTPLVDSSSGPSQYFVSMTGINVGDELLP-ISATVMVDSGTVISRFEQSAYE 219

Query: 346 TIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYG 405
            +   F + +    T+    P+  C++ S+     +P + L F ++++ +  +PV ++Y 
Sbjct: 220 RLRDAFTQILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHF-RDDAELRLSPVHILYP 278

Query: 406 TQVGV 410
              GV
Sbjct: 279 VDDGV 283


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 86/334 (25%), Positives = 137/334 (41%), Gaps = 59/334 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + LD GSDL WI C  C+ C   S  YY+          P  SS+ +++SC
Sbjct: 201 VGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYD----------PKDSSSFRNISC 250

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHL---ISGGDNALK 218
               C L +S      C+   Q CPY   +Y + ++++G    +   +      G + LK
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFY-WYGDGSNTTGDFALETFTVNLTTPNGKSELK 309

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
           +    +V+ GCG    G +       GL    L   S         L   SFS C  D++
Sbjct: 310 HV--ENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCLVDRN 362

Query: 278 D----SGRIFFG-DQGPATQQSTSFLA----SNGKYIT-YIIGVETCCIGSSCLK----- 322
                S ++ FG D+   +  + +F +     +G   T Y + + +  +    LK     
Sbjct: 363 SNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEET 422

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-PWKCCYKSSSQ 376
                + +   I+DSG++ T+  +  YE I   F R++       EG  P K CY  S  
Sbjct: 423 WHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKG-YELVEGLPPLKPCYNVSGI 481

Query: 377 RLPKLPSVKLM--------FPQNNSFVVNNPVFV 402
              +LP   ++        FP  N F+  +P  V
Sbjct: 482 EKMELPDFGILFADGAVWNFPVENYFIQIDPDVV 515


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 117/283 (41%), Gaps = 56/283 (19%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP       +D G+D +W  C    C P        L++    + PS SST K + C+
Sbjct: 96  IGTPPFQLYSLIDTGNDNIWFQCK--PCKP-------CLNQTSPMFHPSKSSTYKTIPCT 146

Query: 169 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQASVII 227
             +C                     +N     L V+ + L+  +G   + KN     ++I
Sbjct: 147 SPIC---------------------KNADGHYLGVDTLTLNSNNGTPISFKN-----IVI 180

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDD-SGRI 282
           GCG +  G  L+G    G IGL  G +S  S L  +  I   FS C    F K++ S ++
Sbjct: 181 GCGHRNQGP-LEGYV-SGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKL 236

Query: 283 FFGDQGPAT---QQSTSFLASNGKYITYIIGVETCCIGSSCLK----QTSFKAIVDSGSS 335
            FGD+   +     ST     NG    Y + +E   +G   +K         +I+DSG++
Sbjct: 237 HFGDKSTVSGLGTVSTPIKEENG----YFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTT 292

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
            T LPK+VY  + +     V           +  CY+++S  L
Sbjct: 293 MTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTL 335


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 120/303 (39%), Gaps = 39/303 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP     +  D GSDL W      +C P + S Y   D     + PS SS+ 
Sbjct: 136 YFVVVGLGTPKRDLSLVFDTGSDLTW-----TQCEPCAGSCYKQQDA---IFDPSKSSSY 187

Query: 163 KHLSCSHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            +++C+  LC   TS      C +    C Y +  Y + ++S G L ++ L + +     
Sbjct: 188 INITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQ-YGDKSTSVGFLSQERLTITA----- 241

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
               +    + GCG     G   G A  GLIGLG   IS   +   + +    FS C   
Sbjct: 242 --TDIVDDFLFGCGQDNE-GLFSGSA--GLIGLGRHPISF--VQQTSSIYNKIFSYCLPS 294

Query: 277 DDS--GRIFFGDQGPATQQSTSFL------ASNGKYITYIIGVETCCIGSSCLKQTSFKA 328
             S  G + FG    AT  +  +         N  Y   I+G+         +  ++F A
Sbjct: 295 TSSSLGHLTFG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA 353

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
              I+DSG+  T L    Y  + + F + +     + E   +  CY  S  +   +P + 
Sbjct: 354 GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKID 413

Query: 386 LMF 388
             F
Sbjct: 414 FEF 416


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 113/283 (39%), Gaps = 68/283 (24%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I IG P V  L+ +D GSDL WI C   +C P +  +++          PS SST ++ S
Sbjct: 82  ISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFH----------PSRSSTYRNAS 131

Query: 167 CSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           C      +    ++ K   C Y + Y  + +++ G+L E+ L   +  D  +    + ++
Sbjct: 132 CVSAPHAMPQIFRDEKTGNCQYHLRY-RDFSNTRGILAEEKLTFETSDDGLIS---KQNI 187

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN---SFSMCFDKDDSGRI 282
           + GCG   SG         G++GLG G  S+        + RN    FS CF        
Sbjct: 188 VFGCGQDNSGF----TKYSGVLGLGPGTFSI--------VTRNFGSKFSYCF-------- 227

Query: 283 FFGDQGPATQQSTSFLASNGKYIT------------YIIGVETCCIGSSCLK-------- 322
             G     T      +  NG  I             Y + ++    G   L         
Sbjct: 228 --GSLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQR 285

Query: 323 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFD-------RQVND 357
            ++    ++D+G S T L +E YET++ E D       R+V D
Sbjct: 286 YRSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKD 328


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 112/265 (42%), Gaps = 47/265 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP + +   +D GSDL+W  C  C+ CA     Y++             S+T + L
Sbjct: 93  LAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDV----------KKSATYRAL 142

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS- 224
            C    C   +S    K+ C Y   YY +  S++G+L  +      G  N+ K  V+A+ 
Sbjct: 143 PCRSSRCASLSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATN 197

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---R 281
           +  GCG   +G   D     G++G G G +S+ S L  +      FS C     S    R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSR 249

Query: 282 IFFG---------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF------ 326
           ++FG             +  QST F+ +      Y + ++   +G+  L           
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIND 309

Query: 327 ----KAIVDSGSSFTFLPKEVYETI 347
                 I+DSG+S T+L ++ YE +
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAV 334


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 133/301 (44%), Gaps = 56/301 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  ++   LD GSDL+W  C  C +C   S   ++          P  SS+   L
Sbjct: 101 LAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFD----------PKKSSSFSKL 150

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           SCS +LC+    +SC N    C Y +  Y + +S+ G+L  + L     G  ++ N    
Sbjct: 151 SCSSQLCEALPQSSCNN---GCEY-LYSYGDYSSTQGILASETLTF---GKASVPN---- 199

Query: 224 SVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDS 279
            V  GCG    G G+  G    GL+GLG G +S+ S L +       FS C    D   +
Sbjct: 200 -VAFGCGADNEGSGFSQGA---GLVGLGRGPLSLVSQLKEP-----KFSYCLTTVDDTKT 250

Query: 280 GRIFFG-----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK----- 327
             +  G     +   +  ++T  + S      Y + +E   +G + L  K+++F      
Sbjct: 251 STLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDG 310

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPK 380
               I+DSG++ T+L +  +  +A EF  ++N  + S        C+     S++  +PK
Sbjct: 311 SGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPK 370

Query: 381 L 381
           L
Sbjct: 371 L 371


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 76/299 (25%), Positives = 123/299 (41%), Gaps = 50/299 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP    L+A+D  SD+ WIPC  CV C   +A            +SP+ S++ K++SC
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA------------FSPAKSTSFKNVSC 152

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S   C    +     + C + + Y + + +++  L +D + L +    A           
Sbjct: 153 SAPQCKQVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKAFT--------F 202

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIF 283
           GC  K +GG   G  P     LGLG   +  +     + +++FS C         SG + 
Sbjct: 203 GCVNKVAGG---GTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLR 259

Query: 284 FGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDS 332
            G    P   + T  L +  +   Y + +    +G   +            T    I DS
Sbjct: 260 LGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDS 319

Query: 333 GSSFTFLPKEVYETIAAEFDRQVND---TITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           G+ +T L K VYE +  EF ++V      +TS  G+    CY        K+P++  MF
Sbjct: 320 GTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGF--DTCYSGQV----KVPTITFMF 372


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 83/170 (48%), Gaps = 20/170 (11%)

Query: 105 TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           T + IGTP   F + +D GS++ ++PC         +  Y     D   +   +SST + 
Sbjct: 52  TKLYIGTPPQEFTLVVDTGSNMTFVPC-------CGSEEYCGKHED-PAFQTESSSTYQP 103

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           ++C H  CD    C   +  C Y M +Y + + S G+L EDI   IS G+ +        
Sbjct: 104 VNC-HPSCD----CDYLRSQCSYKM-HYGDGSYSRGVLAEDI---ISFGNES--EFAPQR 152

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
           ++ GC +   G  L  +  DG+IGLG G  ++   L   G+I +SFS+C+
Sbjct: 153 LVFGCELDAIGS-LYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 125/311 (40%), Gaps = 51/311 (16%)

Query: 98  DFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           D G   Y   + IGTP +S    +D GSDL+W  C+ C  C+  S               
Sbjct: 36  DIGSGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDP----------- 84

Query: 156 PSASSTSKHLSCSHRLCDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
            S+SST   + C   LC   +  SC N    C Y    Y + +S+SG+L ++   + S  
Sbjct: 85  -SSSSTYSKVLCQSSLCQPPSIFSCNNDGD-CEYVYP-YGDRSSTSGILSDETFSISS-- 139

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
             +L N     +  GCG    G   D V   GL+G G G +S+ S L  +  + N FS C
Sbjct: 140 -QSLPN-----ITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYC 187

Query: 274 F----DKDDSGRIFFGDQG--PATQQSTSFLASNGKYITYIIGVETCCIGSSCL------ 321
                D   +  +F G+     AT   ++ L  +     Y + +E   +G   L      
Sbjct: 188 LVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGT 247

Query: 322 ----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 377
                  S   I+DSG++ TFL +  Y+ +       +N  +   +G     C+      
Sbjct: 248 FDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN--LPQADGQ-LDLCFNQQGSS 304

Query: 378 LPKLPSVKLMF 388
            P  PS+   F
Sbjct: 305 NPGFPSMTFHF 315


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 81/180 (45%), Gaps = 27/180 (15%)

Query: 103 HYTWI---DIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSA 158
           HY ++    IGTP V      D GSDL+W+ C  C  C       Y  L+   +  S   
Sbjct: 56  HYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNC-------YKQLNPMFDSQS--- 105

Query: 159 SSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS--GGD 214
           SST  +++C    C     TSC   +  C Y    Y + + + G+L ++ L L S  G  
Sbjct: 106 SSTFSNIACGSESCSKLYSTSCSPDQINCKYNYS-YVDGSETQGVLAQETLTLTSTTGEP 164

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            A K      VI GCG   +G + D     G+IGLG G +S+ S +  + L  N FS C 
Sbjct: 165 VAFK-----GVIFGCGHNNNGAFNDKEM--GIIGLGRGPLSLVSQIGSS-LGGNMFSQCL 216


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 87/390 (22%), Positives = 153/390 (39%), Gaps = 56/390 (14%)

Query: 5   SLTIYLAVFWLLTESSGAETV--MFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYY 62
           S+ I L+ F  +   S AE     FS  LIHR S +      S   N +  PA++   ++
Sbjct: 12  SIVIALS-FVSVAHISAAEVKNGRFSIDLIHRDSPK------SPLYNPSETPAERLDRFF 64

Query: 63  QVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDA 122
           +  +S                + P+     +S  N     +   I IGTP        D 
Sbjct: 65  RRFMSFSEAS-----------ISPNTPEPPVSSNN---GEYLMKISIGTPPFDVYGIYDT 110

Query: 123 GSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQ 179
           GSDL+W  C  C+ C       ++          PS S++ K +SC  + C L    SC 
Sbjct: 111 GSDLMWTQCLPCLSCYKQKNPMFD----------PSKSTSFKEVSCESQQCRLLDTVSCS 160

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLD 239
            P++ C ++   Y + + + G++  + L L S   N+ + +   +++ GCG   SG + +
Sbjct: 161 QPQKLCDFSYG-YGDGSLAQGVIATETLTLNS---NSGQPTSILNIVFGCGHNNSGTFNE 216

Query: 240 GVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQS 294
                GL G G   +S+ S +         FS C      D   + +I FG +   +   
Sbjct: 217 NEM--GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSD 274

Query: 295 --TSFLASNGKYITYIIGVETCCIG-------SSCLKQTSFKAIVDSGSSFTFLPKEVYE 345
             ++ L +      Y + ++   +G       SS    T     +D+G+  T LP++ Y 
Sbjct: 275 VVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYN 334

Query: 346 TIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
            +       +            + CY+S++
Sbjct: 335 RLVQGVKEAIPMEPVQDPDLQPQLCYRSAT 364


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/405 (22%), Positives = 159/405 (39%), Gaps = 100/405 (24%)

Query: 1   MNRISLTI--YLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNR-----NATSW 53
           MN  SL I  Y ++ ++++ S       FS +LIHR S +      ++N+     NA   
Sbjct: 1   MNTCSLLILFYFSLCFIISLSHALNN-GFSVELIHRDSSKSPLYQPTQNKYQHIVNAARR 59

Query: 54  PAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPN 113
              ++  +Y+  L++  Q            + P  G   M+              +GTP 
Sbjct: 60  SINRANHFYKTALTNTPQ----------STVIPDHGEYLMTYS------------VGTPP 97

Query: 114 VSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
                  D GSD++W+ C+ C  C       YN   +   ++ PS SST K++ CS  LC
Sbjct: 98  FKLYGIADTGSDIVWLQCEPCKEC-------YN---QTTPKFKPSKSSTYKNIPCSSDLC 147

Query: 173 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
             G                        G L  D L L S   + +        +IGCG  
Sbjct: 148 KSG----------------------QQGNLSVDTLTLESSTGHPIS---FPKTVIGCGTD 182

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ 287
            +  + +G A  G++GLG G  S+ + L  +  I   FS C      + + + ++ FGD 
Sbjct: 183 NTVSF-EG-ASSGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVESNTTSKLNFGDT 238

Query: 288 GPATQQS--TSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA----------IVDSGSS 335
              +     ++ +      + Y + +E   +G+   K+  F+           I+DSG++
Sbjct: 239 AVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGN---KRIEFEGSSNGGHEGNIIIDSGTT 295

Query: 336 FTFLPKEVYETIAAEF-----DRQVNDTITSFEGYPWKCCYKSSS 375
            T +P +VY  + +        ++VND    F       CY  +S
Sbjct: 296 LTVIPTDVYNNLESAVLELVKLKRVNDPTRLFN-----LCYSVTS 335


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/312 (25%), Positives = 124/312 (39%), Gaps = 36/312 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +  G+P  ++ +++D GSD+ WI     +C P S   Y   D     + P+ S+T   + 
Sbjct: 165 VGFGSPAQNYTLSIDTGSDVSWI-----QCLPCSGHCYKQHD---PVFDPTKSATYSAVP 216

Query: 167 CSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           C H  C   G  C N    C Y +  Y + +S++G+L  + L L S  D           
Sbjct: 217 CGHPQCAAAGGKCSNSGT-CLYKVT-YGDGSSTAGVLSHETLSLSSTRD-------LPGF 267

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--GRIF 283
             GCG    G +        L+GLG G +S+PS    A     +FS C    D+  G + 
Sbjct: 268 AFGCGQTNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYCLPSYDTTHGYLT 322

Query: 284 FGDQGPATQ------QSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDS 332
            G   PA        Q T+ +        Y + V +  IG   L       T    + DS
Sbjct: 323 MGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDS 382

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
           G+  T+LP E Y ++   F   +     +    P+  CY  +      +P+V   F    
Sbjct: 383 GTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGA 442

Query: 393 SFVVNNPVFVIY 404
            F ++    +IY
Sbjct: 443 VFDLSPVAILIY 454


>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
 gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
          Length = 408

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 128/328 (39%), Gaps = 57/328 (17%)

Query: 71  QKQKMKTG-PQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
           ++  M+ G P F      +G  ++ L N     ++T I IG P  SF V LD GS  LW+
Sbjct: 64  RRVAMQNGEPLFWTQDELKGGHSVPLSNFMNAQYFTEISIGNPPQSFKVILDTGSSNLWV 123

Query: 130 PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTM 189
           P   V+C  ++   +   D        SASS++   + S      G+             
Sbjct: 124 P--SVKCTSIACFLHTKYD--------SASSSTFKANGSEFSIHYGSG------------ 161

Query: 190 DYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGL 249
                  S  G +  D+L +   GD  +K    A  +   G+  + G  DG+     +GL
Sbjct: 162 -------SMEGFVSNDLLSI---GDITIKGQDFAEAVKEPGLAFAFGKFDGI-----LGL 206

Query: 250 GLGEISVPSL------LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLAS 300
           G   ISV  +      +   GLI +   SF +   ++D G   FG    +  +       
Sbjct: 207 GYDTISVNHIIPPFYSMINQGLIDSPVFSFRLGSSEEDGGEAVFGGIDESAYKGKITYVP 266

Query: 301 NGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
             +   + + +E    G+  L+  S  A +D+G+S   LP ++ E +  +   + +    
Sbjct: 267 VRRKAYWEVELEKVSFGNDDLELESTGAAIDTGTSLIVLPTDIAEMLNTQIGAKKS---- 322

Query: 361 SFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
                 W   Y+    ++P LP +   F
Sbjct: 323 ------WNGQYQVDCAKVPSLPELSFYF 344


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/298 (27%), Positives = 119/298 (39%), Gaps = 51/298 (17%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
            +GTP  + L+ALD   D  WIPC  CV C   S++ +N++           S+T K L 
Sbjct: 40  KVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVK----------STTFKTLG 86

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C    C      Q P   C  +   +     SS +L       I     AL         
Sbjct: 87  CGAPQCK-----QVPNPICGGSTCTWNTTYGSSTILSNLTRDTI-----ALSMDPVPYYA 136

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRI 282
            GC  K +G     V P GL+G G G +S   L     L +++FS C       + SG +
Sbjct: 137 FGCIQKATG---SSVPPQGLLGFGRGPLSF--LSQTQNLYKSTFSYCLPSFRTLNFSGSL 191

Query: 283 FFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVD 331
             G  G P   ++T  L +  +   Y + +    +G   +            T    I D
Sbjct: 192 RLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFD 251

Query: 332 SGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           SG+ FT L    Y  +  EF ++V N T++S  G+    CY  S   +P  P++  MF
Sbjct: 252 SGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGGF--DTCY--SVPIVP--PTITFMF 303


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 84/307 (27%), Positives = 129/307 (42%), Gaps = 47/307 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + IG+P     + +D GSD+ WI     +C+P  + Y     ++   + P ASS+ 
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWI-----QCSPCKSCY----KQNDAVFDPRASSSF 64

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           + LSCS   C L    +C +    C Y +  Y + + + G L  D   +  G        
Sbjct: 65  RRLSCSTPQCKLLDVKACASTDNRCLYQVS-YGDGSFTVGDLASDSFSVSRG-------- 115

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
             + V+ GCG    G +   V   GL+GLG G++S PS L+        FS C    D+G
Sbjct: 116 RTSPVVFGCGHDNEGLF---VGAAGLLGLGAGKLSFPSQLSS-----RKFSYCLVSRDNG 167

Query: 281 -----RIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--- 327
                 + FGD    T  S ++  L  N K  T Y  G+    IG + L    T+FK   
Sbjct: 168 VRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSS 227

Query: 328 ------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
                  I+DSG+S T LP   Y  +   F         + +   +  CY  S+     +
Sbjct: 228 STGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTI 287

Query: 382 PSVKLMF 388
           P+V   F
Sbjct: 288 PTVSFHF 294


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 51/326 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G++SV   L ++    + FS C     S
Sbjct: 104 QKIPGFTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMS 159

Query: 280 GRIFF---------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT--- 324
            R FF         G +  AT+   + T  +A       + + +    +    L  +   
Sbjct: 160 ERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSI 219

Query: 325 -SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            S K +V DSGS  +++P      ++    R++     + E    + CY   S     +P
Sbjct: 220 FSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMP 278

Query: 383 SVKLMFPQNNSFVV-NNPVFVIYGTQ 407
           ++ L F     F +  + VFV    Q
Sbjct: 279 AISLHFDDGARFDLGRHGVFVERSVQ 304


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 127/327 (38%), Gaps = 47/327 (14%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P+Q    +  GN     +   + +GTP     +  D GSDL W      +C P   S Y
Sbjct: 141 LPAQSGLPLGTGN-----YIVNVGLGTPKKDLSLIFDTGSDLTW-----TQCQPCVKSCY 190

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSS 199
               +    + PSAS T  ++SC+   C       G S       C Y +  Y +++ + 
Sbjct: 191 ---AQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQ-YGDSSFTV 246

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G   +D L L        +N V    + GCG    G +       GLIGLG   +S+   
Sbjct: 247 GFFAKDTLTLT-------QNDVFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQ 296

Query: 260 LAKAGLIRNSFSMCF--DKDDSGRIFFGD-QGPATQQS-------TSFLASNGKYITYII 309
            A+       FS C    +  +G + FG+  G  T ++       T F +S G    Y I
Sbjct: 297 TAQK--FGKYFSYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATF-YFI 353

Query: 310 GVETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
            V    +G   L  +         I+DSG+  T LP  VY ++ + F + ++   T+   
Sbjct: 354 DVLGISVGGKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPAL 413

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMFPQN 391
                CY  S+     +P +   F  N
Sbjct: 414 SLLDTCYDLSNYTSISIPKISFNFNGN 440


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 101/428 (23%), Positives = 171/428 (39%), Gaps = 64/428 (14%)

Query: 7   TIYLAVFWLLTESS--GAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQV 64
           ++ L + W L   S   A    FS ++IHR S          +R+    P +  F+    
Sbjct: 9   SLALVLLWCLYNISFLKANDGGFSVEMIHRDS----------SRSPLYRPTETPFQRV-- 56

Query: 65  LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGS 124
              ++  ++ +  G  F+  F S  S   ++    G     +  +G+P    L  +D GS
Sbjct: 57  ---ANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRY-SVGSPPFQVLGIVDTGS 112

Query: 125 DLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNP 181
           D+LW+ C+ C  C   +   ++          PS S T K L CS   C+    T+C + 
Sbjct: 113 DILWLQCEPCEDCYKQTTPIFD----------PSKSKTYKTLPCSSNTCESLRNTACSS- 161

Query: 182 KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDG 240
              C Y++DY   + S   L VE +    + G     +SV     +IGCG    G + + 
Sbjct: 162 DNVCEYSIDYGDGSHSDGDLSVETLTLGSTDG-----SSVHFPKTVIGCGHNNGGTFQE- 215

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQ-- 293
              +G   +GLG   V  +   +  I   FS C      + + S ++ FGD    + +  
Sbjct: 216 ---EGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGT 272

Query: 294 -STSFLASNGKYITYIIGVETCCIGSSCLK----------QTSFKAIVDSGSSFTFLPKE 342
            ST     NG+ + Y + +E   +G + ++                I+DSG++ T LP+E
Sbjct: 273 VSTPLDPLNGQ-VFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQE 331

Query: 343 VYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFV 402
            Y  + +     +              CYK++S  L  LP +   F   +  V  NP+  
Sbjct: 332 DYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDEL-DLPVITAHFKGAD--VELNPIST 388

Query: 403 IYGTQVGV 410
               + GV
Sbjct: 389 FVPVEKGV 396


>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
          Length = 191

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 58/111 (52%), Gaps = 11/111 (9%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L++T + +G+P   + V +D GSD+LW+ C +C RC   S      +  DL  Y P  S 
Sbjct: 69  LYFTKLGLGSPKKDYYVQVDTGSDILWVNCVECSRCPTKS-----QIGMDLTLYDPKGSH 123

Query: 161 TSKHLSCSHRLCDLGTSCQNP----KQPCPYTMDYYTENTSSSGLLVEDIL 207
           TS+ +SC H  C        P    + PCPY++  Y + ++++G  V D L
Sbjct: 124 TSELISCDHEFCSSTYDGPIPGCRAETPCPYSIT-YGDGSATTGYYVRDYL 173


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/366 (21%), Positives = 142/366 (38%), Gaps = 53/366 (14%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFP 86
           FS  LIHR S +      S   N +  PA++   +++  +S                + P
Sbjct: 35  FSIDLIHRDSPK------SPLYNPSETPAERLDRFFRRFMSFSEAS-----------ISP 77

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYN 145
           +     +S  N     +   I IGTP        D GSDL+W  C  C+ C       ++
Sbjct: 78  NTPEPPVSSNN---GEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFD 134

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
                     PS S++ K +SC  + C L    SC  P++ C ++   Y + + + G++ 
Sbjct: 135 ----------PSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYG-YGDGSLAQGVIA 183

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
            + L L S   N+ +     +++ GCG   SG + +     GL G G   +S+ S +   
Sbjct: 184 TETLTLNS---NSGQPXSIXNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238

Query: 264 GLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQS--TSFLASNGKYITYIIGVETCCI 316
                 FS C      D   + +I FG +   +     ++ L +      Y + ++   +
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 317 G-------SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
           G       SS    T     +D+G+  T LP++ Y  +       +            + 
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358

Query: 370 CYKSSS 375
           CY+S++
Sbjct: 359 CYRSAT 364


>gi|392568782|gb|EIW61956.1| aspartic peptidase A1 [Trametes versicolor FP-101664 SS1]
          Length = 415

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/350 (23%), Positives = 136/350 (38%), Gaps = 67/350 (19%)

Query: 70  VQKQKMKTGPQF---QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDL 126
           V +  +K G +    Q  F ++G  T+ L N     ++  I +GTP  SF V LD GS  
Sbjct: 67  VSRPTVKDGEELFWTQDEFSTEGGHTVPLSNFMNAQYFAEITLGTPPQSFKVILDTGSSN 126

Query: 127 LWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 186
           LW+P    +C  ++   +        +Y  SASST K                       
Sbjct: 127 LWVP--STKCTSIACFLH-------AKYDSSASSTYK------------------ANGSE 159

Query: 187 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 246
           +++ Y   + S  G +  D+L +   GD  +KN   A      G+  + G  DG+     
Sbjct: 160 FSIQY--GSGSMEGFVSRDVLTI---GDLTVKNLDFAEATKEPGLAFAFGKFDGI----- 209

Query: 247 IGLGLGEISVPSL------LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSF 297
           +GLG   ISV  +      L   GL+ +   SF +   ++D G   FG    +       
Sbjct: 210 LGLGYDTISVNHIVPPFYALVNQGLLDSPVFSFRLGDSEEDGGEAIFGGIDDSAYSGKIE 269

Query: 298 LASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
                +   + + +E   +G   L+  +  A +D+G+S   LP ++ E + A+   + + 
Sbjct: 270 YVPVRRKAYWEVELEKIRLGDEELELENTGAAIDTGTSLIALPSDLAEMLNAQIGAKKS- 328

Query: 358 TITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQ 407
                    W   Y     ++P LP +   F        N   +V+ GT 
Sbjct: 329 ---------WNGQYTVDCAKVPDLPDLTFFF--------NGKPYVLKGTD 361


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/286 (24%), Positives = 122/286 (42%), Gaps = 50/286 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           +  GTP+V  ++ +D GSD+ W+   PC+  +C P     ++          PS SST  
Sbjct: 135 LGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFD----------PSKSSTYA 184

Query: 164 HLSCSHRLC-DLG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            ++C+   C  LG      C +    C Y+++ Y + + S G+   + L L  G      
Sbjct: 185 PIACNTDACRKLGDHYHNGCTSGGTQCGYSVE-YADGSHSRGVYSNETLTLAPG------ 237

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
                    GCG  Q G        DGL+GLG   +S+  ++  + +   +FS C    +
Sbjct: 238 -ITVEDFHFGCGRDQRG---PSDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSYCLPALN 291

Query: 279 SGRIFFGDQGPATQQSTSFLASNGKYIT-----YIIGVETCCIGSSCLK--QTSFKA--I 329
           S   F     P +   ++F+ +  +++      Y++ +    +G   L   Q++F+   I
Sbjct: 292 SEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMI 351

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVN----------DTITSFEGY 365
           +DSG+  T LP+  Y  + A   + +           DT  +F GY
Sbjct: 352 IDSGTVDTELPETAYNALEAALRKALKAYPLVPSDDFDTCYNFTGY 397


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 138/322 (42%), Gaps = 53/322 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  ++   +D GSDL+W  C  C  C           D+    + P  SS+   L
Sbjct: 101 LAIGTPAETYSAIMDTGSDLIWTQCKPCKVC----------FDQPTPIFDPEKSSSFSKL 150

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS  LC +     +    C Y    Y +++S+ G+L  +       GD ++     + +
Sbjct: 151 PCSSDLC-VALPISSCSDGCEYRYS-YGDHSSTQGVLATETFTF---GDASV-----SKI 200

Query: 226 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--RI 282
             GCG    G  Y  G    GL+GLG G +   SL+++ G+ + S+ +    D  G   +
Sbjct: 201 GFGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTL 254

Query: 283 FFGDQGPATQQS---TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------I 329
             G +  AT +S   T  + +  +   Y + +E   +G + L  ++++F          I
Sbjct: 255 LVGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLI 312

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL---- 381
           +DSG++ T+L    +  +  EF  Q+   + +      + C+      S   +P+L    
Sbjct: 313 IDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHF 372

Query: 382 PSVKLMFPQNNSFVVNNPVFVI 403
             V L  P+ N  + ++ + VI
Sbjct: 373 EGVDLKLPKENYIIEDSALRVI 394


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 131/312 (41%), Gaps = 41/312 (13%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G   Y   + +GTP V+ ++++D GSD+ W     V+CAP +A   +S    L 
Sbjct: 120 SSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSW-----VQCAPCAAQSCSSQKDKL- 173

Query: 153 EYSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
            + P+ S+T    SCS   C      G  C N    C Y +  Y ++++++G    D L 
Sbjct: 174 -FDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSH--CQYIVK-YVDHSNTTGTYGSDTLG 229

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L +   +A+KN        GC  + +G  G LDG+       +GLG  +   +   A   
Sbjct: 230 LTT--SDAVKN-----FQFGCSHRANGFVGQLDGL-------MGLGGDTESLVSQTAATY 275

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYII----GVETCCIGSSCLK 322
             +FS C     S    F   G A   ++S   S    + + +    GV    I  +  K
Sbjct: 276 GKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTK 335

Query: 323 QT------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
                   S  ++VDSG+  T LP   Y+ +   F +++    ++        C+  S  
Sbjct: 336 LNVPASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGI 395

Query: 377 RLPKLPSVKLMF 388
           +  ++P V L F
Sbjct: 396 KTVRVPVVTLTF 407


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 133/328 (40%), Gaps = 49/328 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T + +GTP  + +V +D GS + W+ C+C  C     ++             S S+T 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFCECDGCHTNPRTFLQ-----------SRSTTC 49

Query: 163 KHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
             +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L          
Sbjct: 50  AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF-------- 100

Query: 218 KNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
            + VQ   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C  
Sbjct: 101 -SDVQKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLP 155

Query: 276 KDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT- 324
              S R FF         G     T  + T  +A       + + +    +    L  + 
Sbjct: 156 LQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSP 215

Query: 325 ---SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
              S K +V DSGS  +++P      ++    R++     + E    + CY   S     
Sbjct: 216 SIFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGD 274

Query: 381 LPSVKLMFPQNNSFVVNNP-VFVIYGTQ 407
           +P++ L F     F + +  VFV    Q
Sbjct: 275 MPAISLHFDDGARFDLGSSGVFVERSVQ 302


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 86/319 (26%), Positives = 126/319 (39%), Gaps = 57/319 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP V  L+ALD  SDL W+ C  C RC P S   ++          P  S++   +
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFD----------PRHSTSYGEM 194

Query: 166 SCSHRLCD-LGTSCQN--PKQPCPYTM-----DYYTENTSSSGLLVEDILHLISGGDNAL 217
           +     C  LG S      +  C YT+     D +   ++S G LVE+ L    G     
Sbjct: 195 NYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGG----- 249

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
               QA + IGCG    G  L G    G++GL  G+IS+P  +A  G    SFS C    
Sbjct: 250 --VRQAYLSIGCGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLVDF 304

Query: 278 DSG------RIFFG----DQGPATQQSTSFLASNGK--YITYIIGVETCCIGSSCLKQTS 325
            SG       + FG    D  P    + + L  N    Y   +IGV    +    + +  
Sbjct: 305 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 364

Query: 326 FK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCY-- 371
            +          I+DSG++ T L +  Y      F            G P   +  CY  
Sbjct: 365 LQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTV 424

Query: 372 --KSSSQRLPKLPSVKLMF 388
             ++  +   K+P+V + F
Sbjct: 425 GGRAGLRHCVKVPAVSMHF 443


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 122/299 (40%), Gaps = 47/299 (15%)

Query: 112 PNVSFLVALDAGSDLLW---IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           P V   V LD+ SD+ W   +PC    C P   S+Y+          PS S TS   SCS
Sbjct: 25  PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYD----------PSRSPTSAAFSCS 74

Query: 169 HRLCD-LG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
              C  LG     C N +  C Y +  Y + +S+SG  + D+L L +G  NA+       
Sbjct: 75  SPTCTALGPYANGCANNQ--CQYLVR-YPDGSSTSGAYIADLLTLDAG--NAVSG----- 124

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
              GC   + G +    A  G++ LG G  S+  L   A    N+FS C     S   FF
Sbjct: 125 FKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFF 180

Query: 285 GDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSF 336
               P    S    T  +        Y + + T  +G   L      F A  ++DS ++ 
Sbjct: 181 TLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAI 240

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWK----CCYKSSSQRLPKLPSVKLMFPQN 391
           T LP   Y+ + A F      ++T +   P K     CY  +     +LP + L+F +N
Sbjct: 241 TRLPPTAYQALRAAF----RSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRN 295


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 93/391 (23%), Positives = 152/391 (38%), Gaps = 77/391 (19%)

Query: 30  KLIHRFS--------EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQF 81
           KL HR+S         E   LG+SK             ++ Q L+  + ++ +   G   
Sbjct: 25  KLQHRYSGLEGSSKQNEKLGLGMSK-------------QHLQHLVEHNDRRGRFLQG--- 68

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRC---- 136
            + FP +G+ +     D G L+YT I +G P     V +D GSD+LW+ C  C  C    
Sbjct: 69  -ISFPLKGNYS-----DLG-LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQ 121

Query: 137 ---APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT 193
               PLS    ++              T + + CS                C Y +  Y 
Sbjct: 122 DIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSR---------SGNNSACAY-VSSYQ 171

Query: 194 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
           + ++S G  V D +H +  G NA      + +  GC    +G +      DG++G GL  
Sbjct: 172 DKSASVGAYVRDDMHYVLHGGNA----TTSRIFFGCATNITGSW----PVDGIMGFGLIS 223

Query: 254 ISVPSLLAKAGLIRNSFSMCF--DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGV 311
            +VP+ +A    +   FS C   +K   G + FG+    T+   + L +   +  Y + +
Sbjct: 224 KTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEAPNTTEMVFTPLLNVTTH--YNVDL 281

Query: 312 ETCCIGSSCL----KQTSF--------KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
            +  + S  L    K+ S+          I+DSG++F  L  +    +  E        +
Sbjct: 282 LSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKL 341

Query: 360 -TSFEGYPWKCCY-KSSSQRLPKLPSVKLMF 388
               EG   +C Y KS        P+V L F
Sbjct: 342 GPKLEG--LECFYLKSGLTMETSFPNVTLTF 370


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 135/319 (42%), Gaps = 35/319 (10%)

Query: 84  LFPSQGSKTMSL--GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           +FP + + T+ +  G   G   Y   + +GTP   F +  D GSD+ W      +C P  
Sbjct: 97  MFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITW-----TQCEPCV 151

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT----ENT 196
            + Y   +  LN   PS S++ K++SCS  LC L  S +   Q C  +   Y     + +
Sbjct: 152 KTCYKQKEPRLN---PSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGS 208

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G    + L L S   N  KN      + GCG + +          GL+GLG  ++++
Sbjct: 209 YSIGFFATETLTLSS--SNVFKN-----FLFGCGQQNN---GLFGGAAGLLGLGRTKLAL 258

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           PS  AK    +  FS C     S  G +  G Q   + + T   A       Y + +   
Sbjct: 259 PSQTAKT--YKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGL 316

Query: 315 CIGSSCLK--QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 369
            +G   L   +++F A  ++DSG+  T L    Y  +++ F   + D   S  GY  +  
Sbjct: 317 SVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSIFDT 375

Query: 370 CYKSSSQRLPKLPSVKLMF 388
           CY  S     ++P V + F
Sbjct: 376 CYDFSKYDTVRIPKVGVTF 394


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/300 (28%), Positives = 124/300 (41%), Gaps = 47/300 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP   F V  D GSD  W     V+C P  A  Y   +     + P+ S+T  ++S
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTW-----VQCQPCVAYCYRQKE---PLFDPTKSATYANIS 151

Query: 167 CSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           CS   C DL  S C      C Y +  Y + + + G   +D L L     + +KN     
Sbjct: 152 CSSSYCSDLYVSGCSGGH--CLYGIQ-YGDGSYTIGFYAQDTLTLAY---DTIKN----- 200

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF 283
              GCG K  G  L G A  GL+GLG G+ S+P     K G +   F+ C     +G  F
Sbjct: 201 FRFGCGEKNRG--LFGRA-AGLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGF 254

Query: 284 FGDQGP----ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGS 334
             D GP    A  + T  L   G    Y +G+    +G   L       ++   +VDSG+
Sbjct: 255 L-DLGPGAPAANARLTPMLVDRGPTF-YYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGT 312

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLP--KLPSVKLMF 388
             T LP   Y  + + F + +      +   P       CY  +  +     LP+V L+F
Sbjct: 313 VITRLPPSAYAPLRSAFSKAMQG--LGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 370


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 89/349 (25%), Positives = 144/349 (41%), Gaps = 64/349 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP    L+ LD GSD++W+ C  C RC   S   ++          P  S +
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFD----------PRRSRS 189

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
              + C+  LC   D G  C   +  C Y +  Y + + ++G    + L    G      
Sbjct: 190 YNAVGCAAPLCRRLDSG-GCDLRRSACLYQV-AYGDGSVTAGDFATETLTFAGG------ 241

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
            +  A V +GCG    G +   VA  GL+GLG G +S P+ +++      SFS C  D+ 
Sbjct: 242 -ARVARVALGCGHDNEGLF---VAAAGLLGLGRGSLSFPTQISR--RYGRSFSYCLVDRT 295

Query: 278 DSGR-------IFFGDQGPATQQSTSF--LASNGKYITY----IIGVETCCIGSSCLKQT 324
            S         + FG     +  ++SF  +  N +  T+    +IG+         +  +
Sbjct: 296 SSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANS 355

Query: 325 SFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS 374
             +          IVDSG+S T L +  Y  +   F         S  G+  +  CY  S
Sbjct: 356 DLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLS 415

Query: 375 SQRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
            +++ K+P+V + F         P+N    V++     F   GT  GVS
Sbjct: 416 GRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVS 464


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/310 (23%), Positives = 122/310 (39%), Gaps = 64/310 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I IG P V  L+ +D GSDL WI C   +C P +  +++          PS SST ++ S
Sbjct: 92  ISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPFFH----------PSRSSTYRNAS 141

Query: 167 CSHRLCDLGTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           C      +    ++ K   C Y +  Y + +++ G+L ++ L   +  +  +    + ++
Sbjct: 142 CESAPHAMPQIFRDEKTGNCRYHLR-YRDFSNTRGILAKEKLTFQTSDEGLIS---KPNI 197

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN---SFSMCFDKDDSGRI 282
           + GCG   SG         G++GLG G  S+        + RN    FS C         
Sbjct: 198 VFGCGQDNSG----FTQYSGVLGLGPGTFSI--------VTRNFGSKFSYC--------- 236

Query: 283 FFGDQGPATQQSTSFLASNGKYI------------TYIIGVETCCIGSSCLK-------- 322
            FG     T      +  NG  I             Y + ++   +G   L         
Sbjct: 237 -FGSLIDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQR 295

Query: 323 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFD---RQVNDTITSFEGYPWKCCYKSSSQRL 378
            ++    ++D+G S T L +E YET++ E D    +V   +  +E Y   C   +    L
Sbjct: 296 YRSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDL 355

Query: 379 PKLPSVKLMF 388
              P V   F
Sbjct: 356 YGFPVVTFHF 365


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 108/281 (38%), Gaps = 32/281 (11%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I  G+P     + +D GS L W      +C P S  Y   +     +Y P+AS T +   
Sbjct: 62  IHFGSPQKKQFLHMDTGSSLTW-----TQCFPCSDCYAQKI---YPKYRPAASITYRDAM 113

Query: 167 C--SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C  SH   +   +     + C Y   +Y + T+  G L ++++  +   D   K      
Sbjct: 114 CEDSHPKSNPHFAFDPLTRICTY-QQHYLDETNIKGTLAQEMI-TVDTHDGGFKRV--HG 169

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSG 280
           V  GC     G Y  G    G++GLG+G+ S+       G   + FS C     +   S 
Sbjct: 170 VYFGCNTLSDGSYFTGT---GILGLGVGKYSI------IGEFGSKFSFCLGEISEPKASH 220

Query: 281 RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLP 340
            +  GD        T    + G  I     +E+  +G         +  VD+GS+ + L 
Sbjct: 221 NLILGDGANVQGHPTVINITEGHTI---FQLESIIVGEEITLDDPVQVFVDTGSTLSHLS 277

Query: 341 KEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
             +Y      FD  +     S+E  P  C    + +RL K+
Sbjct: 278 TNLYYKFVDAFDDLIGSRPLSYE--PTLCYKADTIERLEKM 316


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/280 (27%), Positives = 121/280 (43%), Gaps = 45/280 (16%)

Query: 100 GWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 158
           G L Y   + IGTP       LD GSDL+W      +CAP +    + L +    ++P  
Sbjct: 98  GDLEYVVDLAIGTPPQPVSALLDTGSDLIW-----TQCAPCA----SCLAQPDPLFAPGE 148

Query: 159 SSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           S++ + + C+ +LC   L   C+ P   C Y  + Y + T + G+   +     S G + 
Sbjct: 149 SASYEPMRCAGQLCSDILHHGCEMPDT-CTYRYN-YGDGTMTMGVYATERFTFTSSGGDR 206

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
           L   +   +  GCG    G   +G    G++G G   +S+ S L+    IR  FS C   
Sbjct: 207 L---MTVPLGFGCGSMNVGSLNNG---SGIVGFGRNPLSLVSQLS----IRR-FSYCLTS 255

Query: 277 DDSGR---IFFGD-----QGPATQ--QSTSFLASNGKYITYIIGVETCCIGSSCLK--QT 324
             SGR   + FG       G AT   Q+T  L S      Y + +    +G+  L+  ++
Sbjct: 256 YGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPES 315

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           +F          IVDSG++ T LP  V   +   F +Q+ 
Sbjct: 316 AFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLR 355


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 130/318 (40%), Gaps = 38/318 (11%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASY 143
           F    SK +S  ++    ++  + IG+P     + +D+GSD++W+ C  C+ C       
Sbjct: 109 FSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC------- 161

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLL 202
           Y   D     + P+ S+T   + C   +C  L TS       C Y +  Y + + + G L
Sbjct: 162 YAQAD---PLFDPATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVS-YGDGSYTKGAL 217

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
             + L L   G  A++      V IGCG +  G +   V   GL+GLG G +S+   L  
Sbjct: 218 ALETLTL---GGTAVEG-----VAIGCGHRNRGLF---VGAAGLLGLGWGPMSLVGQLGG 266

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF-LASNGKYIT-YIIGVETCCIGSSC 320
           A     +FS C     +G +  G      + +    L  N +  + Y +G+    +G   
Sbjct: 267 A--AGGAFSYCLASRGAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDER 324

Query: 321 L----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC 370
           L          +  +   ++D+G++ T LP+E Y  +   F   V     +        C
Sbjct: 325 LPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTC 384

Query: 371 YKSSSQRLPKLPSVKLMF 388
           Y  S     ++P+V   F
Sbjct: 385 YDLSGYTSVRVPTVSFYF 402


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 51/326 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G++SV   L ++    + FS C     S
Sbjct: 104 QKIPGFTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMS 159

Query: 280 GRIFF---------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT--- 324
            R FF         G +  AT+   + T  +A       + + +    +    L  +   
Sbjct: 160 ERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSI 219

Query: 325 -SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            S K +V DSGS  +++P      ++    R++     + E    + CY   S     +P
Sbjct: 220 FSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMP 278

Query: 383 SVKLMFPQNNSFVV-NNPVFVIYGTQ 407
           ++ L F     F + ++ VFV    Q
Sbjct: 279 AISLHFDDGARFDLGSHGVFVERSVQ 304


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 129/308 (41%), Gaps = 45/308 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP  +  + +D GSDL+W PC     C  C+      +++ +   N + P +SS+S
Sbjct: 94  LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCS------FSTSNPSSNIFIPKSSSSS 147

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           K L C +  C    G+  Q+  + C  T    T+       +    L+ +   D+     
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQ-------ICPPYLNFLRFWDH---RR 197

Query: 221 VQASVIIGCGMKQS-----GGYLDGVAPDGLIG-LGLGEISVPSLLAKAGLIRNSFSMCF 274
            Q    + C + QS      G+  G  P  L   LGL + S   L  +      S S+  
Sbjct: 198 SQFHRRMLCPLHQSTRREISGF--GRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVL 255

Query: 275 D-KDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------ 327
           D + DSG    G       Q+      +   + Y +G+    +G   +K   +K      
Sbjct: 256 DGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK-IPYKYLIPGA 314

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFDRQV-NDTITSFEGYP-WKCCYKSSSQRLPK 380
                 I+DSG++FT++  E++E +AAEF++QV +   T  EG    + C+  S    P 
Sbjct: 315 DGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPS 374

Query: 381 LPSVKLMF 388
            P + L F
Sbjct: 375 FPELTLKF 382


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 135/319 (42%), Gaps = 35/319 (10%)

Query: 84  LFPSQGSKTMSL--GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           +FP + + T+ +  G   G   Y   + +GTP   F +  D GSD+ W      +C P  
Sbjct: 109 MFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITW-----TQCEPCV 163

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYT----ENT 196
            + Y   +  LN   PS S++ K++SCS  LC L  S +   Q C  +   Y     + +
Sbjct: 164 KTCYKQKEPRLN---PSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGS 220

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G    + L L S   N  KN      + GCG + +          GL+GLG  ++++
Sbjct: 221 YSIGFFATETLTLSS--SNVFKN-----FLFGCGQQNN---GLFGGAAGLLGLGRTKLAL 270

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           PS  AK    +  FS C     S  G +  G Q   + + T   A       Y + +   
Sbjct: 271 PSQTAKT--YKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGL 328

Query: 315 CIGSSCLK--QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 369
            +G   L   +++F A  ++DSG+  T L    Y  +++ F   + D   S  GY  +  
Sbjct: 329 SVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSIFDT 387

Query: 370 CYKSSSQRLPKLPSVKLMF 388
           CY  S     ++P V + F
Sbjct: 388 CYDFSKYDTVRIPKVGVTF 406


>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
          Length = 411

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 108/260 (41%), Gaps = 27/260 (10%)

Query: 96  GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+   ++I  P   + + +D GS L W+ CD  C+ C  +    Y        
Sbjct: 30  GNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKP------ 83

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHL-I 210
           E   +   T +   C+    DL    +  PK  C Y + Y     SS G+L+ D   L  
Sbjct: 84  ELKYAVKCTEQR--CADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPA 139

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP-DGLIGLGLGEISVPSLLAKAGLI-RN 268
           S G N        S+  GCG  Q     +   P +G++GLG G++++ S L   G+I ++
Sbjct: 140 SNGTNP------TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKH 193

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS---SCLKQTS 325
               C      G +FFGD    T   T +   N ++  Y     T    S   S +    
Sbjct: 194 VLGHCISSKGKGFLFFGDAKVPTSGVT-WSPMNREHKHYSPRQGTLHFNSNKQSPISAAP 252

Query: 326 FKAIVDSGSSFTFLPKEVYE 345
            + I DSG+++T+   + Y 
Sbjct: 253 MEVIFDSGATYTYFALQPYH 272


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 147/343 (42%), Gaps = 56/343 (16%)

Query: 93  MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           ++ G +   L+Y  + IG  N +  V +D GSDL W+ CD C+ C       +N  +   
Sbjct: 122 LASGINLETLNYI-VTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSS 180

Query: 152 NEYSPSASSTSKHLSCSHRLCDLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
                  SST ++L  +    +   +C+ N    C +T+ Y   + +   L VE   HL 
Sbjct: 181 YNSLLCNSSTCQNLQFTTGNTE---ACESNNPSSCNHTVSYGDGSFTDGELGVE---HLS 234

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSF 270
            GG +       ++ + GCG + + G   GV+  G++GLG   +S+ S           F
Sbjct: 235 FGGISV------SNFVFGCG-RNNKGLFGGVS--GIMGLGRSNLSMISQTNTT--FGGVF 283

Query: 271 SMCF---DKDDSGRIFFGDQGPATQQSTSF----LASNGK----YITYIIGVETCCIGSS 319
           S C    D   SG +  G++    +  T      + SN +    Y+  + G++   +G  
Sbjct: 284 SYCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGID---VGGV 340

Query: 320 CLKQTSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKC 369
            ++ TSF     ++DSG+  T L   +Y  + AEF +Q       F GYP          
Sbjct: 341 AIQDTSFGNGGILIDSGTVITRLAPSLYNALKAEFLKQ-------FSGYPIAPALSILDT 393

Query: 370 CYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIY----GTQV 408
           C+  +      +P++ + F  N    V + V ++Y    G+QV
Sbjct: 394 CFNLTGIEEVSIPTLSMHFENNVDLNV-DAVGILYMPKDGSQV 435


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/299 (25%), Positives = 125/299 (41%), Gaps = 49/299 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP VS+   LD GSDL+W  C  C RC       ++          P  SS+   +
Sbjct: 112 LAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFD----------PKKSSSFSKV 161

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           SC   LC     ++C +    C Y    Y + + + G+L  +       G +  K SV  
Sbjct: 162 SCGSSLCSALPSSTCSD---GCEYVYS-YGDYSMTQGVLATETFTF---GKSKNKVSVH- 213

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-- 281
           ++  GCG    G   +  +  GL+GLG G +S+ S L +       FS C    D  +  
Sbjct: 214 NIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLKE-----QRFSYCLTPIDDTKES 266

Query: 282 -IFFGDQGPATQQ----STSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK------- 327
            +  G  G         +T  L +  +   Y + +E   +G + L  ++++F+       
Sbjct: 267 VLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNG 326

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLPKL 381
             I+DSG++ T++ ++ YE +  EF  Q    +          C+     S+   +PKL
Sbjct: 327 GVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKL 385


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 135/319 (42%), Gaps = 35/319 (10%)

Query: 84  LFPSQGSKTMSL--GNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           +FP + + T+ +  G   G   Y   + +GTP   F +  D GSD+ W      +C P  
Sbjct: 49  MFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITW-----TQCEPCV 103

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYT----MDYYTENT 196
            + Y   +  LN   PS S++ K++SCS  LC L  S +   Q C  +       Y + +
Sbjct: 104 KTCYKQKEPRLN---PSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGS 160

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G    + L L S   N  KN      + GCG + +          GL+GLG  ++++
Sbjct: 161 YSIGFFATETLTLSS--SNVFKN-----FLFGCGQQNN---GLFGGAAGLLGLGRTKLAL 210

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS--GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETC 314
           PS  AK    +  FS C     S  G +  G Q   + + T   A       Y + +   
Sbjct: 211 PSQTAKT--YKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGL 268

Query: 315 CIGSSCLK--QTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 369
            +G   L   +++F A  ++DSG+  T L    Y  +++ F   + D   S  GY  +  
Sbjct: 269 SVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSIFDT 327

Query: 370 CYKSSSQRLPKLPSVKLMF 388
           CY  S     ++P V + F
Sbjct: 328 CYDFSKYDTVRIPKVGVTF 346


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 131/323 (40%), Gaps = 61/323 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP   F + +D+GSDLLW     V+CAP    Y     +D   Y+PS SST   + C 
Sbjct: 71  LGTPPQKFSLIVDSGSDLLW-----VQCAPCLQCY----AQDTPLYAPSNSSTFNPVPCL 121

Query: 169 HRLCDLGTSCQNPKQPCPY------TMDY-YTENTSSSGLLVEDILHLISGGDNALKNSV 221
              C L  + +    PC +        +Y Y + + S G+            ++A  + V
Sbjct: 122 SPECLLIPATEG--FPCDFHYPGACAYEYRYADTSLSKGVFAY---------ESATVDDV 170

Query: 222 QAS-VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----D 275
           +   V  GCG    G +    A  G++GLG G +S  S +  A    N F+ C       
Sbjct: 171 RIDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDP 225

Query: 276 KDDSGRIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLKQT-------- 324
              S  + FGD+  +T     F  + SN +  T Y + +E   +G   L  +        
Sbjct: 226 TSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDF 285

Query: 325 --SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKL 381
             +  +I DSG++ T+     Y  I A FD+ V      S +G     C   +    P  
Sbjct: 286 LGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQG--LDLCVDVTGVDQPSF 343

Query: 382 PSVKLMF-------PQNNSFVVN 397
           PS  ++        PQ  ++ V+
Sbjct: 344 PSFTIVLGGGAVFQPQQGNYFVD 366


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 138/322 (42%), Gaps = 53/322 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  ++   +D GSDL+W  C  C  C           D+    + P  SS+   L
Sbjct: 101 LAIGTPAETYSAIMDTGSDLIWTQCKPCKVC----------FDQPTPIFDPEKSSSFSKL 150

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            CS  LC +     +    C Y    Y +++S+ G+L  +       GD ++     + +
Sbjct: 151 PCSSDLC-VALPISSCSDGCEYRYS-YGDHSSTQGVLATETFTF---GDASV-----SKI 200

Query: 226 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG--RI 282
             GCG    G  Y  G    GL+GLG G +   SL+++ G+ + S+ +    D  G   +
Sbjct: 201 GFGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTL 254

Query: 283 FFGDQGPATQQS---TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--------I 329
             G +  AT +S   T  + +  +   Y + +E   +G + L  ++++F          I
Sbjct: 255 LVGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLI 312

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL---- 381
           +DSG++ T+L    +  +  EF  Q+   + +      + C+      S   +P+L    
Sbjct: 313 IDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHF 372

Query: 382 PSVKLMFPQNNSFVVNNPVFVI 403
             V L  P+ N  + ++ + VI
Sbjct: 373 EGVDLKLPKENYIIEDSALRVI 394


>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 435

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/240 (29%), Positives = 100/240 (41%), Gaps = 51/240 (21%)

Query: 73  QKMKTGPQFQMLFPSQGSKT-------MSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGS 124
           Q++ T   F ++ PS   ++       + +  D   L Y T I   TP V   + LD G 
Sbjct: 7   QRLFTLFLFSLIAPSLAQQSFRPRALVVPVKKDASTLQYITQIKQRTPLVPENLVLDIGG 66

Query: 125 DLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS--CQN-- 180
             LW+ CD                   N Y    SST +   C    C L  S  C N  
Sbjct: 67  QFLWVDCD-------------------NNY---VSSTYRPARCGSAQCSLARSDSCGNCF 104

Query: 181 --PK-----QPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNSVQASVIIGCG-- 230
             PK       C  T D     T++SG L +D++ L S  G N ++N+  +  +  C   
Sbjct: 105 SAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVSLQSTNGFNPIQNATVSRFLFSCAPT 164

Query: 231 -MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGP 289
            + Q  G   GV+  G+ GLG   I++PS LA A   R  F++C    + G  FFGD GP
Sbjct: 165 FLLQ--GLATGVS--GMAGLGRTRIALPSQLASAFSFRRKFAVCLSSSN-GVAFFGD-GP 218


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 46/318 (14%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P+   + +S GN     +   + +GTP   + V  D GSD  W     V+C P     Y
Sbjct: 150 LPATSGRAVSTGN-----YVVTVGLGTPASKYTVVFDTGSDTTW-----VQCRPCVVKCY 199

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLL 202
              +     + P+ SST  ++SC+   C DL T+ C      C Y +  Y + + + G  
Sbjct: 200 KQKE---PLFDPAKSSTYANVSCTDSACADLDTNGCTGGH--CLYAVQ-YGDGSYTVGFF 253

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
            +D L +     +A+K         GCG K +G +       GL+GLG G+ S+   +  
Sbjct: 254 AQDTLTIA---HDAIKG-----FRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQA 300

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS----TSFLASNGKYITYI------IGVE 312
                 +F+ C     +G  +  D GP +  +    T  L   G+   Y+      +G +
Sbjct: 301 YNKYGGAFAYCLPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQ 359

Query: 313 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR-QVNDTITSFEGYP-WKCC 370
              +  S    ++   +VDSG+  T LP   Y  +++ FD+  +        GY     C
Sbjct: 360 QVPVAESVF--STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTC 417

Query: 371 YKSSSQRLPKLPSVKLMF 388
           Y  +     +LP+V L+F
Sbjct: 418 YDFTGLSDVELPTVSLVF 435


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 133/328 (40%), Gaps = 49/328 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T 
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTC 49

Query: 163 KHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
             +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L          
Sbjct: 50  AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF-------- 100

Query: 218 KNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
            + VQ   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C  
Sbjct: 101 -SDVQKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLP 155

Query: 276 KDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT- 324
              S R FF         G     T  + T  +A       + + +    +    L  + 
Sbjct: 156 LQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSP 215

Query: 325 ---SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
              S K +V DSGS  +++P      ++    R++     + E    + CY   S     
Sbjct: 216 SIFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGD 274

Query: 381 LPSVKLMFPQNNSFVVNNP-VFVIYGTQ 407
           +P++ L F     F + +  VFV    Q
Sbjct: 275 MPAISLHFDDGARFDLGSRGVFVERSVQ 302


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/325 (24%), Positives = 131/325 (40%), Gaps = 39/325 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +Y  + +G+P   + + +D GS L W+ C  CV         Y  +  D   + PSAS T
Sbjct: 13  YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCV--------VYCHVQAD-PLFDPSASKT 63

Query: 162 SKHLSCSHRLCDLGTS-------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
            K LSC+   C            C+     C YT   Y +++ S G L +D+L L     
Sbjct: 64  YKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTAS-YGDSSYSMGYLSQDLLTLA---- 118

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
               +      + GCG    G  L G A  G++GLG  ++S+   ++       +FS C 
Sbjct: 119 ---PSQTLPGFVYGCGQDSEG--LFGRAA-GILGLGRNKLSMLGQVSSK--FGYAFSYCL 170

Query: 275 -DKDDSGRIFFGDQGPA--TQQSTSFLASNGKYITYIIGVETCCIGSSCL----KQTSFK 327
             +   G +  G    A    + T      G    Y + +    +G   L     Q    
Sbjct: 171 PTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP 230

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKL 386
            I+DSG+  T LP  VY      F + ++       G+     C+K + + +  +P V+L
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRL 290

Query: 387 MFPQNNSFVVNNPVFVIYGTQVGVS 411
           +F Q  + +   PV V+     G++
Sbjct: 291 IF-QGGADLNLRPVNVLLQVDEGLT 314


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 128/323 (39%), Gaps = 59/323 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP  ++   +D GSDL+W  C  C  C           D+    + P  SS+   L C
Sbjct: 103 IGTPAETYSAIMDTGSDLIWTQCKPCKDC----------FDQPTPIFDPKKSSSFSKLPC 152

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           S  LC        P   C    +Y   Y + +S+ G+L  +          A  ++  + 
Sbjct: 153 SSDLC-----AALPISSCSDGCEYLYSYGDYSSTQGVLATETF--------AFGDASVSK 199

Query: 225 VIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR-- 281
           +  GCG    G G+  G    GL+GLG G +S+ S L +       FS C    D  +  
Sbjct: 200 IGFGCGEDNDGSGFSQGA---GLVGLGRGPLSLISQLGEP-----KFSYCLTSMDDSKGI 251

Query: 282 --IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL--KQTSFKA-------- 328
             +  G +       T+ L  N    + Y + +E   +G + L  ++++F          
Sbjct: 252 SSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGL 311

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY----KSSSQRLPKL--- 381
           I+DSG++ T+L    +  +  EF  Q+   +          C+     +S+  +P+L   
Sbjct: 312 IIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFH 371

Query: 382 -PSVKLMFPQNNSFVVNNPVFVI 403
                L  P  N  + ++ + VI
Sbjct: 372 FEGADLKLPAENYIIADSGLGVI 394


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 167/397 (42%), Gaps = 70/397 (17%)

Query: 40  KALGVSKNRNATSWP-AKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGND 98
           K +   KN+N  S    KK+ E     ++S V++Q        Q++   +   T+  G  
Sbjct: 102 KRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAG------QLVATLESGMTLGSGE- 154

Query: 99  FGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPS 157
               ++  + +G+P   F + LD GSDL WI C  C  C   + ++Y+          P 
Sbjct: 155 ----YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYD----------PK 200

Query: 158 ASSTSKHLSCSHRLCDLGT------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL-- 209
           AS++ K+++C+   C+L +       C++  Q CPY   +Y ++++++G    +   +  
Sbjct: 201 ASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYY-WYGDSSNTTGDFAVETFTVNL 259

Query: 210 -ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
             SGG + L N    +++ GCG    G +        L+GLG G +S  S L    L  +
Sbjct: 260 TTSGGSSELYNV--ENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGH 312

Query: 269 SFSMCF-----DKDDSGRIFFGDQGPATQQS----TSFLASNGKYIT--YIIGVETCCIG 317
           SFS C      D + S ++ FG+            TSF+A     +   Y + +++  + 
Sbjct: 313 SFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVA 372

Query: 318 SSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP- 366
              L             +   I+DSG++ ++  +  YE I  +   +       +  +P 
Sbjct: 373 GEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPI 432

Query: 367 WKCCYKSSSQRLPKLPSVKLM--------FPQNNSFV 395
              C+  S     +LP + +         FP  NSF+
Sbjct: 433 LDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFI 469


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 137/341 (40%), Gaps = 41/341 (12%)

Query: 82  QMLFPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
            +L P+  S  ++ G   G   +Y  + +GTP   + + LD GS L W+ C    CA   
Sbjct: 103 HLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQ--PCAVYC 160

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-------CQNPKQPCPYTMDYYT 193
            +  + L      Y PS S T K LSC+   C    +       C+     C YT   Y 
Sbjct: 161 HAQADPL------YDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTAS-YG 213

Query: 194 ENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGE 253
           + + S G L +D+L L S       +        GCG    G  L G A  G+IGL   +
Sbjct: 214 DTSFSIGYLSQDLLTLTS-------SQTLPQFTYGCGQDNQG--LFGRA-AGIIGLARDK 263

Query: 254 ISVPSLLA-KAGLIRNSFSMCFDKDDSGRIFFGDQ-----GPATQQSTSFLASNGKYITY 307
           +S+ + L+ K G   ++FS C    +SG    G        P + + T  L  +     Y
Sbjct: 264 LSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLY 320

Query: 308 IIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
            + +    +    L   +       ++DSG+  T LP  +Y  +   F + ++       
Sbjct: 321 FLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAP 380

Query: 364 GYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            Y     C+K S + +  +P +K++F       +  P  +I
Sbjct: 381 AYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILI 421


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 75/322 (23%), Positives = 127/322 (39%), Gaps = 54/322 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C           D+    + P AS++ ++++C
Sbjct: 156 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FDQRGPVFDPMASTSYRNVTC 205

Query: 168 SHRLCDLGTSCQNPK-------QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
               C L +    P+        PCPY   +Y + ++++G L    L   +    A  + 
Sbjct: 206 GDTRCGLVSPPAAPRTCRSSRSDPCPYYY-WYGDQSNTTGDLA---LEAFTVNLTASSSR 261

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG 280
               V++GCG +  G +       GL    L   S   L A  G   ++FS C     S 
Sbjct: 262 RVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHGSA 316

Query: 281 ---RIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL-----------K 322
              +I FGD            T+F  S  +   Y + ++   +G   L           +
Sbjct: 317 VGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKE 376

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL 381
             S   I+DSG++ ++ P+  Y+ I   F  +++        +P    CY  S     ++
Sbjct: 377 DGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEV 436

Query: 382 PSVKLM--------FPQNNSFV 395
           P   L+        FP  N F+
Sbjct: 437 PEFSLLFADGAVWDFPAENYFI 458


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 132/328 (40%), Gaps = 49/328 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTC 49

Query: 163 KHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
             +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L          
Sbjct: 50  AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF-------- 100

Query: 218 KNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
            + VQ   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C  
Sbjct: 101 -SDVQKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLP 155

Query: 276 KDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT- 324
              S R FF         G     T  + T  +A       + + +    +    L  + 
Sbjct: 156 LQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSP 215

Query: 325 ---SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
              S K +V DSGS  +++P      ++    R++     + E    + CY   S     
Sbjct: 216 SIFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGD 274

Query: 381 LPSVKLMFPQNNSFVV-NNPVFVIYGTQ 407
           +P++ L F     F +  + VFV    Q
Sbjct: 275 MPAISLHFDDGARFDLGRHGVFVERSVQ 302


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 134/350 (38%), Gaps = 61/350 (17%)

Query: 60  EYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVA 119
            Y+ +  SSD    ++++G         Q    M L             IGTP V F+  
Sbjct: 71  RYFTMSTSSDAGPARLRSG---------QAEYLMELA------------IGTPPVPFVAL 109

Query: 120 LDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 178
            D GSDL W  C  C  C P     Y++         P AS+T   +  S        +C
Sbjct: 110 ADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATCLPIWSSR-------NC 162

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
                PC Y    Y +   S+G+L  + L        A   SV   +  GCG+   G   
Sbjct: 163 TASSSPCRYRYA-YGDGAYSAGVLGTETLTF----PGAPGVSV-GGIAFGCGVDNGGLSY 216

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC--FDKDDSGRIFFGD----QGPATQ 292
           +     G +GLG G +   SL+A+ G+ + S+ +   F+      + FG       P+T 
Sbjct: 217 NST---GTVGLGRGSL---SLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPSTG 270

Query: 293 ---QSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFL 339
              QST  + S      Y + +E   +G + L             S   IVDSG++FTFL
Sbjct: 271 AAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFL 330

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSS-SQRLPKLPSVKLMF 388
            +  +  +       +   + +       C   ++  Q+LP +P + L F
Sbjct: 331 VESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHF 380


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 127/308 (41%), Gaps = 56/308 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP + FL   D GSDL+W      +CAP S   +    +    Y+PS+S+T   L 
Sbjct: 89  LAIGTPPLPFLAIADTGSDLIW-----TQCAPCSRQCFQ---QPTPLYNPSSSTTFSALP 140

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA-SV 225
           C+  L     +C      C Y M Y      S    V       + G +   + V+   +
Sbjct: 141 CNSSLGLCAPACA-----CMYNMTY-----GSGWTYVFQGTETFTFGSSTPADQVRVPGI 190

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGR 281
             GC    SG   +  +  GL+GLG G +S+ S L         FS C     D + +  
Sbjct: 191 AFGCSNASSG--FNASSASGLVGLGRGSLSLVSQLGAP-----KFSYCLTPYQDTNSTST 243

Query: 282 IFFG------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----KQTSFKA--- 328
           +  G      D G     ST F+AS    I Y + +    +G++ L       S KA   
Sbjct: 244 LLLGPSASLNDTG--VVSSTPFVASPSS-IYYYLNLTGISLGTTALPIPPNAFSLKADGT 300

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYK--SSSQRLPK 380
              I+DSG++ T L    Y+ + A     V  T+ + +G        C++  SS+   P 
Sbjct: 301 GGLIIDSGTTITMLGNTAYQQVRAAVLSLV--TLPTTDGSAATGLDLCFELPSSTSAPPS 358

Query: 381 LPSVKLMF 388
           +PS+ L F
Sbjct: 359 MPSMTLHF 366


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 85/300 (28%), Positives = 124/300 (41%), Gaps = 47/300 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP   F V  D GSD  W     V+C P  A  Y   +     + P+ S+T  ++S
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTW-----VQCQPCVAYCYRQKE---PLFDPTKSATYANIS 216

Query: 167 CSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           CS   C DL  S C      C Y +  Y + + + G   +D L L     + +KN     
Sbjct: 217 CSSSYCSDLYVSGCSGGH--CLYGIQ-YGDGSYTIGFYAQDTLTLAY---DTIKN----- 265

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNSFSMCFDKDDSGRIF 283
              GCG K  G  L G A  GL+GLG G+ S+P     K G +   F+ C     +G  F
Sbjct: 266 FRFGCGEKNRG--LFGRA-AGLLGLGRGKTSLPVQAYDKYGGV---FAYCLPATSAGTGF 319

Query: 284 FGDQGP----ATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAIVDSGS 334
             D GP    A  + T  L   G    Y +G+    +G   L       ++   +VDSG+
Sbjct: 320 L-DLGPGAPAANARLTPMLVDRGPTF-YYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGT 377

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLP--KLPSVKLMF 388
             T LP   Y  + + F + +      +   P       CY  +  +     LP+V L+F
Sbjct: 378 VITRLPPSAYAPLRSAFSKAMQG--LGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 435


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/322 (25%), Positives = 136/322 (42%), Gaps = 62/322 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  ++   +D GSDL+W  C  C +C           D+    + P  SS+   L
Sbjct: 101 LAIGTPPETYSAIMDTGSDLIWTQCKPCTQC----------FDQPTPIFDPKKSSSFSKL 150

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           SCS +LC+       P+  C    +Y   Y + +S+ G+L  + L          K SV 
Sbjct: 151 SCSSKLCE-----ALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTFG-------KVSV- 197

Query: 223 ASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDD 278
             V  GCG    G G+  G    GL+GLG G +S+ S L +       FS C    D   
Sbjct: 198 PEVAFGCGEDNEGSGFSQG---SGLVGLGRGPLSLVSQLKEP-----KFSYCLTSVDDTK 249

Query: 279 SGRIFFGDQGPATQ-----QSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--- 328
           +  +  G            ++T  + ++ +   Y + +E   +G + L  K+++F     
Sbjct: 250 ASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQED 309

Query: 329 -----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLP 379
                I+DSG++ T+L +  ++ +A EF  Q+N  + +      + C+     S+   +P
Sbjct: 310 GSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVP 369

Query: 380 KL----PSVKLMFPQNNSFVVN 397
           KL        L  P  N  + +
Sbjct: 370 KLVFHFDGADLELPAENYMIAD 391


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 139/344 (40%), Gaps = 52/344 (15%)

Query: 90  SKTMSLGNDFGW---LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
           +K + +G D G    L+   + +GTP  + +V +D GS   W+ C+C  C     ++   
Sbjct: 66  TKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ- 124

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGL 201
                     S S+T   +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+
Sbjct: 125 ----------SRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGI 173

Query: 202 LVEDILHLISGGDNALKNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           L +D L           + VQ       GC M   G    G   DGL+G+G G +SV   
Sbjct: 174 LYQDTLTF---------SDVQKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV--- 220

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYII 309
           L ++    + FS C     S R FF         G     T  + T  +A       + +
Sbjct: 221 LKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFV 280

Query: 310 GVETCCIGSSCLKQT----SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
            +    +    L  +    S K +V DSGS  +++P      ++    R++     + E 
Sbjct: 281 DLTAISVDGERLGLSPSVFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLKRGAAEE 339

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV-NNPVFVIYGTQ 407
              + CY   S     +P++ L F     F + ++ VFV    Q
Sbjct: 340 ESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQ 383


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 132/328 (40%), Gaps = 49/328 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + T + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTC 49

Query: 163 KHLSCSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
             +SC   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L          
Sbjct: 50  AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF-------- 100

Query: 218 KNSVQA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
            + VQ   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C  
Sbjct: 101 -SDVQKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLP 155

Query: 276 KDDSGRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT- 324
              S R FF         G     T  + T  +A       + + +    +    L  + 
Sbjct: 156 LQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSP 215

Query: 325 ---SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
              S K +V DSGS  +++P      ++    R++     + E    + CY   S     
Sbjct: 216 SIFSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGD 274

Query: 381 LPSVKLMFPQNNSFVVN-NPVFVIYGTQ 407
           +P++ L F     F +  + VFV    Q
Sbjct: 275 MPAISLHFDDGARFDLGIHGVFVERSVQ 302


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 132/318 (41%), Gaps = 46/318 (14%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P+   + +S GN     +   + +GTP   + V  D GSD  W     V+C P     Y
Sbjct: 150 LPATSGRAVSTGN-----YVVTVGLGTPASKYTVVFDTGSDTTW-----VQCRPCVVKCY 199

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSSSGLL 202
               +    + P+ SST  ++SC+   C DL T+ C      C Y +  Y + + + G  
Sbjct: 200 K---QKGPLFDPAKSSTYANVSCTDSACADLDTNGCTGGH--CLYAVQ-YGDGSYTVGFF 253

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
            +D L +     +A+K         GCG K +G +       GL+GLG G+ S+   +  
Sbjct: 254 AQDTLTIA---HDAIKG-----FRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQA 300

Query: 263 AGLIRNSFSMCFDKDDSGRIFFGDQGPATQQS----TSFLASNGKYITYI------IGVE 312
                 +F+ C     +G  +  D GP +  +    T  L   G+   Y+      +G +
Sbjct: 301 YNKYGGAFAYCLPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQ 359

Query: 313 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDR-QVNDTITSFEGYP-WKCC 370
              +  S    ++   +VDSG+  T LP   Y  +++ FD+  +        GY     C
Sbjct: 360 QVPVAESVF--STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTC 417

Query: 371 YKSSSQRLPKLPSVKLMF 388
           Y  +     +LP+V L+F
Sbjct: 418 YDFTGLSDVELPTVSLVF 435


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 118/273 (43%), Gaps = 42/273 (15%)

Query: 115 SFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
           +F + +D GS   ++PC  C  C    A  Y         Y   AS+    + CS     
Sbjct: 46  TFELIVDTGSSRTYLPCKGCASCGAHEAGRY---------YDYDASADFSRVECS-ACAG 95

Query: 174 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 233
           +G  C      C Y + +Y E + S G LV D++ L  GG         A+V+ GC  ++
Sbjct: 96  IGGKC-GTSGVCRYDV-HYLEGSGSEGYLVRDVVSL--GGSVG-----NATVVFGCEERE 146

Query: 234 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS------------GR 281
            G  +   + DGL G G    ++ + LA A +I + FSMC +  +             G 
Sbjct: 147 LGS-IKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205

Query: 282 IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT-SFKAIVDSGSSFTFLP 340
             FG   PA   +   + S+  Y  Y +   +  +G+S ++ +     I+DSG+S+T++P
Sbjct: 206 FDFGADAPALVYTP--MVSSAMY--YQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVP 261

Query: 341 KEVYET---IAAEFDRQVN-DTITSFEGYPWKC 369
             ++     +A +  R+   + +   E YP  C
Sbjct: 262 GNMHARFLQLAEDAARESGLEKVAPPEDYPDLC 294


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 137/357 (38%), Gaps = 87/357 (24%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNSLD-RDLNEYSPSASS 160
           + IGTP     V +D GSDL W+PC     DC  C      Y N++    L  + P+ SS
Sbjct: 25  LSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDC----EEYQNNISGPRLAAFLPTHSS 80

Query: 161 TSKHLSCSHRLCDLGTSCQNP-------------------KQPCPYTMDYYTENTSSSGL 201
           TS   +C    C    S  NP                    +PCP     Y  +   +G 
Sbjct: 81  TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           L  D+  L + G+    N+    +   C       Y +   P G+ G G G +S+P  L 
Sbjct: 141 LTRDV--LFTHGNYNNNNNNNKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPFQL- 194

Query: 262 KAGLIRNSFSMCF-------DKDDSGRIFFGDQGPATQ----QSTSFLASNGKYITYIIG 310
             G     FS CF       + + S  +  G+   +++    Q T  L S      Y IG
Sbjct: 195 --GFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIG 252

Query: 311 VETCCIG----------SSCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           +E+  IG          S  L++   K     ++DSG+++T LP+ +Y  + +  +  + 
Sbjct: 253 LESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI- 311

Query: 357 DTITSFEGYP----------WKCCYK-------SSSQRLPKLPSVKLMFPQNNSFVV 396
                  GYP          +  CYK       SS     +LPS+   F  N S V+
Sbjct: 312 -------GYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVL 361


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 131/321 (40%), Gaps = 55/321 (17%)

Query: 96  GNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEY 154
           G +   L+Y    +G       V +D  S+L W+ C  C  C           D+    +
Sbjct: 112 GANLRTLNYVAT-VGLGAAEATVVVDTASELTWVQCQPCESCH----------DQQDPLF 160

Query: 155 SPSASSTSKHLSCSHRLCDL-------GTS-C--QNPKQP-CPYTMDYYTENTSSSGLLV 203
            PS+S +   + C+   CD        GTS C   N +QP C Y + Y  + + S G+L 
Sbjct: 161 DPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSY-RDGSYSRGVLA 219

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAK 262
            D L L +G D           + GCG    G    G +  GL+GLG   +S V   + +
Sbjct: 220 RDKLRL-AGQD-------IEGFVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMDQ 269

Query: 263 AGLIRNSFSMCF---DKDDSGRIFFGDQGPATQQSTSFLAS---------NGKYITYIIG 310
            G +   FS C    +   SG +  GD   A + ST  + +          G +  Y + 
Sbjct: 270 FGGV---FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPF--YFLN 324

Query: 311 VETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
           +    +G   ++   F A   I+DSG+  T L   VY  + AEF  Q+ +   +      
Sbjct: 325 LTGITVGGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSIL 384

Query: 368 KCCYKSSSQRLPKLPSVKLMF 388
             C+  +  +  ++PS+K +F
Sbjct: 385 DTCFNLTGLKEVQVPSLKFVF 405


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 143/351 (40%), Gaps = 84/351 (23%)

Query: 84  LFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLS 140
           L PS G   M+L             IGTP    L   D GSDL W+   PCD  +C P  
Sbjct: 73  LLPSGGEYMMNLS------------IGTPPFPILAIADTGSDLTWLQSKPCD--QCYPQK 118

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----GTSCQNPKQPCPYTMDYYTENT 196
              ++          PS S+T   L C+   C+       SC +P   C YT   Y +++
Sbjct: 119 GPIFD----------PSNSTTFHKLPCTTAPCNALDESARSCTDPTT-CGYTYS-YGDHS 166

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS 255
            ++G L  D + +     NA   SVQ  +V  GCG +  G + +  +  G++GLG G +S
Sbjct: 167 YTTGYLASDTVTV----GNA---SVQIRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLS 217

Query: 256 VPSLLAKAGLIRNSFSMCF------------DKDDSGRIFFGDQGPATQQS-------TS 296
             S L     I   FS C             D   + RI FGD    +  S       T+
Sbjct: 218 FVSQLGDT--IGKKFSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATT 275

Query: 297 FLASNGKYITYIIGVETCCIGSSCL-------KQTSFKA-----------IVDSGSSFTF 338
            L +      Y + +E   +G   L       K  S+ +           I+DSG++ TF
Sbjct: 276 PLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTF 335

Query: 339 LPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           L +E Y  + A    ++  + +   +   +  C+KS  + + +LP +K+ F
Sbjct: 336 LEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGKEEV-ELPLMKVHF 385


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 79/348 (22%), Positives = 136/348 (39%), Gaps = 82/348 (23%)

Query: 65  LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGN----DFGWLHYTWIDIGTPNVSFLVAL 120
           LL   +Q+ + +       L P+     + +        G  +   + +GTP   F  A+
Sbjct: 46  LLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFTAAI 105

Query: 121 DAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGT-S 177
           D  SDL+W  C  CV+C       Y  LD   N   P AS++   + C+   CD L T  
Sbjct: 106 DTASDLIWTQCQPCVKC-------YKQLDPVFN---PVASTSYAVVPCNSDTCDELDTHR 155

Query: 178 C-----QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
           C      + +  C YT   Y  N ++ G+L  D L +   GD+  +      V+ GC   
Sbjct: 156 CARDGDSDDEDACQYTYS-YGGNATTRGILAVDRLAI---GDDVFRG-----VVFGCSSS 206

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIFFGDQGP 289
             GG    V+  G++GLG G +S+ S L+    +R  F  C        +GR+  G    
Sbjct: 207 SVGGPPPQVS--GVVGLGRGALSLVSQLS----VRR-FMYCLPPPVSRSAGRLVLGADAA 259

Query: 290 ATQQSTSF-----LASNGKYIT-YIIGVETCCIGSSCLK--------------------- 322
           AT ++ S      +++  +Y + Y + ++   IG   +                      
Sbjct: 260 ATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPAS 319

Query: 323 --------------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                           ++  I+D  S+ TFL + +YE +  + + ++ 
Sbjct: 320 PVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIR 367


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 132/327 (40%), Gaps = 61/327 (18%)

Query: 112 PNVSFLVALDAGSDLLWIPC---DCVRCA-------PLSASYYNSLDRDLNEYSPSASST 161
           P+ S  + +D GSDL+W PC   +C+ C        PL+ +  + +       S + SS 
Sbjct: 29  PSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQSPACSTAHSSV 88

Query: 162 SKHLSCSHRLCDLG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           S H  C+   C L     + C +   P  Y   Y   + S    L  D L +       L
Sbjct: 89  SSHDLCAIARCPLDNIETSDCSSATCPPFY---YAYGDGSFIAHLHRDTLSM---SQLFL 142

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC--- 273
           KN        GC       +     P G+ G G G +S+P+ LA  +  + N FS C   
Sbjct: 143 KN-----FTFGCA------HTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVS 191

Query: 274 --FDKDDSGR---IFFGDQGPATQQSTSF----LASNGKY-ITYIIGVETCCIGSSCL-- 321
             FDK+   +   +  G     + +   F    +  N K+   Y +G+    +G   +  
Sbjct: 192 HSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTILA 251

Query: 322 --------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC---- 369
                   ++     +VDSG++FT LP  +Y ++ AEFDR+V            K     
Sbjct: 252 PEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGP 311

Query: 370 CYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           CY    + L ++P+V   F  NNS V+
Sbjct: 312 CY--FLEGLVEVPTVTWHFLGNNSNVM 336


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 89/206 (43%), Gaps = 41/206 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I+  TP V   + +D G   LW+ C+                   N Y+   SST + + 
Sbjct: 53  INQRTPLVPLNLVVDLGGKFLWVDCE-------------------NHYT---SSTYRPVR 90

Query: 167 CSHRLCDLGTS-----C-QNPKQPCPYTMDYYTENT----SSSGLLVEDILHLIS-GGDN 215
           C    C L  S     C  +PK  C  T     +NT    ++ G L ED+L + S  G N
Sbjct: 91  CPSAQCSLAKSDSCGDCFSSPKPGCNNTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFN 150

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
             +N V +  +  C        L G A  G+ GLG  +I++PS LA A + +  F+ CF 
Sbjct: 151 TGQNVVVSRFLFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFS 209

Query: 276 KDDSGRIFFGDQGPATQQSTSFLASN 301
             D G I FGD GP      SFLA N
Sbjct: 210 SSD-GVIIFGD-GPY-----SFLADN 228


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 73/289 (25%), Positives = 113/289 (39%), Gaps = 35/289 (12%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP V  L   D  SDL+W+ C  C  C P          +D   + P  SST  +LSC
Sbjct: 96  IGTPPVERLAIADTASDLIWVQCSPCETCFP----------QDTPLFEPHKSSTFANLSC 145

Query: 168 SHRLCDLGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
             + C       C      C YT + Y + +S+ G+L  + +H  S      +       
Sbjct: 146 DSQPCTSSNIYYCPLVGNLCLYT-NTYGDGSSTKGVLCTESIHFGS------QTVTFPKT 198

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRI 282
           I GCG      +       G++GLG G +S+ S L     I + FS C   F    + ++
Sbjct: 199 IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKL 256

Query: 283 FFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLK-----QTSFKAIVDSGS 334
            FG+    T     ST  +        Y + +    IG   L+      T+   I+D G+
Sbjct: 257 KFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGT 316

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQ-RLPKL 381
             T+L    Y          +  + T  +  YP+  C+ + +    PK+
Sbjct: 317 VLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITFPKI 365


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 71/275 (25%), Positives = 112/275 (40%), Gaps = 55/275 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP  +  + LD GS L WI   C +  P +AS+            PS SST   L 
Sbjct: 79  LPIGTPPQTQPMVLDTGSQLSWI--QCHKKQPPTASF-----------DPSLSSTFSILP 125

Query: 167 CSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           C+H LC        L TSC   +  C Y+  +Y + T + G LV +            ++
Sbjct: 126 CTHPLCKPRIPDFTLPTSCDQNRL-CHYSY-FYADGTYAEGNLVREKFTFS-------RS 176

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-------------VPSLLAKAGLI 266
                +I+GC  + +        P G++G+ LG +S             VP    + G  
Sbjct: 177 VSTPPLILGCATESTD-------PRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFT 229

Query: 267 -RNSFSMCFDKDDSGRIFFGDQGPATQQSTSF--LASNGKYITYIIGVETCCIGSSCLKQ 323
              SF +  +    G  + G    + Q+  +F  LA     +   I  +   I  +  + 
Sbjct: 230 PTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRA 289

Query: 324 T---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
               S + ++DSGS FT+L  E Y+ + A+  R V
Sbjct: 290 DAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAV 324


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 141/333 (42%), Gaps = 62/333 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C           ++    + P+ASS+ ++L+C
Sbjct: 152 VGTPPRRFQMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNLTC 201

Query: 168 SHRLCD--------LGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNAL 217
               C            +C+ P + PCPY   Y  ++ S+  L +E   ++L + G    
Sbjct: 202 GDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPG---- 257

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--- 274
            +S    V+ GCG +  G +        L+GLG G +S  S L +A    ++FS C    
Sbjct: 258 ASSRVDGVVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RAVYGGHTFSYCLVDH 313

Query: 275 DKDDSGRIFFGDQ----------------GPATQQSTSFLASNGKYITYIIGVETCCIGS 318
             D + ++ FG+                  PA+  + +F     +    ++G E   I S
Sbjct: 314 GSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYV--RLTGVLVGGELLNISS 371

Query: 319 SCLKQT---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS 374
                +   S   I+DSG++ ++  +  Y+ I   F  +++ +      +P    CY  S
Sbjct: 372 DTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVS 431

Query: 375 SQRLPKLPSVKLM--------FPQNNSFVVNNP 399
               P++P + L+        FP  N F+  +P
Sbjct: 432 GVERPEVPELSLLFADGAVWDFPAENYFIRLDP 464


>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
 gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
          Length = 453

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 107/263 (40%), Gaps = 34/263 (12%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 156
           NDF +L    + +GTP V +LV +D GS L W     V+C P +   +    +    + P
Sbjct: 49  NDFAFL--IPVKLGTPAVQYLVTMDTGSSLSW-----VQCRPCTIKCHVQPAKVGPIFDP 101

Query: 157 SASSTSKHLSCSHRLCD-LGTS-------CQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
           S SST +H+ CS  +C  LG +       C   +  C YTM Y      S G  V D L 
Sbjct: 102 SNSSTFRHVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRL- 160

Query: 209 LISGGDNALKNSVQASVIIGCGM-KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
           ++ GG+        A+ + GC M  Q   + +     G+ GLG    S   +     L  
Sbjct: 161 VLGGGETTRTTLSLANFVFGCSMDTQYSTHKEA----GIFGLGTSNYSFEQIAPL--LSY 214

Query: 268 NSFSMCFDKDDS--GRIFFGDQG---------PATQQSTSFLASNGKYITYIIGVETCCI 316
            +FS C   D++  G +  G            P T +    +   G  +T    V +   
Sbjct: 215 KAFSYCLPSDEAHQGYLSIGPDSSGGVPTSMFPGTPRPVYSIGMTGLTVTVNGEVRSLVS 274

Query: 317 GSSCLKQTSFKAIVDSGSSFTFL 339
           GS      S   +VDSG+  T L
Sbjct: 275 GSGSSPSPSSLMVVDSGAKLTLL 297


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 130/317 (41%), Gaps = 40/317 (12%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G   Y   + IGTP V+ ++++D GSD+ W     V+CAP +A   +S    L 
Sbjct: 119 SSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSW-----VQCAPCAAQSCSSQKDKL- 172

Query: 153 EYSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
            + P+ S+T    SC    C    D G  C   K  C Y +  Y + ++++G    D L 
Sbjct: 173 -FDPAMSATYSAFSCGSAQCAQLGDEGNGCL--KSQCQYIVK-YGDGSNTAGTYGSDTLS 228

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L S   +A+K     S   GC  + +G  G LDG+       +GLG  +   +   A   
Sbjct: 229 LTS--SDAVK-----SFQFGCSHRAAGFVGELDGL-------MGLGGDTESLVSQTAATY 274

Query: 267 RNSFSMCFDKDDS---GRIFFGDQGPATQQSTSFLASNGKYITYIIGV--ETCCIGSSCL 321
             +FS C     S   G +  G  G A+    S        +    GV  +   +  + L
Sbjct: 275 GKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTML 334

Query: 322 KQT----SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQR 377
                  S  ++VDSG+  T LP   Y+ +   F +++    ++        C+  S   
Sbjct: 335 NVPASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFN 394

Query: 378 LPKLPSVKLMFPQNNSF 394
              +P+V L F +  + 
Sbjct: 395 TITVPTVTLTFSRGAAM 411


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 151/358 (42%), Gaps = 52/358 (14%)

Query: 56  KKSFEYYQVLLS--SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPN 113
           ++   Y+   L+  SD      K GP+   + P +   +M  GN     +Y  + +G+P 
Sbjct: 60  EERIRYFHSRLAKNSDANASSKKVGPKLAGI-PLKSGLSMGSGN-----YYVKMGLGSPT 113

Query: 114 VSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
             + + +D GS   W+     +C P   + Y  +  D   ++PSAS T K + CS   C 
Sbjct: 114 KYYTMIVDTGSSFSWL-----QCQP--CTIYCHIQED-PVFNPSASKTYKTVPCSSSQCS 165

Query: 174 LGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
                     +C      C Y    Y +++ S G L +D+L L         +   +S +
Sbjct: 166 SLKSATLNEPTCSKQSNACVYKAS-YGDSSFSLGYLSQDVLTLT-------PSQTLSSFV 217

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRI 282
            GCG    G  L G   DG+IGL   E+S+ S L  +G   N+FS C    F   +S + 
Sbjct: 218 YGCGQDNQG--LFGRT-DGIIGLANNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKE 272

Query: 283 FFGDQG------PATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDS 332
            F   G       ++ + T  L +      Y I +E+  +    L    +S+K   I+DS
Sbjct: 273 GFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDS 332

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL-PSVKLMF 388
           G+  T LP  VY T+   +   ++       G      C+K S   + ++ P ++++F
Sbjct: 333 GTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIF 390


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 70/255 (27%), Positives = 107/255 (41%), Gaps = 49/255 (19%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           ++   + +GTP    +  +D GSDL+W  C  C  C    A  ++          PS SS
Sbjct: 60  IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFD----------PSKSS 109

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           T K   C       G S       CPY + Y  E+ S+  L  E +    + G+      
Sbjct: 110 TFKEKRCH------GNS-------CPYEIIYADESYSTGILATETVTIQSTSGEPF---- 152

Query: 221 VQASVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSL--LAKAGLIRNSFSMCFDK 276
           V A   IGCG+  S     G A    G++GL +G  S+ S   L   GLI    S CF  
Sbjct: 153 VMAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLI----SYCFSS 208

Query: 277 DDSGRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-- 328
             + +I FG      G  T  +  F+  +  +  Y + ++   +G   ++   T F A  
Sbjct: 209 QGTSKINFGTNAVVAGDGTVAADMFIKKDQPF--YYLNLDAVSVGDKRIETLGTPFHAQD 266

Query: 329 ---IVDSGSSFTFLP 340
               +DSG+++T+LP
Sbjct: 267 GNIFIDSGTTYTYLP 281


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 132/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 143/350 (40%), Gaps = 65/350 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP    L+ LD GSD++W+ C  C RC           D+    + P AS +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRC----------YDQSGQMFDPRASHS 196

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
              + C+  LC   D G  C   ++ C Y +  Y + + ++G    + L   SG      
Sbjct: 197 YGAVDCAAPLCRRLDSG-GCDLRRKACLYQV-AYGDGSVTAGDFATETLTFASG------ 248

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 274
            +    V +GCG    G +   VA  GL+GLG G +S PS +++      SFS C     
Sbjct: 249 -ARVPRVALGCGHDNEGLF---VAAAGLLGLGRGSLSFPSQISR--RFGRSFSYCLVDRT 302

Query: 275 -----DKDDSGRIFFGDQ--GPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSF 326
                    S  + FG    GP+   S + +  N +  T Y + +    +G + +   + 
Sbjct: 303 SSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAV 362

Query: 327 K------------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKS 373
                         IVDSG+S T L +  Y  +   F         S  G+  +  CY  
Sbjct: 363 SDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDL 422

Query: 374 SSQRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
           S  ++ K+P+V + F         P+N    V++     F   GT  GVS
Sbjct: 423 SGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS 472


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 128/303 (42%), Gaps = 81/303 (26%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +  GTP  +F   LD GS L+W+PC     C +C   S       + +  ++ P  S +S
Sbjct: 220 LKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFS-------NNNTPKFIPKDSFSS 272

Query: 163 KHLSCSHRLC------DLGTSC-----------QNPKQPCP-YTMDYYTENTSSSGLLVE 204
           K + C +  C      D+ + C            N  Q CP YT+ Y     S++G L+ 
Sbjct: 273 KFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGL--GSTAGFLLS 330

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           + L+  +      KN   +  ++GC +      +    P G+ G G GE S+P   A+  
Sbjct: 331 ENLNFPA------KNV--SDFLVGCSV------VSVYQPGGIAGFGRGEESLP---AQMN 373

Query: 265 LIRNSFSMC-----FDK--DDSGRIFFGDQGPATQQS-----TSFLASN-------GKYI 305
           L R  FS C     FD+  ++S  +         +++     T+FL +        G Y 
Sbjct: 374 LTR--FSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAY- 430

Query: 306 TYIIGVETCCIGSSCLKQTSFKA----------IVDSGSSFTFLPKEVYETIAAEFDRQV 355
            Y I +    +G   ++                IVDSGS+ TF+ + +++ +A EF +QV
Sbjct: 431 -YYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQV 489

Query: 356 NDT 358
           N T
Sbjct: 490 NYT 492


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 133/323 (41%), Gaps = 63/323 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           +  GTP   F + LD GS + W  C  CVRC   S  +++          PSAS T    
Sbjct: 166 VAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFD----------PSASLTYSLG 215

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS-VQAS 224
           SC      + ++  N      Y M Y  ++TS      + +          L++S V   
Sbjct: 216 SC------IPSTVGN-----TYNMTYGDKSTSVGNYGCDTM---------TLEHSDVFPK 255

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIF 283
              GCG    G +  G   DG++GLG G++S  S  A     +  FS C  ++DS G + 
Sbjct: 256 FQFGCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLL 311

Query: 284 FGDQGPATQQS-------------TSFLASNGKYITYI----IGVETCCIGSSCLKQTSF 326
           FG++  AT QS             TS L  +G Y   +    +G +   I SS     S 
Sbjct: 312 FGEK--ATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF--ASP 367

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS----FEGYPWKCCYKSSSQRLPKLP 382
             I+DSG+  T LP+  Y  + A F + +     S     +G     CY  S ++   LP
Sbjct: 368 GTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLP 427

Query: 383 SVKLMFPQNNSFVVNNPVFVIYG 405
            + L F +     +N    VI+G
Sbjct: 428 EIVLHFGEGADVRLNGKR-VIWG 449


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 114/283 (40%), Gaps = 46/283 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC----DCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           + +GTP     V LD GS L W+PC     C  C+        S    +  + P  SS+S
Sbjct: 95  VSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCS-----SSPSAMSAMAVFHPKNSSSS 149

Query: 163 KHLSCSHRLCD---------LGTSCQNPK-QPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           + + C +  C           G++  N     CP  +  Y    S+SGLL+ D L L   
Sbjct: 150 RLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSG-STSGLLISDTLRLSPS 208

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
             ++     + +  IGC +           P GL G G G  SVPS L          S 
Sbjct: 209 SSSSAPAPFR-NFAIGCSIVSV-----HQPPSGLAGFGRGAPSVPSQLKVPKFSYCLLSR 262

Query: 273 CFDKDD--SGRIFFGDQG-PATQQSTSF--------LASNGKY-ITYIIGVETCCIGSSC 320
            FD +   SG +  GD   PA ++ T+          AS   Y + Y + +    +G   
Sbjct: 263 RFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKP 322

Query: 321 LKQTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           +   S          AI+DSG++FT+L   V++ +AA  +  V
Sbjct: 323 VNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAV 365


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 97/413 (23%), Positives = 156/413 (37%), Gaps = 76/413 (18%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVL 65
           L +Y+ + +L     G     FS ++IHR S          +R+    P +  F+     
Sbjct: 15  LCLYINISFLNALDGGG----FSVEIIHRDS----------SRSPYYRPTETQFQRVANA 60

Query: 66  LSSDVQKQKMKTGPQF--------QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFL 117
           L   + +      P            +  SQG   MS              +GTP    L
Sbjct: 61  LRRSINRANHFNKPNLVASTNTAESTVIASQGEYLMSYS------------VGTPPFQIL 108

Query: 118 VALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---D 173
             +D GSD++W+ C  C  C       YN   +    + PS S T K L CS  +C    
Sbjct: 109 GIVDTGSDIIWLQCQPCEDC-------YN---QTTPIFDPSQSKTYKTLPCSSNICQSVQ 158

Query: 174 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-ASVIIGCGMK 232
              SC +    C YT+  Y +N+ S G L  + L L S       +SVQ    +IGCG  
Sbjct: 159 SAASCSSNNDECEYTIT-YGDNSHSQGDLSVETLTLGSTD----GSSVQFPKTVIGCGHN 213

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ 287
             G +      +G   +GLG   V  +   +  I   FS C        + S ++ FGD+
Sbjct: 214 NKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDE 269

Query: 288 GPATQQ---STSFLASNGKYITYIIGVETCCIGSSCL---------KQTSFKAIVDSGSS 335
              + +   ST  +  NG    Y + +E   +G + +                I+DSG++
Sbjct: 270 AVVSGRGTVSTPIVPKNGLGF-YFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTT 328

Query: 336 FTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            T LP++ Y  + +     +            + CY+++S     +P +   F
Sbjct: 329 LTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHF 381


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 142/348 (40%), Gaps = 61/348 (17%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP  + L+ LD GSD++W+     +CAP    Y  S       + P  S + 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL-----QCAPCRHCYAQS----GRVFDPRRSRSY 172

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             + C   +C       C   +  C Y +  Y + + ++G    + L    G        
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQV-AYGDGSVTAGDFASETLTFARGA------R 225

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS 279
           VQ  V IGCG    G +   +A  GL+GLG G +S PS +A++     SFS C  D+  S
Sbjct: 226 VQ-RVAIGCGHDNEGLF---IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 279

Query: 280 GR--------IFFGDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTS 325
            R        + FG    A     SF  +  N +    Y  +++G          + Q+ 
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339

Query: 326 FK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSS 375
            +          I+DSG+S T L + VYE +   F         S  G+  +  CY  S 
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399

Query: 376 QRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
           +R+ K+P+V +           P+N    V+      F + GT  GVS
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 135/343 (39%), Gaps = 69/343 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           + IGTP     V +D GSDL W PC     DC+ C     +Y N  +R +  +SPS SS+
Sbjct: 84  LSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECD----NYRN--NRMMASFSPSHSSS 137

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPC-------------------PYTMDYYTENTSSSGLL 202
           S   SC+   C    S  NP  PC                   P     Y      +G L
Sbjct: 138 SHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTL 197

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
             D L +   G N            GC    +  Y +   P G+ G G G +S+PS L  
Sbjct: 198 TRDTLRV--HGRNLGVTQEIPRFCFGC---VASSYRE---PIGIAGFGRGALSLPSQL-- 247

Query: 263 AGLIRNSFSMCF-------DKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVE 312
            G +R  FS CF       + + S  +  GD    ++   Q T  L S      Y +G+E
Sbjct: 248 -GFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLE 306

Query: 313 TCCIGS-------SCLKQ----TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTIT 360
              +G+       S L++     +   +VDSG+++T LP+  Y  + +     +N    T
Sbjct: 307 AITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRAT 366

Query: 361 SFEGYP-WKCCYKSSSQRLP-----KLPSVKLMFPQNNSFVVN 397
             E    +  CYK   Q         LPS+   F  N S V++
Sbjct: 367 DMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLS 409


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 134/338 (39%), Gaps = 49/338 (14%)

Query: 74  KMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD- 132
           K++ GP      P +   ++  GN     +Y  I +GTP   F + +D GS L W+ C  
Sbjct: 89  KLRGGPSLVSTTPLKSGLSIGSGN-----YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQP 143

Query: 133 CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL-------GTSCQNPKQPC 185
           CV         Y  +  D   ++PS S T K L CS   C            C N    C
Sbjct: 144 CV--------IYCHVQVD-PIFTPSTSKTYKALPCSSSQCSSLKSSTLNAPGCSNATGAC 194

Query: 186 PYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDG 245
            Y    Y + + S G L +D+L L          +  +  + GCG    G  L G +  G
Sbjct: 195 VYKAS-YGDTSFSIGYLSQDVLTLTP------SEAPSSGFVYGCGQDNQG--LFGRS-SG 244

Query: 246 LIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYI 305
           +IGL   +IS+   L+K     N+FS C     S        G  +  ++S  +S  K+ 
Sbjct: 245 IIGLANDKISMLGQLSKK--YGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSSPYKFT 302

Query: 306 ----------TYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEVYETIAAEF 351
                      Y + + T  +    L  ++       I+DSG+  T LP  VY  +   F
Sbjct: 303 PLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAVYNALKKSF 362

Query: 352 DRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMF 388
              ++       G+     C+K S + +  +P ++++F
Sbjct: 363 VLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIF 400


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 78/326 (23%), Positives = 134/326 (41%), Gaps = 54/326 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + LD GSDL W+ C  C  C   + ++Y+          P  S++ K+++C
Sbjct: 168 VGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYD----------PKTSASFKNITC 217

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           +   C L +S      C++  Q CPY   Y   + ++    VE     ++  +       
Sbjct: 218 NDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYK 277

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
             +++ GCG    G +        L+GLG G +S  S L    L  +SFS C      D 
Sbjct: 278 VENMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDT 332

Query: 277 DDSGRIFFGDQGPATQQS----TSFL--ASNGKYITYIIGVETCCIGSSCLK-------- 322
           + S ++ FG+       +    TSF+    N     Y I +++  +G   L         
Sbjct: 333 NVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNI 392

Query: 323 --QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS--SQR 377
               +   I+DSG++ ++  +  YE I  +F  ++ +    F  +P    C+  S   + 
Sbjct: 393 SPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEEN 452

Query: 378 LPKLPSVKLM--------FPQNNSFV 395
              LP + +         FP  NSF+
Sbjct: 453 NIHLPELGIAFADGAVWNFPAENSFI 478


>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
 gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
          Length = 107

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 33/79 (41%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGR 281
            +V   C    +G +LDG A +GL+GLG  ++SV  +L  +GL+  +SFSMCF +D  GR
Sbjct: 12  GAVAKACRCGPTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGR 71

Query: 282 IFFGDQGPATQQSTSFLAS 300
           I FGD G   Q    F+++
Sbjct: 72  INFGDAGIRGQGEMPFIST 90


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 80/348 (22%), Positives = 143/348 (41%), Gaps = 52/348 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP   F++  D GSDL W+ C        S+S   +       + P+ S + 
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSP----SSSSSSPAASPPQRVFRPAGSKSW 159

Query: 163 KHLSCSHRLCDLG-----TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHL-ISGGDN 215
             L C    C         +C +P  PC Y  DY Y +N+S+ G++  D   + +SG D 
Sbjct: 160 SPLPCDSDTCKSYVPFSLANCSSPPDPCSY--DYRYKDNSSARGVVGLDSATVSLSGNDG 217

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF- 274
             K  +Q  V++GC     G      + DG++ LG   IS  S    A      FS C  
Sbjct: 218 TRKAKLQ-EVVLGCTTSYDGQSFK--SSDGVLSLGNSNISFAS--RAASRFGGRFSYCLV 272

Query: 275 ----DKDDSGRIFFGD------QGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK- 322
                ++ +  + FG+         +++++   L  + +    Y + V+   +    L+ 
Sbjct: 273 DHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEI 332

Query: 323 -------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
                  + +  AI+DSG+S T L    Y+ +     +Q    +      P++ CY  + 
Sbjct: 333 LPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAG-VPRVNMDPFEYCYNWTG 391

Query: 376 QRLPKLPSVKLMF-------PQNNSFVVNNP-----VFVIYGTQVGVS 411
               ++P ++L F       P   S+V++       + V+ G   GVS
Sbjct: 392 VSA-EIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVS 438


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 133/335 (39%), Gaps = 56/335 (16%)

Query: 84  LFPSQGSKTMSLGNDF---GWLHYTWIDIGTPNVSFLVALDAGSDLLWI---PCD-CVR- 135
           +F ++     S  ND    G  ++  + IGTP V  +V  D GSDL W+   PCD C R 
Sbjct: 72  VFKTKAVDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQ 131

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL----GTSCQNPKQPCPYTMDY 191
            +PL              + PS SS+ +H+ C  R C+       +C      C Y   Y
Sbjct: 132 KSPL--------------FDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSY 177

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
             ++ ++  L  E      + G  + +    + ++ GCG   +GG  D +    +   G 
Sbjct: 178 GDKSYTNGNLATEKF----TIGSTSSRPVHLSPIVFGCGTG-NGGTFDELGSGIVGLGGG 232

Query: 252 GEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQ----GPATQQSTSFLASNG 302
               V  L   + +I+  FS C        + + +I FG      GP  Q  ++ L S  
Sbjct: 233 ALSLVSQL---SSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGP--QVVSTPLVSKQ 287

Query: 303 KYITYIIGVETCCIGSSCLKQTSF---------KAIVDSGSSFTFLPKEVYETIAAEFDR 353
               Y + +E   +G+  L  T+            I+DSG++ TFL  E +  +    + 
Sbjct: 288 PDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEE 347

Query: 354 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            V     S     +  C++S+      LP + + F
Sbjct: 348 TVKAERVSDPRGLFSVCFRSAGDI--DLPVIAVHF 380


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 86/325 (26%), Positives = 132/325 (40%), Gaps = 63/325 (19%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCA--PLSASYYNSLDRDLNEYSP--- 156
           ++  I +GTP  S L+  D GSDL+W+ C  C  C+  P S+++   L R  + +SP   
Sbjct: 88  YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAF---LPRHSSSFSPFHC 144

Query: 157 -----SASSTSKHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL- 209
                     + H  C+H RL            PC + +  Y + + SSG   ++   L 
Sbjct: 145 FDPHCRLLPHAPHHLCNHTRL----------HSPCRF-LYSYADGSLSSGFFSKETTTLK 193

Query: 210 -ISGGDNALKNSVQASVIIGCGMKQSGGYLDGV---APDGLIGLGLGEISVPSLLAKAGL 265
            +SG +  LK      +  GCG + SG  + G       G++GLG G IS  S L +   
Sbjct: 194 SLSGSEIHLKG-----LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR-- 246

Query: 266 IRNSFSMC-----FDKDDSGRIFFGDQ------GPATQQSTSFLASNGKYIT-YIIGVET 313
             N FS C          +  +  G          AT+ S + L  N    T Y I + +
Sbjct: 247 FGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHS 306

Query: 314 CCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN--DTITS 361
             I    L          +Q +   +VDSG++ T+L K  YE +     R+V   +    
Sbjct: 307 ITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAEL 366

Query: 362 FEGYPWKCCYKSSSQRLPKLPSVKL 386
             G+   C   S   R P LP ++ 
Sbjct: 367 TPGFDL-CVNASGESRRPSLPRLRF 390


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 85/280 (30%), Positives = 122/280 (43%), Gaps = 45/280 (16%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           ++GTP  +FL+ALD  +D  WIPC+ CV C   S++ +NS+           S+T K L 
Sbjct: 95  NVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLG 141

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C    C      Q P   C  +    T NT+  G     IL  ++    AL   +     
Sbjct: 142 CDAPQCK-----QVPNPTCGGST--CTWNTTYGG---STILSNLTRDTIALSTDIVPGYT 191

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRI 282
            GC  K +G     V P GL+GLG G +S   L     L +++FS C       + SG +
Sbjct: 192 FGCIQKTTG---SSVPPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTL 246

Query: 283 FFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVD 331
             G  G   +  T+ L  N +     Y+  I   +G +   I +S L     T    I D
Sbjct: 247 RLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFD 306

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
           SG+ FT L   VY  +  EF ++V + I S  G  +  CY
Sbjct: 307 SGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG-FDTCY 345


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 142/348 (40%), Gaps = 61/348 (17%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP  + L+ LD GSD++W+     +CAP    Y  S       + P  S + 
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWL-----QCAPCRHCYAQS----GRVFDPRRSRSY 178

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             + C   +C       C   +  C Y +  Y + + ++G    + L    G        
Sbjct: 179 AAVDCVAPICRRLDSAGCDRRRNSCLYQV-AYGDGSVTAGDFASETLTFARGA------R 231

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS 279
           VQ  V IGCG    G +   +A  GL+GLG G +S PS +A++     SFS C  D+  S
Sbjct: 232 VQ-RVAIGCGHDNEGLF---IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSS 285

Query: 280 GR--------IFFGDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTS 325
            R        + FG    A     SF  +  N +    Y  +++G          + Q+ 
Sbjct: 286 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 345

Query: 326 FK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSS 375
            +          I+DSG+S T L + VYE +   F         S  G+  +  CY  S 
Sbjct: 346 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 405

Query: 376 QRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
           +R+ K+P+V +           P+N    V+      F + GT  GVS
Sbjct: 406 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 453


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 100/248 (40%), Gaps = 49/248 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP       +D GS++ W  C  CV C   +A  ++          PS SST K  
Sbjct: 69  LQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFD----------PSKSSTFK-- 116

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKNSVQAS 224
               + CD           CPY +DY+    +   L  E I LH  SG     +  V   
Sbjct: 117 ---EKRCD--------GHSCPYEVDYFDHTYTMGTLATETITLHSTSG-----EPFVMPE 160

Query: 225 VIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRI 282
            IIGCG   S        P   G++GL  G  S+  +    G      S CF    + +I
Sbjct: 161 TIIGCGHNNS-----WFKPSFSGMVGLNWGPSSL--ITQMGGEYPGLMSYCFSGQGTSKI 213

Query: 283 FFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-----IVDS 332
            FG           ST+   +  K   Y + ++   +G++ ++   T+F A     ++DS
Sbjct: 214 NFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDS 273

Query: 333 GSSFTFLP 340
           G++ T+ P
Sbjct: 274 GTTLTYFP 281


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 85/280 (30%), Positives = 122/280 (43%), Gaps = 45/280 (16%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           ++GTP  +FL+ALD  +D  WIPC+ CV C   S++ +NS+           S+T K L 
Sbjct: 95  NVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSV----------TSTTFKTLG 141

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C    C      Q P   C  +    T NT+  G     IL  ++    AL   +     
Sbjct: 142 CDAPQCK-----QVPNPTCGGST--CTWNTTYGG---STILSNLTRDTIALSTDIVPGYT 191

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGRI 282
            GC  K +G     V P GL+GLG G +S   L     L +++FS C       + SG +
Sbjct: 192 FGCIQKTTG---SSVPPQGLLGLGRGPLSF--LSQTQDLYKSTFSYCLPSFRTLNFSGTL 246

Query: 283 FFGDQGPATQQSTSFLASNGK-----YITYI---IGVETCCIGSSCLK---QTSFKAIVD 331
             G  G   +  T+ L  N +     Y+  I   +G +   I +S L     T    I D
Sbjct: 247 RLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFD 306

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCY 371
           SG+ FT L   VY  +  EF ++V + I S  G  +  CY
Sbjct: 307 SGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG-FDTCY 345


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 135/323 (41%), Gaps = 51/323 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C   S   ++          P+AS + ++++C
Sbjct: 155 LGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFD----------PAASISYRNVTC 204

Query: 168 SHRLCDLGT--------SCQNPK-QPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNAL 217
               C L +         C+ P+  PCPY   Y  ++ ++  L +E   ++L   G   +
Sbjct: 205 GDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRV 264

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKD 277
                  V  GCG +  G +        L+GLG G +S  S L +     ++FS C  + 
Sbjct: 265 DG-----VAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-RGVYGGHAFSYCLVEH 315

Query: 278 DSG---RIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCLKQTSFK--- 327
            S    +I FG             T+F  +      Y + +++  +G   +  +S     
Sbjct: 316 GSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSA 375

Query: 328 --AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSV 384
              I+DSG++ ++ P+  Y+ I   F  +++ +     G+P    CY  S     ++P +
Sbjct: 376 GGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPEL 435

Query: 385 KLM--------FPQNNSFVVNNP 399
            L+        FP  N F+   P
Sbjct: 436 SLVFADGAAWEFPAENYFIRLEP 458


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 130/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTTWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVVNNP-VFVIYGTQ 407
            L F     F + +  VFV    Q
Sbjct: 279 SLHFDDGARFDLGSRGVFVERSVQ 302


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 131/332 (39%), Gaps = 44/332 (13%)

Query: 73  QKMKTGPQFQMLFPSQGSKTMSL----GNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLL 127
            +M  GP       S  SK +SL    G   G  +Y   + +GTP    LV  D GSDL 
Sbjct: 155 HRMTAGPW--TAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLS 212

Query: 128 WIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP 186
           W+ C  C  C       Y   D     + PS S+T   + C  + C    +C + K  C 
Sbjct: 213 WVQCKPCNNC-------YKQHD---PLFDPSQSTTYSAVPCGAQECLDSGTCSSGK--CR 260

Query: 187 YTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGL 246
           Y +  Y + + + G L  D L L    D           + GCG   +G  L G A DGL
Sbjct: 261 YEV-VYGDMSQTDGNLARDTLTLGPSSDQL------QGFVFGCGDDDTG--LFGRA-DGL 310

Query: 247 IGLGLGEISVPSLLAKAGLIRNSFSMCFDKD--DSGRIFFGD-QGPATQQSTSFLASNGK 303
            GLG   +S+ S    A      FS C        G +  G    P   Q T+ +  +  
Sbjct: 311 FGLGRDRVSLAS--QAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDT 368

Query: 304 ---YITYIIGVETCCIGSSC-LKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFDRQVN 356
              Y   ++G++    G +  +    FKA   ++DSG+  T LP   Y  + + F   + 
Sbjct: 369 PSFYYLDLVGIKVA--GRTVRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSFAGFMR 426

Query: 357 DTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
               +        CY  + +   ++PSV L+F
Sbjct: 427 RYKRAPALSILDTCYDFTGRTKVQIPSVALLF 458


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 110/272 (40%), Gaps = 46/272 (16%)

Query: 107 IDIGTPNVSFLV-ALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           + IG P    +V  LD GSD++W  C+ C  C            + L  +  +AS+T + 
Sbjct: 96  LSIGAPRSQPVVLTLDTGSDVVWTQCEPCAEC----------FTQPLPRFDTAASNTVRS 145

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           ++CS  LC+  +        C Y +  Y + + S G  + D       G    K +V   
Sbjct: 146 VACSDPLCNAHSEHGCFLHGCTY-VSGYGDGSLSFGHFLRDSF-TFDDGKGGGKVTV-PD 202

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGR 281
           +  GCGM  +G +L      G+ G G G +S+PS L     +R  FS CF    +  S  
Sbjct: 203 IGFGCGMYNAGRFLQ--TETGIAGFGRGPLSLPSQLK----VRQ-FSYCFTTRFEAKSSP 255

Query: 282 IFFGDQGPATQQ------STSFLAS------NGKYITYIIGVETCCIGSSCLKQTSFKA- 328
           +F G  G           ST F+ S      N  Y+    GV    +G + L     KA 
Sbjct: 256 VFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVT---VGKTRLPVPEIKAD 312

Query: 329 -----IVDSGSSFTFLPKEVYETIAAEFDRQV 355
                 +DSG+  T  P  V+  + + F  Q 
Sbjct: 313 GSGATFIDSGTDITTFPDAVFRQLKSAFIAQA 344


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 91/207 (43%), Gaps = 32/207 (15%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  +   + IGTP   F  A+D  SDL+W  C  C  C       Y+ +D   N   P  
Sbjct: 86  GGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGC-------YHQVDPMFN---PRV 135

Query: 159 SSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           SST   L CS   C   D+     +  + C YT   Y+ N ++ G L  D L +   G++
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYT-YSGNATTEGTLAVDKLVI---GED 191

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           A +      V  GC    +GG     A  G++GLG G +S+ S L+    +R  F+ C  
Sbjct: 192 AFRG-----VAFGCSTSSTGGAPPPQA-SGVVGLGRGPLSLVSQLS----VRR-FAYCLP 240

Query: 276 KDDS---GRIFFGDQGPATQQSTSFLA 299
              S   G++  G    A + +T+ +A
Sbjct: 241 PPASRIPGKLVLGADADAARNATNRIA 267


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 116/272 (42%), Gaps = 44/272 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IG+P  SF   +D GSDL+W  C  C +C           D+    + P  SS+   +
Sbjct: 115 LAIGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKI 164

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           SCS  LC    +       C Y +  Y +++S+ G+L  +       GD+         +
Sbjct: 165 SCSSELCGALPTSTCSSDGCEY-LYTYGDSSSTQGVLAFETFTF---GDSTEDQISIPGL 220

Query: 226 IIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR--- 281
             GCG   +G G+  G    GL+GLG G +S+ S L +       F+ C    D  +   
Sbjct: 221 GFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKE-----QKFAYCLTAIDDSKPSS 272

Query: 282 IFFGDQGPAT-------QQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK----- 327
           +  G     T        ++T  + +  +   Y + ++   +G + L   +++F+     
Sbjct: 273 LLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDG 332

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
               I+DSG++ T++    + ++  EF  Q+N
Sbjct: 333 SGGVIIDSGTTITYVENSAFTSLKNEFIAQMN 364


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 111/276 (40%), Gaps = 52/276 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +G+P+   L+ALD  +D  W       C+P      +SL      ++P+ SS+   L CS
Sbjct: 85  LGSPSQQLLLALDTSADATW-----AHCSPCGTCPSSSL------FAPANSSSYASLPCS 133

Query: 169 HRLCDL--GTSCQNPK-----QPCPYTMDYYT-----ENTSSSGLLVEDILHLISGGDNA 216
              C L  G +C  P+      P P T+          + S    L  D L L   G +A
Sbjct: 134 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRL---GKDA 190

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFD 275
           + N        GC +    G    +   GL+GLG G +   +LL++AG + N  FS C  
Sbjct: 191 IPN-----YTFGC-VSSVTGPTTNMPRQGLLGLGRGPM---ALLSQAGSLYNGVFSYCLP 241

Query: 276 K------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------- 322
                    S R+  G   P + + T  L +  +   Y + V    +G + +K       
Sbjct: 242 SYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFA 301

Query: 323 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
               T    +VDSG+  T     VY  +  EF RQV
Sbjct: 302 FDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQV 337


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 91/207 (43%), Gaps = 32/207 (15%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  +   + IGTP   F  A+D  SDL+W  C  C  C       Y+ +D   N   P  
Sbjct: 86  GGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGC-------YHQVDPMFN---PRV 135

Query: 159 SSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           SST   L CS   C   D+     +  + C YT   Y+ N ++ G L  D L +   G++
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYT-YSGNATTEGTLAVDKLVI---GED 191

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           A +      V  GC    +GG     A  G++GLG G +S+ S L+    +R  F+ C  
Sbjct: 192 AFRG-----VAFGCSTSSTGGAPPPQA-SGVVGLGRGPLSLVSQLS----VRR-FAYCLP 240

Query: 276 KDDS---GRIFFGDQGPATQQSTSFLA 299
              S   G++  G    A + +T+ +A
Sbjct: 241 PPASRIPGKLVLGADADAARNATNRIA 267


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 111/276 (40%), Gaps = 52/276 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +G+P+   L+ALD  +D  W       C+P      +SL      ++P+ SS+   L CS
Sbjct: 87  LGSPSQQLLLALDTSADATW-----AHCSPCGTCPSSSL------FAPANSSSYASLPCS 135

Query: 169 HRLCDL--GTSCQNPK-----QPCPYTMDYYT-----ENTSSSGLLVEDILHLISGGDNA 216
              C L  G +C  P+      P P T+          + S    L  D L L   G +A
Sbjct: 136 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRL---GKDA 192

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFD 275
           + N        GC +    G    +   GL+GLG G +   +LL++AG + N  FS C  
Sbjct: 193 IPN-----YTFGC-VSSVTGPTTNMPRQGLLGLGRGPM---ALLSQAGSLYNGVFSYCLP 243

Query: 276 K------DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK------- 322
                    S R+  G   P + + T  L +  +   Y + V    +G + +K       
Sbjct: 244 SYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFA 303

Query: 323 ---QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
               T    +VDSG+  T     VY  +  EF RQV
Sbjct: 304 FDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQV 339


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 124/306 (40%), Gaps = 52/306 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP   F + +D+GSDLLW     V+C+P    Y     +D   Y PS SST   + C 
Sbjct: 70  LGTPPQKFSLIVDSGSDLLW-----VQCSPCRQCY----AQDSPLYVPSNSSTFSPVPCL 120

Query: 169 HRLCDL-----GTSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
              C L     G  C   + P     +Y Y + +SS G+            ++A  + V+
Sbjct: 121 SSDCLLIPATEGFPCDF-RYPGACAYEYLYADTSSSKGVFAY---------ESATVDGVR 170

Query: 223 AS-VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DK 276
              V  GCG    G +    A  G++GLG G +S  S +  A    N F+ C        
Sbjct: 171 IDKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPT 225

Query: 277 DDSGRIFFGDQGPATQQSTSF--LASNGKYIT-YIIGVETCCIGSSCLKQTSFK------ 327
             S  + FGD+  +T     +  + SN K  T Y + +E   +G   L  +         
Sbjct: 226 SVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLL 285

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLPKLP 382
               +I DSG++ T+     Y  I A FD  V+     S +G     C + +    P  P
Sbjct: 286 GNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG--LDLCVELTGVDQPSFP 343

Query: 383 SVKLMF 388
           S  + F
Sbjct: 344 SFTIEF 349


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 132/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 82/355 (23%), Positives = 143/355 (40%), Gaps = 38/355 (10%)

Query: 65  LLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFG-WLHYTWIDIGTPNVSFLVALDAG 123
           L   DVQ           +L P+  +  ++ G   G   +Y  + +G+P   + + LD G
Sbjct: 81  LRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLGSPPKYYTMILDTG 140

Query: 124 SDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQNP 181
           S L W+     +C P     ++ +D     + PSAS+T + L CS   C L    +  +P
Sbjct: 141 SSLSWL-----QCKPCVVYCHSQVD---PLFEPSASNTYRPLYCSSSECSLLKAATLNDP 192

Query: 182 ----KQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 237
                  C YT   Y + + S G L  D+L L         +    S   GCG    G  
Sbjct: 193 LCTASGVCVYTAS-YGDASYSMGYLSRDLLTLT-------PSQTLPSFTYGCGQDNEG-- 242

Query: 238 LDGVAPDGLIGLGLGEISVPSLLA-KAGLIRNSFSMCFDKDDS---GRIFFGDQGPATQQ 293
           L G A  G++GL   ++S+ + L+ K G    +FS C     S   G +  G   P++ +
Sbjct: 243 LFGKAA-GIVGLARDKLSMLAQLSPKYGY---AFSYCLPTSTSSGGGFLSIGKISPSSYK 298

Query: 294 STSFLASNGKYITYIIGVETCCIGSSCLKQTS----FKAIVDSGSSFTFLPKEVYETIAA 349
            T  + ++     Y + +    +    +   +       I+DSG+  T LP  +Y  +  
Sbjct: 299 FTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLPISIYAALRE 358

Query: 350 EFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
            F + ++        Y     C+K S + +   P ++++F       +  P  +I
Sbjct: 359 AFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILI 413


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 131/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/270 (24%), Positives = 115/270 (42%), Gaps = 44/270 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IG+P  SF   +D GSDL+W  C  C +C           D+    + P  SS+   +SC
Sbjct: 372 IGSPPRSFSAIMDTGSDLIWTQCKPCQQC----------FDQSTPIFDPKQSSSFYKISC 421

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
           S  LC    +       C Y +  Y +++S+ G+L        + GD+         +  
Sbjct: 422 SSELCGALPTSTCSSDGCEY-LYTYGDSSSTQGVLA---FETFTFGDSTEDQISIPGLGF 477

Query: 228 GCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR---IF 283
           GCG   +G G+  G    GL+GLG G +S+ S L +       F+ C    D  +   + 
Sbjct: 478 GCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKE-----QKFAYCLTAIDDSKPSSLL 529

Query: 284 FGDQGPAT-------QQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK------- 327
            G     T        ++T  + +  +   Y + ++   +G + L   +++F+       
Sbjct: 530 LGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSG 589

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
             I+DSG++ T++    + ++  EF  Q+N
Sbjct: 590 GVIIDSGTTITYVENSAFTSLKNEFIAQMN 619


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 131/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 132/315 (41%), Gaps = 59/315 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE-YSPSASSTSKHL 165
           + +GTP  +  + LD GS+L W+ C+  +      ++  + D + +  YSP   S+   L
Sbjct: 89  LTVGTPPQNVSMVLDTGSELSWLRCNKTQ------TFQTTFDPNRSSSYSPVPCSS---L 139

Query: 166 SCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           +C+ R  D  +  SC +  Q C + +  Y + +SS G L  D  ++         NS   
Sbjct: 140 TCTDRTRDFPIPASCDS-NQLC-HAILSYADASSSEGNLASDTFYI--------GNSDMP 189

Query: 224 SVIIGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR 281
             I GC     S    +     GL+G+  G +S  S +         FS C  D D SG 
Sbjct: 190 GTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFP-----KFSYCISDSDFSGV 244

Query: 282 IFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF--- 326
           +  GD            P  Q ST     +   + Y + +E   + S  L   ++ F   
Sbjct: 245 LLLGDANFSWLMPLNYTPLIQISTPLPYFD--RVAYTVQLEGIKVSSKLLPLPKSVFVPD 302

Query: 327 -----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE--GYPWK----CCYKS-- 373
                + +VDSG+ FTFL   VY  +  EF  Q +  +   E   Y ++     CY+   
Sbjct: 303 HTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPL 362

Query: 374 SSQRLPKLPSVKLMF 388
           S   LP LP+V LMF
Sbjct: 363 SQTSLPWLPTVSLMF 377


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 89/200 (44%), Gaps = 32/200 (16%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP   F  A+D  SDL+W  C  C  C       Y+ +D   N   P  SST   L
Sbjct: 93  LGIGTPPYKFTAAIDTASDLIWTQCQPCTGC-------YHQVDPMFN---PRVSSTYAAL 142

Query: 166 SCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            CS   C   D+     +  + C YT   Y+ N ++ G L  D L +   G++A +    
Sbjct: 143 PCSSDTCDELDVHRCGHDDDESCQYTYT-YSGNATTEGTLAVDKLVI---GEDAFRG--- 195

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--- 279
             V  GC    +GG     A  G++GLG G +S+ S L+    +R  F+ C     S   
Sbjct: 196 --VAFGCSTSSTGGAPPPQA-SGVVGLGRGPLSLVSQLS----VRR-FAYCLPPPASRIP 247

Query: 280 GRIFFGDQGPATQQSTSFLA 299
           G++  G    A + +T+ +A
Sbjct: 248 GKLVLGADADAARNATNRIA 267


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 82/299 (27%), Positives = 123/299 (41%), Gaps = 47/299 (15%)

Query: 112 PNVSFLVALDAGSDLLW---IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           P V   V LD+ SD+ W   +PC    C P   S+Y+          PS S +S   SCS
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYD----------PSRSPSSAPFSCS 204

Query: 169 HRLCD-LG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
              C  LG     C N +  C Y +  Y + +S+SG  + D+L L +G  NA+     + 
Sbjct: 205 SPTCTALGPYANGCANNQ--CQYLV-RYPDGSSTSGAYIADLLTLDAG--NAV-----SG 254

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
              GC   + G +    A  G++ LG G  S+  L   A    N+FS C     S   FF
Sbjct: 255 FKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNAFSYCIPATASDSGFF 310

Query: 285 GDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDSGSSF 336
               P    S    T  +        Y + + T  +G   L      F A  ++DS ++ 
Sbjct: 311 TLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAI 370

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEGYPWK----CCYKSSSQRLPKLPSVKLMFPQN 391
           T LP   Y+ + + F      ++T +   P K     CY  +     +LP + L+F +N
Sbjct: 371 TRLPPTAYQALRSAF----RSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRN 425


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 151/358 (42%), Gaps = 52/358 (14%)

Query: 56  KKSFEYYQVLLS--SDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPN 113
           ++   Y+   L+  SD      K GP+   + P +   +M  GN     +Y  + +G+P 
Sbjct: 60  EERIRYFHSRLAKNSDANASFKKVGPKLAGI-PLKSGLSMGSGN-----YYVKMGLGSPT 113

Query: 114 VSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
             + + +D GS   W+     +C P   + Y  +  D   ++PSAS T K + CS   C 
Sbjct: 114 KYYTMIVDTGSSFSWL-----QCQP--CTIYCHIQED-PVFNPSASKTYKTVPCSSSQCS 165

Query: 174 LGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
                     +C      C Y    Y +++ S G L +D+L L         +   +S +
Sbjct: 166 SLKSATLNEPTCSKQSNACVYKAS-YGDSSFSLGYLSQDVLTLT-------PSQTLSSFV 217

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC----FDKDDSGRI 282
            GCG    G  L G   DG+IGL   E+S+ S L  +G   N+FS C    F   +S + 
Sbjct: 218 YGCGQDNQG--LFGRT-DGIIGLANNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKE 272

Query: 283 FFGDQG------PATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA--IVDS 332
            F   G       ++ + T  L +      Y I +E+  +    L    +S+K   I+DS
Sbjct: 273 GFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDS 332

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKL-PSVKLMF 388
           G+  T LP  VY T+   +   ++       G      C+K S   + ++ P ++++F
Sbjct: 333 GTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIF 390


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 130/306 (42%), Gaps = 37/306 (12%)

Query: 94  SLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           +LG     L Y   + +G+P  S  + +D GSD+ W+ C  C +C   +   ++      
Sbjct: 123 TLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD------ 176

Query: 152 NEYSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
               PS+SST    SCS   C      G  C + +  C YT+  Y + +S++G    D L
Sbjct: 177 ----PSSSSTYSPFSCSSAACAQLGQEGNGCSSSQ--CQYTVT-YGDGSSTTGTYSSDTL 229

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
            L   G NA++         GC   +S G+ D    DGL+GLG G  S+ S    AG   
Sbjct: 230 AL---GSNAVRK-----FQFGCSNVES-GFNDQT--DGLMGLGGGAQSLVS--QTAGTFG 276

Query: 268 NSFSMCFDKDDSGRIFFG-DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QT 324
            +FS C     S   F     G +    T  L S+     Y + ++   +G   L    +
Sbjct: 277 AAFSYCLPATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTS 336

Query: 325 SFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            F A  I+DSG+  T LP   Y  +++ F   +    ++        C+  S Q    +P
Sbjct: 337 VFSAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIP 396

Query: 383 SVKLMF 388
           +V L+F
Sbjct: 397 TVALVF 402


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 89/348 (25%), Positives = 142/348 (40%), Gaps = 61/348 (17%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP  + L+ LD GSD++W+     +CAP    Y  S       + P  S + 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL-----QCAPCRHCYAQS----GRVFDPRRSRSY 172

Query: 163 KHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             + C   +C       C   +  C Y +  Y + + ++G    + L    G        
Sbjct: 173 AAVDCVAPICRRLDSAGCDRRRNSCLYQVA-YGDGSVTAGDFASETLTFARGA------R 225

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS 279
           VQ  V IGCG    G +   +A  GL+GLG G +S P+ +A++     SFS C  D+  S
Sbjct: 226 VQ-RVAIGCGHDNEGLF---IAASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSS 279

Query: 280 GR--------IFFGDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQTS 325
            R        + FG    A     SF  +  N +    Y  +++G          + Q+ 
Sbjct: 280 VRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSD 339

Query: 326 FK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSS 375
            +          I+DSG+S T L + VYE +   F         S  G+  +  CY  S 
Sbjct: 340 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSG 399

Query: 376 QRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
           +R+ K+P+V +           P+N    V+      F + GT  GVS
Sbjct: 400 RRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 81/354 (22%), Positives = 130/354 (36%), Gaps = 53/354 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           ++    +GTP   FL+  D GSDL W+ C       A  ++S   S       + P  S 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 161 TSKHLSCSHRLCDLG-----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           T   + C+   C        ++C  P  PC Y   Y   + +   +  E     +S   +
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214

Query: 216 ALKNSVQAS----VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
           + KN V+ +    +++GC    +G   +  A DG++ LG   +S  S    A      FS
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGRFS 270

Query: 272 MCF-----DKDDSGRIFFGDQ-----------GPATQQSTSFLASNGKYITYIIGVETCC 315
            C       ++ +  + FG             GP  +Q+   L S  +   Y + ++   
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPF-YDVSIKAIS 329

Query: 316 IGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
           +    LK              IVDSG+S T L K  Y  + A   +++          P+
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLA-RFPRVAMDPF 388

Query: 368 KCCY-------KSSSQRLPKLP-----SVKLMFPQNNSFVVNNPVFVIYGTQVG 409
           + CY       K     LPKL      S +L  P  +  +   P     G Q G
Sbjct: 389 EYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEG 442


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 98/237 (41%), Gaps = 39/237 (16%)

Query: 177 SCQNPK----QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
           SC +PK    Q C YT  Y  + + ++G L  D    +  G +         V  GCG+ 
Sbjct: 50  SCGSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFVGAGASV------PGVAFGCGLF 102

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-----------DDSGR 281
            +G +       G+ G G G +S+PS L K G    +FS CF             D    
Sbjct: 103 NNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPSTVLLDLPAD 155

Query: 282 IFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK-------AIVD 331
           +F   QG          A N    T Y + ++   +GS+ L   +++F         I+D
Sbjct: 156 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIID 215

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           SG+S T LP +VY+ +  EF  Q+   +          C+ + SQ  P +P + L F
Sbjct: 216 SGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 272


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 130/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVVNNP-VFVIYGTQ 407
            L F     F + +  VFV    Q
Sbjct: 279 SLHFDDGARFDLGSKGVFVERSVQ 302


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/268 (24%), Positives = 116/268 (43%), Gaps = 38/268 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP V     +D GSDL+W      +C P    Y          + P  S+T   + 
Sbjct: 54  LTLGTPPVDVYGLVDTGSDLVW-----AQCTPCQGCYRQKSPM----FEPLRSNTYTPIP 104

Query: 167 CSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C    C+   G SC +P++ C Y+   Y +++ + G+L  + +   S     +   V   
Sbjct: 105 CDSEECNSLFGHSC-SPQKLCAYSY-AYADSSVTKGVLARETVTFSSTDGEPV---VVGD 159

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS--FSMCF-----DKD 277
           ++ GCG   SG +      +  +G+        SL+++ G +  S  FS C      D  
Sbjct: 160 IVFGCGHSNSGTF-----NENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPH 214

Query: 278 DSGRIFFGDQGPATQQ---STSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI----- 329
             G I FGD    + +   +T  ++  G+   Y++ +E   +G + +   S + +     
Sbjct: 215 TLGTISFGDASDVSGEGVAATPLVSEEGQ-TPYLVTLEGISVGDTFVSFNSSEMLSKGNI 273

Query: 330 -VDSGSSFTFLPKEVYETIAAEFDRQVN 356
            +DSG+  T+LP+E Y+ +  E   Q N
Sbjct: 274 MIDSGTPATYLPQEFYDRLVKELKVQSN 301


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 131/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 131/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 140/328 (42%), Gaps = 62/328 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP  ++   +D GSDL+W  C  C +C           D+    + P  SS+   L
Sbjct: 104 LAIGTPPETYSAIMDTGSDLIWTQCKPCTQC----------FDQPSPIFDPKKSSSFSKL 153

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           SCS +LC        P+  C  + +Y   Y + +S+ G +  +       G  ++ N   
Sbjct: 154 SCSSQLCK-----ALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF---GKVSIPN--- 202

Query: 223 ASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDD 278
             V  GCG    G G+  G    GL+GLG G +S+ S L +A      FS C    D   
Sbjct: 203 --VGFGCGEDNEGDGFTQG---SGLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTK 252

Query: 279 SGRIFFG-----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK---- 327
           +  +  G     +   A  ++T  + +  +   Y + +E   +G + L  K+++F+    
Sbjct: 253 TSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDD 312

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK----SSSQRLP 379
                I+DSG++ T+L +  ++ +  EF  Q+   + +      + CY     +S   +P
Sbjct: 313 GTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVP 372

Query: 380 KL----PSVKLMFPQNNSFVVNNPVFVI 403
           KL        L  P  N  + ++ + VI
Sbjct: 373 KLVLHFTGADLELPGENYMIADSSMGVI 400


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 88/325 (27%), Positives = 136/325 (41%), Gaps = 69/325 (21%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           +LG+    L Y   + IGTP ++  V +D GSD+ W+ C   R    S+ +++       
Sbjct: 115 TLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH-ARAGAGSSLFFD------- 166

Query: 153 EYSPSASSTSKHLSCSHRLC------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
              P  SST    SCS   C      D G S  +    C YT+  Y + ++++G    D 
Sbjct: 167 ---PGKSSTYTPFSCSSAACTRLEGRDNGCSLNS---TCQYTV-RYGDGSNTTGTYGSDT 219

Query: 207 LHLISGGDNALKNSVQASVIIGCG-MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AG 264
           L L S     ++N        GC      G  LD    DGL+GLG G    PSL+++ A 
Sbjct: 220 LALNS--TEKVEN-----FQFGCSETSDPGEGLDEDQTDGLMGLGGG---APSLVSQTAA 269

Query: 265 LIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL---ASNGK--YIT------------Y 307
              ++FS C               PAT +S+ FL   AS G   ++T            Y
Sbjct: 270 TYGSAFSYCL--------------PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFY 315

Query: 308 IIGVETCCIGSS--CLKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
            + ++   +G     +  T F A  I+DSG+  T LP   Y  ++A F   +     +  
Sbjct: 316 FVILQGINVGGDPVAISPTVFAAGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARA 375

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMF 388
                 C+  + Q    +P+V+L+F
Sbjct: 376 FSILDTCFDFTGQDNVSIPAVELVF 400


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 131/322 (40%), Gaps = 47/322 (14%)

Query: 90  SKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLD 148
           SK +S  ++    ++  + IG+P     + +D+GSD++W+ C  C+ C       Y   D
Sbjct: 112 SKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLEC-------YAQAD 164

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
                + P++S+T   +SC   +C  L TS       C Y +  Y + + + G L  + L
Sbjct: 165 ---PLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVS-YGDGSYTKGTLALETL 220

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
            L   G  A++      V IGCG +  G +   V   GL+GLG G +S+   L  A    
Sbjct: 221 TL---GGTAVEG-----VAIGCGHRNRGLF---VGAAGLLGLGWGPMSLVGQLGGA--AG 267

Query: 268 NSFSMCF---------DKDDSGRIFFGDQGPATQQSTSF-LASNGKYIT-YIIGVETCCI 316
            +FS C            D +G +  G      + +    L  N +  + Y +GV    +
Sbjct: 268 GAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGV 327

Query: 317 GSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP 366
           G   L          +      ++D+G++ T LP+E Y  +   F   V     +     
Sbjct: 328 GDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSL 387

Query: 367 WKCCYKSSSQRLPKLPSVKLMF 388
              CY  S     ++P+V   F
Sbjct: 388 LDTCYDLSGYTSVRVPTVSFYF 409


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/326 (24%), Positives = 134/326 (41%), Gaps = 51/326 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + ++ +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMS 159

Query: 280 GRIFF---------GDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGSSCLKQT--- 324
            R FF         G +  AT+   + T  +A       + + +    +    L  +   
Sbjct: 160 ERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSI 219

Query: 325 -SFKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
            S K +V DSGS  +++P      ++    R++     + E    + CY   S     +P
Sbjct: 220 FSRKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMP 278

Query: 383 SVKLMFPQNNSFVV-NNPVFVIYGTQ 407
           ++ L F     F + ++ VFV    Q
Sbjct: 279 AISLHFDDGARFDLGSHGVFVERSVQ 304


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 125/311 (40%), Gaps = 69/311 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP  +  + LD GS L WI C   +  P          +    + PS SS+   L 
Sbjct: 76  LPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----------KPKTSFDPSLSSSFSTLP 125

Query: 167 CSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CSH LC        L TSC + +  C Y+  +Y + T + G LV++ +   +        
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSY-FYADGTFAEGNLVKEKITFSN-------T 176

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD- 278
            +   +I+GC  + S          G++G+  G +   S +++A +  + FS C      
Sbjct: 177 EITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAKI--SKFSYCIPPKSN 224

Query: 279 ------SGRIFFGDQG-------------PATQQSTSF--LASNGKYITYIIGVETCCIG 317
                 +G  + GD               P +Q+  +   LA     I    G++   I 
Sbjct: 225 RPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNIS 284

Query: 318 SSCLKQT---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW----KCC 370
            S  +     S + +VDSGS FT L    Y+ + AE   +V   +   +GY +      C
Sbjct: 285 GSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK--KGYVYGGTADMC 342

Query: 371 YKSSSQRLPKL 381
           +  +   +P+L
Sbjct: 343 FDGNVAMIPRL 353


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 125/311 (40%), Gaps = 69/311 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP  +  + LD GS L WI C   +  P          +    + PS SS+   L 
Sbjct: 76  LPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP----------KPKTSFDPSLSSSFSTLP 125

Query: 167 CSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CSH LC        L TSC + +  C Y+  +Y + T + G LV++ +   +        
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRL-CHYSY-FYADGTFAEGNLVKEKITFSN-------T 176

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD- 278
            +   +I+GC  + S          G++G+  G +   S +++A +  + FS C      
Sbjct: 177 EITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAKI--SKFSYCIPPKSN 224

Query: 279 ------SGRIFFGDQG-------------PATQQSTSF--LASNGKYITYIIGVETCCIG 317
                 +G  + GD               P +Q+  +   LA     I    G++   I 
Sbjct: 225 RPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNIS 284

Query: 318 SSCLKQT---SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW----KCC 370
            S  +     S + +VDSGS FT L    Y+ + AE   +V   +   +GY +      C
Sbjct: 285 GSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK--KGYVYGGTADMC 342

Query: 371 YKSSSQRLPKL 381
           +  +   +P+L
Sbjct: 343 FDGNVAMIPRL 353


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 129/324 (39%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVVNNP-VFVIYGTQ 407
            L F     F +    VFV    Q
Sbjct: 279 SLHFDDGARFDLGRRGVFVERSVQ 302


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/327 (23%), Positives = 135/327 (41%), Gaps = 55/327 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C           ++    + P+ASS+ ++++C
Sbjct: 155 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 204

Query: 168 SHRLCDL------GTSCQNPKQ-PCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKN 219
             + C L        +C+ P +  CPY   Y  ++ ++  L +E   ++L + G +   +
Sbjct: 205 GDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 264

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DK 276
                V+ GCG +  G +       GL    L   S   L A  G   ++FS C      
Sbjct: 265 ----GVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVEHGS 315

Query: 277 DDSGRIFFGDQ----GPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCL---------- 321
           D   ++ FG+          + T+F  ++    T Y + ++   +G   L          
Sbjct: 316 DAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVG 375

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPK 380
           K  S   I+DSG++ ++  +  Y+ I   F   ++        +P    CY  S    P+
Sbjct: 376 KDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVERPE 435

Query: 381 LPSVKLM--------FPQNNSFVVNNP 399
           +P + L+        FP  N FV  +P
Sbjct: 436 VPELSLLFADGAVWDFPAENYFVRLDP 462


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 148/349 (42%), Gaps = 53/349 (15%)

Query: 19  SSGAETVMFSTKLIHRFS-----EEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQ 73
           S+ +++  FST LIH  S     + VKA  ++K+    S  ++ ++            +Q
Sbjct: 35  SAASDSKGFSTNLIHIHSPSSPYKNVKAESLAKDTALESTLSRHAYLR---------ARQ 85

Query: 74  KMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD- 132
           +    P   +  P    K+  L N         + IG P  +  V LD GSDL WI C+ 
Sbjct: 86  QKALQPADFVPPPLIRDKSAFLAN---------LSIGNPPTNVYVVLDTGSDLFWIQCEP 136

Query: 133 CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQ-NPKQPCPYTMD 190
           C  C       YN           + S +   + C+   C  LG   Q +    C Y   
Sbjct: 137 CDVCYKQKDPIYNR----------TKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTS 186

Query: 191 YYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLG 250
            Y + + +SGLL  + +   S   +  K    A V  GCG+ Q+  ++      G++GLG
Sbjct: 187 -YADGSRTSGLLSYEKVAFTSHYSDEDKT---AQVGFGCGL-QNLNFVTSSRDGGVLGLG 241

Query: 251 LGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT 306
            G +S+ S L+  G +  SF+ CF    + +  G + FGD        T  + +   Y+ 
Sbjct: 242 PGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVN 301

Query: 307 YI---IGVET--CCIGSSCLKQT---SFKAIVDSGSSFTFLPKEVYETI 347
            +   +GVE     I SS  ++    S   I+DSGS+ +  P EVYE +
Sbjct: 302 LLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVV 350


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 118/314 (37%), Gaps = 70/314 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +GTP ++F V  D GSDL+W  C  C +C            +    + P++SST   L
Sbjct: 90  ISVGTPLLTFSVVADTGSDLIWTQCAPCTKC----------FQQPAPPFQPASSSTFSKL 139

Query: 166 SCSHRLCDLGTSCQNPKQPCPYT---MDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
            C+   C       N  + C  T    +Y   +  ++G L  + L +   GD +      
Sbjct: 140 PCTSSFCQF---LPNSIRTCNATGCVYNYKYGSGYTAGYLATETLKV---GDASFP---- 189

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG-- 280
            SV  GC  +   G LD         LG+G                 FS C     +   
Sbjct: 190 -SVAFGCSTENGLGQLD---------LGVGR----------------FSYCLRSGSAAGA 223

Query: 281 -RIFFGDQGPATQ---QSTSFLASNGKYIT-YIIGVETCCIGSSCLKQTSFK-------- 327
             I FG     T    QST F+ +   + + Y + +    +G + L  T+          
Sbjct: 224 SPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGL 283

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL--P 382
               IVDSG++ T+L K+ YE +   F  Q  D  T         C+KS+      +  P
Sbjct: 284 GGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVP 343

Query: 383 SVKLMFPQNNSFVV 396
           S+ L F     + V
Sbjct: 344 SLVLRFDGGAEYAV 357


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 132/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + ++ +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q   S   GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPSFSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 129/302 (42%), Gaps = 41/302 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +G P       LD GSD+ W+   C+ CA  +  Y    ++    + P  SS+   +SC 
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWL--QCLPCAGKNGCY----EQITPIFDPELSSSYNPVSCD 56

Query: 169 HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIG 228
              C L          C Y ++Y  + + + G L  + L  +    N++ N     + IG
Sbjct: 57  SEQCQLLDEAGCNVNSCIYKVEY-GDGSFTIGELATETLTFVHS--NSIPN-----ISIG 108

Query: 229 CGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD-- 286
           CG    G +   V  DGLIGLG G IS+ S L  +     SFS C    DS      D  
Sbjct: 109 CGHDNEGLF---VGADGLIGLGGGAISISSQLKAS-----SFSYCLVDIDSPSFSTLDFN 160

Query: 287 QGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK----------AIVDS 332
             P +    S L  N ++ ++    +IG+    +G   L  +S +           IVDS
Sbjct: 161 TDPPSDSLISPLVKNDRFPSFRYVKVIGMS---VGGKPLPISSSRFEIDESGLGGIIVDS 217

Query: 333 GSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNN 392
           G++ T LP +VYE +   F     +   + E  P+  CY  SSQ   ++P++  + P  N
Sbjct: 218 GTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGEN 277

Query: 393 SF 394
           S 
Sbjct: 278 SL 279


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 114/306 (37%), Gaps = 65/306 (21%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +   + IGTP     + LD GSDL+W      +C P  A +    D+ L  + PS SST 
Sbjct: 89  YLVHLAIGTPPQPVQLTLDTGSDLIW-----TQCQPCPACF----DQALPYFDPSTSSTL 139

Query: 163 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
              SC   LC                     +    + L   D    +  G +       
Sbjct: 140 SLTSCDSTLC---------------------QGLPVASLPRSDKFTFVGAGASV------ 172

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK------ 276
             V  GCG+  +G +       G+ G G G +S+PS L K G    +FS CF        
Sbjct: 173 PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIP 225

Query: 277 -----DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS---------CLK 322
                D    +F   QG    Q+T  + +      Y + ++   +GS+          LK
Sbjct: 226 STVLLDLPADLFSNGQG--AVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK 283

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
             +   I+DSG++ T LP  VY  +   F  QV   + S        C  +  +  P +P
Sbjct: 284 NGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVP 343

Query: 383 SVKLMF 388
            + L F
Sbjct: 344 KLVLHF 349


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 148/366 (40%), Gaps = 51/366 (13%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFS-----EEVKALGVSKNRNATSWPA 55
           +N + L I     +    S+ +++  FST LIH  S     + VKA  ++K+    S  +
Sbjct: 4   VNNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAKDTALESTLS 63

Query: 56  KKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVS 115
           + ++            +Q+    P   +  P    K+  L N         + IG P  +
Sbjct: 64  RHAYLR---------ARQQKALQPADFVPPPLIRDKSAFLAN---------LSIGNPPTN 105

Query: 116 FLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-D 173
             V LD GSDL WI C+ C  C       YN           + S +   + C+   C  
Sbjct: 106 VYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNR----------TKSDSYTEMLCNEPPCVS 155

Query: 174 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQ 233
           LG   Q            Y +   +SGLL  + +   S   +  K    A V  GCG+ Q
Sbjct: 156 LGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKT---AQVGFGCGL-Q 211

Query: 234 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGP 289
           +  ++      G++GLG G +S+ S L+  G +  SF+ CF    + +  G + FGD   
Sbjct: 212 NLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDATY 271

Query: 290 ATQQSTSFLASNGKYITYI-----IGVETCCIGSSCLKQT---SFKAIVDSGSSFTFLPK 341
                T  + +   Y+  +     +G     I SS  ++    S   I+DSGS+ +  P 
Sbjct: 272 LNGDMTPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFPP 331

Query: 342 EVYETI 347
           EVYE +
Sbjct: 332 EVYEVV 337


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 129/324 (39%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP+ + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLRRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F +    VFV    Q
Sbjct: 279 SLHFDDGARFDLGRGGVFVERSVQ 302


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 134/329 (40%), Gaps = 60/329 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + LD GSDL W+ C  C  C   +  +Y+          P  S++ K+++C
Sbjct: 166 VGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD----------PKTSASFKNITC 215

Query: 168 SHRLCDLGTS------CQNPKQPCPYTMDYYTENTSSSGLLVEDI---LHLISGGDNALK 218
           +   C L +S      C++  Q CPY   Y   + ++    VE     L    GG +  K
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 274
                +++ GCG    G +        L+GLG G +S  S L    L  +SFS C     
Sbjct: 276 ---VGNMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRN 327

Query: 275 -DKDDSGRIFFGDQGPATQQS----TSFL--ASNGKYITYIIGVETCCIGSSCLK----- 322
            + + S ++ FG+       +    TSF+    N     Y I +++  +G   L      
Sbjct: 328 SNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEET 387

Query: 323 -----QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSS-- 374
                      I+DSG++ ++  +  YE I  +F  ++ +    F  +P    C+  S  
Sbjct: 388 WNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGI 447

Query: 375 SQRLPKLPSVKLM--------FPQNNSFV 395
            +    LP + +         FP  NSF+
Sbjct: 448 EENNIHLPELGIAFVDGTVWNFPAENSFI 476


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 125/302 (41%), Gaps = 41/302 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  I +G+P  +  V +D+GSD++W+ C+ C +C       Y+  D   N   P+ SS+
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQC-------YHQSDPVFN---PADSSS 183

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
              +SC+  +C    +    +  C Y +  Y + + + G L    L  ++ G   ++N  
Sbjct: 184 YAGVSCASTVCSHVDNAGCHEGRCRYEVS-YGDGSYTKGTLA---LETLTFGRTLIRN-- 237

Query: 222 QASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPSLLAKAGLIRNSFSMCF---DKD 277
              V IGCG    G +   V   GL+GLG G +S V  L  +AG    +FS C       
Sbjct: 238 ---VAIGCGHHNQGMF---VGAAGLLGLGSGPMSFVGQLGGQAG---GTFSYCLVSRGIQ 288

Query: 278 DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC---LKQTSFK------- 327
            SG + FG +      +   L  N +  ++     +          + +  FK       
Sbjct: 289 SSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDG 348

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             ++D+G++ T LP   YE     F  Q  +   +     +  CY        ++P+V  
Sbjct: 349 GVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSF 408

Query: 387 MF 388
            F
Sbjct: 409 YF 410


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 109/261 (41%), Gaps = 27/261 (10%)

Query: 96  GNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCA-PLSASY--YNSLDR 149
           GN +   +YT  + IG P   + + +D GSDL W+ CD  C  C  P +  Y  +  L +
Sbjct: 56  GNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHGDLVK 115

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
            ++    +  S   H             C  P + C Y ++Y  + +S   LL ++I   
Sbjct: 116 CVDPLCAAIQSAPNH------------HCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLK 163

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
            + G  A     +  +  GCG  Q+  G     +  G++GLG G  S+ S L   GLIRN
Sbjct: 164 FTNGSLA-----RPMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRN 218

Query: 269 SFSMCFDKDDSGRIFFGDQ--GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
               C      G +FFGDQ   P+    T  L S+     Y  G                
Sbjct: 219 VVGHCLSGRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQ-HYKTGPADLFFDRKTTSVKGL 277

Query: 327 KAIVDSGSSFTFLPKEVYETI 347
           + I DSGSS+T+   + ++ +
Sbjct: 278 ELIFDSGSSYTYFNSQAHKAL 298


>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 416

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 118/312 (37%), Gaps = 58/312 (18%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP-CDCVRCAPLSASYYN 145
           + G   + L N     +YT IDIGTP  +F V LD GS  LW+P   C   A    + Y+
Sbjct: 89  ANGGHGVPLTNFMNAQYYTEIDIGTPPQTFKVILDTGSSNLWVPSSQCTSIACFLHTKYD 148

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
           S          SASS+ K       +           Q    +M+ +  N        +D
Sbjct: 149 S----------SASSSYKANGTEFSI-----------QYGSGSMEGFVSN--------DD 179

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL------ 259
           I+     GD +L +   A      G+  + G  DG+     +GL    I+V  +      
Sbjct: 180 IVF----GDMSLSSVDFAEATKEPGLAFAFGKFDGI-----LGLAYDTIAVNHITPVFYE 230

Query: 260 LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           L   G+I     SF +   +DD G   FG   P+        A   +   + + +E    
Sbjct: 231 LVNQGIISEPVFSFRLGSSEDDGGEAIFGGIDPSAYSGKIDYAPVRRKAYWEVELEKVSF 290

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
           G   L+  +  A +D+G+S   LP +V E +  +   + +          W   Y     
Sbjct: 291 GDDDLELENTGAAIDTGTSLIALPTDVAEMLNTQIGAKKS----------WNGQYTVDCA 340

Query: 377 RLPKLPSVKLMF 388
           ++P LP +   F
Sbjct: 341 KVPDLPDLTFYF 352


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/295 (24%), Positives = 118/295 (40%), Gaps = 40/295 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I +G P   F +  D  +D  W+ C  C++C           D+  + + PS SS+   L
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKC----------YDQPDSIFDPSQSSSYTLL 240

Query: 166 SCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           SC  + C+L   +SC +    C Y + Y  + T++ G+L+ + +   S G          
Sbjct: 241 SCETKHCNLLPNSSCSDDGY-CRYNITY-KDGTNTEGVLINETVSFESSG-------WVD 291

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-SGRI 282
            V +GC  K  G +   V  DG  GLG G +S PS +  + +   S+ +   KD  S   
Sbjct: 292 RVSLGCSNKNQGPF---VGSDGTFGLGRGSLSFPSRINASSM---SYCLVESKDGYSSST 345

Query: 283 FFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK--QTSFK--------AIVD 331
              +  P +    + L  N K    Y +G++   +G   +    ++F          IV 
Sbjct: 346 LEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVS 405

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
           S S  T L  + Y  +   F  +            +  CY  SS    +LP ++ 
Sbjct: 406 SSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEF 460


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 127/316 (40%), Gaps = 51/316 (16%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYEQREK--- 221

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL    C      C Y +  Y + + S G    D L L 
Sbjct: 222 LFDPARSSTYANVSCAAPACSDLNIHGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 278

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 279 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 325

Query: 270 FSMCFDKDDSGRIF--FGDQGPATQQS---TSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F+ C     +G  +  FG    A  ++   T  L  NG    Y +G+    +G   L   
Sbjct: 326 FAHCLPARSTGTGYLDFGAGSLAAARARLTTPMLTENGPTF-YYVGMTGIRVGGQLLSIP 384

Query: 323 QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYK 372
           Q+ F     IVDSG+  T LP   Y ++     R       +  GY           CY 
Sbjct: 385 QSVFATAGTIVDSGTVITRLPPAAYSSL-----RYAFAAAMAARGYKKAPAVSLLDTCYD 439

Query: 373 SSSQRLPKLPSVKLMF 388
            +      +P+V L+F
Sbjct: 440 FTGMSQVAIPTVSLLF 455


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 114/292 (39%), Gaps = 68/292 (23%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++GTP  +    LD GS L+W PC     C  C     ++ N     +  + P  SST+
Sbjct: 92  LNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHC-----NFPNIDPTKIPTFIPKNSSTA 146

Query: 163 KHLSCSHRLC------DLGTSCQNPKQP--------CPYTMDYYTENTSSSGLLVEDILH 208
           K L C +  C      D+ + C   K+P        CP  +  Y    ++  LL++++  
Sbjct: 147 KLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNL-- 204

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
                            ++GC +      L    P G+ G G G+ S+PS   +  L R 
Sbjct: 205 -------NFPGKTVPQFLVGCSI------LSIRQPSGIAGFGRGQESLPS---QMNLKR- 247

Query: 269 SFSMCF------DKDDSGRIFF-----GDQGPATQQSTSFLA--SNGKYIT--YIIGVET 313
            FS C       D   S  +       GD        T F +  SN       Y + +  
Sbjct: 248 -FSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRK 306

Query: 314 CCIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
             +G   +K            +   IVDSGS+FTF+ + VY  +A EF RQ+
Sbjct: 307 LIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL 358


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 131/328 (39%), Gaps = 50/328 (15%)

Query: 88  QGSKTMSLGN--DFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           +G   M LG+  D+G   Y T + +GTP   F V +D GS+L W+ C             
Sbjct: 70  KGGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR-------YRGRG 122

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLG-------TSCQNPKQPCPYTMDY-YTENT 196
               ++   +    S + K + C  + C +        ++C  P  PC Y  DY Y + +
Sbjct: 123 KGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSY--DYRYADGS 180

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
           ++ G+  ++ + +  G  N  K  ++  +++GC    S         DG++GL   + S 
Sbjct: 181 AAQGVFAKETITV--GLTNGRKARLRG-LLVGCSSSFS--GQSFQGADGVLGLAFSDFSF 235

Query: 257 PSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGDQGPATQQSTSFLASNGKYIT----- 306
            S      L     S C      +K+ S  + FG    +T   T+   +    +T     
Sbjct: 236 TS--TATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPF 293

Query: 307 YIIGVETCCIGSSCL--------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ-VND 357
           Y I +    IG   L          T    I+DSG+S T L +  Y+ +     R  V  
Sbjct: 294 YAINIIGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVEL 353

Query: 358 TITSFEGYPWKCCYKSSS----QRLPKL 381
                EG P + C+ S+S     +LP+L
Sbjct: 354 KRVKPEGIPIEYCFSSTSGFNESKLPQL 381


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 91/405 (22%), Positives = 163/405 (40%), Gaps = 61/405 (15%)

Query: 4   ISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKK-----S 58
           + L++ L    +L+  +    + F+T LIHR S +      S   N    P+++      
Sbjct: 8   VLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPK------SPFYNPAETPSQRIRNAIH 61

Query: 59  FEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLV 118
             + +V   +D+ +           + P  G   M+L             +GTP    + 
Sbjct: 62  RSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMNLS------------LGTPPSPIMA 109

Query: 119 ALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DL 174
             D GS+L+W  C  C  C       Y  +D     + P ASST K +SCS   C   + 
Sbjct: 110 VADTGSNLIWTQCKPCDDC-------YTQVDP---LFDPKASSTYKDVSCSSSQCTALEN 159

Query: 175 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 234
             SC    + C Y +  Y + + + G    D L L S  +  ++     ++IIGCG   +
Sbjct: 160 QASCSTEDKTCSYLVS-YADGSYTMGKFAVDTLTLGSTDNRPVQ---LKNIIIGCGQNNA 215

Query: 235 GGYLDGVAPDGLIGLGLGEISVPSLLAKAG-LIRNSFSMCF--DKDDSGRIFFGDQ---- 287
             + +  +     G+        SL+ + G  I   FS C   + D + +I FG      
Sbjct: 216 VTFRNKSS-----GVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270

Query: 288 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--IVDSGSSFTFLPKEV 343
           GP T  +   + S   +  Y + +++  +GS  ++   ++ K   ++DSG++ T LP + 
Sbjct: 271 GPGTVSTPLVVKSRDTF--YYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKY 328

Query: 344 YETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           Y  I       +N   +  E      CY +++     +P + + F
Sbjct: 329 YIEIENAVASLINADKSKDERIGSSLCYNATADL--NIPVITMHF 371


>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
          Length = 412

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/310 (23%), Positives = 119/310 (38%), Gaps = 56/310 (18%)

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
            G   + L N     ++T I +GTP   F V LD GS  LW+P    +C  ++   +   
Sbjct: 86  NGGHNVPLTNFMNAQYFTTITLGTPPQEFKVILDTGSSNLWVP--STKCTSIACFLH--- 140

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
                +Y  SASST K                  K    + ++Y   + S  G +  D+L
Sbjct: 141 ----AKYDSSASSTHK------------------KNGTSFKIEY--GSGSMEGFVSNDVL 176

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL------LA 261
            +   GD  + +   A      G+  + G  DG+     +GLG   ISV  +      + 
Sbjct: 177 SI---GDLKIHDQDFAEATKEPGLAFAFGKFDGI-----LGLGYDTISVNHITPPFYSMV 228

Query: 262 KAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGS 318
             GL+     SF +   ++D G   FG    +        A   +   + + +     G 
Sbjct: 229 NKGLLDAPVFSFRLGSSEEDGGEAVFGGIDESAYSGKINYAPVRRKAYWEVELPKVAFGD 288

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL 378
             L+  +  A +D+G+S   LP +V E + A    Q+  T +      W   Y    +++
Sbjct: 289 DVLELENTGAAIDTGTSLIALPSDVAEMLNA----QIGATKS------WNGQYTVDCKKV 338

Query: 379 PKLPSVKLMF 388
           P LP   L F
Sbjct: 339 PDLPDFTLWF 348


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 80/321 (24%), Positives = 138/321 (42%), Gaps = 57/321 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IG+   +    +D GS+ + + C   R  P+              + P+AS + + + 
Sbjct: 3   LGIGSLQKNLSAIIDTGSEAVLVQCGS-RSRPV--------------FDPAASQSYRQVP 47

Query: 167 CSHRLC-------DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           C  +LC         G+S  C N    C Y++ Y  ++ +S+G   +D++ L S   N+ 
Sbjct: 48  CISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSY-GDSRNSTGDFSQDVIFLNS--TNSS 104

Query: 218 KNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
             +VQ   V  GC     G  +D +   G++G   G +S+PS L K  L  + FS CF  
Sbjct: 105 SQAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KDRLGGSKFSYCFPS 162

Query: 277 D-----DSGRIFFGDQG-PATQQSTSFLASN----GKYITYIIGVETCCIGSSCLK--QT 324
                  +G IF GD G   ++ S + L  N     +   Y +G+ +  +    L   ++
Sbjct: 163 QPWQPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 222

Query: 325 SFK---------AIVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYPWKCCY 371
           +FK          ++DSG++FT +  + Y       AA     +   + +  G+   C  
Sbjct: 223 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFD-DCYN 281

Query: 372 KSSSQRLPKLPSVKLMFPQNN 392
            S+   LP +P V+L   QNN
Sbjct: 282 ISAGSSLPGVPEVRLSL-QNN 301


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 132/324 (40%), Gaps = 46/324 (14%)

Query: 82  QMLFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLS 140
           ++L P   S  +S G   G   Y + + +G P+  F + LD GSD+ W+     +C P S
Sbjct: 135 ELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWL-----QCKPCS 189

Query: 141 ASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTS-CQNPKQPCPYTMDYYTENTSS 198
             Y  S       + P+ASS+   L+C  + C DL  S C+N K  C Y +  Y + + +
Sbjct: 190 DCYQQSDPI----FDPTASSSYNPLTCDAQQCQDLEMSACRNGK--CLYQVS-YGDGSFT 242

Query: 199 SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPS 258
            G  V + +   +G  N         V IGCG    G ++            L  +    
Sbjct: 243 VGEYVTETVSFGAGSVN--------RVAIGCGHDNEGLFVGSAG--------LLGLGGGP 286

Query: 259 LLAKAGLIRNSFSMCFDKDDSGR---IFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
           L   + +   SFS C    DSG+   + F    P        L +      Y + +    
Sbjct: 287 LSLTSQIKATSFSYCLVDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVS 346

Query: 316 IGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
           +G   +          +  +   IVDSG++ T L  + Y ++   F R+ ++ +   EG 
Sbjct: 347 VGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSN-LRPAEGV 405

Query: 366 P-WKCCYKSSSQRLPKLPSVKLMF 388
             +  CY  SS +  ++P+V   F
Sbjct: 406 ALFDTCYDLSSLQSVRVPTVSFHF 429


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 78/305 (25%), Positives = 124/305 (40%), Gaps = 43/305 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP     + LD GSD++WI C  C RC   S   ++          P  S +
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFD----------PRKSRS 175

Query: 162 SKHLSCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
              ++C   LC    S  C   KQ C Y + Y   + +      E +           + 
Sbjct: 176 FASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETL---------TFRR 226

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD 278
           +  A V +GCG    G +   V   GL+GLG G +S PS   +     + FS C  D+  
Sbjct: 227 TRVARVALGCGHDNEGLF---VGAAGLLGLGRGRLSFPSQTGRR--FNHKFSYCLVDRSA 281

Query: 279 SGR---IFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK---- 327
           S +   + FGD   +     + L SN K  T+    ++G+         +  + FK    
Sbjct: 282 SSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQT 341

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
                I+DSG+S T L +  Y      F    ++   + +   +  C+  S +   K+P+
Sbjct: 342 GNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPT 401

Query: 384 VKLMF 388
           V L F
Sbjct: 402 VVLHF 406


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 92/341 (26%), Positives = 134/341 (39%), Gaps = 66/341 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNS-------LDRDLNEY 154
           ++IGTP     V +D GSDL W+PC     DC+ C      Y NS            + Y
Sbjct: 16  LNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDC----DDYRNSKLMSAFSPSHSSSSY 71

Query: 155 SPSASS---TSKHLS------CSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVE 204
             S +S   T  H S      C+   C L T  +    +PCP     Y      +G L  
Sbjct: 72  RDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTR 131

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           D L +  G     K+  +     GC       Y +   P G+ G   G +S PS L   G
Sbjct: 132 DTLRVHEGPARVTKDIPK--FCFGC---VGSTYHE---PIGIAGFVRGTLSFPSQL---G 180

Query: 265 LIRNSFSMCF-------DKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETC 314
           L++  FS CF       + + S  +  GD   +++   Q T  L S      Y IG+E  
Sbjct: 181 LLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAI 240

Query: 315 CIGSSC-----LKQTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSF 362
            +G+       L    F +      ++DSG+++T LP+  Y  + + F   +     T  
Sbjct: 241 TVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEV 300

Query: 363 EGYP-WKCCYK--SSSQRLPK----LPSVKLMFPQNNSFVV 396
           E    +  CYK    + RL       PS+   F  N SFV+
Sbjct: 301 EMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVL 341


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 133/315 (42%), Gaps = 56/315 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP VS+    D GSDL+W      +CAP S+  +    +    Y+PS+S+T   L 
Sbjct: 90  LAIGTPPVSYQAIADTGSDLIW-----TQCAPCSSQCFQ---QPTPLYNPSSSTTFAVLP 141

Query: 167 CSHRL----CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           C+  L      L  +   P   C Y M Y +  TS    + +       G       +  
Sbjct: 142 CNSSLSMCAAALAGTTPPPGCTCMYNMTYGSGWTS----VYQGSETFTFGSSTPANQTGV 197

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDD 278
             +  GC    SGG+    A  GL+GLG G +   SL+++ G+ +  FS C     D + 
Sbjct: 198 PGIAFGCS-NASGGFNTSSA-SGLVGLGRGSL---SLVSQLGVPK--FSYCLTPYQDTNS 250

Query: 279 SGRIFFG------DQGPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK----QTS 325
           +  +  G      D G  +  ST F+AS         Y + +    +G++ L       S
Sbjct: 251 TSTLLLGPSASLNDTGGVS--STPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALS 308

Query: 326 FKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG----YPWKCCYK--S 373
            KA      I+DSG++ T L    Y+ + A     V  T+ + +G         C++  S
Sbjct: 309 LKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTDGGSAATGLDLCFELPS 366

Query: 374 SSQRLPKLPSVKLMF 388
           S+   P +PS+ L F
Sbjct: 367 STSAPPTMPSMTLHF 381


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 127/341 (37%), Gaps = 55/341 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++    +GTP   FL+  D GSDL W+ C     A  S S  +S       + P  S T 
Sbjct: 97  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156

Query: 163 KHLSCSHRLCDLG-----TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNA 216
             +SC+   C         +C  P  PC Y  DY Y + +++ G +  +   +   G   
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSGREE 214

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
            K  ++  +++GC    +G   +  A DG++ LG   IS  S  A        FS C   
Sbjct: 215 RKAKLKG-LVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAAS--RFGGRFSYCLVD 269

Query: 275 ---DKDDSGRIFFGDQGPATQ----------------QSTSFLASNGKYITYIIGVETCC 315
               ++ +  + FG   PA                  + T  L        Y + ++   
Sbjct: 270 HLSPRNATSYLTFGPN-PAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAIS 328

Query: 316 IGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
           +    LK        +     I+DSG+S T L K  Y  + A   + +   +      P+
Sbjct: 329 VAGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAG-LPRVTMDPF 387

Query: 368 KCCY-------KSSSQRLPKL----PSVKLMFPQNNSFVVN 397
           + CY       K +   +PK+         + P   S+V++
Sbjct: 388 EYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVID 428


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 125/308 (40%), Gaps = 58/308 (18%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR--CA--PLSAS 142
           S  S  MS  +     ++  I +G+P  + L+  D GSDL W+ C   +  C+  P  ++
Sbjct: 67  SSKSPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST 126

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY--------YTE 194
           +   L R    +SP+         C   LC L     NP  PC +T  +        Y++
Sbjct: 127 F---LARHSTTFSPT--------HCFSSLCQL-VPQPNP-NPCNHTRLHSTCRYEYVYSD 173

Query: 195 NTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA---PDGLIGL 249
            + +SG   ++   L+  SG +  LK     S+  GCG   SG  L G +     G++GL
Sbjct: 174 GSKTSGFFSKETTTLNTSSGREMKLK-----SIAFGCGFHASGPSLIGSSFNGASGVMGL 228

Query: 250 GLGEISVPSLLAKAGLIRNSFSMC-----FDKDDSGRIFFGDQGPATQQSTSFLASNGKY 304
           G G IS  S L +      SFS C          +  +  GD     + + S ++     
Sbjct: 229 GRGPISFASQLGRR--FGRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLL 286

Query: 305 IT------YIIGVETCCIGSSCL----------KQTSFKAIVDSGSSFTFLPKEVYETIA 348
           I       Y I ++   +    L          +  +   ++DSG++ TFL +  Y  I 
Sbjct: 287 INPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREIL 346

Query: 349 AEFDRQVN 356
           + F R+V 
Sbjct: 347 SAFKREVK 354


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 87/385 (22%), Positives = 143/385 (37%), Gaps = 77/385 (20%)

Query: 6   LTIYLAVFWLLTESSGAETVMFSTK--LIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQ 63
           L  Y  +F LL  ++   T   + +  L H          V K R  T W          
Sbjct: 9   LLAYALIFTLLFTAAATPTAGLTMRADLTH----------VDKGRGFTRWERLSRMAVRS 58

Query: 64  VLLSSDVQKQKMKTG-PQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFL-VALD 121
              ++ + ++    G P      PS G            +H+   +IGTP    + + +D
Sbjct: 59  RARAASLYQRGGHYGQPVTATAVPSSGEY---------LIHF---NIGTPRPQRVALTMD 106

Query: 122 AGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT---- 176
            GSDL+W  C  C  C           D+    + PS SST + ++C   +C   +    
Sbjct: 107 TGSDLVWTQCTPCPVC----------FDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSV 156

Query: 177 -SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 235
            +C      C Y +  Y + + ++G + +D    +S           + +  GCG   +G
Sbjct: 157 SACALKTFRCFY-LCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTG 215

Query: 236 GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD------SGRIFFG---- 285
            +    +  G+ G G G +S+PS L + G     FS C    D      +  +F G    
Sbjct: 216 VFASNES--GIAGFGRGPLSLPSQL-RVG----RFSYCLTSHDETESNKTSAVFLGTPPN 268

Query: 286 -----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIV 330
                  GP   +ST  + S      Y + +E   +G + L          K  S   ++
Sbjct: 269 GLRAHSSGPF--RSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVI 326

Query: 331 DSGSSFTFLPKEVYETIAAEFDRQV 355
           DSG+  T  P  V+E +  EF  Q+
Sbjct: 327 DSGTGVTTFPAAVFEQLKNEFVAQL 351


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 130/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 136/308 (44%), Gaps = 45/308 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++ + IG P     + LD GSD+ W     V+CAP +  Y    ++    + P++S++ 
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSW-----VQCAPCAECY----EQTDPXFEPTSSASF 201

Query: 163 KHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             LSC    C   D+ + C+N    C Y + Y  + + + G  V + + L   G  +L N
Sbjct: 202 TSLSCETEQCKSLDV-SECRNGT--CLYEVSY-GDGSYTVGDFVTETVTL---GSTSLGN 254

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD 278
                + IGCG    G ++       L+GLG G +S PS L  +     SFS C  D+D 
Sbjct: 255 -----IAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDS 301

Query: 279 SGRIFFGDQGPATQQS-TSFLASNGKYITYI-IGVETCCIGSSCLK--QTSFKA------ 328
                     P T  + T+ L  N    T+  +G+    +G + L   +TSF+       
Sbjct: 302 DSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNG 361

Query: 329 --IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             IVDSG++ T L   VY  +   F +  +D  T+     +  CY  SS+   ++P+V  
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSF 421

Query: 387 MFPQNNSF 394
            F   N  
Sbjct: 422 HFANGNEL 429


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 161/375 (42%), Gaps = 50/375 (13%)

Query: 32  IHRF-SEEVKALGVSK-NRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQG 89
           IH+   ++ KAL +S+ +R+++   A  +    Q++L+  V K  +K  P    + P   
Sbjct: 90  IHKTPHKDYKALVLSRLHRDSSRVQAITT--RLQLILNG-VSKSDLK--PLQTEIQPQDL 144

Query: 90  SKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD 148
           S  +S G   G   Y T + +G P  S+ + LD GSD+ WI     +C P S  Y  S  
Sbjct: 145 STPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWI-----QCQPCSDCYQQSDP 199

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
                ++P+ASS+   L+C  + C+    +SC+N +  C Y ++ Y + + + G  V + 
Sbjct: 200 I----FTPAASSSYSPLTCDSQQCNSLQMSSCRNGQ--CRYQVN-YGDGSFTFGDFVTET 252

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           +    GG   +      S+ +GCG    G ++      GL G  L      SL ++  L 
Sbjct: 253 MSF--GGSGTVN-----SIALGCGHDNEGLFVGAAGLLGLGGGPL------SLTSQ--LK 297

Query: 267 RNSFSMCFDKDDSGRIFFGD--QGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSCLK- 322
             SFS C    DS      D    P      + L  + K  T Y +G+    +G   L+ 
Sbjct: 298 ATSFSYCLVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRI 357

Query: 323 -QTSFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS 373
            Q  FK         IVD G++ T L  E Y ++   F        ++     +  CY  
Sbjct: 358 PQEVFKLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDL 417

Query: 374 SSQRLPKLPSVKLMF 388
           S Q   K+P+V   F
Sbjct: 418 SGQSSVKVPTVSFHF 432


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 115/300 (38%), Gaps = 53/300 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP      ALD  SDL+W  C     AP               ++P  S+T   + C+
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGAT--AP---------------FNPVRSTTVADVPCT 148

Query: 169 HRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
              C      +C      C YT  Y     +++GLL  +       GD  +       V+
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTF---GDTRIDG-----VV 200

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----GRI 282
            GCG+K  G +  GV+  G+IGLG G +S+ S L       + FS  F  DDS      I
Sbjct: 201 FGCGLKNVGDF-SGVS--GVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDTQSFI 252

Query: 283 FFGDQG-PATQQ--STSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKAIVDSGSSFT 337
            FGD   P T    ST  LAS+     Y + +    +      +   +F      GS   
Sbjct: 253 LFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGV 312

Query: 338 FLPKEVYETIAAEFD-RQVNDTITSFEGYP--------WKCCYKSSSQRLPKLPSVKLMF 388
           FL      T+  E   + +   + S  G P           CY   S    K+PS+ L+F
Sbjct: 313 FLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVF 372


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 107/257 (41%), Gaps = 56/257 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           +  GTP   F + LD GS + W  C  CV C   S  +++SL          ASST    
Sbjct: 131 VAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSL----------ASSTYSFG 180

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           SC      + ++  N      Y M  Y + ++S G    D + L         + V    
Sbjct: 181 SC------IPSTVGN-----TYNMT-YGDKSTSVGNYGCDTMTL-------EPSDVFQKF 221

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFF 284
             GCG    G +  G   DG++GLG G++S  S  A     +  FS C  +++S G + F
Sbjct: 222 QFGCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLF 277

Query: 285 GDQGPATQQSTSF-------------LASNGKYITYI----IGVETCCIGSSCLKQTSFK 327
           G++  AT QS+S              L  +G Y   +    +G +   I SS     S  
Sbjct: 278 GEK--ATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF--ASPG 333

Query: 328 AIVDSGSSFTFLPKEVY 344
            I+DSG+  T LP+  Y
Sbjct: 334 TIIDSGTVITRLPQRAY 350


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/271 (27%), Positives = 110/271 (40%), Gaps = 46/271 (16%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +G+P    L+ALD  +D  W  C      P S S           ++P+ S++   L CS
Sbjct: 83  LGSPAQPILLALDTSADATWAHCSPCGTCPSSGSL----------FAPANSTSYAPLPCS 132

Query: 169 HRLCDL--GTSC--QNP-KQPCPYTMDYYTE---NTSSSGLLVEDILHLISGGDNALKNS 220
             +C +  G  C  Q+P     P  M  +T+   + S    L  D LHL   G +A+ N 
Sbjct: 133 STMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHL---GKDAIPN- 188

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDKDD- 278
                  GC    SG   + +   GL+GLG G +   +LL++ G + N  FS C      
Sbjct: 189 ----YAFGCVSAVSGPTAN-LPKQGLLGLGRGPM---ALLSQVGNMYNGVFSYCLPSYKS 240

Query: 279 ---SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK----------QT 324
              SG +  G  G P   + T  L +  +   Y + V    +G + +K           T
Sbjct: 241 YYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPAT 300

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
               +VDSG+  T     VY  +  EF R V
Sbjct: 301 GAGTVVDSGTVITRWTPPVYAALREEFRRHV 331


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 85/200 (42%), Gaps = 37/200 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  I +G P    LV +D GSDL+W+ C  C RC       Y  +      Y P  S T
Sbjct: 92  YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRC-------YRQV---TPLYDPRNSKT 141

Query: 162 SKHLSCSHRLCD---LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            + + C+   C        C      C Y M  Y + ++SSG L  D L L    D  + 
Sbjct: 142 HRRIPCASPQCRGVLRYPGCDARTGGCVY-MVVYGDGSASSGDLATDTLVLPD--DTRVH 198

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
           N     V +GCG     G L   A  GL+G G G++S P+ LA A    + FS C     
Sbjct: 199 N-----VTLGCGHDNE-GLLASAA--GLLGAGRGQLSFPTQLAPA--YGHVFSYC----- 243

Query: 279 SGRIFFGDQGPATQQSTSFL 298
                 GD+    + S+S+L
Sbjct: 244 -----LGDRMSRARNSSSYL 258


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 138/326 (42%), Gaps = 57/326 (17%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           L    + IG+   +    +D GS+ + + C   R  P+              + P+AS +
Sbjct: 99  LFSMQLGIGSLQKNLSAIIDTGSEAVLVQCGS-RSRPV--------------FDPAASQS 143

Query: 162 SKHLSCSHRLC-------DLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
            + + C  +LC         G+S  C N    C Y++ Y  ++ +S+G   +D++ L S 
Sbjct: 144 YRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSY-GDSRNSTGDFSQDVIFLNS- 201

Query: 213 GDNALKNSVQ-ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFS 271
             N+   +VQ   V  GC     G  +D +   G++G   G +S+PS L K  L  + FS
Sbjct: 202 -TNSSGQAVQFRDVAFGCAHSPQGFLVD-LGSLGIVGFNRGNLSLPSQL-KDRLGGSKFS 258

Query: 272 MCF-----DKDDSGRIFFGDQGPATQQS--TSFL---ASNGKYITYIIGVETCCIGSSCL 321
            CF         +G IF GD G +  +   T  L    +  +   Y +G+ +  +    L
Sbjct: 259 YCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTL 318

Query: 322 K--QTSFK---------AIVDSGSSFTFLPKEVY----ETIAAEFDRQVNDTITSFEGYP 366
              +++FK          ++DSG++FT +  + Y       AA     +   + +  G+ 
Sbjct: 319 AIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFD 378

Query: 367 WKCCYKSSSQRLPKLPSVKLMFPQNN 392
             C   S+   LP +P V+L   QNN
Sbjct: 379 -DCYNISAGSSLPGVPEVRLSL-QNN 402


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 128/307 (41%), Gaps = 55/307 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSK 163
           +  GTP+V  ++ +D GSD+ W+   PC+   C P     ++          PS SST  
Sbjct: 129 LGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFD----------PSKSSTYA 178

Query: 164 HLSCSHRLCD-LG----TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            ++C    C+ LG      C +    C Y ++ Y + +S+ G+   + +    G      
Sbjct: 179 PIACGADACNKLGDHYRNGCTSGGTQCGYRVE-YGDGSSTRGVYSNETITFAPG------ 231

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD--K 276
                    GCG  Q G        DGL+GLG    S+  ++  A +   +FS C     
Sbjct: 232 -ITVKDFHFGCGHDQRG---PSDKFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALN 285

Query: 277 DDSGRIFFGDQGPATQQSTSFLASNGKYI-----TYIIGVETCCIGSSCLK--QTSFKA- 328
            ++G +  G +  A   +++F+ +   ++     +Y++ +    +G   L   +++F+  
Sbjct: 286 SEAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGG 345

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP------WKCCYKSSSQRLPKL 381
            ++DSG+  T LP+  Y  + A   +       +F  YP      +  CY  +      +
Sbjct: 346 MLIDSGTIVTELPETAYNALNAALRK-------AFAAYPMVASEDFDTCYNFTGYSNVTV 398

Query: 382 PSVKLMF 388
           P V L F
Sbjct: 399 PRVALTF 405


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 124/306 (40%), Gaps = 45/306 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +GTP     + LD GSD++W+ C  C +C   S   +N          P  S +
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFN----------PYKSKS 159

Query: 162 SKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
              + CS  LC     + C   +  C Y + Y   + ++     E +           + 
Sbjct: 160 FAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETL---------TFRG 210

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGL-IRNSFSMCF-DKD 277
           +  A V +GCG    G +   V   GL+GLG G +S PS   + G+   + FS C  D+ 
Sbjct: 211 NKIAKVALGCGHHNEGLF---VGAAGLLGLGRGRLSFPS---QTGIRFNHKFSYCLVDRS 264

Query: 278 DSGR---IFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK--- 327
            S +   + FGD   +     + L  N K  T+    +IG+    +    +  + FK   
Sbjct: 265 ASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDS 324

Query: 328 -----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP 382
                 I+DSG+S T L +  Y  +   F           E   +  CY  S Q   K+P
Sbjct: 325 AGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVP 384

Query: 383 SVKLMF 388
           +V L F
Sbjct: 385 TVVLHF 390


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 149/356 (41%), Gaps = 61/356 (17%)

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLW 128
           +Q    K     Q+   S+    ++ G  F  L+Y   + +G+ N+S +V  D GSDL W
Sbjct: 88  IQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGSQNMSVIV--DTGSDLTW 145

Query: 129 IPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC---DLGTSCQNP--KQ 183
           + C+  R      S YN   ++   + PS S + + + C+   C   +LG    +P    
Sbjct: 146 VQCEPCR------SCYN---QNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSA 196

Query: 184 PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAP 243
            C Y ++Y   + +S  L +E    L  GG +       ++ + GCG + + G   G + 
Sbjct: 197 TCDYVVNYGDGSYTSGELGIE---KLGFGGISV------SNFVFGCG-RNNKGLFGGAS- 245

Query: 244 DGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD----SGRIFFGDQGPATQQSTSF-- 297
            GL+GLG  E+S+ S           FS C    D    SG +  G+Q    +  T    
Sbjct: 246 -GLMGLGRSELSMIS--QTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAY 302

Query: 298 ------LASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGSSFTFLPKEVYETIA 348
                 L  +  YI  + G++   + S  ++ +SF     I+DSG+  + L   VY+ + 
Sbjct: 303 TRMLPNLQLSNFYILNLTGIDVGGV-SLHVQASSFGNGGVILDSGTVISRLAPSVYKALK 361

Query: 349 AEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
           A+F  Q       F G+P          C+  +      +P++ + F  N    V+
Sbjct: 362 AKFLEQ-------FSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVD 410


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 124/322 (38%), Gaps = 70/322 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I +GTP  +  + +D GS+L W+ C+    A +   ++N          P+ SS+   +S
Sbjct: 70  ITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFN----------PNISSSYTPIS 119

Query: 167 CSHRLCDLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CS   C   T       SC +    C  T+  Y + +SS G L  D             +
Sbjct: 120 CSSPTCTTRTRDFPIPASCDS-NNLCHATLS-YADASSSEGNLASDTF--------GFGS 169

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           S    ++ GC    +  Y      D    GL+G+ LG +S+ S L         FS C  
Sbjct: 170 SFNPGIVFGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP-----KFSYCIS 221

Query: 276 KDD-SGRIFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
             D SG +  G+            P  Q ST     +     Y + +E   I    L  +
Sbjct: 222 GSDFSGILLLGESNFSWGGSLNYTPLVQISTPLPYFDRS--AYTVRLEGIKISDKLLNIS 279

Query: 325 ----------SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG------YPWK 368
                     + + + D G+ F++L   VY  +  EF  Q N T+ + +           
Sbjct: 280 GNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMD 339

Query: 369 CCYK--SSSQRLPKLPSVKLMF 388
            CY+   +   LP+LPSV L+F
Sbjct: 340 LCYRVPVNQSELPELPSVSLVF 361


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 125/320 (39%), Gaps = 69/320 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP  +  + LD GS+L W+ C   +  P   S +N          P AS T   + 
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRC---KKEPNFTSIFN----------PLASKTYTKIP 117

Query: 167 CSHRLCDLGTS-------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CS + C   TS       C +P + C + + Y   ++    L  E              +
Sbjct: 118 CSSQTCKTRTSDLTLPVTC-DPAKLCHFIISYADASSVEGHLAFETF---------RFGS 167

Query: 220 SVQASVIIGC-GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-D 277
             + + + GC     S    +     GL+G+  G +   S + + G     FS C    D
Sbjct: 168 LTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSL---SFVNQMGF--RKFSYCISGLD 222

Query: 278 DSGRIFFGDQ----------GPATQQSTSFLASNGKYITYIIGVETCCIGSSC--LKQTS 325
            +G +  G+            P  Q ST     +   + Y + +E   + +    L ++ 
Sbjct: 223 STGFLLLGEARYSWLKPLNYTPLVQISTPLPYFD--RVAYSVQLEGIKVNNKVLPLPKSV 280

Query: 326 F--------KAIVDSGSSFTFLPKEVYETIAAEF-------DRQVNDTITSFEGYPWKCC 370
           F        + +VDSG+ FTFL   VY  +  EF        R +N+    F+G     C
Sbjct: 281 FVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQG-AMDLC 339

Query: 371 Y--KSSSQRLPKLPSVKLMF 388
           Y   S+S  LP LP VKLMF
Sbjct: 340 YLIDSTSSTLPNLPVVKLMF 359


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 135/302 (44%), Gaps = 42/302 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++ + IG+P     + +D GSD+ W     V+CAP  A  Y   D     + PS SS+ 
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNW-----VQCAPC-ADCYQQADP---IFEPSFSSSY 205

Query: 163 KHLSC-SHRLCDLGTS-CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
             L+C +H+   L  S C+N    C Y +  Y + + + G    + + L   G  +L N 
Sbjct: 206 APLTCETHQCKSLDVSECRN--DSCLYEVS-YGDGSYTVGDFATETITL--DGSASLNN- 259

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKD 277
               V IGCG    G +   V   GL+GLG G +S PS +  +     SFS C    D D
Sbjct: 260 ----VAIGCGHDNEGLF---VGAAGLLGLGGGSLSFPSQINAS-----SFSYCLVNRDTD 307

Query: 278 DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA------- 328
            +  + F    P+   +   L +N     Y +G+    +G   L   ++SF+        
Sbjct: 308 SASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGG 367

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLM 387
            IVDSG++ T L  +VY ++   F R      ++     +  CY  SS+   ++P+V   
Sbjct: 368 IIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFH 427

Query: 388 FP 389
           FP
Sbjct: 428 FP 429


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 66/258 (25%), Positives = 101/258 (39%), Gaps = 40/258 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           IGTP      A+D GS+++WIPC +C  C   S+S +N          P ASST +   C
Sbjct: 104 IGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFN----------PLASSTYQDAPC 153

Query: 168 SHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVII 227
               C+  +S       C Y+ D   +    +G +  D + L S             V  
Sbjct: 154 DSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVCG 213

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC---FDKDDSGRIFF 284
               K   G        G+IGLG G +S+ S L    L    FS C   +      +I F
Sbjct: 214 NSIYKTFAGV-------GVIGLGRGALSLTSKLYH--LSDGKFSYCLADYYSKQPSKINF 264

Query: 285 GDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCLKQTSF-----------KAI 329
           G Q   +       ++ L  +     Y + +E   +G    +Q  +             +
Sbjct: 265 GLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEK--RQDLYYVDDPFAPPVGNML 322

Query: 330 VDSGSSFTFLPKEVYETI 347
           +DSG+ FT LPK+ Y+ +
Sbjct: 323 IDSGTMFTLLPKDFYDYL 340


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 123/336 (36%), Gaps = 80/336 (23%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS-----ASS 160
           + IGTP       +D GSDL+W+ CD C  C             DL+ +  +     ASS
Sbjct: 9   LSIGTPPQLIPAMIDTGSDLVWLKCDNCDHC-------------DLDHHGETIFFSDASS 55

Query: 161 TSKHLSCSHRLCD------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
           + K L C+   C       +G  C+   + C Y  +Y  + + +SG +  D +   S G 
Sbjct: 56  SYKKLPCNSTHCSGMSSAGIGPRCE---ETCKYKYEY-GDGSRTSGDVGSDRISFRSHGA 111

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
                S     + GCG K  G   D     GLIGLG    S+   L     +   FS C 
Sbjct: 112 GEDHRSFFDGFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF-------- 326
              DS         P + +S  FL S+     + + V T  +    L QT +        
Sbjct: 167 VSYDS---------PPSAKSFLFLGSSAALRGHDV-VSTPILHGDHLDQTLYYVDLQSIT 216

Query: 327 -------------------------KAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTIT 360
                                    K ++DSG+++T L   VYE +    + QV   T+ 
Sbjct: 217 VGGVPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLG 276

Query: 361 SFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           +  G     C+ SS       PSV   F      V+
Sbjct: 277 NSAG--LDLCFNSSGDTSYGFPSVTFYFANQVQLVL 310


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 145/350 (41%), Gaps = 65/350 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP+   L+ LD GSD++W+ C  C RC           D+    + P  SS+
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRC----------YDQSGPVFDPRRSSS 189

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
              + C+  LC   D G  C   ++ C Y +  Y + + ++G    + L    G      
Sbjct: 190 YGAVDCAAPLCRRLDSG-GCDLRRRACLYQV-AYGDGSVTAGDFATETLTFAGG------ 241

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
            +  A V +GCG    G +   VA  GL+GLG G +S P+ +++      SFS C  D+ 
Sbjct: 242 -ARVARVALGCGHDNEGLF---VAAAGLLGLGRGSLSFPTQISR--RYGKSFSYCLVDRT 295

Query: 278 DSGRIFFGDQ--------GPATQQSTSF--LASNGK----YITYIIGVETCCIGSSCLKQ 323
            S       +        GP +  + SF  +  N +    Y   ++G+         + +
Sbjct: 296 SSSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAE 355

Query: 324 TSFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKS 373
           +  +          IVDSG+S T L +  Y  +   F         S  G+  +  CY  
Sbjct: 356 SDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDL 415

Query: 374 SSQRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
             +++ K+P+V + F         P+N    V++     F   GT  GVS
Sbjct: 416 GGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS 465


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 131/324 (40%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC M   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKS 159

Query: 280 GRIFF---------GDQGPATQQSTSFLASNGK-----YITYI-IGVETCCIGSSCLKQT 324
            R FF         G     T    + + +  K     ++  I I V+   +G S    +
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFS 219

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
               + DSGS  +++P      ++    R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDAARFDLGSHGVFVERSVQ 302


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 134/303 (44%), Gaps = 45/303 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++ + IG P     + LD GSD+ W     V+CAP  A  Y   D     + P++S++ 
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNW-----VQCAPC-ADCYQQADP---IFEPASSASF 199

Query: 163 KHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             LSC+ R C   D+ + C+N    C Y + Y   + +    + E I          L +
Sbjct: 200 STLSCNTRQCRSLDV-SECRN--DTCLYEVSYGDGSYTVGDFVTETI---------TLGS 247

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DK 276
           +   +V IGCG    G +   V   GL+GLG G +S PS +        SFS C    D 
Sbjct: 248 APVDNVAIGCGHNNEGLF---VGAAGLLGLGGGSLSFPSQINAT-----SFSYCLVDRDS 299

Query: 277 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFK------- 327
           + +  + F    P    S   L ++     Y +G+    +G     + +++F+       
Sbjct: 300 ESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNG 359

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             IVDSG++ T L  +VY ++   F ++  D  ++     +  CY  SS+   ++P+V  
Sbjct: 360 GVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSF 419

Query: 387 MFP 389
            FP
Sbjct: 420 HFP 422


>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
          Length = 477

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 124/329 (37%), Gaps = 61/329 (18%)

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCV---RCAPLSA 141
           PSQ   T       G  +   + +GTP      A D  S  +W+PC +CV    C     
Sbjct: 76  PSQAPATT------GGTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKT 129

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQ-PCPYTMDYYTENTS 197
             Y +L R+L              SC  + C        C  P   PC YT  Y     +
Sbjct: 130 GVYKTLPREL-------------YSCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGT 176

Query: 198 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
            +   +   L   + GDN +      ++I GCG++    +       G+IGL  G +   
Sbjct: 177 ETEGHLG--LQPFTLGDNTMP----VNMIFGCGLEPETNF-------GVIGLNRGRL--- 220

Query: 258 SLLAKAGLIRNSFSMCFDKDDSGR-----IFFGDQG-PATQ--QSTSFLA-SNGKY-ITY 307
           SL+++  L R S+    + DD+       I FG+   P T   + T F +  NG Y   Y
Sbjct: 221 SLISQLQLGRFSYYFAPEYDDTAAGNASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLY 280

Query: 308 IIGVETCCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
           ++G+    +GS+ L         +    A + +    TFL K  Y+ +  E    V    
Sbjct: 281 LVGLSGMRVGSNNLNMLGAGSGGRDPLVAYLSTSVPITFLEKNAYDLLRRELVSTVGSDT 340

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
                     CY S      K P++ L+F
Sbjct: 341 VDGSALGLDLCYTSQYLAKAKFPAMALVF 369


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 75/255 (29%), Positives = 107/255 (41%), Gaps = 54/255 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           +  GTP     + LD GS + W  C  CV C   S  Y++S          SASST    
Sbjct: 132 VAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDS----------SASSTYSFG 181

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           SC      + ++ +N      Y M  Y ++++S G    D + L         + V    
Sbjct: 182 SC------IPSTVEN-----NYNMT-YGDDSTSVGNYGCDTMTL-------EPSDVFQKF 222

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFF 284
             GCG    G +  GV  DG++GLG G++S  S  A        FS C  ++DS G + F
Sbjct: 223 QFGCGRNNKGDFGSGV--DGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLF 278

Query: 285 GDQGPATQQSTSF-----------LASNGKYITYI----IGVETCCIGSSCLKQTSFKAI 329
           G++  AT QS+S            L  +G Y   +    +G E   I SS     S   I
Sbjct: 279 GEK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF--ASPGTI 334

Query: 330 VDSGSSFTFLPKEVY 344
           +DS +  T LP+  Y
Sbjct: 335 IDSRTVITRLPQRAY 349


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 63/328 (19%)

Query: 120 LDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN--EYSPSASSTSKHLSCSHRLCDLGTS 177
           +D GSDL+W PC    C      Y  +    L+    + SAS + K  +CS     L +S
Sbjct: 91  MDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAHTSLSSS 150

Query: 178 CQNPKQPCPYTMDYYTENTSSS--------------GLLVEDILHLISGGDNALKNSVQA 223
                  CP  +   ++ +S S                L  D L + +     L N    
Sbjct: 151 DLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPASSPLVLHN---- 206

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC-----FDKD 277
               GC     G       P G+ G G G +S+P+ LA  +  + N FS C     FD D
Sbjct: 207 -FTFGCAHTALG------EPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDAD 259

Query: 278 DSGR---IFFGDQGPATQQSTSFLASNGKYIT------------YIIGVETCCIGS---- 318
              R   +  G      ++        G+++             Y +G+E   +G+    
Sbjct: 260 RVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIP 319

Query: 319 --SCLKQTSFKA----IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC--- 369
               LK+   +     +VDSG++FT LP  +YE++  EF+ ++            +    
Sbjct: 320 VPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLG 379

Query: 370 -CYKSSSQRLPKLPSVKLMFPQNNSFVV 396
            CY S      K+P+V L F  N++ ++
Sbjct: 380 PCYYSDDS-AAKVPAVALHFVGNSTVIL 406


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 132/328 (40%), Gaps = 65/328 (19%)

Query: 94  SLGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           +LG+    L Y   + +G+P ++  V +D GSD+ W+ C+ C   +P  A +  +L    
Sbjct: 125 TLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHA-HAGAL---- 179

Query: 152 NEYSPSASSTSKHLSCSHRLC-DLGTSCQ----NPKQPCPYTMDYYTENTSSSGLLVEDI 206
             + P+ASST    +CS   C  LG S +    + K  C Y +  Y + ++++G    D+
Sbjct: 180 --FDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVK-YGDGSNTTGTYSSDV 236

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
           L L SG D      V      GC   + G  +D    DGLIGLG    S+ S  A     
Sbjct: 237 LTL-SGSD------VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSLVSQTAA--RY 286

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL----------ASNGKYIT---------- 306
             SFS C               PAT  S+ FL              ++ T          
Sbjct: 287 GKSFSYCL--------------PATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVP 332

Query: 307 --YIIGVETCCIGSS--CLKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT 360
             Y   +E   +G     L  + F A  +VDSG+  T LP   Y  +++ F   +     
Sbjct: 333 TYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYAR 392

Query: 361 SFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           +        C+  +      +P+V L+F
Sbjct: 393 AEPLGILDTCFNFTGLDKVSIPTVALVF 420


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 101/221 (45%), Gaps = 23/221 (10%)

Query: 67  SSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDL 126
           +S + ++ + T   FQ   P+  S+ +S  +     ++  + +GTP   F + +D GSDL
Sbjct: 23  NSTLPRESLATIQDFQGEDPALFSRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDL 82

Query: 127 LWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQ-N 180
            WI C+     P + +  NS       Y  S+SS+ + + C+   C      +G+SC   
Sbjct: 83  TWIQCN----PPNTTA--NSSSPPAPWYDKSSSSSYREIPCTDDECQFLPAPIGSSCSIT 136

Query: 181 PKQPCPYTMDYYTENTSSSGLLVEDILHLIS-------GGDNALKNSVQASVIIGCGMKQ 233
              PC YT   Y++ + ++G+L  + + + S        G++  +     +V +GC  + 
Sbjct: 137 SPSPCDYTYG-YSDQSRTTGILAYETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRES 195

Query: 234 SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            G    G +  G++GLG G IS+ +      L    FS C 
Sbjct: 196 VGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIFSYCL 233


>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
          Length = 477

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 124/329 (37%), Gaps = 61/329 (18%)

Query: 86  PSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCV---RCAPLSA 141
           PSQ   T       G  +   + +GTP      A D  S  +W+PC +CV    C     
Sbjct: 76  PSQAPATT------GGTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKT 129

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL---GTSCQNPKQ-PCPYTMDYYTENTS 197
             Y +L R+L              SC  + C        C  P   PC YT  Y     +
Sbjct: 130 GVYKTLPREL-------------YSCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGT 176

Query: 198 SSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP 257
            +   +   L   + GDN +      ++I GCG++    +       G+IGL  G +   
Sbjct: 177 ETEGHLG--LQPFTLGDNTMP----VNMIFGCGLEPETNF-------GVIGLNRGRL--- 220

Query: 258 SLLAKAGLIRNSFSMCFDKDDSGR-----IFFGDQG-PATQ--QSTSFLA-SNGKY-ITY 307
           SL+++  L R S+    + DD+       I FG+   P T   + T F +  NG Y   Y
Sbjct: 221 SLISQLQLGRFSYYFAPEYDDTAAGNASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLY 280

Query: 308 IIGVETCCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
           ++G+    +GS+ L         +    A + +    TFL K  Y+ +  E    V    
Sbjct: 281 LVGLSGMRVGSNNLNMLGAGSGGRDPLVAYLSTSVPVTFLEKNAYDLLRRELVSTVGSDT 340

Query: 360 TSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
                     CY S      K P++ L+F
Sbjct: 341 VDGSALGLDLCYTSQYLAKAKFPAMALVF 369


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 158/379 (41%), Gaps = 52/379 (13%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQ---FQM 83
           FST L H   ++ +   ++    A+  P+++          + ++KQK   G       +
Sbjct: 66  FSTVLTH---DDARVAHLASRLAASDPPSRRP---------TSLRKQKKAAGGASGGHHL 113

Query: 84  LFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
              S  S  +S G   G  +Y T + +GTP+ S+ + +D GS L W+     +C+P   S
Sbjct: 114 DDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL-----QCSPCVVS 168

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNP-----KQPCPYTMDYYTENT 196
            +  +      + P ASST   + CS   CD L  +  NP        C Y    Y +++
Sbjct: 169 CHRQVG---PLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQAS-YGDSS 224

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G L  D +        +  ++   S   GCG    G +       GLIGL   ++S+
Sbjct: 225 FSVGYLSTDTV--------SFGSTSYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSL 273

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETC 314
              LA +  +  SFS C     S G +  G        S + +AS+    + Y I +   
Sbjct: 274 LYQLAPS--LGYSFSYCLPTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGM 331

Query: 315 CIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
            +G S L     + +S   I+DSG+  T LP  V+  ++    + +     +        
Sbjct: 332 SVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT 391

Query: 370 CYKSSSQRLPKLPSVKLMF 388
           C++  + +L ++P+V + F
Sbjct: 392 CFEGQASQL-RVPTVVMAF 409


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 136/308 (44%), Gaps = 45/308 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++ + IG P     + LD GSD+ W     V+CAP +  Y    ++    + P++S++ 
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSW-----VQCAPCAECY----EQTDPIFEPTSSASF 201

Query: 163 KHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             LSC    C   D+ + C+N    C Y + Y  + + + G  V + + L   G  +L N
Sbjct: 202 TSLSCETEQCKSLDV-SECRNGT--CLYEVSY-GDGSYTVGDFVTETVTL---GSTSLGN 254

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD 278
                + IGCG    G ++       L+GLG G +S PS L  +     SFS C  D+D 
Sbjct: 255 -----IAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYCLVDRDS 301

Query: 279 SGRIFFGDQGPATQQS-TSFLASNGKYITYI-IGVETCCIGSSCLK--QTSFKA------ 328
                     P T  + T+ L  N    T+  +G+    +G + L   +TSF+       
Sbjct: 302 DSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNG 361

Query: 329 --IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             IVDSG++ T L   VY  +   F +  +D  T+     +  CY  SS+   ++P+V  
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSF 421

Query: 387 MFPQNNSF 394
            F   N  
Sbjct: 422 HFANGNEL 429


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 55/321 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP   F V +D GSDL W     V+C+P    Y     ++   + P+ S++   L+
Sbjct: 17  VRLGTPERVFSVIVDTGSDLTW-----VQCSPCGKCY----SQNDALFLPNTSTSFTKLA 67

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C   LC+        +  C Y    Y + + ++G  V D + +   G N  K  V  +  
Sbjct: 68  CGSALCNGLPFPMCNQTTCVYWYS-YGDGSLTTGDFVYDTITM--DGINGQKQQV-PNFA 123

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGR 281
            GCG    G +      DG++GLG G +S  S L    +    FS C          +  
Sbjct: 124 FGCGHDNEGSF---AGADGILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSP 178

Query: 282 IFFGDQGPATQQSTSFLA--SNGKYIT-YIIGVETCCIGSSCLKQTS----------FKA 328
           + FGD          +L   +N K  T Y + +    +G + L  +S             
Sbjct: 179 LLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGT 238

Query: 329 IVDSGSSFTFLPKEVYETIAA-------EFDRQVNDTITSFEGYPWKCCYKSSSQRLPKL 381
           I DSG++ T L +  Y+ + A        + R+++D I+  +     C       +LP +
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLD----LCLSGFPKDQLPTV 293

Query: 382 PSVKLMF-------PQNNSFV 395
           P++   F       P +N F+
Sbjct: 294 PAMTFHFEGGDMVLPPSNYFI 314


>gi|222640101|gb|EEE68233.1| hypothetical protein OsJ_26421 [Oryza sativa Japonica Group]
          Length = 439

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 124/324 (38%), Gaps = 61/324 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAP----LSASYYNSLDRDLNEYSPSASS 160
           I +G+   + LV+ D   +++W+ C   C  C P     S +YYN+          S S 
Sbjct: 66  IAVGSLGKTRLVSFDTAVNMVWLQCSDYCRDCNPSQVGTSTTYYNA----------SMSI 115

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMD----YYTENTSSSGLLVEDIL--HLISGGD 214
           +   LSC H LC  G +  + +Q     MD    +  ++  ++G  V+ IL    IS  D
Sbjct: 116 SYNPLSCDHPLCGAGDN--HDQQVLAECMDGTCTFKVDSLDNNGGWVQGILGSDRISISD 173

Query: 215 NALKNSVQASVIIGCGMKQSGGY-LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           +        ++I GC       Y LD     G++GLGLG+ S+P  ++        FS C
Sbjct: 174 HFFF-LFDTNIIFGCATVDHSKYTLDQYGSSGVVGLGLGKYSLPQQISVT-----RFSYC 227

Query: 274 FDKDDSGRIF------FGDQGPATQQSTSFLASNGKYITYIIGVETCCI-----GSSC-- 320
                   +F      FG         T FL    KY   + G+    +     GS+   
Sbjct: 228 LPSWVKNELFSPPYVLFGSNAVLQGDMTPFLPGFPKYYLKLEGISYGIVRLDIFGSNAAA 287

Query: 321 ---------------LKQTSFKAI-VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
                          L    F A+ V+S +    LP   YE +  EF+ Q N  +     
Sbjct: 288 ADQYHQQAQFCRGPYLPDAQFYAMSVESATFPLMLPSRAYELLEKEFE-QDNPLLIKSRL 346

Query: 365 YPWKCCYKSSSQRLPKLPSVKLMF 388
            P   CYK S   +    ++ L F
Sbjct: 347 QPMNTCYKGSVDDIADNATITLHF 370


>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
 gi|255630909|gb|ACU15817.1| unknown [Glycine max]
          Length = 244

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 66/142 (46%), Gaps = 6/142 (4%)

Query: 272 MCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVD 331
           MCF  D +GRI FGD G   Q+ T F      + TY I +    +  S +    F AI D
Sbjct: 1   MCFGPDGAGRITFGDTGSPDQRKTPFNVRK-LHPTYNITITQIVVEDS-VADLEFHAIFD 58

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFE----GYPWKCCYKSSSQRLPKLPSVKLM 387
           SG+SFT++    Y  +   ++ +V     S +      P++ CY  S  +  ++P + L 
Sbjct: 59  SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118

Query: 388 FPQNNSFVVNNPVFVIYGTQVG 409
               + + V +P+  ++  + G
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEG 140


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 93/383 (24%), Positives = 160/383 (41%), Gaps = 60/383 (15%)

Query: 27  FSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQ---FQM 83
           FST L H   ++ +   ++    A+  P+++          + ++KQK   G       +
Sbjct: 66  FSTVLTH---DDARVAHLASRLAASDPPSRRP---------TSLRKQKKAAGGASGGHHL 113

Query: 84  LFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSAS 142
              S  S  +S G   G  +Y T + +GTP+ S+ + +D GS L W+     +C+P   S
Sbjct: 114 DDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL-----QCSPCVVS 168

Query: 143 YYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-LGTSCQNP-----KQPCPYTMDYYTENT 196
            +  +      + P ASST   + CS   CD L  +  NP        C Y    Y +++
Sbjct: 169 CHRQVG---PLFDPRASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQAS-YGDSS 224

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G L  D +        +  ++   S   GCG    G +       GLIGL   ++S+
Sbjct: 225 FSVGSLSTDTV--------SFGSTRYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSL 273

Query: 257 PSLLAKAGLIRNSFSMCFDKDDS-GRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETC 314
              LA +  +  SFS C     S G +  G        S + +AS+    + Y I +   
Sbjct: 274 LYQLAPS--LGYSFSYCLPTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGM 331

Query: 315 CIGSSCL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--- 366
            +G S L     + +S   I+DSG+  T LP  V+  ++    + V   +   +  P   
Sbjct: 332 SVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALS----KAVAQAMAGAQRAPAFS 387

Query: 367 -WKCCYKSSSQRLPKLPSVKLMF 388
               C++  + +L ++P+V + F
Sbjct: 388 ILDTCFEGQASQL-RVPTVAMAF 409


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 93/354 (26%), Positives = 145/354 (40%), Gaps = 70/354 (19%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP    L+ LD GSD++W+ C  C RC   S   ++          P  SS+
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFD----------PRRSSS 178

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
              + C   LC   D G  C   +  C Y +  Y + + ++G  V + L    G      
Sbjct: 179 YGAVGCGAALCRRLDSG-GCDLRRGACMYQV-AYGDGSVTAGDFVTETLTFAGG------ 230

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
            +  A V +GCG    G +   VA  GL+GLG G +S P+ +++      SFS C  D+ 
Sbjct: 231 -ARVARVALGCGHDNEGLF---VAAAGLLGLGRGGLSFPTQISR--RYGRSFSYCLVDRT 284

Query: 278 DSGR-----------IFFGDQGPATQQSTSF--LASNGK----YITYIIGVETCCIGSSC 320
            SG            + FG  G     S SF  +  N +    Y   ++G+         
Sbjct: 285 SSGAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPG 343

Query: 321 LKQTSFK---------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT-SFEGYP-WKC 369
           + ++  +          IVDSG+S T L +  Y  +   F       +  S  G+  +  
Sbjct: 344 VAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDT 403

Query: 370 CYKSSSQRLPKLPSVKLMF---------PQNNSFVVNNP---VFVIYGTQVGVS 411
           CY    +R+ K+P+V + F         P+N    V++     F   GT  GVS
Sbjct: 404 CYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS 457


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 130/313 (41%), Gaps = 62/313 (19%)

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
           N+S ++  D GS+L W+ C+            +S    +N + P+ SS+   + CS   C
Sbjct: 85  NISMVI--DTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTC 131

Query: 173 DLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
              T       SC + K  C  T+  Y + +SS G L  +I H      N+  +S   ++
Sbjct: 132 RTRTRDFLIPASCDSDKL-CHATLS-YADASSSEGNLAAEIFHF----GNSTNDS---NL 182

Query: 226 IIGCGMKQSGG-YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
           I GC    SG    +     GL+G+  G +   S +++ G  + S+ +    D  G +  
Sbjct: 183 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQMGFPKFSYCISGTDDFPGFLLL 239

Query: 285 GDQG----------PATQQSTSF-LASNGKYITYIIGVET----CCIGSSCL---KQTSF 326
           GD            P  + ST         Y   + G++       I  S L      + 
Sbjct: 240 GDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAG 299

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR--- 377
           + +VDSG+ FTFL   VY  + ++F  Q N  +T +E   +        CY+ S  R   
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRT 359

Query: 378 --LPKLPSVKLMF 388
             L +LP+V L+F
Sbjct: 360 GILHRLPTVSLVF 372


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 80/170 (47%), Gaps = 33/170 (19%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  + +GTP+   ++ +D GSDL+W+ C  C RC       ++          P  SST
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFD----------PRRSST 135

Query: 162 SKHLSCSH------RL--CDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
            + + CS       R   CD G +       C Y M  Y + +SS+G L  D L   +  
Sbjct: 136 YRRVPCSSPQCRALRFPGCDSGGAAGG---GCRY-MVAYGDGSSSTGDLATDKLAFAN-- 189

Query: 214 DNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
           D  + N     V +GCG + + G  D  A  GL+G+G G+IS+ + +A A
Sbjct: 190 DTYVNN-----VTLGCG-RDNEGLFDSAA--GLLGVGRGKISISTQVAPA 231


>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
 gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
          Length = 260

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 62/137 (45%), Gaps = 6/137 (4%)

Query: 272 MCFDK--DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAI 329
           MCF    D  GRI FGD+G   Q  T  L +     TY + V    +G   +      A+
Sbjct: 1   MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVTEVSVGGDAVG-VQLLAL 58

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE-GYPWKCCYKSSSQRLPKL-PSVKLM 387
            D+G+SFT L +  Y  I   FD  V D     +   P++ CY  S  +   L P V + 
Sbjct: 59  FDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMT 118

Query: 388 FPQNNSFVVNNPVFVIY 404
           F   +   + NP+F+++
Sbjct: 119 FEGGSQMFLRNPLFIVW 135


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 114/288 (39%), Gaps = 65/288 (22%)

Query: 120 LDAGSDLLWIPC---DCVRCA--PLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR---- 170
           LD GSDL+W PC   +C+ C     +AS  ++    L++ +   S  S   S  H     
Sbjct: 97  LDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATPVSCKSSACSAVHSNLPS 156

Query: 171 --LCDLG---------TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             LC +          + C+  K  CP     Y + +  + L  + I   +S   N + N
Sbjct: 157 SDLCAISNCPLESIEISDCR--KHSCPQFYYAYGDGSLIARLYRDSIRLPLSNQTNLIFN 214

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC----- 273
           +       GC       +     P G+ G G G +S+P+ LA  +  + N FS C     
Sbjct: 215 NF----TFGCA------HTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHS 264

Query: 274 ----------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
                           +D D+  R   G + P+    TS L +      Y +G+E   IG
Sbjct: 265 FDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVY-TSMLDNPRHPYFYCVGLEGISIG 323

Query: 318 SSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
              +    F            +VDSG++FT LP  +Y+ + AEF+ +V
Sbjct: 324 RKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRV 371


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 129/302 (42%), Gaps = 43/302 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +++ + +G+P     + LD GSD+ W+ C  C  C       Y   D     + PS S++
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 216

Query: 162 SKHLSCSHRLC-DL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
              ++C +  C DL   +C+N    C Y +  Y + + + G    + L L   GD+A  +
Sbjct: 217 YASVACDNPRCHDLDAAACRNSTGACLYEV-AYGDGSYTVGDFATETLTL---GDSAPVS 272

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD- 277
           SV     IGCG    G +   V   GL+ LG G +S PS ++       +FS C  D+D 
Sbjct: 273 SVA----IGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-----TTFSYCLVDRDS 320

Query: 278 -DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK------- 327
             S  + FGD   A + +   + S      Y +G+    +G   L    ++F        
Sbjct: 321 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAG 379

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             IVDSG++ T L    Y  +   F R       +     +  CY  S +   ++P+V L
Sbjct: 380 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 439

Query: 387 MF 388
            F
Sbjct: 440 RF 441


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 141/373 (37%), Gaps = 67/373 (17%)

Query: 71  QKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP 130
           Q QK     + Q+  P      +S G+D+  L +T       +VS    LD GSDL+W P
Sbjct: 60  QHQKRHLRNRHQVSLP------LSPGSDY-TLSFTLNSNPPQHVSLY--LDTGSDLVWFP 110

Query: 131 CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC-----DLGTSCQNPKQPC 185
           C    C        N+     +   P  SST++ + C    C     +L TS       C
Sbjct: 111 CKPFECILCEGKAENT---TASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADC 167

Query: 186 PY----TMDYYTENTSS------SGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 235
           P     T D ++ +  S       G LV  + H       A  +    +   GC      
Sbjct: 168 PLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCA----- 222

Query: 236 GYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC-----FDKDD---SGRIFFGD 286
            +     P G+ G G G +S+P+ LA  A  + N FS C     F+ D       +  G 
Sbjct: 223 -HTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGH 281

Query: 287 QGPATQQS---------TSFLASNGKYITYIIGVETCCIGSSCLKQTSF----------K 327
                ++          TS L +      Y +G+E   IG   +    F           
Sbjct: 282 SDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGG 341

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC----CYKSSSQRLPKLPS 383
            +VDSG++FT LP  +Y ++ AEFD +V       +    K     CY   +  +  +PS
Sbjct: 342 VVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCYYYDT--VVNIPS 399

Query: 384 VKLMFPQNNSFVV 396
           + L F  N S VV
Sbjct: 400 LVLHFVGNESSVV 412


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 112/276 (40%), Gaps = 58/276 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + +GTP + +   +D GSDL+W  C  CV C           ++    + P+ASST   L
Sbjct: 120 LSVGTPALPYAAIVDTGSDLVWTQCKPCVEC----------FNQTTPVFDPAASSTYAAL 169

Query: 166 SCSHRLC-DLGTSCQNPKQPCP-------YTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
            CS  LC DL TS                YT   Y + +S+ G+L  +           L
Sbjct: 170 PCSSALCADLPTSTCASSSSSSSASSPCGYTYT-YGDASSTQGVLATETF--------TL 220

Query: 218 KNSVQASVIIGCGMKQSG-GYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                  V  GCG    G G+  G    GL+GLG G +   SL+++ G+ R S+ +    
Sbjct: 221 ARQKVPGVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGIDRFSYCLTSLD 274

Query: 277 DDSGR-----------IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQ 323
           D +GR                  PA  Q+T  + +  +   Y + +    +GS+   L  
Sbjct: 275 DAAGRSPLLLGSAAGISASAATAPA--QTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPS 332

Query: 324 TSFK--------AIVDSGSSFTFLPKEVYETIAAEF 351
           ++F          IVDSG+S T+L    Y  +   F
Sbjct: 333 SAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAF 368


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 87/335 (25%), Positives = 135/335 (40%), Gaps = 65/335 (19%)

Query: 87  SQGSKTMSLGNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYY 144
           S+ S   +LG+    L Y   + +G+P V+  V +D GSD+ W+ C+ C   +P  A + 
Sbjct: 91  SKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHA-HA 149

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQ----NPKQPCPYTMDYYTENTSSS 199
            +L      + P+ASST    +CS   C  LG S +    + K  C Y +  Y + ++++
Sbjct: 150 GAL------FDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVK-YGDGSNTT 202

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G    D+L L SG D      V      GC   + G  +D    DGLIGLG G+   P +
Sbjct: 203 GTYSSDVLTL-SGSD------VVRGFQFGCSHAELGAGMDDKT-DGLIGLG-GDAQSP-V 252

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSFL----------ASNGKYIT--- 306
              A     SF  C               PAT  S+ FL              ++ T   
Sbjct: 253 SQTAARYGKSFFYCL--------------PATPASSGFLTLGAPASGGGGGASRFATTPM 298

Query: 307 ---------YIIGVETCCIGSS--CLKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDR 353
                    Y   +E   +G     L  + F A  +VDSG+  T LP   Y  +++ F  
Sbjct: 299 LRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 358

Query: 354 QVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
            +     +        C+  +      +P+V L+F
Sbjct: 359 GMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVF 393


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 121/322 (37%), Gaps = 66/322 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRD------LNEYSPSASS 160
           +D+ TP V  L   D GS L+W+ C        ++S Y  L  D      L + +   ++
Sbjct: 80  LDVSTPPVRMLALADTGSSLVWLKCKLPAAHTPASSSYARLPCDAFACKALGDAASCRAT 139

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
            S +  C +R      SC       P T+D +T +T                        
Sbjct: 140 GSGNNICVYRYAFADGSCTA----GPVTVDAFTFSTR----------------------- 172

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----D 275
               +  GC  +  G     V  DGL+GL  G IS+ S L+      + FS C       
Sbjct: 173 ----LDFGCATRTEG---LSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSS 225

Query: 276 KDDSGRIFFGDQGPATQQ----STSFLASNGKYITYIIGVETCCIGSSC--LKQTSFKAI 329
           +  S  + FG     +      +T  +A   K   Y I +++  +      L+ T+ K I
Sbjct: 226 ETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSF-YTIALDSIKVAGKPVPLQTTTTKLI 284

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK-----LPSV 384
           VDSG+  T+LPK V + + A     +           +  CY    +R P+     +P V
Sbjct: 285 VDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCY-DVRRRAPEDVGKSIPDV 343

Query: 385 KLM--------FPQNNSFVVNN 398
            L+         P  N+FVV N
Sbjct: 344 TLVLGGGGEVRLPWGNTFVVEN 365


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 129/308 (41%), Gaps = 44/308 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++  I +GTP     + +D GSD+LW+ C  CV C   S + ++          P  SST
Sbjct: 58  YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFD----------PYKSST 107

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNAL 217
              L CS R C   D+GT CQ  K  C Y +DY   + ++     +D+ L+  SG    +
Sbjct: 108 YSTLGCSTRQCLNLDIGT-CQANK--CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF--- 274
            N +     +GCG    G +   V   GL+GLG G +S P+ +      R  FS C    
Sbjct: 165 LNKIP----LGCGHDNEGYF---VGAAGLLGLGKGPLSFPNQVDPQNGGR--FSYCLTDR 215

Query: 275 --DKDDSGRIFFGDQG--PA----TQQSTSFLASNGKYITYI---IGVETCCIGSSCLKQ 323
             D  +   + FG+    PA    T Q ++       Y+      +G     I +S  + 
Sbjct: 216 ETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQL 275

Query: 324 TSF---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPK 380
            S      I+DSG+S T L    Y ++   F    +D   +     +  CY  S      
Sbjct: 276 DSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVD 335

Query: 381 LPSVKLMF 388
           +P+V L F
Sbjct: 336 VPTVTLHF 343


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 90/403 (22%), Positives = 146/403 (36%), Gaps = 91/403 (22%)

Query: 64  VLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAG 123
             +SS  +++  +T   F M   S G+ T +        ++    +GTP   FL+  D G
Sbjct: 55  AFISSRGRRRAAETASAFAMPL-SSGAYTGT------GQYFVRFRVGTPAQPFLLVADTG 107

Query: 124 SDLLWIPCDCVRC-------------APLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 170
           SDL W+ C                  AP  AS   +       + P  S T   + CS  
Sbjct: 108 SDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT-------FRPDKSRTWAPIPCSSA 160

Query: 171 LCDLG-----TSCQNPKQPCPYTMDY-YTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
            C         +C  P  PC Y  DY Y + +++ G +  D   +   G  A K  ++  
Sbjct: 161 TCRESLPFSLAACATPANPCAY--DYRYKDGSAARGTVGVDSATIALSGRAARKAKLRG- 217

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
           V++GC    +G     +A DG++ LG   IS  S    A      FS C       ++ +
Sbjct: 218 VVLGCTTSYNGQSF--LASDGVLSLGYSNISFASR--AASRFGGRFSYCLVDHLAPRNAT 273

Query: 280 GRIFFG----------DQGPAT----------------QQSTSFLASNGKYITYIIGVET 313
             + FG           +G A+                 + T  +  +     Y + V+ 
Sbjct: 274 SYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKG 333

Query: 314 CCIGSSCLK--------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
             +    LK        +    AI+DSG+S T L K  Y  + A   +++   +      
Sbjct: 334 VSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAG-LPRVTMD 392

Query: 366 PWKCCYK----SSSQRLPKLPSVKLMF-------PQNNSFVVN 397
           P+  CY     S S     LP + + F       P   S+V++
Sbjct: 393 PFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVID 435


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 102/439 (23%), Positives = 173/439 (39%), Gaps = 86/439 (19%)

Query: 18  ESSGAETVMFSTK--------LIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSD 69
           +S G E+ + ST         L  R  E+     +S+ +     P K+     + ++++ 
Sbjct: 6   KSEGKESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERPEKQ----IKTVVATA 61

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
              +   TG   Q++   +   T+  G      ++  + IGTP   + + LD GSDL WI
Sbjct: 62  ASPESYGTGLSGQLMATLESGVTLGSGE-----YFMDVFIGTPPKHYSLILDTGSDLNWI 116

Query: 130 PC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS------CQNPK 182
            C  C  C   +  YY+          P  SS+ +++ C    C L +S      C+   
Sbjct: 117 QCVPCHDCFEQNGPYYD----------PKESSSFRNIGCHDPRCHLVSSPDPPLPCKAEN 166

Query: 183 QPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDG 240
           Q CPY   +Y ++++++G    +   ++L S    +    V+ +V+ GCG   + G   G
Sbjct: 167 QTCPYFY-WYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVE-NVMFGCG-HWNRGLFHG 223

Query: 241 VAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDSGRIFFGD-----QGPA 290
            +    +G G    S  S L    L  +SFS C      D + S ++ FG+       P 
Sbjct: 224 ASGLLGLGRGPLSFS--SQLQS--LYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPE 279

Query: 291 TQQSTSFLASNGKYITY--------IIGVETCCIGSSCLKQTS---FKAIVDSGSSFTFL 339
              +T          T+        ++G E   I  S    TS      IVDSG++ ++ 
Sbjct: 280 LNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYF 339

Query: 340 PKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM----- 387
            +  Y+ I   F ++V       +GYP          CY  S      LP   ++     
Sbjct: 340 TEPAYQIIKDAFVKKV-------KGYPIVQDFPILDPCYNVSGVEKIDLPDFGILFADGA 392

Query: 388 ---FPQNNSFVVNNPVFVI 403
              FP  N F+  +P  V+
Sbjct: 393 VWNFPVENYFIRLDPEEVV 411


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 129/302 (42%), Gaps = 43/302 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +++ + +G+P     + LD GSD+ W+ C  C  C       Y   D     + PS S++
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADC-------YQQSD---PVFDPSLSTS 212

Query: 162 SKHLSCSHRLC-DL-GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
              ++C +  C DL   +C+N    C Y +  Y + + + G    + L L   GD+A  +
Sbjct: 213 YASVACDNPRCHDLDAAACRNSTGACLYEV-AYGDGSYTVGDFATETLTL---GDSAPVS 268

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD- 277
           SV     IGCG    G +   V   GL+ LG G +S PS ++       +FS C  D+D 
Sbjct: 269 SVA----IGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-----TTFSYCLVDRDS 316

Query: 278 -DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK------- 327
             S  + FGD   A + +   + S      Y +G+    +G   L    ++F        
Sbjct: 317 PSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAG 375

Query: 328 -AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             IVDSG++ T L    Y  +   F R       +     +  CY  S +   ++P+V L
Sbjct: 376 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 435

Query: 387 MF 388
            F
Sbjct: 436 RF 437


>gi|328865865|gb|EGG14251.1| hypothetical protein DFA_12021 [Dictyostelium fasciculatum]
          Length = 698

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 81/305 (26%), Positives = 126/305 (41%), Gaps = 49/305 (16%)

Query: 116 FLVALDAGSDLLWIPCDCVRCAPLSASYYN-----------SLDRDLNEYSPSASSTSKH 164
           F+V +D GS  L IP D       +  +YN           +LD DL +   SA +    
Sbjct: 121 FMVQVDTGSTALAIPGD-------NCYFYNQRKTKCKCDQGALD-DLYQQGSSAET---- 168

Query: 165 LSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSV 221
           LSC    C  G S   P    P T  +   Y + +   G LV D + +      A+  ++
Sbjct: 169 LSCRSSQCKRGCSFITPYASHPSTCGFKISYQDGSFIGGDLVTDYVTVAGLTVKAIFGNM 228

Query: 222 QASVIIGCGMKQSGGYLDGVAP----DGLIGLGLGEIS------VPSLLAKAGLIRNSFS 271
           QA  +      QS    D  A     DG++GL    +       + SLL K   I NSFS
Sbjct: 229 QAQSL---NFSQSSCPADPFAAPRKRDGIMGLSYQSLDPNNGDDIFSLLVKTHEIHNSFS 285

Query: 272 MCFDKDDSGRIFFGDQGPATQQSTSFLA--SNGKYITYIIGVETCCIGSSCLKQTSFK-- 327
           MC   D+ G +  G   P    +       +N +Y  Y +      I  + L   SF+  
Sbjct: 286 MCL-SDEGGMLVLGGVDPKMNSTLMKYTPITNERY--YSVNCTGLRIDGNNLNSKSFQSI 342

Query: 328 AIVDSGSSFTFLPKEVYETIAAEFDRQVNDT--ITSFEGYPWKC-CYKSSSQRLPKLPSV 384
           +IVDSG++  FL  +++  +     +  +    IT+     W   C+  S ++L K P++
Sbjct: 343 SIVDSGTTIMFLKLDIFNDLIYYLVQHYSHLPGITTQSESLWNHQCFTLSDRQLEKYPTI 402

Query: 385 KLMFP 389
            ++FP
Sbjct: 403 SMVFP 407


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 118/303 (38%), Gaps = 41/303 (13%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP    L  +D GS + W+ C   RC        +  ++    + PS S T K L CS
Sbjct: 103 VGTPPFEILGVVDTGSGITWMQCQ--RCE-------DCYEQTTPIFDPSKSKTYKTLPCS 153

Query: 169 HRLCD--LGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ-AS 224
             +C   + T SC + K  C YT+ Y   + S   L VE    L  G  N   +SVQ  +
Sbjct: 154 SNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVET---LTLGSTNG--SSVQFPN 208

Query: 225 VIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
            +IGCG    G +    +    +G G   +      +  G     FS C        + S
Sbjct: 209 TVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGG----KFSYCLAPMFSQSNSS 264

Query: 280 GRIFFGDQGPAT---QQSTSFLASNGKYITYIIGVETCCIGSSCLK-----------QTS 325
            ++ FGD    +     ST  ++  G  + Y + +E   +G   ++              
Sbjct: 265 SKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGE 324

Query: 326 FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
              I+DSG++ T LP+E Y  + +     +     S        CY+++      +P + 
Sbjct: 325 GNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVIT 384

Query: 386 LMF 388
             F
Sbjct: 385 AHF 387


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 139/339 (41%), Gaps = 54/339 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +GTP     + LD GSD++WI C  C +C       Y+  D   N   P+ S +
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKC-------YSQTDPVFN---PTKSRS 196

Query: 162 SKHLSCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             ++ C   LC    S  C   K  C Y +  Y + + + G    + L          + 
Sbjct: 197 FANIPCGSPLCRRLDSPGCSTKKHICLYQVS-YGDGSFTYGEFSTETL--------TFRG 247

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDD 278
           +    V +GCG    G ++       L+GLG G +S PS + +       FS C  D+  
Sbjct: 248 TRVGRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSA 302

Query: 279 SGR---IFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK---- 327
           S +   + FGD   +     + L SN K  T+    ++GV         +  + FK    
Sbjct: 303 SSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDST 362

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
                I+DSG+S T L +  Y  +   F    ++   + E   +  C+  S +   K+P+
Sbjct: 363 GNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPT 422

Query: 384 VKLMF-------PQNNSFV-VNNP---VFVIYGTQVGVS 411
           V L F       P +N  + V+N     F   GT  G+S
Sbjct: 423 VVLHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLS 461


>gi|32482806|gb|AAP84703.1| putative xyloglucanase inhibitor [Solanum tuberosum]
          Length = 437

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 86/346 (24%), Positives = 132/346 (38%), Gaps = 77/346 (22%)

Query: 98  DFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP 156
           D   L Y T I   TP V   + LD G   LW+ CD          Y             
Sbjct: 41  DASTLQYLTQIQQRTPLVPISLTLDLGGQFLWVDCD--------QGY------------- 79

Query: 157 SASSTSKHLSCSHRLCDLGTS-----CQNPKQP------CPYTMDYYTENTSSSGLLVED 205
             SS+ K   C    C LG +     C +P +P      C    D     T++SG L  D
Sbjct: 80  -VSSSYKPARCRSAQCSLGGASGCGECFSPPRPGCNNNTCGLLPDNTVTRTATSGELASD 138

Query: 206 ILHLIS-GGDNALKNSVQASVIIGCGMKQSGGYLDGVAP--DGLIGLGLGEISVPSLLAK 262
           I+ + S  G N  ++    + +  CG       L G+A    G+ GLG   IS+PS  + 
Sbjct: 139 IVSVQSTNGKNPGRSVSDKNFLFVCGATF---LLQGLASGVKGMAGLGRTRISLPSQFSA 195

Query: 263 AGLIRNSFSMCFDKDDS-GRIFFGDQGP--------------------ATQQSTSFLASN 301
                  F++C    +S G + FGD GP                        ST+   S+
Sbjct: 196 EFSFPRKFALCLTSSNSKGVVLFGD-GPYFFLPNREFSNNDFQYTPLFINPVSTASAFSS 254

Query: 302 GKYIT-YIIGVETCCIGSSCLK-QTSFKAIVDSG---------SSFTFLPKEVYETIAAE 350
           G+  + Y IGV++  I    +   T+  +I + G         + +T L   +Y  I   
Sbjct: 255 GQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNF 314

Query: 351 FDRQVNDTITSFEGYPWKCCYKS----SSQRLPKLPSVKLMFPQNN 392
           F +++ +        P+K C+ S    S++  P +PS+ L+    N
Sbjct: 315 FVKELANVTRVAAVAPFKVCFDSRNIGSTRVGPAVPSIDLVLQNEN 360


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 126/309 (40%), Gaps = 45/309 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP     + LD  +D +W+PC    C+  S +  +      + YS  + ST++     
Sbjct: 111 LGTPPQLMFMVLDTSNDAVWLPCS--GCSGCSNASTSFNTNSSSTYSTVSCSTTQCTQAR 168

Query: 169 HRLCDLGTSCQNPKQP--CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
              C   T      QP  C +   Y  +++ S+ L V+D L         L   V  +  
Sbjct: 169 GLTCPSST-----PQPSICSFNQSYGGDSSFSANL-VQDTL--------TLSPDVIPNFS 214

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS----GRI 282
            GC    SG  L    P GL+GLG G +S+ S      L    FS C     S    G +
Sbjct: 215 FGCINSASGNSL---PPQGLMGLGRGPMSLVS--QTTSLYSGVFSYCLPSFRSFYFSGSL 269

Query: 283 FFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCL----------KQTSFKAIVD 331
             G  G P + + T  L +  +   Y + +    +GS  +            +    I+D
Sbjct: 270 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIID 329

Query: 332 SGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRL-PK----LPSVKL 386
           SG+  T   + VYE I  EF +QVN + ++   +    C+ + ++ + PK    + S+ L
Sbjct: 330 SGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGAF--DTCFSADNENVTPKITLHMTSLDL 387

Query: 387 MFPQNNSFV 395
             P  N+ +
Sbjct: 388 KLPMENTLI 396


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 78/326 (23%), Positives = 123/326 (37%), Gaps = 45/326 (13%)

Query: 85  FPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
            P+Q    +  GN     +   + +GTP     +  D GSDL W      +C P   S Y
Sbjct: 141 LPAQSGLPLGTGN-----YIVNVGLGTPKKDLSLIFDTGSDLTW-----TQCQPCVKSCY 190

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENTSSS 199
               +    + PS S T  ++SC+   C       G S       C Y +  Y +++ + 
Sbjct: 191 ---AQQQPIFDPSTSKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQ-YGDSSFTI 246

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G   +D L L        +N V    + GCG    G +       GLIGLG   +S+   
Sbjct: 247 GFFAKDKLTLT-------QNDVFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQ 296

Query: 260 LAKAGLIRNSFSMCF--DKDDSGRIFFGD-----QGPATQQSTSF--LASNGKYITYIIG 310
            A+       FS C    +  +G + FG+        A +   +F   AS+     Y I 
Sbjct: 297 TAQK--FGKYFSYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFID 354

Query: 311 VETCCIGSSCLKQTSF-----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
           V    +G   L  +         I+DSG+  T LP   Y ++ + F + ++   T+    
Sbjct: 355 VLGISVGGKALSISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALS 414

Query: 366 PWKCCYKSSSQRLPKLPSVKLMFPQN 391
               CY  S+     +P +   F  N
Sbjct: 415 LLDTCYDLSNYTSISIPKISFNFNGN 440


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 66/293 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLD-RDLNEYSPSASST 161
           +  GTP+ +     D GS L+W PC     C  C       ++ LD   +  + P  SS+
Sbjct: 94  LSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCN------FSGLDPTQIPRFIPKNSSS 147

Query: 162 SKHLSCSH-------------RLCDLGTSCQNPKQPC-PYTMDYYTENTSSSGLLVEDIL 207
           S+ + C +             R CD  T  +N   PC PY + Y   +T  +G+L+ + L
Sbjct: 148 SRVIGCQNPKCQFLFGANVQCRGCDPNT--RNCTVPCPPYILQYGLGST--AGILISEKL 203

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
                      +      ++GC +      +    P G+ G G G  S+PS +       
Sbjct: 204 D--------FPDLTVPDFVVGCSV------ISTRTPAGIAGFGRGPESLPSQMKLKSFSH 249

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQS--------TSFL----ASNGKYIT-YIIGVETC 314
              S  FD  +       D G   +          T F      SN  ++  Y + +   
Sbjct: 250 CLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRI 309

Query: 315 CIGSSCLK----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND 357
            +GS  +K            +  +IVDSGS+FTF+ + V+E +A EF  Q+++
Sbjct: 310 YVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSN 362


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 133/318 (41%), Gaps = 55/318 (17%)

Query: 108 DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +IG+P V      D GS+++WI C    C        N   + +  ++P+ SST     C
Sbjct: 113 NIGSPPVETYAIPDTGSNIVWIQCGSPICT-------NCYKQKIPLFNPTKSSTYAIRLC 165

Query: 168 SHRLCD-----LGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDIL----HLISGGDNA 216
            HR C      LG    C++  Q C Y +  Y +++ S G +  DI+    H+   G+ +
Sbjct: 166 GHRECKQALWGLGEYLGCKSSVQVCRYHIS-YEDHSFSEGTISTDIITFPEHIAEFGNYS 224

Query: 217 LKNSVQASVIIGCGMKQS---GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC 273
           L+      +  GCG   S   G   +     G++GLG     + SL+ +  L    FS C
Sbjct: 225 LR------MFFGCGYNNSETPGQDPNSFTAPGVVGLG---NEMASLVGQ--LTLGQFSYC 273

Query: 274 FDKDDSGR------IFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK---QT 324
               D  +      I FG     +  ST+ LA+N +       V+   +  + +K   + 
Sbjct: 274 ISTPDVQKPNGTIEIRFGLAASISGHSTA-LANNLEGWYIFQNVDGIYVDDTKVKGYPEW 332

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVN---DTITSFEGYPWKCCYKS 373
            F+         I+DSG+++T L     + +  E   Q+    DT        +  CY +
Sbjct: 333 VFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQIELAPDT-QDHSNSNYSLCYNA 391

Query: 374 SSQRLPKLPSVKLMFPQN 391
           ++  L  +P+++L F  N
Sbjct: 392 ANFLLTYVPAIELKFTDN 409


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 129/324 (39%), Gaps = 49/324 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  + +V +D GS   W+ C+C  C     ++             S S+T   +S
Sbjct: 5   VGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQ-----------SRSTTCAKVS 53

Query: 167 CSHRLCDLGTS---CQNPKQ--PCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
           C   +C LG S   CQ+ +    CP+ +  Y + ++S G+L +D L           + V
Sbjct: 54  CGTSMCLLGGSDPHCQDSENYPDCPFRVS-YQDGSASYGILYQDTLTF---------SDV 103

Query: 222 QA--SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
           Q       GC +   G    G   DGL+G+G G +SV   L ++    + FS C     S
Sbjct: 104 QKIPGFTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMS 159

Query: 280 GRIFF---------GDQGPATQ-QSTSFLASNGKYITYIIGVETCCIGSSCLKQT----S 325
            R FF         G     T  + T  +A       + + +    +    L  +    S
Sbjct: 160 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 219

Query: 326 FKAIV-DSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
            K +V DSGS  +++P      +     R++     + E    + CY   S     +P++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLRQRI-RELLLKRGAAEEESERNCYDMRSVDEGDMPAI 278

Query: 385 KLMFPQNNSFVV-NNPVFVIYGTQ 407
            L F     F + ++ VFV    Q
Sbjct: 279 SLHFDDGARFDLGSHGVFVERSVQ 302


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/286 (26%), Positives = 117/286 (40%), Gaps = 60/286 (20%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IG P       +D GS+L+W  C   R             +DL  Y PS S T+K ++C+
Sbjct: 90  IGDPPQQAAAIIDTGSNLIWTQCSTCRA-------NGCFGQDLTFYDPSRSRTAKPVACN 142

Query: 169 HRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
              C LG  T C    + C     Y     +  G L  ++     G   + +N+V  S+ 
Sbjct: 143 DTACLLGSETRCARDGKACAVLTAYGA--GAIGGFLGTEVFTF--GHGQSSENNV--SLA 196

Query: 227 IGC--GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDDS 279
            GC    + + G LDG +  G+IGLG G++S+PS L       N FS C      D  ++
Sbjct: 197 FGCITASRLTPGSLDGAS--GIIGLGRGKLSLPSQLGD-----NKFSYCLTPYFSDAANT 249

Query: 280 GRIFF-------GDQGPATQQSTSFLAS------NGKYITYIIGVETCCIGSSCLKQTSF 326
             +F        G   PAT  S  FL +      +  Y   + G+    +G++ L   + 
Sbjct: 250 STLFVGASAGLSGGGAPAT--SVPFLKNPDDDPFDSFYYLPLTGIT---VGTAKLDVPAA 304

Query: 327 K-------------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTI 359
                          ++DSGS FT L    Y+ +  E  RQ+  ++
Sbjct: 305 AFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASV 350


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 72/299 (24%), Positives = 117/299 (39%), Gaps = 34/299 (11%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +Y  + +GTP     +  D GS L W      +C P + S Y   D     + PS SS+ 
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTW-----TQCEPCAGSCYKQQDPI---FDPSKSSSY 191

Query: 163 KHLSCSHRLCDLGTSC---QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
            ++ C+  LC    S     +    C Y +  Y +N+ S G L ++ L + +        
Sbjct: 192 TNIKCTSSLCTQFRSAGCSSSTDASCIYDVK-YGDNSISRGFLSQERLTITA-------T 243

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
            +    + GCG     G   G A  GL+GL    IS   +   + +    FS C     S
Sbjct: 244 DIVHDFLFGCGQDNE-GLFRGTA--GLMGLSRHPISF--VQQTSSIYNKIFSYCLPSTPS 298

Query: 280 --GRIFFGDQGP--ATQQSTSFLASNGKYITY---IIGVETCCIGSSCLKQTSFKA---I 329
             G + FG      A  + T F   +G+   Y   I+G+         +  ++F A   I
Sbjct: 299 SLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSI 358

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMF 388
           +DSG+  T LP   Y  + + F + +     ++       CY  S  +   +P +   F
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEF 417


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 124/319 (38%), Gaps = 65/319 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G+P  +  + LD GS+L W+ C   + +P   S +N          P +SST   + 
Sbjct: 65  LAVGSPPQNISMVLDTGSELSWLHC---KKSPNLGSVFN----------PVSSSTYSPVP 111

Query: 167 CSHRLCDLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CS  +C   T       SC +PK    +    Y + TS  G L  D           + +
Sbjct: 112 CSSPICRTRTRDLPIPASC-DPKTHFCHVAISYADATSIEGNLAHDTF--------VIGS 162

Query: 220 SVQASVIIGC-GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KD 277
             +   + GC     S    +     GL+G+  G +S  + L  +      FS C    D
Sbjct: 163 VTRPGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYCISGSD 217

Query: 278 DSGRIFFGDQ-----GPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK--QTSF- 326
            SG +  GD      GP          +   Y   + Y + +E   +GS  L   ++ F 
Sbjct: 218 SSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFV 277

Query: 327 -------KAIVDSGSSFTFLPKEVYETIAAEFDRQ-------VNDTITSFEGYPWKCCYK 372
                  + +VDSG+ FTFL   VY  +  EF  Q       V+D    F+G     CY+
Sbjct: 278 PDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGT-MDLCYR 336

Query: 373 SSSQRLPK---LPSVKLMF 388
             S   P    LP + LMF
Sbjct: 337 VGSSTRPNFTGLPVISLMF 355


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 68/271 (25%), Positives = 114/271 (42%), Gaps = 47/271 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP VS+   LD GSDL+W  C  C +C       ++          P  SS+   +
Sbjct: 112 LAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFD----------PKKSSSFSKV 161

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           SC   LC        P   C    +Y   Y + + + G+L  +       G +  K SV 
Sbjct: 162 SCGSSLCS-----AVPSSTCSDGCEYVYSYGDYSMTQGVLATETFTF---GKSKNKVSVH 213

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGR- 281
            ++  GCG    G   +  +  GL+GLG G +S+ S L +       FS C    D  + 
Sbjct: 214 -NIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLKEP-----RFSYCLTPMDDTKE 265

Query: 282 --IFFGDQGPATQQ----STSFLASNGKYITYIIGVETCCIGSSCL--KQTSFK------ 327
             +  G  G         +T  L +  +   Y + +E   +G + L  ++++F+      
Sbjct: 266 SILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGN 325

Query: 328 --AIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
              I+DSG++ T++ ++ +E +  EF  Q  
Sbjct: 326 GGVIIDSGTTITYIEQKAFEALKKEFISQTK 356


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 30/190 (15%)

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRI 282
           I GCG + + G   GV+  GL+GLG  ++S+ S    +G+    FS C    ++  SG +
Sbjct: 108 IFGCG-RNNKGLFGGVS--GLMGLGRSDLSLIS--QTSGIFGGVFSYCLPSTERKGSGSL 162

Query: 283 FFGDQGPATQQST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGS 334
             G      + S+       + +   Y  Y I +    IG   L+  S    + +VDSG+
Sbjct: 163 ILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGT 222

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM 387
             T LP  +Y+ + AEF +Q       F G+P          C+  S+ +   +P++K+ 
Sbjct: 223 VITRLPPTIYKALKAEFLKQ-------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMH 275

Query: 388 FPQNNSFVVN 397
           F  N    V+
Sbjct: 276 FEGNAELTVD 285


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 83/319 (26%), Positives = 124/319 (38%), Gaps = 65/319 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G P  +  + LD GS+L W+ C   + +P   S +N          P +SST   + 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHC---KKSPNLGSVFN----------PVSSSTYSPVP 115

Query: 167 CSHRLCDLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CS  +C   T       SC +PK    +    Y + TS  G L  +           + +
Sbjct: 116 CSSPICRTRTRDLPIPASC-DPKTHLCHVAISYADATSIEGNLAHETF--------VIGS 166

Query: 220 SVQASVIIGC-GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD-KD 277
             +   + GC     S    +     GL+G+  G +S  + L  +      FS C    D
Sbjct: 167 VTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYCISGSD 221

Query: 278 DSGRIFFGDQ-----GPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK--QTSF- 326
            SG +  GD      GP         ++   Y   + Y + +E   +GS  L   ++ F 
Sbjct: 222 SSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFV 281

Query: 327 -------KAIVDSGSSFTFLPKEVYETIAAEFD-------RQVNDTITSFEGYPWKCCYK 372
                  + +VDSG+ FTFL   VY  +  EF        R V+D    F+G     CYK
Sbjct: 282 PDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGT-MDLCYK 340

Query: 373 SSSQRLPK---LPSVKLMF 388
             S   P    LP V LMF
Sbjct: 341 VGSTTRPNFSGLPMVSLMF 359


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 125/319 (39%), Gaps = 60/319 (18%)

Query: 107 IDIGTP---NVSF--LVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           I +GTP   + SF  L++ D GSD+ W+ C  C RC       YN L           SS
Sbjct: 129 ITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLK----------SS 178

Query: 161 TSKHLSCSHRLCD-LGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
           ++  + C    C  LG+S  C      C Y ++Y   ++S+    VE +           
Sbjct: 179 SASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETL---------TF 229

Query: 218 KNSVQA-SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
              V+   V IGCG    G +    A  G++GLG G +S PS +  AG    SFS C   
Sbjct: 230 PPGVRVPGVAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQI--AGRYGRSFSYCLAG 285

Query: 277 DDSG----RIFFGDQGPA------TQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF 326
             +G     + FG    A          T  L ++  Y  Y +G+    +G   ++  + 
Sbjct: 286 QGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTE 345

Query: 327 K------------AIVDSGSSFTFLPKEVYETIAAEFD-RQVNDTITSFEGYPWK---CC 370
                         IVDSG++ T L    Y      F    V +      G P+     C
Sbjct: 346 SDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTC 405

Query: 371 YKSSSQR-LPKLPSVKLMF 388
           Y S   R + K+P+V + F
Sbjct: 406 YSSVRGRVMKKVPAVSMHF 424


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 120/294 (40%), Gaps = 52/294 (17%)

Query: 109 IGTPNVS-FLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           IGTP      + +D GSD++W  C  C  C            + L  +  SAS T   + 
Sbjct: 98  IGTPRPQQVALEVDTGSDVVWTQCRPCFDC----------FTQPLPRFDTSASDTVHGVL 147

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C+  +C            C Y ++Y  +N+ + G L +D       G   +       ++
Sbjct: 148 CTDPICRALRPHACFLGGCTYQVNY-GDNSVTIGQLAKDSFTFDGKGGGKV---TVPDLV 203

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD---KDDSGRIF 283
            GCG   +G +       G+ G G G +S+P  L  +     SFS CF    +  S  +F
Sbjct: 204 FGCGQYNTGNFHSNET--GIAGFGRGPLSLPRQLGVS-----SFSYCFTTIFESKSTPVF 256

Query: 284 FGDQ----------GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF----- 326
            G            GP    ST FL ++ +Y  Y + ++   +G + L   +++F     
Sbjct: 257 LGGAPADGLRAHATGPIL--STPFLPNHPEY--YYLSLKGITVGKTRLAVPESAFVVKAD 312

Query: 327 ---KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE--GYPWKCCYKSSS 375
                I+DSG++ T  P+ V+ ++   F  QV    TS+   G P   C+ + S
Sbjct: 313 GSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTES 366


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 135/360 (37%), Gaps = 75/360 (20%)

Query: 70  VQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWI 129
           ++ QK +T      LF +    T  L           +  GTP  +  + LD GS+L W+
Sbjct: 34  LRTQKHRTPISTPRLFSTTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWL 93

Query: 130 PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-------LGTSCQNPK 182
            C            +NS+      ++P AS T   + CS   C+       L  SC +P 
Sbjct: 94  HCK-------KEPNFNSI------FNPLASKTYTKIPCSSPTCETRTRDLPLPVSC-DPA 139

Query: 183 QPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVA 242
           + C + + Y   ++    L  E            + +    + + GC      G+     
Sbjct: 140 KLCHFIISYADASSVEGNLAFETF---------RVGSVTGPATVFGC---MDSGFSSNSE 187

Query: 243 PD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFGDQG--------- 288
            D    GL+G+  G +S    + + G     FS C  D+D SG +  G+           
Sbjct: 188 EDAKTTGLMGMNRGSLS---FVNQMGF--RKFSYCISDRDSSGVLLLGEASFSWLKPLNY 242

Query: 289 -PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSF--------KAIVDSGSSFT 337
            P  + ST     +   + Y + +E   +    L   ++ F        + +VDSG+ FT
Sbjct: 243 TPLVEMSTPLPYFD--RVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFT 300

Query: 338 FLPKEVYETIAAEF-------DRQVNDTITSFEGYPWKCCYKSSSQR--LPKLPSVKLMF 388
           FL   VY  +  EF        R +N+    F+G     CY     R  LP LP V LMF
Sbjct: 301 FLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQG-AMDLCYLIEPTRAALPNLPVVNLMF 359


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 54/173 (31%), Positives = 78/173 (45%), Gaps = 26/173 (15%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSA 158
           G  +   + IGTP   F  A+D  SDL+W  C  C  C       Y+ +D   N   P  
Sbjct: 86  GGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGC-------YHQVDPMFN---PRV 135

Query: 159 SSTSKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
           SST   L CS   C   D+     +  + C YT   Y+ N ++ G L  D L +   G++
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYT-YSGNATTEGTLAVDKLVI---GED 191

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA--KAGLI 266
           A +      V  GC    +GG     A  G++GLG G +S+ S L+  + G+I
Sbjct: 192 AFRG-----VAFGCSTSSTGGAPPPQA-SGVVGLGRGPLSLVSQLSVRRYGMI 238


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 140/335 (41%), Gaps = 49/335 (14%)

Query: 86  PSQGSKTMSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           P   S  ++ GN     +Y     +GTP     + LD  +D +W+PC    C+  S +  
Sbjct: 12  PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS--GCSGCSNAST 69

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGL 201
           +      + YS  + ST++   C+      G +C +   P P    +   Y  ++S S  
Sbjct: 70  SFNTNSSSTYSTVSCSTAQ---CTQAR---GLTCPS-SSPQPSVCSFNQSYGGDSSFSAS 122

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
           LV+D L         L   V  +   GC    SG   + + P GL+GLG G +S+ S   
Sbjct: 123 LVQDTL--------TLAPDVIPNFSFGCINSASG---NSLPPQGLMGLGRGPMSLVS--Q 169

Query: 262 KAGLIRNSFSMCFDKDDS----GRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCI 316
              L    FS C     S    G +  G  G P + + T  L +  +   Y + +    +
Sbjct: 170 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 229

Query: 317 GSSCLK----QTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY- 365
           GS  +       +F A      I+DSG+  T   + VYE I  EF +QVN  ++SF    
Sbjct: 230 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN--VSSFSTLG 287

Query: 366 PWKCCYKSSSQRL-PK----LPSVKLMFPQNNSFV 395
            +  C+ + ++ + PK    + S+ L  P  N+ +
Sbjct: 288 AFDTCFSADNENVAPKITLHMTSLDLKLPMENTLI 322


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 122/312 (39%), Gaps = 42/312 (13%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASSTSKH 164
           +  GTP  +  V  D GSD+ W+ C    VRC       ++          PS SST ++
Sbjct: 20  VGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFD----------PSLSSTYRN 69

Query: 165 LSCSHRLCDLGTSCQN-PKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           +SC+   C +G S +      C Y + +Y + +S+ G L  D   L        KN    
Sbjct: 70  VSCTEPAC-VGLSTRGCSSSTCLYGV-FYGDGSSTIGFLAMDTFMLTPA--QKFKN---- 121

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEI-SVPSLLAKAGLIRNSFSMCFDKDDSGRI 282
             I GCG   + G   G A  GL+GLG     S+ S +A +  + N FS C     S   
Sbjct: 122 -FIFGCGQNNT-GLFQGTA--GLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATG 175

Query: 283 FFGDQGPA-TQQSTSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKA---IVDSGSSF 336
           +     P  T   T+ L        Y I +    +G +   L  T F++   I+DSG+  
Sbjct: 176 YLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVI 235

Query: 337 TFLPKEVYETIAAEFDRQVNDTITSFEGYP----WKCCYKSSSQRLPKLPSVKLMFPQNN 392
           T LP   Y  +       V   +T +   P       CY  S       P + L F   +
Sbjct: 236 TRLPPTAYSALKTA----VRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLD 291

Query: 393 SFVVNNPVFVIY 404
             +    VF ++
Sbjct: 292 VRIPATGVFFVF 303


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 138/344 (40%), Gaps = 68/344 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRC-----APLSASYYNSLDRDLNEYSP 156
           ++IGTP     V +D GSDL W+PC     DC+ C       L A++  S        S 
Sbjct: 86  LNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASC 145

Query: 157 SA-------SSTSKHLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDILH 208
           ++       SS +   +C+   C L T  +    +PCP     Y      +G+L  D L 
Sbjct: 146 ASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLR 205

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
            ++G    +   +      GC       Y +   P G+ G G G +   S++++ G ++ 
Sbjct: 206 -VNGSSPGVAKEI-PKFCFGC---VGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQK 254

Query: 269 SFSMCF-------DKDDSGRIFFGDQGPATQ---QSTSFLASNGKYITYIIGVETCCIGS 318
            FS CF       + + S  +  GD    ++   Q T  L S      Y +G+E   +G+
Sbjct: 255 GFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGN 314

Query: 319 SCLKQT-----SFKAI------VDSGSSFTFLPKEVYETIAAEFDRQVN---DTITSFEG 364
               +       F ++      +DSG+++T LP+  Y  + +     +N   DT    + 
Sbjct: 315 VSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQ- 373

Query: 365 YPWKCCYK---------SSSQRLPK-----LPSVKLMFPQNNSF 394
             +  CYK         +S   LP      L +V L+ PQ N F
Sbjct: 374 TGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHF 417


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 123/319 (38%), Gaps = 68/319 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  S  + LD GS+L W+ C             NS+      ++P  SS+   + 
Sbjct: 74  LTVGTPPQSVTMVLDTGSELSWLHCK-------KQQNINSV------FNPHLSSSYTPIP 120

Query: 167 CSHRLCDLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           C   +C   T       SC +    C  T+ Y  + TS  G L  D          A+  
Sbjct: 121 CMSPICKTRTRDFLIPVSCDS-NNLCHVTVSY-ADFTSLEGNLASDTF--------AISG 170

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPD----GLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
           S Q  +I G       G+      D    GL+G+  G +S    + + G  +  FS C  
Sbjct: 171 SGQPGIIFG---SMDSGFSSNANEDSKTTGLMGMNRGSLS---FVTQMGFPK--FSYCIS 222

Query: 276 -KDDSGRIFFGDQ-----GPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK---- 322
            KD SG + FGD      GP        + +   Y   + Y + +    +GS  L+    
Sbjct: 223 GKDASGVLLFGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKE 282

Query: 323 ------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTIT-------SFEGYPWKC 369
                   + + +VDSG+ FTFL   VY  +  EF  Q    +T        FEG    C
Sbjct: 283 IFAPDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLC 342

Query: 370 CYKSSSQRLPKLPSVKLMF 388
                   +P +P+V ++F
Sbjct: 343 FRVRRGGVVPAVPAVTMVF 361


>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
          Length = 178

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 56/128 (43%), Gaps = 8/128 (6%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWI-PCDCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L+YT I IGTP V + V LD GS   W+    C +C      + + + R L  Y P +S 
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCP-----HESDILRKLTFYDPRSSV 112

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +SK + C   +C     C N    CPY    Y +   + G+L  D+LH      N     
Sbjct: 113 SSKEVKCDDTICTSRPPC-NMTLRCPYITG-YADGGLTMGILFTDLLHYHQLYGNGQTQP 170

Query: 221 VQASVIIG 228
              SV  G
Sbjct: 171 TSTSVTFG 178


>gi|219120658|ref|XP_002181063.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407779|gb|EEC47715.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 448

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 111/266 (41%), Gaps = 39/266 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           H+    +G P  +  + +D GS L    C+ C +C    A  +  LD       P  SST
Sbjct: 86  HHVTAWMGEPPQAQTLIVDTGSRLTATACEPCSQCGTTHAHPFPHLD-------PQRSST 138

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
            ++  C   L      C   +Q C     Y TE +S + + V D   L     ++L+  V
Sbjct: 139 LRYTQCGSCLLSGIQECAA-EQKCGINQRY-TEGSSWTAVEVSDTFVLGGPEISSLEQYV 196

Query: 222 QASVII--GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDD 278
             ++I   GC  K  G +    A +G++GL   ++S+   L K  +I R SFS+C    +
Sbjct: 197 SFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVIPRESFSLCMTPFE 255

Query: 279 SGRIFFG----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------- 327
            G I  G    D+   + + T F ++   Y  +++ V    +G  CL             
Sbjct: 256 -GYIGLGGPLRDKHTESMKYTPFTSTQSWYAVHVVRV---FVGDECLTSNDQHDTVVEHA 311

Query: 328 ----------AIVDSGSSFTFLPKEV 343
                      I+DSG++ T+LPK V
Sbjct: 312 LVEAFAEGKGTILDSGTTDTYLPKAV 337


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 122/312 (39%), Gaps = 51/312 (16%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           ++   + +GTP    +  +D GSD++W  C  C  C    A  ++          PS SS
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFD----------PSKSS 469

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           T +   C+      G SC        Y +  Y + T S G+L  + + + S         
Sbjct: 470 TFREQRCN------GNSCH-------YEI-IYADKTYSKGILATETVTIPSTSGEPF--- 512

Query: 221 VQASVIIGCGMKQSGGYLDGVA--PDGLIGLGLGEISVPSL--LAKAGLIRNSFSMCFDK 276
           V A   IGCG+  +     G A    G++GL +G +S+ S   L   GLI    S CF  
Sbjct: 513 VMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFSG 568

Query: 277 DDSGRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA-- 328
             + +I FG      G  T  +  F+  +  +  Y + ++   +  + +    T F A  
Sbjct: 569 QGTSKINFGTNAIVAGDGTVAADMFIKKDNPF--YYLNLDAVSVEDNLIATLGTPFHAED 626

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVK 385
               +DSG++ T+ P      +    ++ V        G     CY S +  +   P + 
Sbjct: 627 GNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTIDI--FPVIT 684

Query: 386 LMFPQNNSFVVN 397
           + F      V++
Sbjct: 685 MHFSGGADLVLD 696



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 107/257 (41%), Gaps = 53/257 (20%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASS 160
           ++   + +GTP       +D GSDL+W  C  C  C       Y+  D     + PS SS
Sbjct: 81  IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDC-------YSQFDP---IFDPSKSS 130

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED--ILHLISGGDNALK 218
           T     C       G SC        Y +  Y +NT S G+L  +   +H  SG     +
Sbjct: 131 TFNEQRCH------GKSCH-------YEI-IYEDNTYSKGILATETVTIHSTSG-----E 171

Query: 219 NSVQASVIIGCGMKQSGGYLDGVA--PDGLIGLGLGEISVPSL--LAKAGLIRNSFSMCF 274
             V A   IGCG+  +     G A    G++GL +G  S+ S   L   GLI    S CF
Sbjct: 172 PFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLI----SYCF 227

Query: 275 DKDDSGRIFFGDQ----GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQ--TSFKA 328
               + +I FG      G  T  +  F+  +  +  Y + ++   +  + ++   T F A
Sbjct: 228 SGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPF--YYLNLDAVSVEDNRIETLGTPFHA 285

Query: 329 -----IVDSGSSFTFLP 340
                ++DSGS+ T+ P
Sbjct: 286 EDGNIVIDSGSTVTYFP 302


>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like, partial [Brachypodium distachyon]
          Length = 364

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 64/227 (28%), Positives = 101/227 (44%), Gaps = 43/227 (18%)

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
           Y + +SS G L  D+  + S        S++A+   GC         DGVA  GL+G+  
Sbjct: 65  YADGSSSDGALATDVFAVGSA-----TPSLRAA--FGCMASAFDSSPDGVASAGLLGMNR 117

Query: 252 GEISVPSLLAKAGLIRNSFSMCF-DKDDSGRIFFG----------DQGPATQQSTSF--- 297
           G +S    +++AG  R  FS C  D+DD+G +  G          +  P  Q S      
Sbjct: 118 GALS---FVSQAGTRR--FSYCISDRDDAGVLLLGHSDLPNFLPLNYTPLYQPSLPLPYF 172

Query: 298 --LASNGKYITYIIGVETCCIGSSCLKQTSFKA---IVDSGSSFTFLPKEVYETIAAEFD 352
             +A + + +  ++G +   I +S L      A   +VDSG+ FTFL  + Y  + AEF 
Sbjct: 173 DRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAEFY 232

Query: 353 RQ-------VNDTITSFEGYPWKCCYKSSSQRLPK----LPSVKLMF 388
           RQ       +++   +F+G  +  C++      P     LPSV L F
Sbjct: 233 RQSTPFLRALDEPSFAFQGA-FDTCFRVPRGMSPPPGRLLPSVTLRF 278


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/309 (24%), Positives = 124/309 (40%), Gaps = 47/309 (15%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IG+P V      D+GS L+W+ C    C        N   + +  ++PS S T     C+
Sbjct: 107 IGSPAVDTYAIPDSGSSLVWLQCGTPYCR-------NCYRQKIPLFNPSKSVTYMKRLCN 159

Query: 169 HRLCDLGT-----SCQNPKQPCPYTMDYYTENTSSSGLLVEDIL----HLISGGDNALKN 219
              C +        C+ P Q C Y  D Y +++ + G++  DI     H+   G+  L+ 
Sbjct: 160 TAECRVALGDEYWRCKKPNQICKYHED-YLDDSYTEGVISTDIFTFPEHISGFGNYTLR- 217

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD- 278
                +I GCG   S        P GL+GL   +    SL+ +  + + S+ +  D +  
Sbjct: 218 -----IIFGCGYNNSDP--QHFYPPGLVGLTNNK---ASLVGQMDVDQFSYCVSIDTEQN 267

Query: 279 ---SGRIFFGDQGPATQQSTSFLA-SNGKYI------TYIIGVETCCIGSSCLKQTSFKA 328
              S  I FG     +  ST  +  S+G YI       Y+   E     +   K T    
Sbjct: 268 LKGSMEIRFGLAASISGHSTQLVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQ 327

Query: 329 ---IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP---WKCCYKSSSQRLPKLP 382
               +D+G+++T L   V + +    +  +  TI   + Y    ++ CY S       LP
Sbjct: 328 GGLTMDTGTTYTELHNSVMDPLIKLLEEHI--TIVPEKDYSNSGFELCYFSDDFLGATLP 385

Query: 383 SVKLMFPQN 391
            ++L F  N
Sbjct: 386 DIELRFTDN 394


>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 435

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 140/326 (42%), Gaps = 50/326 (15%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN-SLDRDLNEYSPSASST 161
           + T I+  TP V+  + +D G   +W+ CD      +S+SY     D  L + + S S T
Sbjct: 49  YVTQINQRTPLVAVKLTVDLGGTFMWVDCDNY----VSSSYTPVRCDSALCKLADSHSCT 104

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNS 220
           ++  S     C   T    P  P          + S+SG +  D++ L S  G    +N 
Sbjct: 105 TECYSSPKPGCYNNTCSHIPYNP--------VVHVSTSGDIGLDVVSLQSMDGKYPGRNV 156

Query: 221 VQASVIIGCGMKQSGGYLDGVAPD--GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK-- 276
              +V   CG   +G  L+ +A    G+ GLG G IS+P+  + A  +++ F++C     
Sbjct: 157 SVPNVPFVCG---TGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSLT 213

Query: 277 DDSGRIFFGDQ-GPATQQSTSF-------LASNGKYIT------YIIGVETCCIGSSCLK 322
           + SG I+FGD  GP +     +       +++ G Y        Y I V+T  +G   +K
Sbjct: 214 NSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLRVGGKEIK 273

Query: 323 -QTSFKAIVDSGSS---------FTFLPKEVYETIAAEFDRQVNDTI-TSFEGYPWKCCY 371
              +  +I + G           +T L   +Y+ +   F +Q+   I  +    P+  CY
Sbjct: 274 FNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAKQMKFLIEVNPPIAPFGLCY 333

Query: 372 KSSSQRL----PKLPSVKLMFPQNNS 393
           +S++  +    P +P + L+     S
Sbjct: 334 QSAAMDINEYGPVVPFIDLVLESQGS 359


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/338 (22%), Positives = 136/338 (40%), Gaps = 66/338 (19%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C           ++    + P+ASS+ ++++C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FEQRGPVFDPAASSSYRNVTC 206

Query: 168 SHRLCD-----------LGTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGD 214
               C               +C+ P + PCPY   Y  ++ ++  L +E   ++L + G 
Sbjct: 207 GDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 266

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
           +   +     V+ GCG +  G +       GL    L   S   L A  G   ++FS C 
Sbjct: 267 SRRVD----GVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCL 317

Query: 275 ---DKDDSGRIFFGDQGPATQ-------QSTSFLASNGKYIT----YIIGVETCCIGSSC 320
                D   ++ FG+   A         + T+F  ++         Y + ++   +G   
Sbjct: 318 VDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGEL 377

Query: 321 L----------KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKC 369
           L          K  S   I+DSG++ ++  +  Y+ I   F  +++ +      +P    
Sbjct: 378 LNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSP 437

Query: 370 CYKSSSQRLPKLPSVKLM--------FPQNNSFVVNNP 399
           CY  S    P++P + L+        FP  N F+  +P
Sbjct: 438 CYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDP 475


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 70/279 (25%), Positives = 113/279 (40%), Gaps = 46/279 (16%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC----DCVRC-APLSASY------YN 145
           + +G   +T   +GTP     V LD GS L W+PC    DC  C +P +A+        +
Sbjct: 98  HSYGGYAFT-ASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNS 156

Query: 146 SLDRDLNEYSPSA--SSTSKHLSCSHRLCDLGTSCQNPKQPC-PYTMDYYTENTSSSGLL 202
           S  R +   +PS     +++H++     C  G +C      C PY + Y   + S++GLL
Sbjct: 157 SSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVY--GSGSTAGLL 214

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
           + D L               +  ++GC +           P GL G G G  SVP+ L  
Sbjct: 215 IADTLR--------APGRAVSGFVLGCSLVSV-----HQPPSGLAGFGRGAPSVPAQLGL 261

Query: 263 AGLIRNSFSMCFDKDD--SGRIFFGDQGPATQQSTSFLASNGK----YITYIIGVETCCI 316
           +       S  FD +   SG +  G      Q      ++ G      + Y + +    +
Sbjct: 262 SKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTV 321

Query: 317 GSSCLK----------QTSFKAIVDSGSSFTFLPKEVYE 345
           G   ++            S  AIVDSG++FT+L   V++
Sbjct: 322 GGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQ 360


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 118/300 (39%), Gaps = 68/300 (22%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD----CVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           +++GTP  +    LD GS L+W PC     C  C     ++ N     +  + P  SST+
Sbjct: 96  LNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHC-----NFPNIDTTKIPTFIPKNSSTA 150

Query: 163 KHLSCSHRLC------DLGTSC-------QNPKQPCPYTMDYYTENTSSSGLLVEDILHL 209
           K L C +  C      D+   C       QN    CP  +  Y   +++  LL++++   
Sbjct: 151 KLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNL--- 207

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
                           ++GC +      L    P G+ G G G+ S+PS   +  L R  
Sbjct: 208 ------NFPGKTVPQFLVGCSI------LSIRQPSGIAGFGRGQESLPS---QMNLKR-- 250

Query: 270 FSMCF------DKDDSGRIFF-----GDQGPATQQSTSFLA----SNGKYITY------- 307
           FS C       D   S  +       GD        T F +    +N  +  Y       
Sbjct: 251 FSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRK 310

Query: 308 -IIGVETCCIGSSCLKQTS---FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFE 363
            I+G +   I  + L+  S      IVDSGS+FTF+ + VY  +A EF +Q+    +  E
Sbjct: 311 VIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAE 370


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 29/187 (15%)

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           ++  PS+   T+  GN     +   + +GTP        D GSDL W      +C P + 
Sbjct: 122 KVTLPSKSGSTIGTGN-----YVVTVGLGTPKRDLTFIFDTGSDLTW-----TQCEPCAR 171

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD-----LGTSCQNPKQPCPYTMDYYTENT 196
             Y+  +   N   PS S++  ++SCS   CD      G S       C Y +  Y + +
Sbjct: 172 YCYHQQEPIFN---PSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQ-YGDQS 227

Query: 197 SSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV 256
            S G   +D L L S         V  + + GCG    G ++ GVA  GLIGLG   +S+
Sbjct: 228 YSVGFFAQDKLALTS-------TDVFNNFLFGCGQNNRGLFV-GVA--GLIGLGRNALSL 277

Query: 257 PSLLAKA 263
            S   KA
Sbjct: 278 MSKYPKA 284


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/332 (22%), Positives = 134/332 (40%), Gaps = 60/332 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP   F + +D GSDL W+ C  C+ C           D+    + P+ASS+ ++++C
Sbjct: 157 VGTPPRRFRMIMDTGSDLNWLQCAPCLDC----------FDQVGPVFDPAASSSYRNVTC 206

Query: 168 SHRLCDL------GTSCQNP-KQPCPYTMDYYTENTSSSGLLVEDI-LHLISGGDNALKN 219
             + C L        +C+ P +  CPY   Y  ++ ++  L +E   ++L + G +   +
Sbjct: 207 GDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 266

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DK 276
                V+ GCG    G +       GL    L   S   L A  G   ++FS C      
Sbjct: 267 ----DVVFGCGHWNRGLFHGAAGLLGLGRGPLSFAS--QLRAVYG---HTFSYCLVDHGS 317

Query: 277 DDSGRIFFGDQG--------PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--- 325
           D + ++ FG+          P    +    AS+     Y + ++   +G   L  +S   
Sbjct: 318 DVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTW 377

Query: 326 ---------FKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSS 375
                       I+DSG++ ++  +  Y+ I   F  ++  +      +P    CY  S 
Sbjct: 378 GVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSG 437

Query: 376 QRLPKLPSVKLM--------FPQNNSFVVNNP 399
              P++P + L+        FP  N F+  +P
Sbjct: 438 VDRPEVPELSLLFADGAVWDFPAENYFIRLDP 469


>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
          Length = 412

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 73/331 (22%), Positives = 126/331 (38%), Gaps = 57/331 (17%)

Query: 82  QMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSA 141
           Q  F  +G   + L N     ++  I +GTP  +F V LD GS  LW+P   V+C  ++ 
Sbjct: 80  QEEFSVEGGHNVPLSNFMNAQYFAEISLGTPPQTFKVILDTGSSNLWVP--SVKCTSIAC 137

Query: 142 SYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGL 201
             +        +Y  S+SST K                       +++ Y   + S  G 
Sbjct: 138 FLH-------TKYDSSSSSTYK------------------ANGTEFSIQY--GSGSMEGF 170

Query: 202 LVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL-- 259
           + +D   +   GD  +     A      G+  + G  DG+     +GL    I+V  +  
Sbjct: 171 VSQDTFRI---GDLTVDGLDFAEATKEPGLAFAFGKFDGI-----LGLAYDTIAVNHITP 222

Query: 260 ----LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVE 312
               L   GL+     SF +   +DD G   FG    +            +   + + +E
Sbjct: 223 PFYHLINKGLVDEPVFSFRLGSSEDDGGEAIFGGVDDSAYTGKIQYVPVRRKAYWEVELE 282

Query: 313 TCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYK 372
              +G   L+  S  A +D+G+S   LP ++ E I    + Q+  T +      W   Y 
Sbjct: 283 KVSLGDDVLELESTGAAIDTGTSLIALPTDIAEMI----NTQIGATKS------WNGQYT 332

Query: 373 SSSQRLPKLPSVKLMFPQNNSFVVNNPVFVI 403
               ++P LP +   F   N +V+    +++
Sbjct: 333 VDCAKVPSLPDLTFTF-GGNPYVLKGTDYIL 362


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 137/337 (40%), Gaps = 53/337 (15%)

Query: 86  PSQGSKTMSLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYY 144
           P   S  ++ GN     +Y     +GTP     + LD  +D +W+PC    C+  S +  
Sbjct: 86  PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCS--GCSGCSNAST 143

Query: 145 NSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDY---YTENTSSS 199
           +      + YS         +SCS   C    G +C +   P P    +   Y  ++S S
Sbjct: 144 SFNTNSSSTYS--------TVSCSTAQCTQARGLTCPS-SSPQPSVCSFNQSYGGDSSFS 194

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
             LV+D L         L   V  +   GC    SG  L    P GL+GLG G +S+ S 
Sbjct: 195 ASLVQDTL--------TLAPDVIPNFSFGCINSASGNSL---PPQGLMGLGRGPMSLVS- 242

Query: 260 LAKAGLIRNSFSMCFDKDDS----GRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETC 314
                L    FS C     S    G +  G  G P + + T  L +  +   Y + +   
Sbjct: 243 -QTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGV 301

Query: 315 CIGSSCLK----QTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG 364
            +GS  +       +F A      I+DSG+  T   + VYE I  EF +QVN  ++SF  
Sbjct: 302 SVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVN--VSSFST 359

Query: 365 Y-PWKCCYKSSSQRL-PK----LPSVKLMFPQNNSFV 395
              +  C+ + ++ + PK    + S+ L  P  N+ +
Sbjct: 360 LGAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLI 396


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/252 (25%), Positives = 105/252 (41%), Gaps = 47/252 (18%)

Query: 120 LDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSC 178
           +D GSDL+W  C  C+ CA     Y++             S+T + L C    C   +S 
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDV----------KKSATYRALPCRSSRCASLSSP 50

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS-VIIGCGMKQSGGY 237
              K+ C Y   YY +  S++G+L  +      G  N+ K  V+A+ +  GCG   +G  
Sbjct: 51  SCFKKMCVYQY-YYGDTASTAGVLANETFTF--GAANSTK--VRATNIAFGCGSLNAG-- 103

Query: 238 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSG---RIFFG--------- 285
            D     G++G G G +S+ S L  +      FS C     S    R++FG         
Sbjct: 104 -DLANSSGMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTN 157

Query: 286 DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSF----------KAIVDSGSS 335
               +  QST F+ +      Y + ++   +G+  L                 I+DSG+S
Sbjct: 158 TSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTS 217

Query: 336 FTFLPKEVYETI 347
            T+L ++ YE +
Sbjct: 218 ITWLQQDAYEAV 229


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 119/298 (39%), Gaps = 75/298 (25%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC----DCVRCAPLSASYYNSLDRDLN 152
           + +G   +T + +GTP     V LD GS L W+PC     C  C+ LSA+        L+
Sbjct: 84  HSYGGYAFT-VSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA------SPLH 136

Query: 153 EYSPSASSTSKHLSCSHRLC------DLGTSCQ---------------NPKQPCPYTMDY 191
            + P  SS+S+ + C +  C      D  + C+               N    CP  +  
Sbjct: 137 VFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVV 196

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
           Y    S++GLL+ D L        A++N      +IGC +           P GL G G 
Sbjct: 197 YGSG-STAGLLISDTLRTPG---RAVRN-----FVIGCSLASVHQ-----PPSGLAGFGR 242

Query: 252 GEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD---------------QGPATQQSTS 296
           G  SVPS L   GL + S+ +   + D      G+               Q     +S S
Sbjct: 243 GAPSVPSQL---GLTKFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSAS 299

Query: 297 FLASNGKYITYIIGVETCCIG--SSCLKQTSF-------KAIVDSGSSFTFLPKEVYE 345
             A     + Y + +    +G  S  L + +F        AIVDSG++F++  + V+E
Sbjct: 300 --ARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFE 355


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 156/356 (43%), Gaps = 76/356 (21%)

Query: 92  TMSLGNDFGWLHYTWIDI--GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR 149
           T+  G + G   Y ++D+  G P   FL+ +D GSDL W+     +C P  A +    D+
Sbjct: 159 TVESGAELGAGEY-FMDVFVGNPPRHFLLIIDTGSDLTWL-----QCKPCKACF----DQ 208

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDL--GTSCQ-NPKQPCPYTMDY---YTENTSSSGLLV 203
               + PS S++ K + C+   CDL     C+ N  +  P T  Y   Y +++ +SG L 
Sbjct: 209 SGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLA 268

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
            + L  +S  D+     ++  ++IGCG    G +        L+GLG G +S PS L ++
Sbjct: 269 LESLS-VSLSDHPSSLEIR-DMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RS 322

Query: 264 GLIRNSFSMCF-DKDD----SGRIFFG---------DQGPATQQSTSFLASNGKYIT-YI 308
             I  SFS C  D+ +    S  I FG         DQ     + T F+ +N    T Y 
Sbjct: 323 SPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQ----MRFTPFVRTNNSVETFYY 378

Query: 309 IGVETCCIGSSCLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDT 358
           +G++   I    L   + +           I+DSG++ T+L ++ Y  + + F  +++  
Sbjct: 379 LGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-- 436

Query: 359 ITSFEGYPWK-------CCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNP 399
                 YP          CY ++ +     P++ ++F        PQ N F+  +P
Sbjct: 437 ------YPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDP 486


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 127/326 (38%), Gaps = 60/326 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS-----ASS 160
           + IGTP       +D GSDL+W+ CD C  C             DL+ +  +     ASS
Sbjct: 9   LSIGTPPQLIPAMIDTGSDLVWLKCDNCDHC-------------DLDHHGETIFFSDASS 55

Query: 161 TSKHLSCSHRLCD------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
           + K L C+   C       +G  C+   + C Y  +Y  + + +SG +  D +   S G 
Sbjct: 56  SYKKLPCNSTHCSGMSSAGIGPRCE---ETCKYKYEY-GDGSRTSGDVGSDRISFRSHGA 111

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMC- 273
                S     + GC  K  G   D     GLIGLG    S+   L     +   FS C 
Sbjct: 112 GEDHRSFFDGFLFGCARKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166

Query: 274 --FDKDDSGRIFFGDQGPATQQSTSFLAS---NGKYIT---YIIGVETCCIG-------- 317
             +D   S + F      A  +    +++   +G ++    Y + +++  IG        
Sbjct: 167 VSYDSPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYD 226

Query: 318 ------SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCC 370
                 +S     + K ++DSG+++T L   VYE +    + QV   T+ +  G     C
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAG--LDLC 284

Query: 371 YKSSSQRLPKLPSVKLMFPQNNSFVV 396
           + SS       PSV   F      V+
Sbjct: 285 FNSSGDTSYGFPSVTFYFANQVQLVL 310


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 129/313 (41%), Gaps = 62/313 (19%)

Query: 113 NVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLC 172
           N+S ++  D GS+L W+ C+            +S    +N + P+ SS+   + CS   C
Sbjct: 85  NISMVI--DTGSELSWLRCN-----------RSSNPNPVNNFDPTRSSSYSPIPCSSPTC 131

Query: 173 DLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
              T       SC + K  C  T+  Y + +SS G L  +I H      N+  +S   ++
Sbjct: 132 RTRTRDFLIPASCDSDKL-CHATLS-YADASSSEGNLAAEIFHF----GNSTNDS---NL 182

Query: 226 IIGCGMKQSGG-YLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFF 284
           I GC    SG    +     GL+G+  G +   S +++ G  + S+ +    D  G +  
Sbjct: 183 IFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQMGFPKFSYCISGTDDFPGFLLL 239

Query: 285 GDQG----------PATQQSTSF-LASNGKYITYIIGVET----CCIGSSCL---KQTSF 326
           GD            P  + ST         Y   + G++       I  S L      + 
Sbjct: 240 GDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG 299

Query: 327 KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYKSSSQR--- 377
           + +VDSG+ FTFL   VY  + + F  + N  +T +E   +        CY+ S  R   
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRS 359

Query: 378 --LPKLPSVKLMF 388
             L +LP+V L+F
Sbjct: 360 GILHRLPTVSLVF 372


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 126/324 (38%), Gaps = 50/324 (15%)

Query: 95  LGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRD 150
           LG     L Y   +  GTP V  +V +D GSD+ W+   PC   +C P     Y+     
Sbjct: 70  LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYD----- 124

Query: 151 LNEYSPSASSTSKHLSCSHRLCD------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
                PS SST   + C+  +C        G+ C + KQ C + +  Y + TS+ G   +
Sbjct: 125 -----PSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAIS-YADGTSTVGAYSQ 177

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAK 262
           D L L  G       ++  +   GCG  +    G  DGV       LGLG +   SL A+
Sbjct: 178 DKLTLAPG-------AIVQNFYFGCGHGKHAVRGLFDGV-------LGLGRLR-ESLGAR 222

Query: 263 AGLIRNSFSMCFDKDDSGRIFF---GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
            G +   FS C     S   F      + P+    T      G+     + +    +G  
Sbjct: 223 YGGV---FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGK 279

Query: 320 C--LKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
              L+ ++F    IVDSG+  T L    Y  + + F R+  +            CY  + 
Sbjct: 280 KLDLRPSAFSGGMIVDSGTVITGLQSTAYRALRSAF-RKAMEAYRLLPNGDLDTCYNLTG 338

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNP 399
            +   +P + L F    +  ++ P
Sbjct: 339 YKNVVVPKIALTFTGGATINLDVP 362


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 82/334 (24%), Positives = 132/334 (39%), Gaps = 48/334 (14%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSL 147
           G+K     +  G  +     IG P +     +D GSDL+W+ C  C  C P  +  Y+  
Sbjct: 73  GTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYD-- 130

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCD-------LGTSCQNPKQPCPYTMDY-YTENTSSS 199
                   P+ S +S  L CS +LC        +   C +    C Y   Y ++ + S+ 
Sbjct: 131 --------PARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQ 182

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G+L  +       GD  + N+V      G      G    G A  GL+GLG G +   SL
Sbjct: 183 GVLGTETFTF---GDGYVANNVS----FGRSDTIDGSQFGGTA--GLVGLGRGHL---SL 230

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQGP----ATQQSTSFLASNGK---YITYIIGVE 312
           +++ G  R ++ +  D +    I FG        A   S++ L +N K      Y + ++
Sbjct: 231 VSQLGAGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQ 290

Query: 313 TCCIGSSCL--KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQ-VNDTITS------FE 363
              +G S L  K  +F AI   GS   F      +T   +   Q V   ITS      ++
Sbjct: 291 GISVGGSRLPIKDGTF-AINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYD 349

Query: 364 GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVVN 397
                C   ++ Q + ++P + L F       +N
Sbjct: 350 AGDDTCFVAANQQAVAQMPPLVLHFDDGADMSLN 383


>gi|46488451|gb|AAS99547.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488453|gb|AAS99548.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 114/288 (39%), Gaps = 61/288 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           IDIGTP     + LD GS  L  PC  C  C               N ++ + S TS  L
Sbjct: 66  IDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNNSKTSSIL 115

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C +  C    +C   K  C Y M  Y E +  SG    D++ ++S  +      V    
Sbjct: 116 YCENEECPFKLNCVKGK--CEY-MQSYCEGSQISGFYFSDVVSVVSYNN----ERVTFRK 168

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPS----LLAKAGLIRNSFSMCFDKDDSG 280
           ++GC M +   +L   A  G++G+ L +   +P+    L   A  ++  F++C   ++ G
Sbjct: 169 LMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTICIS-ENGG 226

Query: 281 RIFFGDQGPA---TQQSTSFLASNG--------------------------------KYI 305
            +  G   PA    ++ +  ++  G                                KY 
Sbjct: 227 ELIAGGYDPAYIVRRRGSKSVSGQGSGPVSESLSESGEDPQVALREAEKIVWENVTRKYY 286

Query: 306 TYIIGVETCCIGSSCLKQTS-FKAIVDSGSSFTFLPKEVYETIAAEFD 352
            YI        G++ +  +   + +VDSGS+FT +P+++Y  +   FD
Sbjct: 287 YYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFD 334


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 131/341 (38%), Gaps = 66/341 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNS-------------LD 148
           +++GTP     V +D GSDL W+PC     DC+ C      Y N+               
Sbjct: 33  LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCN----DYRNNKLMSTYSPSYSSSSL 88

Query: 149 RDL---NEYSPSASSTSKHLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVE 204
           RDL      S   SS + +  C+   C L T  +    +PCP     Y       G L  
Sbjct: 89  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 148

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           D L    G   +    V  +   GC       Y +   P G+ G G G +S+PS L   G
Sbjct: 149 DTL-TTHGSSPSFTREV-PNFCFGC---VGSTYRE---PIGIAGFGRGVLSLPSQL---G 197

Query: 265 LIRNSFSMCF-------DKDDSGRIFFGDQGPATQQSTSF--LASNGKYITYI-IGVETC 314
            ++  FS CF       + + S  +  GD   ++     F  L  N  Y  Y  IG+E  
Sbjct: 198 FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAI 257

Query: 315 CIGSSCLKQ--TSFKA---------IVDSGSSFTFLPKEVY-------ETIAAEFDRQVN 356
            +G++   Q  +S +          I+DSG+++T LP   Y       ++I      Q  
Sbjct: 258 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQ 317

Query: 357 DTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           +  T F+  Y   C     +     LPS+   F  N S V+
Sbjct: 318 EARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVL 358


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 156/353 (44%), Gaps = 70/353 (19%)

Query: 92  TMSLGNDFGWLHYTWIDI--GTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDR 149
           T+  G + G   Y ++D+  G P   FL+ +D GSDL W+     +C P  A +    D+
Sbjct: 75  TVESGAELGAGEY-FMDVFVGNPPRHFLLIIDTGSDLTWL-----QCKPCKACF----DQ 124

Query: 150 DLNEYSPSASSTSKHLSCSHRLCDL--GTSCQ-NPKQPCPYTMDY---YTENTSSSGLLV 203
               + PS S++ K + C+   CDL     C+ N  +  P T  Y   Y +++ +SG L 
Sbjct: 125 SGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLA 184

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
            + L  +S  D+     ++  ++IGCG    G +        L+GLG G +S PS L ++
Sbjct: 185 LESLS-VSLSDHPSSLEIR-DMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RS 238

Query: 264 GLIRNSFSMCF-DKDD----SGRIFFGDQGPATQQS------TSFLASNGKYIT-YIIGV 311
             I  SFS C  D+ +    S  I FG  G A  +       T F+ +N    T Y +G+
Sbjct: 239 SPIGQSFSYCLVDRTNNLSVSSAISFG-AGFALSRHFDQMKFTPFVRTNNSVETFYYLGI 297

Query: 312 ETCCIGSSCLKQTSFK----------AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITS 361
           +   I    L   + +           I+DSG++ T+L ++ Y  + + F  +++     
Sbjct: 298 QGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS----- 352

Query: 362 FEGYPWK-------CCYKSSSQRLPKLPSVKLMF--------PQNNSFVVNNP 399
              YP          CY ++ +     P++ ++F        PQ N F+  +P
Sbjct: 353 ---YPRADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDP 402


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 30/190 (15%)

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDSGRI 282
           I GCG + + G   GV+  GL+GLG  ++S+ S    +G+    FS C    ++  SG +
Sbjct: 165 IFGCG-RNNKGLFGGVS--GLMGLGRSDLSLIS--QTSGIFGGVFSYCLPSTERKGSGSL 219

Query: 283 FFGDQGPATQQST-----SFLASNGKYITYIIGVETCCIGSSCLKQTSF---KAIVDSGS 334
             G      + S+       + +   Y  Y I +    IG   L+  S    + +VDSG+
Sbjct: 220 ILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGT 279

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYKSSSQRLPKLPSVKLM 387
             T LP  +Y+ + AEF +Q       F G+P          C+  S+ +   +P++K+ 
Sbjct: 280 VITRLPPTIYKALKAEFLKQ-------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMH 332

Query: 388 FPQNNSFVVN 397
           F  N    V+
Sbjct: 333 FEGNAELTVD 342


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/319 (24%), Positives = 123/319 (38%), Gaps = 58/319 (18%)

Query: 76  KTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVR 135
           K GP+   +  + G + +S+ +     +     +GTP  + LVA+D  +D  W+P     
Sbjct: 85  KKGPRRSFVPIAPGRQLLSIPS-----YVARARLGTPAQALLVAIDPSNDAAWVP----- 134

Query: 136 CAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCP--------Y 187
                     +       + P+ SST + + C    C      Q P   CP        +
Sbjct: 135 ------CAACAGCARAPSFDPTRSSTYRPVRCGAPQCS-----QAPAPSCPGGLGSSCAF 183

Query: 188 TMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLI 247
            + Y    ++   LL +D L L    D        A+   GC    +GG    V P GL+
Sbjct: 184 NLSY--AASTFQALLGQDALALHDDVDAV------AAYTFGCLHVVTGG---SVPPQGLV 232

Query: 248 GLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPATQQSTSFLASNGK 303
           G G G +S PS      +  + FS C       + SG +  G  G   +  T+ L SN  
Sbjct: 233 GFGRGPLSFPSQTKD--VYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPH 290

Query: 304 -----YITYI---IGVETCCIGSSCLK--QTSFKA-IVDSGSSFTFLPKEVYETIAAEFD 352
                Y+  +   +G     + +S L    TS +  IVD+G+ FT L   VY  +   F 
Sbjct: 291 RPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFR 350

Query: 353 RQVNDTITSFEGYPWKCCY 371
            +V   +    G  +  CY
Sbjct: 351 SRVRAPVAGPLGG-FDTCY 368


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 131/341 (38%), Gaps = 66/341 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-----DCVRCAPLSASYYNS-------------LD 148
           +++GTP     V +D GSDL W+PC     DC+ C      Y N+               
Sbjct: 16  LNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCN----DYRNNKLMSTYSPSYSSSSL 71

Query: 149 RDL---NEYSPSASSTSKHLSCSHRLCDLGTSCQNP-KQPCPYTMDYYTENTSSSGLLVE 204
           RDL      S   SS + +  C+   C L T  +    +PCP     Y       G L  
Sbjct: 72  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 131

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           D L    G   +    V  +   GC       Y +   P G+ G G G +S+PS L   G
Sbjct: 132 DTL-TTHGSSPSFTREV-PNFCFGC---VGSTYRE---PIGIAGFGRGVLSLPSQL---G 180

Query: 265 LIRNSFSMCF-------DKDDSGRIFFGDQGPATQQSTSF--LASNGKYITYI-IGVETC 314
            ++  FS CF       + + S  +  GD   ++     F  L  N  Y  Y  IG+E  
Sbjct: 181 FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAI 240

Query: 315 CIGSSCLKQ--TSFKA---------IVDSGSSFTFLPKEVY-------ETIAAEFDRQVN 356
            +G++   Q  +S +          I+DSG+++T LP   Y       ++I      Q  
Sbjct: 241 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQ 300

Query: 357 DTITSFE-GYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
           +  T F+  Y   C     +     LPS+   F  N S V+
Sbjct: 301 EARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVL 341


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 127/308 (41%), Gaps = 50/308 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T + +GTP     + LD GSD++WI C  C +C       Y   D   N   P+ASST
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKC-------YGQTDPLFN---PAASST 202

Query: 162 SKHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            + + C+  LC   D+ + C+N K+ C Y + Y   + +      E +           +
Sbjct: 203 YRKVPCATPLCKKLDI-SGCRN-KRYCEYQVSYGDGSFTVGDFSTETL---------TFR 251

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
             V   V +GCG    G +   +   GL+GLG G +S PS           FS C  D+ 
Sbjct: 252 GQVIRRVALGCGHDNEGLF---IGAAGLLGLGRGSLSFPS--QTGAQFSKRFSYCLVDRS 306

Query: 278 DSG---RIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKA------ 328
            SG    + FG          + L SN K  T+   VE   I     + TS  A      
Sbjct: 307 ASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYY-VELVGISVGGRRLTSIPASVFRMD 365

Query: 329 -------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPK 380
                  I+DSG+S T L    Y T+   F R     + S  G+  +  CY  S  +  K
Sbjct: 366 ATGNGGVIIDSGTSVTRLVDSAYSTMRDAF-RVGTGNLKSAGGFSLFDTCYDLSGLKTVK 424

Query: 381 LPSVKLMF 388
           +P++   F
Sbjct: 425 VPTLVFHF 432


>gi|409050032|gb|EKM59509.1| hypothetical protein PHACADRAFT_250062 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 407

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/304 (21%), Positives = 113/304 (37%), Gaps = 52/304 (17%)

Query: 92  TMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP---CDCVRCAPLSASYYNSLD 148
           T+ L N     ++T I+IGTP  SF V LD GS  LW+P   C  + C            
Sbjct: 86  TLPLQNFMNAQYFTTIEIGTPPQSFNVILDTGSSNLWVPSTQCTSIAC------------ 133

Query: 149 RDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
                               H+  D G+S         +++ Y   + S  G +  D+L 
Sbjct: 134 ------------------FLHKKYDSGSSSTYKPNGSEFSIQY--GSGSMEGFVSRDVLT 173

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL-LAKAGLIR 267
           +   GD  +     A      G+  + G  DG+       + +  I+ P   + + GLI 
Sbjct: 174 M---GDITIGQQDFAEATKEPGLAFAFGKFDGILGLAYDTIAVNHITPPHYNMFEKGLIE 230

Query: 268 N---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQT 324
               +F +   ++D+G   FG    +  +         +   + + +E   +G   L+  
Sbjct: 231 KPVFAFRLGSTEEDAGEATFGGIDESAFEGKLHRVPVRRKAYWEVELEKVRLGDDELELE 290

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
              A +D+G+S   LP ++ E I A+   +            W   Y      +P LP++
Sbjct: 291 DTGAAIDTGTSLIALPTDMAEMINAQIGAKRG----------WNGQYTVECSTVPDLPAL 340

Query: 385 KLMF 388
            L F
Sbjct: 341 TLYF 344


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 123/311 (39%), Gaps = 60/311 (19%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL---------DRDL------ 151
           +  GTP + + + LD  +DL WI C   R       +Y            D D+      
Sbjct: 144 VRFGTPALPYNLVLDTANDLTWINC---RLRRRKGKHYGRQSSKTMSVGGDDDVVAALAK 200

Query: 152 -----NEYSPSASSTSKHLSCSHRLCDL--GTSCQNPK--QPCPYTMDYYTENTSSSGLL 202
                N Y P+ SS+ + + CS + C      +CQ+P   + C Y         +     
Sbjct: 201 KEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIGIYG 260

Query: 203 VEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK 262
            E     +S G    + +    +++GC + ++G  +D  A DG++ LG G +S     A 
Sbjct: 261 NEKATVTVSDG----RMAKLPGLVLGCSVLEAGASVD--AHDGVLSLGNGHMS----FAI 310

Query: 263 AGLIR--NSFSMCF-----DKDDSGRIFFGDQ----GPATQQS-----TSFLASNGKYIT 306
             ++R    FS C       +D S  + FG      GP T ++         A+ G  +T
Sbjct: 311 HAVLRFGGRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVT 370

Query: 307 YI-IGVETCCIGSSCL---KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVND-TITS 361
            + +G E   I        K      I+D+ +S T L  E YE + A  DR +      S
Sbjct: 371 AVLVGGERLDIPDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAHLPRES 430

Query: 362 FEGYPWKCCYK 372
           F G+ +  CY+
Sbjct: 431 FAGFEY--CYR 439


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 131/300 (43%), Gaps = 39/300 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++T + IG P     + LD GSD+ W+     +C P +  Y+ +       + PS+SS+ 
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWL-----QCTPCADCYHQTEPI----FEPSSSSSY 201

Query: 163 KHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
           + LSC    C+     +     C Y +  Y + + + G    + L +   G   ++N   
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVS-YGDGSYTVGDFATETLTI---GSTLVQN--- 254

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DKDDS 279
             V +GCG    G +   V   GL+GLG G +++PS L        SFS C    D D +
Sbjct: 255 --VAVGCGHSNEGLF---VGAAGLLGLGGGLLALPSQLNTT-----SFSYCLVDRDSDSA 304

Query: 280 GRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFKA--------I 329
             + FG   P        L ++     Y +G+    +G   L+  Q+SF+         I
Sbjct: 305 STVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGII 364

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFP 389
           +DSG++ T L   +Y ++   F +  +D   +     +  CY  S++   ++P+V   FP
Sbjct: 365 IDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFP 424


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 131/305 (42%), Gaps = 45/305 (14%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSC 167
           +GTP    L+A+D  +D  WIPC  C  C   SA+ ++          P+AS++ + + C
Sbjct: 118 LGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFD----------PAASASYRTVPC 167

Query: 168 SHRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
              LC      +C    + C +++ Y   ++S    L +D L +     NA+K     + 
Sbjct: 168 GSPLCAQAPNAACPPGGKACGFSLTY--ADSSLQAALSQDSLAV---AGNAVK-----AY 217

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK----DDSGR 281
             GC  + +G       P GL+GLG G +S   L     +   +FS C       + SG 
Sbjct: 218 TFGCLQRATG---TAAPPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGT 272

Query: 282 IFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------AIVDSGS 334
           +  G  G P   ++T  LA+  +   Y + +    +G   +   +F        ++DSG+
Sbjct: 273 LRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGT 332

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLP----SVKLMFPQ 390
            FT L    Y  +  E  R+V   ++S  G+    C+ +++   P +      +++  P+
Sbjct: 333 MFTRLVAPAYVAVRDEVRRRVGAPVSSLGGF--DTCFNTTAVAWPPMTLLFDGMQVTLPE 390

Query: 391 NNSFV 395
            N  +
Sbjct: 391 ENVVI 395


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 90/211 (42%), Gaps = 47/211 (22%)

Query: 76  KTGPQFQMLFPSQGSKTMSLGNDFGWLH--------------YTWIDIGTPNVSFLVALD 121
           K G   +    +  ++  SL +  G LH              +  + +GTP+   ++ +D
Sbjct: 45  KRGSLLRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVID 104

Query: 122 AGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSH------RL--C 172
            GSDL+W+ C  C RC       ++          P  SST + + CS       R   C
Sbjct: 105 TGSDLVWLQCSPCRRCYAQRGQVFD----------PRRSSTYRRVPCSSPQCRALRFPGC 154

Query: 173 DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMK 232
           D G +       C Y M  Y + +SS+G L  D L   +  D  + N     V +GCG +
Sbjct: 155 DSGGAAGG---GCRY-MVAYGDGSSSTGELATDKLAFAN--DTYVNN-----VTLGCG-R 202

Query: 233 QSGGYLDGVAPDGLIGLGLGEISVPSLLAKA 263
            + G  D  A  GL+G+  G+IS+ + +A A
Sbjct: 203 DNEGLFDSAA--GLLGVARGKISISTQVAPA 231


>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
          Length = 547

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 116/271 (42%), Gaps = 40/271 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           H+ +I  GTP     V ++ GS     PC +C  C   +  Y++          PS SST
Sbjct: 108 HFAYIYAGTPPQRASVIINTGSHFSAFPCSECRSCGNHTDPYWD----------PSQSST 157

Query: 162 SKHLSCSH-RLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           +  ++C     C     CQ+ K+ C    ++YTE +S     V+D+L +   G+  L +S
Sbjct: 158 AHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWV---GERTLSDS 212

Query: 221 VQ-------ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSM 272
            +            GC    +G +   +A DG++GL     ++ + LA AG I    FS+
Sbjct: 213 QKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKISERKFSL 271

Query: 273 CFDKDDSGRIFFGDQGPATQQSTS---FLASNGKYITYII--------GVETCCIGSSCL 321
           CF  +  G +  G   P   +  S   +  S G+     +        GV      S   
Sbjct: 272 CF-SETGGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITTDASVFQ 330

Query: 322 KQTSFKAIVDSGSSFTFLPKEVYETIAAEFD 352
           K T  K +  SG++ T+LP+ V E  +A ++
Sbjct: 331 KGTGIKIV--SGTTNTYLPRAVAEGFSAAWE 359


>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
          Length = 443

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/326 (23%), Positives = 124/326 (38%), Gaps = 60/326 (18%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPS-ASST 161
           + T I   TP V   V LD G + LWI C+          Y +S  R +   SP    S 
Sbjct: 48  YITQITQRTPPVQLKVVLDVGGEFLWIDCE--------KGYKSSTKRPVPCGSPQCVLSG 99

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLIS-GGDNALKNS 220
           S   + S    D+G     P  P          +  +SG L EDIL++ S  G N  K  
Sbjct: 100 SGACTTSDNPSDVGVCGVMPNNPF--------SSVGTSGDLFEDILYIQSTNGFNPGKQV 151

Query: 221 VQASVIIGCGMKQSGGYLDGVAPD--GLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
              +++  C        L+G+A    G+ G G  ++++PSL + A      F +C    +
Sbjct: 152 SVPNLLFSCAPNS---LLEGLASGIIGMAGFGRNKVALPSLFSSAFSFPRKFGVCLSSSN 208

Query: 279 SGRIFFGDQ------------------GPATQQSTSFLAS--NGKYITYIIGVETCCIGS 318
            G IFFG +                   P  Q   S ++S        Y IGV++  +  
Sbjct: 209 -GVIFFGKEPYVLLPGIDVSDPTSLTYTPLIQNPRSLVSSFEGNPSAEYFIGVKSIKVDG 267

Query: 319 SCLK-QTSFKAIVDSGS----------SFTFLPKEVYETIAAEFDRQVNDTITSFEGY-P 366
             L+  T+     + G            FT L   +Y+ +   F + +   +   +   P
Sbjct: 268 KPLRLNTTLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFVKALGPKVPRVKAVAP 327

Query: 367 WKCCYKS----SSQRLPKLPSVKLMF 388
           +  C+ +    +++  P +P + L+ 
Sbjct: 328 FGACFNAKYIGNTRVGPAVPQIDLVL 353


>gi|46488413|gb|AAS99528.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488415|gb|AAS99529.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488417|gb|AAS99530.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488419|gb|AAS99531.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488421|gb|AAS99532.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488423|gb|AAS99533.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488425|gb|AAS99534.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488427|gb|AAS99535.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488429|gb|AAS99536.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488431|gb|AAS99537.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488433|gb|AAS99538.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488435|gb|AAS99539.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488437|gb|AAS99540.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488439|gb|AAS99541.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488441|gb|AAS99542.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488443|gb|AAS99543.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488445|gb|AAS99544.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488447|gb|AAS99545.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488449|gb|AAS99546.1| aspartic protease PM5 [Plasmodium vivax]
 gi|46488455|gb|AAS99549.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 61/288 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           IDIGTP     + LD GS  L  PC  C  C               N ++ + S TS  L
Sbjct: 66  IDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNNSKTSSIL 115

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C +  C    +C   K  C Y M  Y E +  SG    D++ ++S  +      V    
Sbjct: 116 YCENEECPFKLNCVKGK--CEY-MQSYCEGSQISGFYFSDVVSVVSYNN----ERVTFRK 168

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPS----LLAKAGLIRNSFSMCFDKDDSG 280
           ++GC M +   +L   A  G++G+ L +   +P+    L   A  ++  F++C   ++ G
Sbjct: 169 LMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTICIS-ENGG 226

Query: 281 RIFFGDQGP---------------------------------ATQQSTSFLASN--GKYI 305
            +  G   P                                 A +++   +  N   KY 
Sbjct: 227 ELIAGGYDPAYIVRRGGSKSVSGQGSGPVSESLSESGEDPQVALREAEKIVWENVTRKYY 286

Query: 306 TYIIGVETCCIGSSCLKQTS-FKAIVDSGSSFTFLPKEVYETIAAEFD 352
            YI        G++ +  +   + +VDSGS+FT +P+++Y  +   FD
Sbjct: 287 YYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFD 334


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 81/338 (23%), Positives = 135/338 (39%), Gaps = 54/338 (15%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNE----YSPSASSTS 162
           + IGTP +S+    D GSDL+W      +CAP   +  ++ ++   +    Y+PS+S+T 
Sbjct: 91  LSIGTPPLSYRAIADTGSDLIW-----TQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145

Query: 163 KHLSCSHRLCDLGTSCQNPKQP----CPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
             L C+  L  +  +   P  P    C Y   Y T  T+     V+ +     G  +   
Sbjct: 146 GVLPCNSPL-SMCAAMAGPSPPPGCACMYNQTYGTGWTAG----VQSVETFTFGSSSTPP 200

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---- 274
                ++  GC    S  + +G A  GL+GLG G +S+ S L        +FS C     
Sbjct: 201 AVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLTPFQ 252

Query: 275 DKDDSGRIFFGD------QGPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK--- 322
           D + +  +  G       +G    +ST F+A   K      Y + +    +G + L    
Sbjct: 253 DANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPP 312

Query: 323 -QTSFKA------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK-----CC 370
              S +A      I+DSG++ T L    Y+ + A     +   +    G         C 
Sbjct: 313 DAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCF 372

Query: 371 YKSSSQRLPKLPSVKLMFPQNNSFVVNNPVFVIYGTQV 408
              +S   P +PS+ L F      V+    ++I G+ V
Sbjct: 373 ALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGSGV 410


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 120/320 (37%), Gaps = 48/320 (15%)

Query: 95  LGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRD 150
           LG     L Y   + IGTP V   V +D GSDL W+   PC+   C P     ++     
Sbjct: 116 LGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSS 175

Query: 151 LNEYSPSASSTSKHLSCS--HRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILH 208
                P AS   K L        C   TS   P+  C Y ++ Y     + G+   + L 
Sbjct: 176 TFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQ--CGYAIE-YGNGAITEGVYSTETLA 232

Query: 209 LISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN 268
           L S       ++V  S   GCG  Q G Y D    DGL+GLG    S+ S  A   +   
Sbjct: 233 LGS-------SAVVKSFRFGCGSDQHGPY-DKF--DGLLGLGGAPESLVSQTAS--VYGG 280

Query: 269 SFSMCFDKDDSGRIFFGDQGPATQQS-------TSFLASNGKYIT-YIIGVETCCIGSSC 320
           +FS C    +SG  F     P +  +       T   A + K  T Y++ +    +G   
Sbjct: 281 AFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKA 340

Query: 321 LKQT----SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP--------WK 368
           L       +   IVDSG+  T +P   Y+ +   F   + +       YP          
Sbjct: 341 LDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAE-------YPLLPPADSALD 393

Query: 369 CCYKSSSQRLPKLPSVKLMF 388
            CY  +      +P V L F
Sbjct: 394 TCYNFTGHGTVTVPKVALTF 413


>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 873

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 127/312 (40%), Gaps = 38/312 (12%)

Query: 55  AKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNV 114
           A ++ E  ++ L    ++ K +T P    L  +     + LG      HY  + IG P  
Sbjct: 6   ASRNLEPLKIELKRKTRQLKNQTSPP---LVYNDAPLGVGLGT-----HYAELYIGIPPQ 57

Query: 115 SFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD 173
              V LD GS L   PCD CV C   +   +++               +K  S +   C 
Sbjct: 58  RASVILDTGSGLTAFPCDKCVDCGTHTDPKFDA---------------TKSTSINFVQCK 102

Query: 174 LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL---HLISGGDNALKNSVQASVIIGCG 230
               C   +         Y+E +    ++++D++   ++ S     +          GC 
Sbjct: 103 YEEGCDTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIMRRYGIRFKFGCQ 162

Query: 231 MKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFGDQ-- 287
            +++G ++  V  +G++GLG+G  ++ + + KA  +  + F++CF +     +  G    
Sbjct: 163 TRETGLFITQV-ENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQKGGSFVIGGVDYS 221

Query: 288 GPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QTSFK----AIVDSGSSFTFLPK 341
              T+ + + LA +G    Y I V+   IG   L+     FK    AIVDSG++ T+ P 
Sbjct: 222 HHTTKIAYTPLAKHGTS-NYPIEVKDVRIGGISLQVDAEHFKSGRGAIVDSGTTDTYFPS 280

Query: 342 EVYETIAAEFDR 353
                    F R
Sbjct: 281 AAATPFQEAFKR 292


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 87/336 (25%), Positives = 140/336 (41%), Gaps = 56/336 (16%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSP--SASS 160
           ++  + +GTP  + L+ LD GSD++W P   VR  P        L R + + S   +A +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAP---VRALP-------PLLRAVRQGSSTGAAPA 171

Query: 161 TSKHLSCSHRLCDL--GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALK 218
            +   +C   +C       C   +  C Y +  Y + + ++G    + L    G      
Sbjct: 172 PTPRWNCVAPICRRLDSAGCDRRRNSCLYQV-AYGDGSVTAGDFASETLTFARGA----- 225

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKD 277
             VQ  V IGCG    G +   +A  GL+GLG G +S PS +A++     SFS C  D+ 
Sbjct: 226 -RVQ-RVAIGCGHDNEGLF---IAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRT 278

Query: 278 DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK---------A 328
            S R     +   T +  +F      Y  +++G          + Q+  +          
Sbjct: 279 SSRRARPSRRWGGTPRMATF------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 332

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-WKCCYKSSSQRLPKLPSVKLM 387
           I+DSG+S T L + VYE +   F         S  G+  +  CY  S +R+ K+P+V + 
Sbjct: 333 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 392

Query: 388 F---------PQNNSFVVNNP---VFVIYGTQVGVS 411
                     P+N    V+      F + GT  GVS
Sbjct: 393 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 428


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 104/255 (40%), Gaps = 54/255 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           +  GTP  +F++ LD GS + W  C  CV C   S  Y+N                S   
Sbjct: 132 VAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFN---------------WSASS 176

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
           + S   C  GT   N      Y M  Y ++++S G    D + L         + V    
Sbjct: 177 TYSSGSCIPGTVENN------YNMT-YGDDSTSVGNYGCDTMTL-------EPSDVFQKF 222

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFF 284
             GCG    G +  GV  DG++GLG G++S  S  A        FS C  ++DS G + F
Sbjct: 223 QFGCGRNNKGDFGSGV--DGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLF 278

Query: 285 GDQGPATQQSTSF-----------LASNGKYITYI----IGVETCCIGSSCLKQTSFKAI 329
           G++  AT QS+S            L  +G Y   +    +G E   I SS     S   I
Sbjct: 279 GEK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF--ASPGTI 334

Query: 330 VDSGSSFTFLPKEVY 344
           +DS +  T LP+  Y
Sbjct: 335 IDSRTVITRLPQRAY 349


>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 453

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 111/266 (41%), Gaps = 39/266 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           H+    +G P  +  + +D GS L    C+ C +C    A  +  LD       P  SST
Sbjct: 82  HHVTAWMGEPPQAQTLIVDTGSRLTATACEPCSQCGTTHAHPFPHLD-------PQRSST 134

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSV 221
            ++  C   L      C   +Q C      YTE +S + + V D   L     ++L+  V
Sbjct: 135 LRYTQCGSCLLSGIQECAA-EQKCGIN-QRYTEGSSWTAVEVSDTFVLGGPEISSLEQYV 192

Query: 222 QASVII--GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDD 278
             ++I   GC  K  G +    A +G++GL   ++S+   L K  +I R SFS+C    +
Sbjct: 193 SFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVIPRESFSLCMTPFE 251

Query: 279 SGRIFFG----DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFK------- 327
            G I  G    D+   + + T F ++   Y  +++ V    +G  CL             
Sbjct: 252 -GYIGLGGPLRDKHTESMKYTPFTSTQSWYAVHVVRV---FVGDECLTSNDQHDTVVEHA 307

Query: 328 ----------AIVDSGSSFTFLPKEV 343
                      I+DSG++ T+LPK V
Sbjct: 308 LVEAFAEGKGTILDSGTTDTYLPKAV 333


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 126/324 (38%), Gaps = 50/324 (15%)

Query: 95  LGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRD 150
           LG     L Y   +  GTP V  +V +D GSD+ W+   PC   +C P     Y+     
Sbjct: 104 LGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYD----- 158

Query: 151 LNEYSPSASSTSKHLSCSHRLCD------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVE 204
                PS SST   + C+  +C        G+ C + KQ C + +  Y + TS+ G   +
Sbjct: 159 -----PSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAIS-YADGTSTVGAYSQ 211

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSG--GYLDGVAPDGLIGLGLGEISVPSLLAK 262
           D L L  G       ++  +   GCG  +    G  DGV       LGLG +   SL A+
Sbjct: 212 DKLTLAPG-------AIVQNFYFGCGHGKHAVRGLFDGV-------LGLGRLR-ESLGAR 256

Query: 263 AGLIRNSFSMCFDKDDSGRIFF---GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
            G +   FS C     S   F      + P+    T      G+     + +    +G  
Sbjct: 257 YGGV---FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGK 313

Query: 320 C--LKQTSFKA--IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
              L+ ++F    IVDSG+  T L    Y  + + F R+  +            CY  + 
Sbjct: 314 KLDLRPSAFSGGMIVDSGTVITGLQSTAYRALRSAF-RKAMEAYRLLPNGDLDTCYNLTG 372

Query: 376 QRLPKLPSVKLMFPQNNSFVVNNP 399
            +   +P + L F    +  ++ P
Sbjct: 373 YKNVVVPKIALTFTGGATINLDVP 396


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 104/290 (35%), Gaps = 69/290 (23%)

Query: 120 LDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-- 177
           LD GSDL+W PC    C        N+     +   P  S T+  +SC    C    S  
Sbjct: 97  LDTGSDLVWFPCQPFECILCEGKAENT--SLASTPPPKLSKTATPVSCKSSACSAAHSNL 154

Query: 178 --------------------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
                               CQ  K  CP     Y + +  + L  + I   +S   N +
Sbjct: 155 PSSDLCAISNCPLESIETSDCQ--KHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLI 212

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC--- 273
            N+       GC       +     P G+ G G G +S+P+ LA  +  + N FS C   
Sbjct: 213 VNNF----TFGCA------HTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVS 262

Query: 274 ------------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
                             +D D+  R   G   P     TS L +      Y +G+E   
Sbjct: 263 HSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGIS 321

Query: 316 IGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           IG   +    F            +VDSG++FT LP  +Y ++ AEF+ +V
Sbjct: 322 IGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRV 371


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 126/317 (39%), Gaps = 47/317 (14%)

Query: 94  SLGNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G D G L+Y     +GTP V+  + +D GSDL W+ C     AP   S  + L     
Sbjct: 130 SWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPL----- 184

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHL 209
            + P+ SS+   + C   +C  G               Y   Y + ++++G+   D L L
Sbjct: 185 -FDPAQSSSYAAVPCGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRN 268
                 +  ++VQ     GCG  QS G  +GV  DGL+GLG  +   PSL+ + AG    
Sbjct: 243 ------SASSAVQG-FFFGCGHAQS-GLFNGV--DGLLGLGREQ---PSLVEQTAGTYGG 289

Query: 269 SFSMCFDKDDS--GRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            FS C     S  G +  G  GP+       +T  L S      Y++ +    +G   L 
Sbjct: 290 VFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349

Query: 323 --QTSFKAIVDSGSSFTF--LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCY 371
              ++F       +      LP   Y  + + F       + S+ GYP          CY
Sbjct: 350 VPASAFAGGTVVDTGTVVTRLPPTAYAALRSAF----RSGMASY-GYPTAPSNGILDTCY 404

Query: 372 KSSSQRLPKLPSVKLMF 388
             +      LP+V L F
Sbjct: 405 NFAGYGTVTLPNVALTF 421


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 104/290 (35%), Gaps = 69/290 (23%)

Query: 120 LDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTS-- 177
           LD GSDL+W PC    C        N+     +   P  S T+  +SC    C    S  
Sbjct: 97  LDTGSDLVWFPCQPFECILCEGKAENT--SLASTPPPKLSKTATPVSCKSSACSAAHSNL 154

Query: 178 --------------------CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNAL 217
                               CQ  K  CP     Y + +  + L  + I   +S   N +
Sbjct: 155 PSSDLCAISNCPLESIETSDCQ--KHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLI 212

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC--- 273
            N+       GC       +     P G+ G G G +S+P+ LA  +  + N FS C   
Sbjct: 213 VNNF----TFGCA------HTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVS 262

Query: 274 ------------------FDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCC 315
                             +D D+  R   G   P     TS L +      Y +G+E   
Sbjct: 263 HSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGIS 321

Query: 316 IGSSCLKQTSF----------KAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
           IG   +    F            +VDSG++FT LP  +Y ++ AEF+ +V
Sbjct: 322 IGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRV 371


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 116/280 (41%), Gaps = 43/280 (15%)

Query: 100 GWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSA 158
           G L Y   + IGTP       LD GSDL+W      +CAP +    + L +    ++P  
Sbjct: 92  GDLEYVVDLAIGTPPQPVSALLDTGSDLIW-----TQCAPCA----SCLSQPDPLFAPGQ 142

Query: 159 SSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           S++ + + C+  LC   L  SC+ P   C Y  + Y + T + G+   +     S     
Sbjct: 143 SASYEPMRCAGTLCSDILHHSCERPDT-CTYRYN-YGDGTMTVGVYATERFTFAS-SGGG 199

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
              +    +  GCG    G   +G    G++G G   +S+ S L+    IR  FS C   
Sbjct: 200 GLTTTTVPLGFGCGSVNVGSLNNG---SGIVGFGRNPLSLVSQLS----IRR-FSYCLTS 251

Query: 277 DDSGR---IFFGD-----QGPATQ--QSTSFLASNGKYITYIIGVETCCIGSSCLK--QT 324
             S R   + FG       G AT   Q+T  L S      Y +      +G+  L+  ++
Sbjct: 252 YASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPES 311

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
           +F          IVDSG++ T LP  V   +   F +Q+ 
Sbjct: 312 AFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLR 351


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 133/345 (38%), Gaps = 83/345 (24%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLD-----RDLNEYSPSASST 161
           +++GTP   F V LD GSDL W+PC        S+S Y  LD     +    + PS S++
Sbjct: 29  LNLGTPPQVFQVYLDTGSDLTWVPCG-------SSSSYQCLDCGSSVKPTPTFLPSESTS 81

Query: 162 SKHLSCSHRLC-DLGTS-----------CQNP-------KQPCPYTMDYYTENTSSSGLL 202
           +    C  R C D+ +S           C  P        +PCP     Y       G L
Sbjct: 82  NTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSL 141

Query: 203 VED--ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLL 260
             D   LH  + G  A    +  +   G G    G  +    P G+ G G G +S+PS L
Sbjct: 142 SRDSVTLHGSTHGSGAGAGPLPVA-FPGFGFGCVGSSIR--EPLGIAGFGRGALSLPSQL 198

Query: 261 AKAGLIRNSFSMCF-------DKDDSGRIFFGDQGPATQQS------TSFLASNGKYITY 307
              G +   FS CF       + + +  +  GD   ++  +      T  L S      Y
Sbjct: 199 ---GFLGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFY 255

Query: 308 IIGVETCCIG--------------SSCLKQTSFKAIVDSGSSFTFLPKEVYETI------ 347
            +G+E   +G              S    Q +   +VD+G+++T LP   Y ++      
Sbjct: 256 YVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLIS 315

Query: 348 -AAEFDRQVN-DTITSFEGYPWKCCYKSSSQRLP----KLPSVKL 386
            A  ++R  + +  T F+      C+K    R P    +LP + L
Sbjct: 316 AAPPYERSRDLEARTGFD-----LCFKVPCARAPCADDELPPITL 355


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/305 (26%), Positives = 130/305 (42%), Gaps = 44/305 (14%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP     + LD GSD++W+ C  C  C       Y+  D   N   P  S +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNC-------YSQTDPVFN---PVKSGS 178

Query: 162 SKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
              + C   LC  L +   N +Q C Y +  Y + + ++G  V + L          + +
Sbjct: 179 FAKVLCRTPLCRRLESPGCNQRQTCLYQVS-YGDGSYTTGEFVTETL--------TFRRT 229

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRN-SFSMCF-DKDD 278
               V +GCG    G +   V   GL+GLG G +S PS   +AG   N  FS C  D+  
Sbjct: 230 KVEQVALGCGHDNEGLF---VGAAGLLGLGRGGLSFPS---QAGRTFNQKFSYCLVDRSA 283

Query: 279 SGR---IFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK---- 327
           S +   + FG+   +     + L +N +  T+    ++G+       S +  + FK    
Sbjct: 284 SSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRT 343

Query: 328 ----AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPS 383
                I+D G+S T L K  Y  +   F    +   ++ E   +  CY  S +   K+P+
Sbjct: 344 GNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPT 403

Query: 384 VKLMF 388
           V L F
Sbjct: 404 VVLHF 408


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 125/317 (39%), Gaps = 47/317 (14%)

Query: 94  SLGNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G D G L+Y     +GTP V+  + +D GSDL W+ C     AP   S Y+  D    
Sbjct: 38  SWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAP---SCYSQKDP--- 91

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHL 209
            + P+ SS+   + C   +C  G               Y   Y + ++++G+   D L L
Sbjct: 92  LFDPAQSSSYAAVPCGGPVCA-GLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 150

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRN 268
                 +  ++VQ     GCG  QS G  +GV  DGL+GLG  +   PSL+ + AG    
Sbjct: 151 ------SASSAVQG-FFFGCGHAQS-GLFNGV--DGLLGLGREQ---PSLVEQTAGTYGG 197

Query: 269 SFSMCFDKDDS--GRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            FS C     S  G +  G  GP+       +T  L S      Y++ +    +G   L 
Sbjct: 198 VFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 257

Query: 323 --QTSFKAIVDSGSSFTF--LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCY 371
              ++F       +      LP   Y  + + F   +        GYP          CY
Sbjct: 258 VPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMAS-----YGYPTAPSNGILDTCY 312

Query: 372 KSSSQRLPKLPSVKLMF 388
             +      LP+V L F
Sbjct: 313 NFAGYGTVTLPNVALTF 329


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 125/316 (39%), Gaps = 51/316 (16%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 170 SSGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTW-----VQCQPCVVVCYEQREK--- 221

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P+ SST  ++SC+   C DL    C      C Y +  Y + + S G    D L L 
Sbjct: 222 LFDPARSSTYANVSCAAPACSDLNIHGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 278

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 279 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 325

Query: 270 FSMCFDKDDSGRIFF-----GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F+ C     +G  +           + + +T  L  NG    Y +G+    +G   L   
Sbjct: 326 FAHCLPARSTGTGYLDFGAGSLAAASARLTTPMLTDNGPTF-YYVGMTGIRVGGQLLSIP 384

Query: 323 QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYK 372
           Q+ F     IVDSG+  T LP   Y ++     R       +  GY           CY 
Sbjct: 385 QSVFATAGTIVDSGTVITRLPPAAYSSL-----RYAFAAAMAARGYKKAPAVSLLDTCYD 439

Query: 373 SSSQRLPKLPSVKLMF 388
            +      +P+V L+F
Sbjct: 440 FTGMSQVAIPTVSLLF 455


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 74/326 (22%), Positives = 122/326 (37%), Gaps = 56/326 (17%)

Query: 100 GWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSAS 159
           G ++Y+ I +G+P   F + +D GSDL W+ CD   C+P  +S ++ L          AS
Sbjct: 121 GGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCD--PCSPDCSSTFDRL----------AS 168

Query: 160 STSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           +T K L+C+  L              P  +  +      SG  + D L +     + L+ 
Sbjct: 169 NTYKALTCADDL------------RLPVLLRLW-RRLFHSGRSLRDTLKMAGAASDELEE 215

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS 279
                 + GCG    G     V   G++ L  G +S PS + +     N FS C  +  +
Sbjct: 216 F--PGFVFGCGSLLKGLISGEV---GILALSPGSLSFPSQIGEK--YGNKFSYCLLRQTA 268

Query: 280 GR------IFFGDQ-------GPATQQSTSFLASNGKYITYIIGVETCCIG--------S 318
                   + FG+        G    Q   +       I Y + ++   +G        S
Sbjct: 269 QNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPS 328

Query: 319 SCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQR 377
           + L       I DSG++ T LP  V ++I       V+     + +G     C++     
Sbjct: 329 TFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKG--LDACFRVPPSS 386

Query: 378 LPKLPSVKLMFPQNNSFVVNNPVFVI 403
              LP +   F     FV     +VI
Sbjct: 387 GQGLPDITFHFNGGADFVTRPSNYVI 412


>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
 gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
          Length = 475

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/180 (26%), Positives = 80/180 (44%), Gaps = 19/180 (10%)

Query: 178 CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGY 237
           C N K  C Y+  Y  E +SS G +VED             +     ++ GC   ++G  
Sbjct: 2   CNNEK--CYYSRTY-AERSSSEGWMVEDAFGFP-------DDQPPVRMVFGCENGETGEI 51

Query: 238 LDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGDQGPATQQSTSF 297
              +A DG++G+G    +  S L   G+I + FS+CF     G +  GD       +T +
Sbjct: 52  YRQLA-DGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVY 110

Query: 298 --LASNGKYITYIIGVETCCIGSSCLKQTS------FKAIVDSGSSFTFLPKEVYETIAA 349
             L +N     Y + ++   +    L   +      +  ++DSG++FT+LP E +  +AA
Sbjct: 111 TPLLNNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAA 170


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 119/298 (39%), Gaps = 75/298 (25%)

Query: 97  NDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPC----DCVRCAPLSASYYNSLDRDLN 152
           + +G   +T + +GTP     V LD GS L W+PC     C  C+ LSA+        L+
Sbjct: 84  HSYGGYAFT-VSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA------SPLH 136

Query: 153 EYSPSASSTSKHLSCSHRLC------DLGTSCQ---------------NPKQPCPYTMDY 191
            + P  SS+S+ + C +  C      D  + C+               N    CP  +  
Sbjct: 137 VFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVV 196

Query: 192 YTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGL 251
           Y    S++GLL+ D L        A++N      +IGC +           P GL G G 
Sbjct: 197 YGSG-STAGLLISDTLRTPG---RAVRN-----FVIGCSLASV-----HQPPSGLAGFGR 242

Query: 252 GEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD---------------QGPATQQSTS 296
           G  SVPS L   GL + S+ +   + D      G+               Q     +S S
Sbjct: 243 GAPSVPSQL---GLTKFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSAS 299

Query: 297 FLASNGKYITYIIGVETCCIG--SSCLKQTSF-------KAIVDSGSSFTFLPKEVYE 345
             A     + Y + +    +G  S  L + +F        AIVDSG++F++  + V+E
Sbjct: 300 --ARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFE 355


>gi|156099262|ref|XP_001615633.1| aspartic protease PM5 [Plasmodium vivax Sal-1]
 gi|148804507|gb|EDL45906.1| aspartic protease PM5 [Plasmodium vivax]
          Length = 536

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 61/288 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           IDIGTP     + LD GS  L  PC  C  C               N ++ + S TS  L
Sbjct: 66  IDIGTPEQRISLILDTGSSSLSFPCAGCKNCGVHME----------NPFNLNNSKTSSIL 115

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C +  C    +C   K  C Y M  Y E +  SG    D++ ++S  +      V    
Sbjct: 116 YCENEECPFKLNCVKGK--CEY-MQSYCEGSQISGFYFSDVVSVVSYNN----ERVTFRK 168

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-VPS----LLAKAGLIRNSFSMCFDKDDSG 280
           ++GC M +   +L   A  G++G+ L +   +P+    L   A  ++  F++C   ++ G
Sbjct: 169 LMGCHMHEESLFLYQQA-TGVLGMSLSKPQGIPTFVNLLFDNAPQLKQVFTICIS-ENGG 226

Query: 281 RIFFGDQGP---------------------------------ATQQSTSFLASN--GKYI 305
            +  G   P                                 A +++   +  N   KY 
Sbjct: 227 ELIAGGYDPAYIVRRGGSKSVSGQGSGPVSESLSESGEDPQVALREAEKVVWENVTRKYY 286

Query: 306 TYIIGVETCCIGSSCLKQTS-FKAIVDSGSSFTFLPKEVYETIAAEFD 352
            YI        G++ +  +   + +VDSGS+FT +P+++Y  +   FD
Sbjct: 287 YYIKVRGLDMFGTNMMSSSKGLEMLVDSGSTFTHIPEDLYNKLNYFFD 334


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 126/317 (39%), Gaps = 47/317 (14%)

Query: 94  SLGNDFGWLHYTWI-DIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G D G L+Y     +GTP V+  + +D GSDL W+ C     AP   S  + L     
Sbjct: 130 SWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPL----- 184

Query: 153 EYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHL 209
            + P+ SS+   + C   +C  G               Y   Y + ++++G+   D L L
Sbjct: 185 -FDPAQSSSYAAVPCGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 210 ISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRN 268
                 +  ++VQ     GCG  QS G  +GV  DGL+GLG  +   PSL+ + AG    
Sbjct: 243 ------SASSAVQG-FFFGCGHAQS-GLFNGV--DGLLGLGREQ---PSLVEQTAGTYGG 289

Query: 269 SFSMCFDKDDS--GRIFFGDQGPATQ----QSTSFLASNGKYITYIIGVETCCIGSSCLK 322
            FS C     S  G +  G  GP+       +T  L S      Y++ +    +G   L 
Sbjct: 290 VFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS 349

Query: 323 --QTSFKAIVDSGSSFTF--LPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCY 371
              ++F       +      LP   Y  + + F       + S+ GYP          CY
Sbjct: 350 VPASAFAGGTVVDTGTVVTRLPPTAYAALRSAF----RSGMASY-GYPTAPSNGILDTCY 404

Query: 372 KSSSQRLPKLPSVKLMF 388
             +      LP+V L F
Sbjct: 405 NFAGYGTVTLPNVALTF 421


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 114/304 (37%), Gaps = 57/304 (18%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IGTP      ALD  SDL+W  C     AP               ++P  S+T   + C+
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGAT--AP---------------FNPVRSTTVADVPCT 148

Query: 169 HRLCD------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQ 222
              C        G         C YT  Y     +++GLL  +       GD  +     
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTF---GDTRIDG--- 202

Query: 223 ASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS--- 279
             V+ GCG++  G +  GV+  G+IGLG G +S+ S L       + FS  F  DDS   
Sbjct: 203 --VVFGCGLQNVGDF-SGVS--GVIGLGRGNLSLVSQLQV-----DRFSYHFAPDDSVDT 252

Query: 280 -GRIFFGDQG-PATQQ--STSFLASNGKYITYIIGVETCCIGSS--CLKQTSFKAIVDSG 333
              I FGD   P T    ST  LAS+     Y + +    +      +   +F      G
Sbjct: 253 QSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDG 312

Query: 334 SSFTFLPKEVYETIAAEFD-RQVNDTITSFEGYP--------WKCCYKSSSQRLPKLPSV 384
           S   FL      T+  E   + +   + S  G P           CY   S    K+PS+
Sbjct: 313 SGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSM 372

Query: 385 KLMF 388
            L+F
Sbjct: 373 ALVF 376


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 80/315 (25%), Positives = 138/315 (43%), Gaps = 56/315 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +GTP  +  + +D GS+L W+ C+  + +  S+S +N +    + YSP   S+S   +
Sbjct: 77  LTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWS--SSYSPIPCSSS---T 131

Query: 167 CSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQAS 224
           C+ +  D  +  SC +  Q C  T+  Y + +SS G L  D  ++ S G          +
Sbjct: 132 CTDQTRDFPIRPSCDS-NQFCHATLS-YADASSSEGNLATDTFYIGSSG--------IPN 181

Query: 225 VIIGCGMKQ--SGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD-SGR 281
           V+ GC M    S    +     GL+G+  G +   S +++ G  +  FS C  + D SG 
Sbjct: 182 VVFGC-MDSIFSSNSEEDSKNTGLMGMNRGSL---SFVSQMGFPK--FSYCISEYDFSGL 235

Query: 282 IFFGDQG----------PATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSF--- 326
           +  GD            P  + ST     +   + Y + +E   +    L   ++ F   
Sbjct: 236 LLLGDANFSWLAPLNYTPLIEMSTPLPYFD--RVAYTVQLEGIKVAHKLLPIPESVFEPD 293

Query: 327 -----KAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW------KCCYK--S 373
                + +VDSG+ FTFL    Y  +   F  +   ++  +E   +        CY+  +
Sbjct: 294 HTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPT 353

Query: 374 SSQRLPKLPSVKLMF 388
           +  RLP LPSV L+F
Sbjct: 354 NQTRLPPLPSVTLVF 368


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 111/280 (39%), Gaps = 60/280 (21%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP     + LD GS L WI   C + AP       S D       PS SST   L 
Sbjct: 101 LPIGTPPQVQPMVLDTGSQLSWI--QCHKKAPAKPPPTASFD-------PSLSSTFSTLP 151

Query: 167 CSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           C+H +C        L TSC   +  C Y+  +Y + T + G LV +            ++
Sbjct: 152 CTHPVCKPRIPDFTLPTSCDQNRL-CHYSY-FYADGTYAEGNLVREKFTFS-------RS 202

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-------------VPSLLAKAGLI 266
                +I+GC  + +        P G++G+  G +S             VP+ + + G  
Sbjct: 203 LFTPPLILGCATESTD-------PRGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYT 255

Query: 267 -RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS 325
              SF +  + + +   +      A  Q       N   + Y + ++   IG   L  + 
Sbjct: 256 PTGSFYLGHNPNSNTFRYIEMLTFARSQRM----PNLDPLAYTVALQGIRIGGRKLNISP 311

Query: 326 --FKA--------IVDSGSSFTFLPKEVYETIAAEFDRQV 355
             F+A        ++DSGS FT+L  E Y+ + AE  R V
Sbjct: 312 AVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAV 351


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/319 (26%), Positives = 130/319 (40%), Gaps = 45/319 (14%)

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNS 146
           QG     +G   G  +++ + IG+P     + LD GSD+ W+ C  C  C       Y  
Sbjct: 155 QGPVVSGVGQGSGE-YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADC-------YQQ 206

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVE 204
            D     + PS S++   +SC    C DL T +C+N    C Y +  Y + + + G    
Sbjct: 207 SD---PVFDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEV-AYGDGSYTVGDFAT 262

Query: 205 DILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
           + L L  G    + N     V IGCG    G +   V   GL+ LG G +S PS ++   
Sbjct: 263 ETLTL--GDSTPVTN-----VAIGCGHDNEGLF---VGAAGLLALGGGPLSFPSQISA-- 310

Query: 265 LIRNSFSMCF-DKDD--SGRIFFGDQGPATQQSTSFLASNGKYIT-YIIGVETCCIGSSC 320
              ++FS C  D+D   +  + FG  G      T+ L  + +  T Y + +    +G   
Sbjct: 311 ---STFSYCLVDRDSPAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQA 367

Query: 321 LK-----------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC 369
           L              S   IVDSG++ T L    Y  +   F R       +     +  
Sbjct: 368 LSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDT 427

Query: 370 CYKSSSQRLPKLPSVKLMF 388
           CY  S +   ++P+V L F
Sbjct: 428 CYDLSDRTSVEVPAVSLRF 446


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 107/267 (40%), Gaps = 39/267 (14%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I IGTP V+ +   D GSDL W  C  C  C   S   +N          P  SS+ + +
Sbjct: 94  IFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFN----------PRRSSSYRKV 143

Query: 166 SCSHRLCDLGTS--CQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQA 223
           SC+   C    S  C    Q C Y    Y + + + G L  D    I+ G   L  +V  
Sbjct: 144 SCASDTCRSLESYHCGPDLQSCSYGYS-YGDRSFTYGDLASD---QITIGSFKLPKTV-- 197

Query: 224 SVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-----DKDD 278
              IGCG  Q+GG   GV    +   G     V  +   AG ++  FS C      + + 
Sbjct: 198 ---IGCG-HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAG-VKPRFSYCLPTFFSNANI 252

Query: 279 SGRIFFGDQGPAT--QQSTSFLASNGKYITYIIGVETCCIGSSCLKQTS--------FKA 328
           +G I FG +   +  Q  ++ L        Y + +E   +G    K  +           
Sbjct: 253 TGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNI 312

Query: 329 IVDSGSSFTFLPKEVYETIAAEFDRQV 355
           I+DSG++ T LP+ +Y  + +   R +
Sbjct: 313 IIDSGTTLTLLPRSLYYGVFSTLARVI 339


>gi|68071623|ref|XP_677725.1| aspartyl (acid) protease [Plasmodium berghei strain ANKA]
 gi|56497949|emb|CAH98861.1| aspartyl (acid) protease, putative [Plasmodium berghei]
          Length = 518

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 108/281 (38%), Gaps = 50/281 (17%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           I+IGTP     + +D GS  L  PC +C  C               N ++ + SSTS  L
Sbjct: 59  INIGTPGQKLSLIVDTGSSSLSFPCSECKDCGVHME----------NPFNLNNSSTSSIL 108

Query: 166 SCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASV 225
            C+  +C     C   K  C Y +  Y E +  +G    DI+ L S  +N    ++    
Sbjct: 109 YCNDNICPYNLKC--VKGRCEY-LQSYCEGSRINGFYFSDIVRLES-NNNTKNGNITFKK 164

Query: 226 IIGCGMKQSGGYLDGVAPDGLIGLGLGE-ISVPS----LLAKAGLIRNSFSMCFDKDDSG 280
            +GC M + G +L   A  G++GL L +   VP+    L   +  +   FS+C  +    
Sbjct: 165 HMGCHMHEEGLFLHQHAT-GVLGLSLTKPKGVPTFIDLLFKSSPKLNKIFSLCISEYGGE 223

Query: 281 RIFFGDQGPATQQSTS----------------------------FLASNGKYITYIIGVE 312
            I  G       +  S                            + A   KY  YI    
Sbjct: 224 LILGGYSKDYIVKEVSIDEKKDNIEHNKNENINSINKSIVDGILWEAITRKYYYYIRVKG 283

Query: 313 TCCIGSS-CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFD 352
               G++      S + +VDSGS+FT LP ++Y  +   FD
Sbjct: 284 FQLFGTTFSHNNKSMEMLVDSGSTFTHLPDDLYNNLNFFFD 324


>gi|342876649|gb|EGU78232.1| hypothetical protein FOXB_11258 [Fusarium oxysporum Fo5176]
          Length = 588

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 135/342 (39%), Gaps = 41/342 (11%)

Query: 28  STKLIHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPS 87
           +T++I R++    A  +  + +A S+P  K+ + Y  +    V++ K K   + Q     
Sbjct: 68  TTEVIMRWTTITTAAALLGSVDAVSFPRSKAGKGYLSMHVGTVERNKNKQHDKRQ----- 122

Query: 88  QGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSL 147
            G        +  + + T I+IGTP     V LD GS+ LW+  DC      SA  YN  
Sbjct: 123 DGDAIAVRLENKDFFYATDIEIGTPPQKVTVLLDTGSNELWVNPDCQEAQ--SALQYNQC 180

Query: 148 DRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
             D  +Y P  S T           + G +  +       T+ YYT+             
Sbjct: 181 -LDFGQYDPRKSKTPPIGPFGGETLNYGDASDSSTH-TSATIRYYTD------------- 225

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL--LAKAGL 265
            L++ GD+ L+N     ++   G+ Q  G L G+APD   G    E     L  +A+ GL
Sbjct: 226 -LMTFGDSKLRNQTFGVLVESNGISQ--GIL-GLAPDLRAGFDGDEPYSLLLTSMAEQGL 281

Query: 266 IRN---SFSMCFDKDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSC 320
           I +   +  +    D  G + +G  D+     +  +     G+   Y + VE   +G + 
Sbjct: 282 INSRVFALDLRHSDDTEGALIYGGIDRSKYIGKLETRPIIRGEGGEYRLAVELNSLGVTI 341

Query: 321 LKQTSF--------KAIVDSGSSFTFLPKEVYETIAAEFDRQ 354
              T            ++DSG++ T +   V   I    D Q
Sbjct: 342 SGDTQHIRVSSSDSNVMLDSGTTLTRMHMSVARPILEALDAQ 383


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 69/304 (22%), Positives = 115/304 (37%), Gaps = 49/304 (16%)

Query: 100 GWLHYTWIDIGTPNVSF---LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           G  +   + IGTP        V  D GSDL W  C+ C  C+  +             + 
Sbjct: 120 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTP---------YPPHD 170

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           PS S T + LSC   +C+L T+  +       C +    Y +  + SG LV D+ H  + 
Sbjct: 171 PSKSRTFRRLSCFDPMCELCTAVVDGGGGSAGCLFRR-RYGDGGAVSGELVSDVFHFGAA 229

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           GD      ++  V  GC   +    + G +  G++ LG+G+   PS + + G+ R  FS 
Sbjct: 230 GDGG-GYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGVDR--FSY 282

Query: 273 CF-------------DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           C              ++  +  + FG     T +   F      Y   +  V     G  
Sbjct: 283 CIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRL 342

Query: 320 CLKQ------------TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
             +Q             +   +VDSG++  +LP  V+  +    +  ++ T      +P 
Sbjct: 343 NQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPS 402

Query: 368 KCCY 371
             CY
Sbjct: 403 LYCY 406


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 85/185 (45%), Gaps = 23/185 (12%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + +GTP   F + +D GSDL WI C+     P + +  NS       Y  S+SS+ 
Sbjct: 27  YFVELRVGTPAKKFPLIIDTGSDLTWIQCN----PPNTTA--NSSSPPAPWYDKSSSSSY 80

Query: 163 KHLSCSHRLC-----DLGTSCQ-NPKQPCPYTMDYYTENTSSSGLLVEDILHLIS----- 211
           + + C+   C      +G+SC      PC YT   Y++ + ++G+L  + + + S     
Sbjct: 81  REIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYG-YSDQSRTTGILAYETISMKSRKRSG 139

Query: 212 --GGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS 269
              G++  +     +V +GC  +  G    G +  G++GLG G IS+ +      L    
Sbjct: 140 KRAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGI 196

Query: 270 FSMCF 274
           FS C 
Sbjct: 197 FSYCL 201


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 69/304 (22%), Positives = 115/304 (37%), Gaps = 49/304 (16%)

Query: 100 GWLHYTWIDIGTPNVSF---LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           G  +   + IGTP        V  D GSDL W  C+ C  C+  +             + 
Sbjct: 99  GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTP---------YPPHD 149

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           PS S T + LSC   +C+L T+  +       C +    Y +  + SG LV D+ H  + 
Sbjct: 150 PSKSRTFRRLSCFDPMCELCTAVVDGGGGSAGCLFRR-RYGDGGAVSGELVSDVFHFGAA 208

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           GD      ++  V  GC   +    + G +  G++ LG+G+   PS + + G+ R  FS 
Sbjct: 209 GDGG-GYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGVDR--FSY 261

Query: 273 CF-------------DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSS 319
           C              ++  +  + FG     T +   F      Y   +  V     G  
Sbjct: 262 CIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRL 321

Query: 320 CLKQ------------TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPW 367
             +Q             +   +VDSG++  +LP  V+  +    +  ++ T      +P 
Sbjct: 322 NQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPS 381

Query: 368 KCCY 371
             CY
Sbjct: 382 LYCY 385


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 129/313 (41%), Gaps = 46/313 (14%)

Query: 94  SLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDL 151
           +LG     L Y   + +G+P  S  + +D GSD+ W+ C  C +C   +   ++      
Sbjct: 118 ALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD------ 171

Query: 152 NEYSPSASSTSKHLSCSHRLC----DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
               PS+SST    SC    C      G  C +  Q C Y +  Y + +S++G    D L
Sbjct: 172 ----PSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQ-CQYIVT-YGDGSSTTGTYSSDTL 225

Query: 208 HLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIR 267
            L   G +A+K     S   GC   +S G+ D    DGL+GLG G  S+ S    AG + 
Sbjct: 226 AL---GSSAVK-----SFQFGCSNVES-GFNDQT--DGLMGLGGGAQSLVS--QTAGTLG 272

Query: 268 NSFSMCFDKDDSGRIFFGDQGPATQQSTSF-----LASNGKYITYIIGVETCCIGSSCLK 322
            +FS C     S   F          ++ F     L S+     Y + ++   +G    +
Sbjct: 273 RAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGG---R 329

Query: 323 QTSFKA-------IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
           Q S  A       ++DSG+  T LP   Y  +++ F   +     +        C+  S 
Sbjct: 330 QLSIPASVFSAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSG 389

Query: 376 QRLPKLPSVKLMF 388
           Q    +PSV L+F
Sbjct: 390 QSSVSIPSVALVF 402


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 133/313 (42%), Gaps = 52/313 (16%)

Query: 104 YTWIDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLNEYSPSASST 161
           +  I +GTP V  LV +D GS + W+ C    V C       Y    R    ++ S+SST
Sbjct: 24  FMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHC-------YTQDQRAGPTFNTSSSST 76

Query: 162 SKHLSCSHRLC-------DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD 214
            + + CS ++C       ++ + C   +  C Y++  Y     S+G L +D L L     
Sbjct: 77  YRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLR-YASGEYSAGYLSQDRLTL----- 130

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            A   S+Q   I GCG   S    +G +  G+IG G    S  + +A+     ++FS CF
Sbjct: 131 -ANSYSIQ-KFIFGCG---SDNRYNGHSA-GIIGFGNKSYSFFNQIAQL-TNYSAFSYCF 183

Query: 275 DKDDSGRIFFGDQGPATQQSTSFLASN----GKYI-TYIIGVETCCIGSSCLK-----QT 324
             +     F    GP  + S   + +     G ++  Y +      +    L+      T
Sbjct: 184 PSNQENEGFLS-IGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYT 242

Query: 325 SFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY-----PWKCCYKSSSQRL- 378
           +   +VDSG+  TF+   V+  +    DR +   + + EGY       + C+ S+   + 
Sbjct: 243 TRMTVVDSGTVETFVLSPVFRAL----DRALTKAMVA-EGYVRGSDSKEICFHSNGDSVD 297

Query: 379 -PKLPSVKLMFPQ 390
             KLP V++ F +
Sbjct: 298 WSKLPVVEIKFSR 310


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 88/176 (50%), Gaps = 27/176 (15%)

Query: 62  YQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALD 121
           +  + SSD + ++ ++     +L  +Q S  +SLG+     ++  + IG+P  S+ + LD
Sbjct: 12  HHRIQSSDHRHRRGRS-----LLQTAQVSSGLSLGSG---EYFARMGIGSPQRSYYLELD 63

Query: 122 AGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDL--GTSCQ 179
            GSD+ WI     +CAP S S Y+ +D     Y PS SS+ + + C   LC     ++CQ
Sbjct: 64  TGSDVTWI-----QCAPCS-SCYSQVD---PIYDPSNSSSYRRVYCGSALCQALDYSACQ 114

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSG 235
                C Y +  Y ++++SSG L  +  +L      A++N     +  GCG   SG
Sbjct: 115 G--MGCSYRV-VYGDSSASSGDLGIESFYLGPNSSTAMRN-----IAFGCGHSNSG 162


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 112/265 (42%), Gaps = 36/265 (13%)

Query: 96  GNDFGWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD--CVRCAPLSASYYNSLDRDLN 152
           GN +   H+T  + IG P   F + +D GSDL W+ CD  C  C         +L  D  
Sbjct: 47  GNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGC---------TLPHD-R 96

Query: 153 EYSPSASSTSKHLSCSHRLCDL-----GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDIL 207
            Y P     +  + C   LC        + C+NP   C Y ++ Y ++ SS G+LV+D +
Sbjct: 97  LYKPH----NNVVRCGEPLCSALFSASKSPCKNPNDQCDYEVE-YADHGSSIGVLVKDPV 151

Query: 208 HLISGGDNALKNS--VQASVIIGCGMKQ-SGGYLDGVAPDGLIGLGLGEISVPSLLAKAG 264
            L       L N   +  ++  GCG  Q +GG        G++GLG  + ++ + L+   
Sbjct: 152 PL------RLTNGTILAPNLGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALS 205

Query: 265 LIRNSFSMC-FDKDDSGRIFFGDQGPATQQS-TSFLASNGKYITYIIGVETCCIGSSCLK 322
            +RN    C   +      F GD  P++  S    L + G    Y  G      G + + 
Sbjct: 206 HVRNVLGHCFSGQGGGFLFFGGDLVPSSGMSWMPILRTPGG--KYSAGPAEVYFGGNPVG 263

Query: 323 QTSFKAIVDSGSSFTFLPKEVYETI 347
                   DSGSS+T+   +VY  +
Sbjct: 264 IRGLILTFDSGSSYTYFNSQVYGAV 288


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 124/316 (39%), Gaps = 51/316 (16%)

Query: 94  SLGNDFGWLHYT-WIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLN 152
           S G   G  +Y   + +GTP   + V  D GSD  W     V+C P     Y   ++   
Sbjct: 168 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW-----VQCQPCVVVCYEQQEK--- 219

Query: 153 EYSPSASSTSKHLSCSHRLC-DLGT-SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLI 210
            + P  SST  ++SC+   C DL    C      C Y +  Y + + S G    D L L 
Sbjct: 220 LFDPVRSSTYANVSCAAPACSDLNIHGCSGGH--CLYGVQ-YGDGSYSIGFFAMDTLTLS 276

Query: 211 SGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP-SLLAKAGLIRNS 269
           S   +A+K         GCG +  G + +     GL+GLG G+ S+P     K G +   
Sbjct: 277 S--YDAVKG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV--- 323

Query: 270 FSMCFDKDDSGRIFF-----GDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
           F+ C     +G  +           + + +T  L  NG    Y IG+    +G   L   
Sbjct: 324 FAHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTDNGPTF-YYIGMTGIRVGGQLLSIP 382

Query: 323 QTSFK---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYP-------WKCCYK 372
           Q+ F     IVDSG+  T LP   Y ++     R       +  GY           CY 
Sbjct: 383 QSVFATAGTIVDSGTVITRLPPPAYSSL-----RYAFAAAMAARGYKKAPAVSLLDTCYD 437

Query: 373 SSSQRLPKLPSVKLMF 388
            +      +P+V L+F
Sbjct: 438 FTGMSQVAIPTVSLLF 453


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 103/282 (36%), Gaps = 49/282 (17%)

Query: 100 GWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS 157
           G L Y   + +GTP       LD GSDL+W  CD C  C          L +    +SP 
Sbjct: 94  GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTAC----------LRQPDPLFSPR 143

Query: 158 ASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
            SS+ + + C+ +LC   L  SC  P   C Y   Y    T+      E      S G+ 
Sbjct: 144 MSSSYEPMRCAGQLCGDILHHSCVRPDT-CTYRYSYGDGTTTLGYYATERFTFASSSGET 202

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                    +  GCG    G   +     G++G G   +S+ S L+    IR  FS C  
Sbjct: 203 Q-----SVPLGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLS----IRR-FSYCLT 249

Query: 276 KDDSGR---IFFG--------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
              S R   + FG        D      Q+T  L S      Y +      +G+  L+  
Sbjct: 250 PYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIP 309

Query: 323 --------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                     S   I+DSG++ T  P  V   +   F  Q+ 
Sbjct: 310 ASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLR 351


>gi|403414885|emb|CCM01585.1| predicted protein [Fibroporia radiculosa]
          Length = 414

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 114/312 (36%), Gaps = 62/312 (19%)

Query: 89  GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP---CDCVRCAPLSASYYN 145
           G   + L N     ++  I +GTP  SF V LD GS  LW+P   C  + C  L A Y  
Sbjct: 88  GGHNVPLSNFMNAQYFAEIQLGTPAQSFKVILDTGSSNLWVPSSKCTSIACF-LHAKY-- 144

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVED 205
                      S+SST+   + S      G+                    S  G + +D
Sbjct: 145 ----------DSSSSTTYKANGSEFSIQYGSG-------------------SMEGFVSQD 175

Query: 206 ILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL------ 259
           +L +   GD ++K+   A      G+  + G  DG+     +GLG   ISV  +      
Sbjct: 176 LLKI---GDLSIKHQDFAEATKEPGLAFAFGKFDGI-----LGLGYDTISVNHMTPPFYE 227

Query: 260 LAKAGLIRN---SFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCI 316
           +    LI     +F +   ++D G   FG         +       +   + + ++   +
Sbjct: 228 MVAQKLIDEPVFAFRLGSSEEDGGEAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVAL 287

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQ 376
           G   L      A +D+G+S   LP ++ E I  +   Q            W   Y     
Sbjct: 288 GDDELDLEHTGAAIDTGTSLIALPTDIAEMINTQIGAQKQ----------WNGQYTVDCS 337

Query: 377 RLPKLPSVKLMF 388
           ++P LP + L F
Sbjct: 338 KVPSLPELVLTF 349


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 103/274 (37%), Gaps = 55/274 (20%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           +GTP    L+ALD  +D  W       CAP       S       + P++SS+   L C+
Sbjct: 85  LGTPVQQLLLALDTSADATW-----SHCAPCDTCPAGS------RFIPASSSSYASLPCA 133

Query: 169 HRLCDLGTSCQNPKQPCPYTMDYYTE-----------NTSSSGLLVEDILHLISGGDNAL 217
              C L        QPCP   D               +TS    L  D L L   G +A+
Sbjct: 134 SDWCPLFEG-----QPCPANQDASAPLPACAFSKPFADTSFQASLGSDTLRL---GKDAI 185

Query: 218 KNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNS-FSMCFDK 276
                A    GC +    G    +   GL+GLG G +   SLL++ G   N  FS C   
Sbjct: 186 -----AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPM---SLLSQTGSTYNGVFSYCLPS 236

Query: 277 DD----SGRIFFGDQG-PATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--------- 322
                 SG +  G  G P   + T  L +  +   Y + V    +G + +K         
Sbjct: 237 YRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFD 296

Query: 323 -QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQV 355
             T    ++DSG+  T     VY  +  EF RQV
Sbjct: 297 PATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 118/295 (40%), Gaps = 40/295 (13%)

Query: 114 VSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 170
           V+  + LD  SD+ W+   PC    C P          +D+  Y P+ SS+S   SC+  
Sbjct: 167 VTQTMVLDTASDVTWVQCSPCPTPPCYP---------QKDV-LYDPTKSSSSGVFSCNSP 216

Query: 171 LC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
            C  LG     C N  Q C Y +  Y + TS++G  + D+L +      A++     S  
Sbjct: 217 TCTQLGPYANGCTNNNQ-CQYRVR-YPDGTSTAGTYISDLLTITPA--TAVR-----SFQ 267

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 286
            GC     G +  G +  G++ LG G  S+ S    A      FS CF    + R FF  
Sbjct: 268 FGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRVFSHCF-PPPTRRGFFTL 324

Query: 287 QGPATQQSTSFLASNGKYIT-----YIIGVETCCIGSS--CLKQTSFK--AIVDSGSSFT 337
             P        L    K        Y++ +E   +      +  T F   A +DS ++ T
Sbjct: 325 GVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAIT 384

Query: 338 FLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQN 391
            LP   Y+ +   F DR         +G P   CY  +  R   LP + L+F +N
Sbjct: 385 RLPPTAYQALRQAFRDRMAMYQPAPPKG-PLDTCYDMAGVRSFALPRITLVFDKN 438


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/304 (23%), Positives = 117/304 (38%), Gaps = 40/304 (13%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASS 160
           L +    +G P V     +D GS LLWI C  C  C+       N +   +  ++P+ SS
Sbjct: 67  LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSS------NHMIHPV--FNPALSS 118

Query: 161 TSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
           T    SC  R C    +       C Y    Y   T S G+L ++ L   +   N +   
Sbjct: 119 TFVECSCDDRFCRYAPNGHCSSNKCVYEQ-VYISGTGSKGVLAKERLTFTTPNGNTV--- 174

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DK 276
           V   +  GCG  ++G  L+     G++GLG    S+   L       + FS C     +K
Sbjct: 175 VTQPIAFGCG-HENGEQLESEF-TGILGLGAKPTSLAVQLG------SKFSYCIGDLANK 226

Query: 277 D-DSGRIFFGDQGPATQQST--SFLASNGKYITYIIGVETCCIGSSCL--KQTSFK---- 327
           +    ++  G+        T   F   NG    Y + +E   +G   L  +   FK    
Sbjct: 227 NYGYNQLVLGEDADILGDPTPIEFETENG---IYYMNLEGISVGDKQLNIEPVVFKRRGS 283

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
               I+D+G+ +T+L    Y  +  E    ++  +  F    + C +   ++ L   P V
Sbjct: 284 RTGVILDTGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVV 343

Query: 385 KLMF 388
              F
Sbjct: 344 TFHF 347


>gi|325087547|gb|EGC40857.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H88]
          Length = 398

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/312 (25%), Positives = 124/312 (39%), Gaps = 67/312 (21%)

Query: 87  SQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIP---CDCVRCAPLSASY 143
           + G  ++ + N     +++ I IGTP  +F V LD GS  LW+P   C  + C      Y
Sbjct: 69  ASGGHSLPVDNFLNAQYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECGSIAC------Y 122

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
            +      N+Y  SASST K                  K    +++ Y   + S +G + 
Sbjct: 123 LH------NKYDSSASSTHK------------------KNGSEFSITY--GSGSLTGFVS 156

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVP------ 257
           +D L +   GD  ++N V A      G+  + G  DG+     +GLG   ISV       
Sbjct: 157 QDCLTI---GDLVVENQVFAEATSEPGLAFAFGRFDGI-----LGLGYDTISVNKIVPPF 208

Query: 258 -SLLAKAGLIRNSFSMCFD----KDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIG 310
             +L K  L    FS         DD   + FG  ++   T + T        Y  + + 
Sbjct: 209 YEMLNKDLLDEPMFSFYLGDANIDDDQSEVVFGGMNKDRFTGELTKIPLRRKAY--WEVD 266

Query: 311 VETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEG-YPWKC 369
           +++   G      T+   I+D+G+S   LP  + E +  E   +      SF G Y  +C
Sbjct: 267 LDSITFGKQTAMMTNTGVILDTGTSLIALPSTIAELLNKEIGAK-----KSFNGQYTVEC 321

Query: 370 CYKSSSQRLPKL 381
             + S   LP L
Sbjct: 322 AKRDS---LPNL 330


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 127/304 (41%), Gaps = 42/304 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++T I +GTP     + LD GSD++W+ C  C  C       Y+  D   N   P  S +
Sbjct: 42  YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNC-------YSQTDPVFN---PVKSGS 91

Query: 162 SKHLSCSHRLC-DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNS 220
              + C   LC  L +   N +Q C Y +  Y + + ++G  V + L          + +
Sbjct: 92  FAKVLCRTPLCRRLESPGCNQRQTCLYQVS-YGDGSYTTGEFVTETL--------TFRRT 142

Query: 221 VQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDS 279
               V +GCG    G +   V   GL+GLG G +S PS   +       FS C  D+  S
Sbjct: 143 KVEQVALGCGHDNEGLF---VGAAGLLGLGRGGLSFPSQAGRT--FNQKFSYCLVDRSAS 197

Query: 280 GR---IFFGDQGPATQQSTSFLASNGKYITY----IIGVETCCIGSSCLKQTSFK----- 327
            +   + FG+   +     + L +N +  T+    ++G+       S +  + FK     
Sbjct: 198 SKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTG 257

Query: 328 ---AIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
               I+D G+S T L K  Y  +   F    +   ++ E   +  CY  S +   K+P+V
Sbjct: 258 NGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTV 317

Query: 385 KLMF 388
            L F
Sbjct: 318 VLHF 321


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 88/208 (42%), Gaps = 52/208 (25%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPC-DCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHL 165
           + IGTP V+F V  D GS L+W  C  C  CA           R    + P++SST   L
Sbjct: 94  LSIGTPPVTFSVLADTGSSLIWTQCAPCTECA----------ARPAPPFQPASSSTFSKL 143

Query: 166 SCSHRLCDLGTSCQNPKQPC---------PYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
            C+  LC   TS   P + C         PY M +      ++G L  + LH+  GG + 
Sbjct: 144 PCASSLCQFLTS---PYRTCNATGCVYYYPYGMGF------TAGYLATETLHV--GGASF 192

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDK 276
                   V  GC  +       G +  G++GLG   +   SL+++ G+ R  FS C   
Sbjct: 193 ------PGVTFGCSTENG----VGNSSSGIVGLGRSPL---SLVSQVGVAR--FSYCLRS 237

Query: 277 D-DSGR--IFFGDQGPATQ---QSTSFL 298
           + D+G   I FG     T    QST  L
Sbjct: 238 NADAGDSPILFGSLAKVTGGNVQSTPLL 265


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 81/308 (26%), Positives = 124/308 (40%), Gaps = 41/308 (13%)

Query: 114 VSFLVALDAGSDLLWI---PCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHR 170
           V+  + LD  SD+ W+   PC    C P          +D+  Y P+ SS+S   SC+  
Sbjct: 142 VTQTMVLDTASDVTWVQCSPCPTPPCYP---------QKDV-LYDPTKSSSSGVFSCNSP 191

Query: 171 LC-DLG---TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
            C  LG     C N  Q C Y +  Y + TS++G  + D+L +      A++     S  
Sbjct: 192 TCTQLGPYANGCTNNNQ-CQYRVR-YPDGTSTAGTYISDLLTITPA--TAVR-----SFQ 242

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDSGRIFFGD 286
            GC     G +  G +  G++ LG G  S+  +   A      FS CF    + R FF  
Sbjct: 243 FGCSHGVQGSFSFGSSAAGIMALGGGPESL--VSQTAATYGRVFSHCF-PPPTRRGFFTL 299

Query: 287 QGPATQQSTSFLASNGKYIT-----YIIGVETCCIGSS--CLKQTSFK--AIVDSGSSFT 337
             P        L    K        Y++ +E   +      +  T F   A +DS ++ T
Sbjct: 300 GVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAIT 359

Query: 338 FLPKEVYETIAAEF-DRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSFVV 396
            LP   Y+ +   F DR         +G P   CY  +  R   LP + L+F   N+ V 
Sbjct: 360 RLPPTAYQALRQAFRDRMAMYQPAPPKG-PLDTCYDMAGVRSFALPRITLVF-DKNAAVE 417

Query: 397 NNPVFVIY 404
            +P  V++
Sbjct: 418 LDPSGVLF 425


>gi|345568347|gb|EGX51242.1| hypothetical protein AOL_s00054g478 [Arthrobotrys oligospora ATCC
           24927]
          Length = 392

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 135/338 (39%), Gaps = 67/338 (19%)

Query: 62  YQVLLSSDVQKQKMKTGPQ--FQMLFPSQGSKTMSLGNDFGWLHYTWIDIGTPNVSFLVA 119
           +Q  + +  QK   + G Q  F     + G  ++ + N     +Y+ I +GTP  +F V 
Sbjct: 39  FQTQVQALAQKYINRAGNQQAFTNDVNADGGHSVPVNNFLNAQYYSEITLGTPPQTFKVV 98

Query: 120 LDAGSDLLWIP---CDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGT 176
           LD GS  LW+P   C  + C   +            +Y  S SST K             
Sbjct: 99  LDTGSSNLWVPSKSCSSIACFLHT------------KYDSSESSTYK------------- 133

Query: 177 SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGG 236
                     +++ Y   + S  G + +D L +   GD  +KN + A      G+  + G
Sbjct: 134 -----ANGTEFSIQY--GSGSMEGFISQDTLTI---GDLTIKNQLFAEATKEPGLAFAFG 183

Query: 237 YLDGVAPDGLIGLGLGEISVPSL------LAKAGLIRN---SFSMCFDKDDSGRIFFG-D 286
             DG+     +GLG   ISV  +      +    L+     +F +  ++D+S  +F G D
Sbjct: 184 KFDGI-----LGLGYDTISVNKIPPPFYQMISQKLVDEPVFAFYLGREEDESEAVFGGID 238

Query: 287 QGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLKQTSFKAIVDSGSSFTFLPKEVYET 346
           +   T   T        Y  + +  ++   G    +  S+ A++D+G+S   LP +  E 
Sbjct: 239 KSHYTGDITWVDVRRKAY--WEVPFDSISFGDQTAELDSWGAVLDTGTSLITLPSDYAEM 296

Query: 347 IAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSV 384
           +        N  I + +G  W   Y    +++P LPS+
Sbjct: 297 L--------NSAIGATKG--WNGQYSVPCEKVPDLPSL 324


>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
          Length = 642

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/313 (23%), Positives = 134/313 (42%), Gaps = 48/313 (15%)

Query: 95  LGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNE 153
           LG  +G  HY  I +G P     V +D GS L  +PC  C  C   +   ++        
Sbjct: 88  LGVGYG-THYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGCGQHTDPLFDV------- 139

Query: 154 YSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGG 213
              S S+T+K+L+C H       SC++ +Q   Y    Y E +    ++V++++ +  GG
Sbjct: 140 ---SKSTTAKYLAC-HDF----DSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWV--GG 189

Query: 214 DNALKNSVQASVI-------IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLI 266
            ++  + ++  +        +GC  K++G ++     +G++GLG    +V S +  AG +
Sbjct: 190 FSSPADEMEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRV 248

Query: 267 -RNSFSMCFDKDDSGRIFFGDQGPATQQS----TSFLASNGKYITYIIGVETCCIGSSCL 321
            +N F++CF   D G + FG    +   S    T  L+    Y  Y + V+   +    L
Sbjct: 249 TQNLFTLCF-AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAY--YPVHVKDILLNGVSL 305

Query: 322 K------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
                   +    IVDSG++ TF   +      + F +      +       +   K +S
Sbjct: 306 GIDTGTINSGRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYS-------ESRMKLTS 358

Query: 376 QRLPKLPSVKLMF 388
           + L  LP + ++ 
Sbjct: 359 EELAALPVISIIL 371


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 121/313 (38%), Gaps = 43/313 (13%)

Query: 120 LDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQ 179
           +D GSDL+W PC    C         +   ++ + + S S  S   S +H        C 
Sbjct: 93  MDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSCQSPACSAAHASMSSSNLCA 152

Query: 180 NPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGD---NALKNSVQASVIIGCGMKQSGG 236
             +  CP     Y E +  S        +    G    N  + ++  S +          
Sbjct: 153 ISR--CPLD---YIETSDCSSFSCPPFYYAYGDGSFVANLYQQTLSLSSLHLQNFTFGCA 207

Query: 237 YLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMC-----FDKDDSGR---IFFGDQ 287
           +     P G+ G G G +S+P+ L+  +  + N FS C     FD D   R   +  G  
Sbjct: 208 HTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDRLRRPSPLILGRH 267

Query: 288 -----GPATQQSTSF----LASNGKY-ITYIIGVETCCIGS------SCLKQTSFKA--- 328
                G    +S  F    + SN K+   Y +G+    +G         LK+   K    
Sbjct: 268 NDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGG 327

Query: 329 -IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKC----CYKSSSQRLPKLPS 383
            +VDSG++FT LP+  Y  +  EFD++VN           K     CY  +   L ++P 
Sbjct: 328 MVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLGPCYYLNG--LSQIPV 385

Query: 384 VKLMFPQNNSFVV 396
           +KL F  NNS VV
Sbjct: 386 LKLHFVGNNSDVV 398


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 74/327 (22%), Positives = 123/327 (37%), Gaps = 66/327 (20%)

Query: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASST 161
           ++Y+ I +G+P   F + +D GSDL W+ CD   C+P  +S ++ L          AS+T
Sbjct: 2   VYYSTITLGSPPKDFSLVMDTGSDLTWVRCD--PCSPDCSSTFDRL----------ASNT 49

Query: 162 SKHLSCSHRLCDLGTSCQNPKQPCPYTMDY---YTENTSSSGLLVEDILHLISGGDNALK 218
            K L+C+                     DY   Y + + + G L  D L +     + L+
Sbjct: 50  YKALTCAD--------------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELE 89

Query: 219 NSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
                  + GCG     G + G    G++ L  G +S PS + +     N FS C  +  
Sbjct: 90  EF--PGFVFGCGSLLK-GLISGEV--GILALSPGSLSFPSQIGEK--YGNKFSYCLLRQT 142

Query: 279 SGR------IFFGDQ-------GPATQQSTSFLASNGKYITYIIGVETCCIG-------- 317
           +        + FG+        G    Q   +       I Y + ++   +G        
Sbjct: 143 AQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSP 202

Query: 318 SSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDT-ITSFEGYPWKCCYKSSSQ 376
           S+ L       I DSG++ T LP  V ++I       V+     + +G     C++    
Sbjct: 203 SAFLNGQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG--LDACFRVPPS 260

Query: 377 RLPKLPSVKLMFPQNNSFVVNNPVFVI 403
               LP +   F     FV     +VI
Sbjct: 261 SGQGLPDITFHFNGGADFVTRPSNYVI 287


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 87/200 (43%), Gaps = 32/200 (16%)

Query: 228 GCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDDS-GRIFFGD 286
           GCG    G +  G   DG++GLG G++S  S  A     +  FS C  ++DS G + FG+
Sbjct: 172 GCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGE 227

Query: 287 QGPATQQS------------TSFLASNGKYITYIIGVETCCIGSSCLKQ-----TSFKAI 329
           +  AT QS            TS L  +G Y   ++ +    +G+  L        S   I
Sbjct: 228 K--ATSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDIS---VGNKRLNVPSSVFASPGTI 282

Query: 330 VDSGSSFTFLPKEVYETIAAEFDRQVNDTITS----FEGYPWKCCYKSSSQRLPKLPSVK 385
           +DSG+  T LP+  Y  + A F + +     S     +G     CY  S ++   LP + 
Sbjct: 283 IDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIV 342

Query: 386 LMFPQNNSFVVNNPVFVIYG 405
           L F +     +N    VI+G
Sbjct: 343 LHFGEGADVRLNGKR-VIWG 361


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/332 (21%), Positives = 134/332 (40%), Gaps = 64/332 (19%)

Query: 95  LGNDFGWLHYTWI---DIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRD 150
             N   W +Y+++    +GTP  +  V +D  S L W+ C+ C+    +           
Sbjct: 115 FANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINACLIPT--------- 165

Query: 151 LNEYSPSASSTSKHLSCSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
              ++P+ASST K + C   LC+          SC  P + C Y   Y+ + + S G++ 
Sbjct: 166 ---FNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYH-DYSLSVGVVS 221

Query: 204 EDILHLISGGDNALKNSVQASVIIGCG--MKQSGGYLDGVAPDGLIGLGLGEISVPSLLA 261
            D L    G             I GC    +  GG   G+     +G+ + + S+ S + 
Sbjct: 222 SDTLTYGLGSQK---------FIFGCCNLFRGVGGRYSGI-----LGMSVNKFSLFSQMT 267

Query: 262 KAGLIRNSFSMCF-DKDDSGRIFFG--DQGPATQQSTSFLASNGKYITYI--IGVETCCI 316
                R + S CF    + G + FG  D+  +  + T        Y  ++  + VET  +
Sbjct: 268 VGHRYR-AMSYCFPHPRNQGFLQFGRYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSL 326

Query: 317 GSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSF-EGY------PWKC 369
                   + +   D+G+ +T LP+ ++ +++        DT+ +  EGY        + 
Sbjct: 327 DVQSSGNQTMRCFFDTGTPYTMLPQSLFVSLS--------DTVGNLVEGYYRVGASTGQT 378

Query: 370 CYKSSSQRLPK---LPSVKLMFPQNNSFVVNN 398
           C+++    +     +P+VK+ F       +N+
Sbjct: 379 CFQADGNWIEGDLYMPTVKIEFQNGARITLNS 410


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 105/437 (24%), Positives = 172/437 (39%), Gaps = 70/437 (16%)

Query: 1   MNRISLTIYLAVFWLLTESSGAETVMFSTKLIHRFSEEVKALGVSKNRNATSWPAKKSFE 60
           M  +++ ++L V   L  +SGA +V      IH                  S P   + E
Sbjct: 8   MASLAVLVFLVVCATL--ASGAASVRVGLTRIH------------------SDPDITAPE 47

Query: 61  YYQVLLSSDVQKQKMKTGPQFQMLFPSQGSKTMSLGNDF--GWLHYTWIDIGTPNVSFLV 118
           + +  L  D+ +Q+ ++    ++      + +     D   G  +   + IGTP +S+  
Sbjct: 48  FVRDALRRDMHRQQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYPA 107

Query: 119 ALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRL--CD--L 174
             D GSDL+W      +CAP S     +    L  Y+P++S+T   L C+  L  C   L
Sbjct: 108 IADTGSDLIW-----TQCAPCSGDQCFAQPAPL--YNPASSTTFGVLPCNSSLSMCAGVL 160

Query: 175 GTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQS 234
                 P   C Y   Y T  T  +G+   +       G  A   +    +  GC    S
Sbjct: 161 AGKAPPPGCACMYNQTYGTGWT--AGVQGSETFTF---GSAAADQARVPGIAFGCSNASS 215

Query: 235 GGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF----DKDDSGRIFFGDQGPA 290
             + +G A  GL+GLG G +   SL+++ G  R  FS C     D + +  +  G     
Sbjct: 216 SDW-NGSA--GLVGLGRGSL---SLVSQLGAGR--FSYCLTPFQDTNSTSTLLLGPSAAL 267

Query: 291 TQ---QSTSFLASNGKY---ITYIIGVETCCIGSSCLKQT----SFKA------IVDSGS 334
                +ST F+AS  K      Y + +    +G+  L  +    S KA      I+DSG+
Sbjct: 268 NGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGT 327

Query: 335 SFTFLPKEVYETIAAEFDRQVN-DTITSFEGYPWKCCYK--SSSQRLPKLPSVKLMFPQN 391
           + T L    Y+ + A     V    I   +      CY   + +   P +PS+ L F   
Sbjct: 328 TITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHF-DG 386

Query: 392 NSFVVNNPVFVIYGTQV 408
              V+    ++I G+ V
Sbjct: 387 ADMVLPADSYMISGSGV 403


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 122/300 (40%), Gaps = 56/300 (18%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           +  GTP   F + +D GSD  WI C+       S S  N  ++    ++PS SS+  + S
Sbjct: 133 VGFGTPQQKFNLIIDTGSDTTWIQCN-------SCSLGNCHNK--KTFNPSLSSSYSNRS 183

Query: 167 CSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
           C             P     YTM  Y +N+ S G+ V D        +  LK  V     
Sbjct: 184 CI------------PSTDTNYTMK-YEDNSYSKGVFVCD--------EVTLKPDVFPKFQ 222

Query: 227 IGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAK-AGLIRNSFSMCFDKDDS--GRIF 283
            GCG   SGG   G A  G++GL  GE    SL+++ A   +  FS CF   +   G + 
Sbjct: 223 FGCG--DSGGGEFGTA-SGVLGLAKGEQY--SLISQTASKFKKKFSYCFPPKEHTLGSLL 277

Query: 284 FGDQGPATQQSTSFL-----ASNGKYITYIIGVETC----CIGSSCLKQTSFKAIVDSGS 334
           FG++  +   S  F       S   Y   +IG+        + SS     S   I+DSG+
Sbjct: 278 FGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLF--ASPGTIIDSGT 335

Query: 335 SFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWK----CCY--KSSSQRLPKLPSVKLMF 388
             T LP   YE +   F +++     S    P +     CY  K    R  KLP + L F
Sbjct: 336 VITRLPTAAYEALRTAFQQEMLH-CPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHF 394


>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 435

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 93/354 (26%), Positives = 139/354 (39%), Gaps = 82/354 (23%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I+  TP V   V +D G   LW+ C+                   N+Y    SST +   
Sbjct: 51  INQRTPLVPLNVIVDLGGQFLWVDCE-------------------NKY---ISSTYRPAR 88

Query: 167 CSHRLCDLGTS-----CQNPKQP------CPYTMDYYTENTSSSGLLVEDILHL-ISGGD 214
           C    C L  S     C +  +P      C  T D    +T++SG L ED+L +  S G 
Sbjct: 89  CRSAQCSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSGELAEDVLSIQSSNGF 148

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVA--PDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           N  +N V +  +  C        L G+A    G+ GLG  +I++PS LA A      F++
Sbjct: 149 NPGQNVVVSRFLFSCAPTF---LLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAI 205

Query: 273 CFDKDDSGRIFFGDQGP--------------------ATQQSTSFLASNGK-YITYIIGV 311
           C      G + FGD GP                        ST+   S G+    Y IGV
Sbjct: 206 CLSSSK-GVVLFGD-GPYGFLPNVVFDSDSLTYTPLLINPVSTASAFSQGQPSAEYFIGV 263

Query: 312 ETCCIGSSCLK-QTSFKAIVDSG---------SSFTFLPKEVYETIAAEFDRQVNDTITS 361
           +T  I    +   TS  +I ++G           +T L   +Y+ +   F +        
Sbjct: 264 KTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKASAARNIK 323

Query: 362 FEG--YPWKCCYKS-SSQRL-PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVGVS 411
             G   P++ CY + +  RL   +P+++L F QN      N V+ I+G    VS
Sbjct: 324 RVGSVAPFEFCYTNLTGTRLGAAVPTIEL-FLQN-----ENVVWRIFGANSMVS 371


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 69/306 (22%), Positives = 115/306 (37%), Gaps = 51/306 (16%)

Query: 100 GWLHYTWIDIGTPNVSF---LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           G  +   + IGTP        V  D GSDL W  C+ C  C+  +             + 
Sbjct: 119 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTP---------YPPHD 169

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           PS S T + LSC   +C+L T+  +       C +    Y +  + SG LV D+ H  + 
Sbjct: 170 PSKSRTFRRLSCFDPMCELCTAVVDGGGGSAGCLFRR-RYGDGGAVSGELVSDVFHFGAA 228

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           GD      ++  V  GC   +    + G +  G++ LG+G+   PS + + G+ R  FS 
Sbjct: 229 GDGG-GYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGVDR--FSY 281

Query: 273 CF---------------DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           C                ++  +  + FG     T +   F      Y   +  V     G
Sbjct: 282 CIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGG 341

Query: 318 SSCLKQ------------TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
               +Q             +   +VDSG++  +LP  V+  +    +  ++ T      +
Sbjct: 342 RLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTH 401

Query: 366 PWKCCY 371
           P   CY
Sbjct: 402 PSLYCY 407


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 117/285 (41%), Gaps = 63/285 (22%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPC-------DCVRCAPLSASYYNSLD-RDLNEYSPSASS 160
           +GTP     + LD GS L+W PC        C  C       ++ +D   +  Y+ + SS
Sbjct: 80  LGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCT------FSGVDPTKIPIYARNKSS 133

Query: 161 TSKHLSCSHRLCD--LGT--SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNA 216
           T + L C    C+   G+  +C   K+ CPY    Y    S++G LV D+L L       
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKR-CPYYGLEYGLG-STTGQLVSDVLGLS------ 185

Query: 217 LKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF-- 274
            K +     + GC +      +    P+G+ G G G  S+P   A+ GL +  FS C   
Sbjct: 186 -KLNRIPDFLFGCSL------VSNRQPEGIAGFGRGLASIP---AQLGLTK--FSYCLVS 233

Query: 275 ----DKDDSGRIFFGD-------------QGPATQQSTSFLASNGKYITY---IIGVETC 314
               D   SG +                   P T+       S   YI+    ++G +  
Sbjct: 234 HRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDV 293

Query: 315 CIGSSCL---KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
            I    L   K+     IVDSGS+FTF+ + +++ +A E ++ + 
Sbjct: 294 PIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMT 338


>gi|336373584|gb|EGO01922.1| hypothetical protein SERLA73DRAFT_177556 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336386403|gb|EGO27549.1| hypothetical protein SERLADRAFT_461213 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 413

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 80/373 (21%), Positives = 145/373 (38%), Gaps = 65/373 (17%)

Query: 32  IHRFSEEVKALGVSKNRNATSWPAKKSFEYYQVLLSSDVQKQKMKTGPQFQMLFPSQ--- 88
           +H+  +     G+     A  + A+ +++   ++ +    +      P+   LF +Q   
Sbjct: 26  LHKLPKVSPNHGLESAYLAEKYGAETTYQQLPLMGAGGAGRHIRPDRPEDSDLFWTQEEL 85

Query: 89  --GSKTMSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNS 146
             G   + L N     +YT I +G+P  +F V LD GS  LW+P    +C  ++   +  
Sbjct: 86  VKGGHGVPLTNFMNAQYYTEITLGSPAQTFKVILDTGSSNLWVPSS--KCTSIACFLHT- 142

Query: 147 LDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDI 206
                 +Y  S+SST K                       +++ Y   + S  G + ++ 
Sbjct: 143 ------KYDSSSSSTYK------------------ANGTEFSIQY--GSGSMEGFVSQES 176

Query: 207 LHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL------L 260
           + +   GD ++++   A      G+  + G  DG+     +GLG   ISV  +      +
Sbjct: 177 MKI---GDLSIQHQDFAEATKEPGLAFAFGKFDGI-----LGLGYDTISVNHITPPFYNM 228

Query: 261 AKAGLIRN---SFSMCFDKDDSGRIFFG--DQGPATQQSTSFLASNGKYITYIIGVETCC 315
              GL+     SF +   +DD G   FG  D    T   T        Y  + + +E   
Sbjct: 229 IDQGLLDEPLFSFRLGSSEDDGGEAVFGGIDSSAYTGSITYVPVRRKAY--WEVELEKVS 286

Query: 316 IGSSCLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSS 375
            G   L   +  A +D+G+S   LP +V E +    + Q+  T +      W   Y+   
Sbjct: 287 FGGDELDLENTGAAIDTGTSLIALPTDVAEML----NTQIGATRS------WNGQYQVDC 336

Query: 376 QRLPKLPSVKLMF 388
            ++P LP +   F
Sbjct: 337 AKVPSLPELSFYF 349


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 69/306 (22%), Positives = 115/306 (37%), Gaps = 51/306 (16%)

Query: 100 GWLHYTWIDIGTPNVSF---LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           G  +   + IGTP        V  D GSDL W  C+ C  C+  +             + 
Sbjct: 98  GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTP---------YPPHD 148

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           PS S T + LSC   +C+L T+  +       C +    Y +  + SG LV D+ H  + 
Sbjct: 149 PSKSRTFRRLSCFDPMCELCTAVVDGGGGSAGCLFRR-RYGDGGAVSGELVSDVFHFGAA 207

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           GD      ++  V  GC   +    + G +  G++ LG+G+   PS + + G+ R  FS 
Sbjct: 208 GDGG-GYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGVDR--FSY 260

Query: 273 CF---------------DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           C                ++  +  + FG     T +   F      Y   +  V     G
Sbjct: 261 CIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGG 320

Query: 318 SSCLKQ------------TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
               +Q             +   +VDSG++  +LP  V+  +    +  ++ T      +
Sbjct: 321 RLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTH 380

Query: 366 PWKCCY 371
           P   CY
Sbjct: 381 PSLYCY 386


>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
          Length = 392

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 87/318 (27%), Positives = 125/318 (39%), Gaps = 61/318 (19%)

Query: 86  PSQGSKT--MSLGNDFGWLHYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASY 143
           PS G K   + L N +   ++  I +GTP  +F VA D GS  LW+P    RC   S   
Sbjct: 31  PSPGDKPTFVPLSNYWDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSR--RCHFFSVPC 88

Query: 144 YNSLDRDLNEYSPSASSTSKHLSCSHRLCDLGTSCQNPKQPCPYTMDYYTENTSSSGLLV 203
           +       + ++PSASS+ K           GT          + + Y T      G+L 
Sbjct: 89  WFH-----HRFNPSASSSFK---------PNGTK---------FAIQYGTGRV--DGILS 123

Query: 204 EDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISV----PSL 259
           ED L +  GG         ASVI G  + +S        PDG++GLG   ++V    P L
Sbjct: 124 EDKLTI--GGIKG------ASVIFGEALWESSLVFTVSRPDGILGLGFPILAVEGVRPPL 175

Query: 260 --LAKAGLI-RNSFSMCFDKD----DSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVE 312
             L K GL+ +  FS   ++D    D G +  G   PA                + I +E
Sbjct: 176 DVLVKQGLLDKPIFSFYLNRDPKVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHME 235

Query: 313 TCCIGSS-CLKQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCC- 370
              +GS   L      AI+D+G+     P E    + A           +  G P     
Sbjct: 236 RVKVGSGLTLCARGCAAILDTGTPVIVGPTEEIRALHA-----------AIGGIPLLAGE 284

Query: 371 YKSSSQRLPKLPSVKLMF 388
           Y      +PKLP+V L+ 
Sbjct: 285 YIIRCSEIPKLPAVSLLI 302


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 129/315 (40%), Gaps = 36/315 (11%)

Query: 87  SQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYN 145
           S  S  ++ G  +G  +Y T + +GTP   +++ +D GS L W+     +C+P   S + 
Sbjct: 120 SLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL-----QCSPCRVSCHR 174

Query: 146 SLDRDLNEYSPSASSTSKHLSCSHRLC-DLGTSCQNP-----KQPCPYTMDYYTENTSSS 199
              +    + P  SS+   +SCS   C DL T+  NP        C Y    Y +++ S 
Sbjct: 175 ---QSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQAS-YGDSSFSV 230

Query: 200 GLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSL 259
           G L +D    +S G N++ N        GCG    G +       GL+GL   ++S+  L
Sbjct: 231 GYLSKDT---VSFGSNSVPN-----FYYGCGQDNEGLFGRSA---GLMGLARNKLSL--L 277

Query: 260 LAKAGLIRNSFSMCFDKDDSGRIFFGDQ-GPATQQSTSFLASNGKYITYIIGVETCCIGS 318
              A  +  SFS C     S          P     T  ++S      Y I +    +  
Sbjct: 278 YQLAPTLGYSFSYCLPSSSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAG 337

Query: 319 SCL-----KQTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKS 373
             L     + +S   I+DSG+  T LP  VY+ ++      +  T  +        C+  
Sbjct: 338 KPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCFVG 397

Query: 374 SSQRLPKLPSVKLMF 388
            +  L ++P+V + F
Sbjct: 398 QASSL-RVPAVSMAF 411


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 123/319 (38%), Gaps = 65/319 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + +G P  +  + LD GS+L W+ C   + +P   S +N          P +SST   + 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHC---KKSPNLGSVFN----------PVSSSTYSPVP 115

Query: 167 CSHRLCDLGT-------SCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           CS  +C   T       SC +PK    +    Y + TS  G L  +           + +
Sbjct: 116 CSSPICRTRTRDLPIPASC-DPKTHLCHVAISYADATSIEGNLAHETF--------VIGS 166

Query: 220 SVQASVIIGC-GMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFDKDD 278
             +   + GC     S    +     GL+G+  G +S  + L  +      FS C    D
Sbjct: 167 VTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYCISGSD 221

Query: 279 SGR-IFFGDQ-----GPATQQSTSFLASNGKY---ITYIIGVETCCIGSSCLK--QTSF- 326
           S   +  GD      GP         ++   Y   + Y + +E   +GS  L   ++ F 
Sbjct: 222 SSVFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFV 281

Query: 327 -------KAIVDSGSSFTFLPKEVYETIAAEFD-------RQVNDTITSFEGYPWKCCYK 372
                  + +VDSG+ FTFL   VY  +  EF        R V+D    F+G     CYK
Sbjct: 282 PDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGT-MDLCYK 340

Query: 373 SSSQRLPK---LPSVKLMF 388
             S   P    LP V LMF
Sbjct: 341 VGSTTRPNFSGLPMVSLMF 359


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 69/306 (22%), Positives = 115/306 (37%), Gaps = 51/306 (16%)

Query: 100 GWLHYTWIDIGTPNVSF---LVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYS 155
           G  +   + IGTP        V  D GSDL W  C+ C  C+  +             + 
Sbjct: 101 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTP---------YPPHD 151

Query: 156 PSASSTSKHLSCSHRLCDLGTSCQN---PKQPCPYTMDYYTENTSSSGLLVEDILHLISG 212
           PS S T + LSC   +C+L T+  +       C +    Y +  + SG LV D+ H  + 
Sbjct: 152 PSKSRTFRRLSCFDPMCELCTAVVDGGGGSAGCLFRR-RYGDGGAVSGELVSDVFHFGAA 210

Query: 213 GDNALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           GD      ++  V  GC   +    + G +  G++ LG+G+   PS + + G+ R  FS 
Sbjct: 211 GDGG-GYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGVDR--FSY 263

Query: 273 CF---------------DKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIG 317
           C                ++  +  + FG     T +   F      Y   +  V     G
Sbjct: 264 CIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGG 323

Query: 318 SSCLKQ------------TSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGY 365
               +Q             +   +VDSG++  +LP  V+  +    +  ++ T      +
Sbjct: 324 RLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTH 383

Query: 366 PWKCCY 371
           P   CY
Sbjct: 384 PSLYCY 389


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 74/170 (43%), Gaps = 30/170 (17%)

Query: 109 IGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCS 168
           IG P       +D GS+L+W    C RC P          ++L  Y PS S  ++ + C+
Sbjct: 77  IGDPPQRAEAIIDTGSNLIWT--QCSRCRP------TCFRQNLPYYDPSRSRAARAVGCN 128

Query: 169 HRLCDLG--TSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVI 226
              C LG  T C +  + C     Y   N + + L  E++             S   S++
Sbjct: 129 DAACALGSETQCLSDNKTCAVVTGYGAGNIAGT-LATENLTF----------QSETVSLV 177

Query: 227 IGCGM--KQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF 274
            GC +  K S G L+G +  G+IGLG G++S+PS L         FS C 
Sbjct: 178 FGCIVVTKLSPGSLNGAS--GIIGLGRGKLSLPSQLGD-----TRFSYCL 220


>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
           partial [Brachypodium distachyon]
          Length = 354

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 80/175 (45%), Gaps = 17/175 (9%)

Query: 179 QNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYL 238
           +NP Q C Y + Y     SS G+L+ D   L  G D       + ++  GCG  Q GG  
Sbjct: 73  ENPNQ-CDYDVRY-AGGESSLGVLIADKFSL-PGRD------ARPTLTFGCGYDQEGGKA 123

Query: 239 DGVAPDGLIGLGLGEISVPSLLAKAGLI-RNSFSMCFDKDDSGRIFFG-DQGPATQQSTS 296
           + +  DG++G+G G   + S L + G I  N    C      G +FFG ++ P++  +  
Sbjct: 124 E-MPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHEKVPSSVVTWV 182

Query: 297 FLASNGKYITYIIGVETCCIGSSC---LKQTSFKAIVDSGSSFTFLPKEVYETIA 348
            +  N  Y  Y  G+       +    +     + ++DSGS++T++P E Y  + 
Sbjct: 183 PMVPNNHY--YSPGLAALHFNGNLGNPISVAPMEVVIDSGSTYTYMPTETYRRLV 235


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 103/282 (36%), Gaps = 49/282 (17%)

Query: 100 GWLHYTW-IDIGTPNVSFLVALDAGSDLLWIPCD-CVRCAPLSASYYNSLDRDLNEYSPS 157
           G L Y   + +GTP       LD GSDL+W  CD C  C          L +    +SP 
Sbjct: 94  GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTAC----------LRQPDPLFSPR 143

Query: 158 ASSTSKHLSCSHRLCD--LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDN 215
            SS+ + + C+ +LC   L  SC  P   C Y   Y    T+      E      S G+ 
Sbjct: 144 MSSSYEPMRCAGQLCGDILHHSCVRPDT-CTYRYSYGDGTTTLGYYATERFTFASSSGET 202

Query: 216 ALKNSVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCFD 275
                    +  GCG    G   +     G++G G   +S+ S L+    IR  FS C  
Sbjct: 203 Q-----SVPLGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLS----IRR-FSYCLT 249

Query: 276 KDDSGR---IFFG--------DQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK-- 322
              S R   + FG        D      Q+T  L S      Y +      +G+  L+  
Sbjct: 250 PYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIP 309

Query: 323 --------QTSFKAIVDSGSSFTFLPKEVYETIAAEFDRQVN 356
                     S   I+DSG++ T  P  V   +   F  Q+ 
Sbjct: 310 ASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLR 351


>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
          Length = 435

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 93/354 (26%), Positives = 139/354 (39%), Gaps = 82/354 (23%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           I+  TP V   V +D G   LW+ C+                   N+Y    SST +   
Sbjct: 51  INQRTPLVPLNVIVDLGGQFLWVDCE-------------------NKY---ISSTYRPAR 88

Query: 167 CSHRLCDLGTS-----CQNPKQP------CPYTMDYYTENTSSSGLLVEDILHL-ISGGD 214
           C    C L  S     C +  +P      C  T D    +T++SG L ED+L +  S G 
Sbjct: 89  CRSAQCSLANSDGCGDCFSSPKPGCNNNTCGVTPDNSITHTATSGELAEDVLSIQSSNGF 148

Query: 215 NALKNSVQASVIIGCGMKQSGGYLDGVA--PDGLIGLGLGEISVPSLLAKAGLIRNSFSM 272
           N  +N V +  +  C        L G+A    G+ GLG  +I++PS LA A      F++
Sbjct: 149 NPGQNVVVSRFLFSCAPTF---LLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAI 205

Query: 273 CFDKDDSGRIFFGDQGP--------------------ATQQSTSFLASNGK-YITYIIGV 311
           C      G + FGD GP                        ST+   S G+    Y IGV
Sbjct: 206 CLSSSK-GVVLFGD-GPYGFLPNVVFDSDSLTYTPLLINPVSTASAFSQGQPSAEYFIGV 263

Query: 312 ETCCIGSSCLK-QTSFKAIVDSG---------SSFTFLPKEVYETIAAEFDRQVNDTITS 361
           +T  I    +   TS  +I ++G           +T L   +Y+ +   F +        
Sbjct: 264 KTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKAPAARNIK 323

Query: 362 FEG--YPWKCCYKS-SSQRL-PKLPSVKLMFPQNNSFVVNNPVFVIYGTQVGVS 411
             G   P++ CY + +  RL   +P+++L F QN      N V+ I+G    VS
Sbjct: 324 RVGSVAPFEFCYTNLTGTRLGAAVPTIEL-FLQN-----ENVVWRIFGANSMVS 371


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 71/279 (25%), Positives = 108/279 (38%), Gaps = 57/279 (20%)

Query: 107 IDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLS 166
           + IGTP  +  + LD GS L WI C                      + PS SS+   L 
Sbjct: 84  LPIGTPPQTQQMVLDTGSQLSWIQCH--------KKSVPKKPPPTTSFDPSLSSSFSVLP 135

Query: 167 CSHRLCD-------LGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
           C+H LC        L T+C   +  C Y+  +Y + T + G LV + +   S       +
Sbjct: 136 CNHPLCKPRIPDFTLPTTCDQNRL-CHYSY-FYADGTYAEGSLVREKITFSS-------S 186

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEIS-------------VPSLLAKAGLI 266
                +I+GC    +          G++G+ LG  S             VP+  A+AGL 
Sbjct: 187 QSTPPLILGCAEASTD-------EKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLS 239

Query: 267 RNSFSMCFDKDDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCLK--QT 324
                   +  +SGR  + +    T    S    N   + Y I ++   +G++ L    T
Sbjct: 240 STGSFYLGNNPNSGRFQYINLLTFTPSQRS---PNLDPLAYTIPMQGIRMGNARLNISAT 296

Query: 325 SFK--------AIVDSGSSFTFLPKEVYETIAAEFDRQV 355
            F+         I+DSGS FT+L  E Y  +  E  R V
Sbjct: 297 LFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLV 335


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 136/323 (42%), Gaps = 45/323 (13%)

Query: 103 HYTWIDIGTPNVSFLVALDAGSDLLWIPCDCVRCAPLSASYYNSLDRDLNEYSPSASSTS 162
           ++  + IG P     V LD GSD+ WI     +CAP S  Y  S       + P +S++ 
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWI-----QCAPCSECYQQSDPI----FDPISSNSY 199

Query: 163 KHLSCSHRLC---DLGTSCQNPKQPCPYTMDYYTENTSSSGLLVEDILHLISGGDNALKN 219
             + C    C   DL + C+N    C Y +  Y + + + G    + + L   G  A++N
Sbjct: 200 SPIRCDEPQCKSLDL-SECRNGT--CLYEVS-YGDGSYTVGEFATETVTL---GSAAVEN 252

Query: 220 SVQASVIIGCGMKQSGGYLDGVAPDGLIGLGLGEISVPSLLAKAGLIRNSFSMCF---DK 276
                V IGCG    G +   V   GL+GLG G++S P     A +   SFS C    D 
Sbjct: 253 -----VAIGCGHNNEGLF---VGAAGLLGLGGGKLSFP-----AQVNATSFSYCLVNRDS 299

Query: 277 DDSGRIFFGDQGPATQQSTSFLASNGKYITYIIGVETCCIGSSCL--KQTSFKA------ 328
           D    + F    P    +   + +      Y +G++   +G   L   ++SF+       
Sbjct: 300 DAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGG 359

Query: 329 --IVDSGSSFTFLPKEVYETIAAEFDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKL 386
             I+DSG++ T L  EVY+ +   F +       +     +  CY  SS+   ++P+V  
Sbjct: 360 GIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSF 419

Query: 387 MFPQNNSFVVNNPVFVIYGTQVG 409
            FP+     +    ++I    VG
Sbjct: 420 RFPEGRELPLPARNYLIPVDSVG 442


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 139/344 (40%), Gaps = 48/344 (13%)

Query: 69  DVQKQKMKTGPQFQMLFPSQGSKTMSLGNDFGWLHY-TWIDIGTPNVSFLVALDAGSDLL 127
           D+ K  +K  P    + P   S  ++ G   G   Y T + +G P   F + LD GSD+ 
Sbjct: 128 DISKSDLK--PLETEIKPEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDIN 185

Query: 128 WIPCD-CVRCAPLSASYYNSLDRDLNEYSPSASSTSKHLSCSHRLCD--LGTSCQNPKQP 184
           W+ C  C  C       Y   D     + P+ASST   ++C  + C     +SC++ +  
Sbjct: 186 WLQCQPCTDC-------YQQTDP---IFDPTASSTYAPVTCQSQQCSSLEMSSCRSGQ-- 233

Query: 185 CPYTMDYYTENTSSSGLLVEDILHLISGGDNALKNSVQASVIIGCGMKQSGGYLDGVAPD 244
           C Y ++Y   + +      E +     G   ++KN     V +GCG    G ++      
Sbjct: 234 CLYQVNYGDGSYTFGDFATESVSF---GNSGSVKN-----VALGCGHDNEGLFVGAAGLL 285

Query: 245 GLIGLGLGEISVPSLLAKAGLIRNSFSMCF-DKDDSGR--IFFGDQGPATQQSTSFLASN 301
           GL G  L      SL  +  L   SFS C  ++D +G   + F          T+ L  N
Sbjct: 286 GLGGGPL------SLTNQ--LKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKN 337

Query: 302 GKYIT-YIIGVETCCIGSS--CLKQTSFK--------AIVDSGSSFTFLPKEVYETIAAE 350
            K  T Y +G+    +G     + +++F+         IVD G++ T L  + Y  +   
Sbjct: 338 RKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDA 397

Query: 351 FDRQVNDTITSFEGYPWKCCYKSSSQRLPKLPSVKLMFPQNNSF 394
           F R   +   +     +  CY  S Q   ++P+V   F    S+
Sbjct: 398 FVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 441


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.407 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,537,764,499
Number of Sequences: 23463169
Number of extensions: 278806506
Number of successful extensions: 661735
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 354
Number of HSP's successfully gapped in prelim test: 2192
Number of HSP's that attempted gapping in prelim test: 657193
Number of HSP's gapped (non-prelim): 3124
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)