BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047535
         (376 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 228/372 (61%), Positives = 277/372 (74%), Gaps = 16/372 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  +  VS+ NGEY+MK SIGTPP  D+YGI DTGSDLMW QCLPC+ CYKQ  P+++P+
Sbjct: 78  NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 136

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S+S+KE+SC+S+QC LLDTVSCS  Q+LC+++YGY D SL +GV+ATE +T  NSN+  
Sbjct: 137 KSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTL-NSNSGQ 195

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTD 184
                N+VFGCGHNN+G FNENEMGL G G   LSL SQI+S LG+  KFS CLVPF TD
Sbjct: 196 PXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTD 255

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            SITSK+ FG  +EVSG  VVST LV+K+D TYYFVTL+GISVG+     KL P+ +SS 
Sbjct: 256 PSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSP 310

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
             +KGN+FID G PPTLLP+DFYNRL + V+ AI + P QDP L  QLCY++ ++    P
Sbjct: 311 MATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GP 369

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           ILTAHFD GA V L   +TFI P  EGV+CFAMQPIDGD GIFGNF Q +  IG+D D +
Sbjct: 370 ILTAHFD-GADVQLKPLNTFISPK-EGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 427

Query: 365 MVSFKPTDCTKQ 376
            VSFK  DCTKQ
Sbjct: 428 KVSFKAVDCTKQ 439


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 228/372 (61%), Positives = 277/372 (74%), Gaps = 16/372 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  +  VS+ NGEY+MK SIGTPP  D+YGI DTGSDLMW QCLPC+ CYKQ  P+++P+
Sbjct: 78  NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 136

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S+S+KE+SC+S+QC LLDTVSCS  Q+LC+++YGY D SL +GV+ATE +T  NSN+  
Sbjct: 137 KSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTL-NSNSGQ 195

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTD 184
                N+VFGCGHNN+G FNENEMGL G G   LSL SQI+S LG+  KFS CLVPF TD
Sbjct: 196 PTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTD 255

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            SITSK+ FG  +EVSG  VVST LV+K+D TYYFVTL+GISVG+     KL P+ +SS 
Sbjct: 256 PSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSP 310

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
             +KGN+FID G PPTLLP+DFYNRL + V+ AI + P QDP L  QLCY++ ++    P
Sbjct: 311 MATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GP 369

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           ILTAHFD GA V L   +TFI P  EGV+CFAMQPIDGD GIFGNF Q +  IG+D D +
Sbjct: 370 ILTAHFD-GADVQLKPLNTFISPK-EGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 427

Query: 365 MVSFKPTDCTKQ 376
            VSFK  DCTKQ
Sbjct: 428 KVSFKAVDCTKQ 439


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 207/367 (56%), Positives = 250/367 (68%), Gaps = 45/367 (12%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  +  VS+ NGEY+MK SIGTPP  D+YGI DTGSDLMW QCLPC+ CYKQ  P+++P+
Sbjct: 11  NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 69

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
            S+S+KE+SC+S+QC LLDT +                     +L               
Sbjct: 70  KSTSFKEVSCESQQCRLLDTPT--------------------SIL--------------- 94

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTDSSITS 189
           N+VFGCGHNN+G FNENEMGL G G   LSL SQI+S LG+  KFS CLVPF TD SITS
Sbjct: 95  NIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITS 154

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
           K+ FG  +EVSG  VVST LV+K+D TYYFVTL+GISVG+     KL P+ +SS   +KG
Sbjct: 155 KIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSPMATKG 209

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
           N+FID G PPTLLP+DFYNRL + V+ AI + P QDP L  QLCY++ ++    PILTAH
Sbjct: 210 NVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GPILTAH 268

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           FD GA V L   +TFI  P EGV+CFAMQPIDGD GIFGNF Q +  IG+D D + VSFK
Sbjct: 269 FD-GADVQLKPLNTFI-SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFK 326

Query: 370 PTDCTKQ 376
             DCTKQ
Sbjct: 327 AVDCTKQ 333


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/367 (53%), Positives = 249/367 (67%), Gaps = 12/367 (3%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           QS +    G Y+M+ SIGTPP   IYGI DTGSDL W  C+PC +CYKQ  PI++P  S+
Sbjct: 15  QSPIYAYLGHYLMEVSIGTPPF-KIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKST 73

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
           SY+ +SC S+ CH LDT  CS Q+ CNYTY YA +++T+GVLA E IT  ++        
Sbjct: 74  SYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLK 133

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            +VFGCGHNNTG FN+ EMG++GLG   +S  SQI S  G  +FS CLVPFHTD S++SK
Sbjct: 134 GIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSK 193

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G GSEVSG GVVST LV+K+DKT YFVTL GISVGN    + L    +SS ++ KGN
Sbjct: 194 MSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGN----TYLHFNGSSSQSVEKGN 249

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAH 309
           +F+D+G PPT+LP   Y+RL  QVR+ + + P   D  LG QLCY+T +     P+LTAH
Sbjct: 250 VFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNL-RGPVLTAH 308

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F+GG  V L+ T TF+ P  +GVFC        D G++GNFAQS+  IG+D D Q+VSFK
Sbjct: 309 FEGG-DVKLLPTQTFVSPK-DGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFK 366

Query: 370 PTDCTKQ 376
           P DCTK 
Sbjct: 367 PMDCTKH 373


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 194/367 (52%), Positives = 248/367 (67%), Gaps = 13/367 (3%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           QS +    G Y+M+ SIGTPP   IYGI DTGSDL W  C+PC  CYKQ  P+++P  S+
Sbjct: 62  QSPIYAYLGHYLMELSIGTPPF-KIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKST 120

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
           +Y+ +SC S+ CH LDT  CS Q+ CNYTY YA +++T+GVLA E IT  ++        
Sbjct: 121 TYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLK 180

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            +VFGCGHNNTG FN++EMG++GLG   +SL SQ+ S  G  +FS CLVPFHTD S++SK
Sbjct: 181 GIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSK 240

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M FG GS+VSG GVVST LV+K+DKT YFVTL GISV N       + +  SS  + KGN
Sbjct: 241 MSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN-----TYLHFNGSSQNVEKGN 295

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAH 309
           MF+D+G PPT+LP   Y+++  QVR+ + + P   DP LG QLCY+T +     P+LTAH
Sbjct: 296 MFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLR-GPVLTAH 354

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F+ GA V L  T TFI P  +GVFC        D G++GNFAQS+  IG+D D Q+VSFK
Sbjct: 355 FE-GADVKLSPTQTFISPK-DGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFK 412

Query: 370 PTDCTKQ 376
           P DCTK 
Sbjct: 413 PKDCTKH 419


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 191/373 (51%), Positives = 248/373 (66%), Gaps = 18/373 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N+VQ+ ++   G+++M+  IGTPP+  I G+VDTGSDL+W+QC PC+ CYKQ+KP+++P 
Sbjct: 55  NIVQAPINAYIGQHLMEIYIGTPPI-KITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPL 113

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
            SS+Y  +SC S  CH LDT  CS ++ CNYTYGY D+SLTKGVLA +  TF ++     
Sbjct: 114 KSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPV 173

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
                +FGCGHNNTG FN++EMGL+GLG    SL SQI    G  KFS CLVPF TD  I
Sbjct: 174 SLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKI 233

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           +S+M FG GS+V G GVV+T LV +E  T YFVTL GISV +         Y+  +  I 
Sbjct: 234 SSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDT--------YFPMNSTIG 285

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPIL 306
           K NM +D+G PP LLP+  Y+++  +VRN + L P   DP LG+QLCY+T +     P L
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLK-GPTL 344

Query: 307 TAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDS 363
           T HF  GA V L    TFIPP    +G+FC A+    + D G++GNFAQS+  IG+D D 
Sbjct: 345 TFHFV-GANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDR 403

Query: 364 QMVSFKPTDCTKQ 376
           Q+VSFKPTDCTKQ
Sbjct: 404 QVVSFKPTDCTKQ 416


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 197/371 (53%), Positives = 250/371 (67%), Gaps = 20/371 (5%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ VS  + +Y+M+ SIGTPP+   Y  VDTGSDL+W+QC+PC  CYKQ+ P+++P SSS
Sbjct: 49  QTPVSVHHYDYLMELSIGTPPV-KTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSS 107

Query: 74  SYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
           +Y  ++  SE C  L + SCS  Q  CNYTY Y D S+T+GVLA E +T  ++       
Sbjct: 108 TYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVAL 167

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
             V+FGCGHNN GVFN+ EMG++GLGR  LSL SQI S  G   FS CLVPFHT+ SITS
Sbjct: 168 KGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITS 227

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG--AI 246
            M FG GSEV G GVVST LVSK   + +YFVTL GISV +++     +P+ + S    I
Sbjct: 228 PMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDIN-----LPFNDGSSLEPI 282

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPS-MAGIAP 304
           +KGNM ID+G P TLLP+DFY+RL E+VRN + L P   DP LG QLCY+TP+ + G   
Sbjct: 283 TKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKGTT- 341

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDS 363
            LTAHF+ GA V L  T  FIP   +G+FCFA       + GI+GN AQS+  IG+D + 
Sbjct: 342 -LTAHFE-GADVLLTPTQIFIPVQ-DGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEK 398

Query: 364 QMVSFKPTDCT 374
           Q+VSFK TDCT
Sbjct: 399 QLVSFKATDCT 409


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 189/367 (51%), Positives = 257/367 (70%), Gaps = 17/367 (4%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           + V++ NG+Y+MK ++G+PP+ DIYG+VDTGSDL+W QC PC  CY+Q  P++ P  S +
Sbjct: 73  TRVTSNNGDYLMKLTLGSPPV-DIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKT 131

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDN 131
           Y  + C+SEQC      SCS Q++C Y+Y YADSS+TKGVLA E ITF +++       +
Sbjct: 132 YSPIPCESEQCSFFG-YSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGD 190

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           ++FGCGH+N+G FNEN+MG++G+G   LSL SQI +  G+ +FS CLVPFHTD+  +  +
Sbjct: 191 IIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTI 250

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG  S+VSG GVV+T L S+E +T Y VTLEGISVG+          +NSS  +SKGN+
Sbjct: 251 NFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGD------TFVRFNSSETLSKGNI 304

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAHF 310
            ID+G P T +P++FY RL E+++    L P + DP LG+QLCY++ +     PILTAHF
Sbjct: 305 MIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYRSETNLE-GPILTAHF 363

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           + GA V L+   TFIPP  +GVFCFAM    DGD  IFGNFAQS++ +G+D D + +SFK
Sbjct: 364 E-GADVQLLPIQTFIPPK-DGVFCFAMAGSTDGDY-IFGNFAQSNILMGFDLDRKTISFK 420

Query: 370 PTDCTKQ 376
           PTDCT Q
Sbjct: 421 PTDCTNQ 427


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  350 bits (898), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 189/374 (50%), Positives = 247/374 (66%), Gaps = 19/374 (5%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           ++VQ+ ++   G+Y+M+  IGTPP+  I G VDTGSDL+WVQC+PC+ CY Q+ P+++P 
Sbjct: 51  DIVQAPINAYIGQYLMELYIGTPPI-KISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPL 109

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
            SS+Y  +SC S  C+      CS ++ C+YTYGYADSSLTKGVLA E +T  ++     
Sbjct: 110 KSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPI 169

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
               ++FGCGHNNTG FN++EMGL+GLG    SL SQI    G  KFS CLVPF TD +I
Sbjct: 170 SLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITI 229

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +S+M FG GSEV G GVV+T LV +E D T Y+VTL GISV +         Y   +  I
Sbjct: 230 SSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDT--------YLPMNSTI 281

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPI 305
            KGNM +D+G PP +LP+  Y+R+  +V+N + L P   DP LG QLCY+T +     P 
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLK-GPT 340

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVE--GVFCFAMQP-IDGDVGIFGNFAQSDLFIGYDFD 362
           LT HF+ GA + L    TFIPP  E  GVFC A+    + D GI+GNFAQ++  IG+D D
Sbjct: 341 LTYHFE-GANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLD 399

Query: 363 SQMVSFKPTDCTKQ 376
            Q+VSFKPTDCTKQ
Sbjct: 400 RQIVSFKPTDCTKQ 413


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 21/376 (5%)

Query: 8   YPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
           Y  + +QS VS  + EY+M+ SIGTPP+  IY   DTGSDL+W QC+PC +CYKQ  P++
Sbjct: 44  YKPSTIQSPVSAYDCEYLMELSIGTPPI-KIYAEADTGSDLVWFQCIPCTKCYKQQNPMF 102

Query: 68  NPASSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           +P SSSSY  ++C +E C+ LD+  CS+ Q+ CNYTY YAD+S+T+GVLA E +T  ++ 
Sbjct: 103 DPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTT 162

Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA--NKFSYCLVPF 181
                F  ++FGCGHNN+G FN+ EMGL+GLGR  LSL SQI S LGA  N FS CLVPF
Sbjct: 163 GEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPF 221

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
           +TD SITS+M FG GSEV G G VST L+SK D T YF TL GISV +++     +P+ N
Sbjct: 222 NTDPSITSQMNFGKGSEVLGNGTVSTPLISK-DGTGYFATLLGISVEDIN-----LPFSN 275

Query: 242 SS--GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            S  G I+KGN+ ID+G   T LP++FY+RL EQVRN + L P++    G +LCY+TP+ 
Sbjct: 276 GSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRID--GYELCYQTPTN 333

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P LT HF+GG  V L     FIP   +  FCFA+   + +   +GN+AQS+  IG+
Sbjct: 334 LN-GPTLTIHFEGG-DVLLTPAQMFIPVQDDN-FCFAVFDTNEEYVTYGNYAQSNYLIGF 390

Query: 360 DFDSQMVSFKPTDCTK 375
           D + Q+VSFK TDCTK
Sbjct: 391 DLERQVVSFKATDCTK 406


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  346 bits (888), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 187/371 (50%), Positives = 255/371 (68%), Gaps = 14/371 (3%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           +N V + V++ NG+Y+MK ++GTPP+ D+YG+VDTGSDL+W QC PC  CY+Q  P++ P
Sbjct: 36  SNGVFTRVTSNNGDYLMKLTLGTPPV-DVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEP 94

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
             S++Y  + C SE+C+ L   SCS Q+LC Y+Y YADSS+TKGVLA E +TF +++   
Sbjct: 95  LRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEP 154

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
               ++VFGCGH+N+G FNEN+MG++GLG   LSL SQ  +  G+ +FS CLVPFH D  
Sbjct: 155 VVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPH 214

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
               + FG+ S+VSG GV +T LVS+E +T Y VTLEGISVG+   S      +NSS  +
Sbjct: 215 TLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVS------FNSSEML 268

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPI 305
           SKGN+ ID+G P T LP++FY+RL ++++    + P   DP LG+QLCY++ +     PI
Sbjct: 269 SKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLE-GPI 327

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L AHF+ GA V L+   TFIPP  +GVFCFAM        IFGNFAQS++ IG+D D + 
Sbjct: 328 LIAHFE-GADVQLMPIQTFIPPK-DGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKT 385

Query: 366 VSFKPTDCTKQ 376
           VSFK TDC+ Q
Sbjct: 386 VSFKATDCSNQ 396


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 185/371 (49%), Positives = 251/371 (67%), Gaps = 25/371 (6%)

Query: 8   YPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
           Y +N   + V++ NG+Y+MK ++GTPP+ D+YG+VDT SDL+W QC PC  CYKQ  P++
Sbjct: 15  YASNGPFTRVTSNNGDYLMKLTLGTPPV-DVYGLVDTDSDLVWAQCTPCQGCYKQKNPMF 73

Query: 68  NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           +P             ++C+     SCS ++ C+Y Y YAD S TKG+LA E  TF +++ 
Sbjct: 74  DPL------------KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDG 121

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
               ++++FGCGHNNTGVFNEN+MGL+GLG   LSL SQ+ +  G+ +FS CLVPFH D 
Sbjct: 122 KPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADP 181

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +  +  G  S+VSG GVV+T LVS+E +T Y VTLEGISVG+       +P +NSS  
Sbjct: 182 HTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGD-----TFVP-FNSSEM 235

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQDPRLGSQLCYKTPSMAGIAP 304
           +SKGN+ ID+G P T LP++FY+RL E+++  I L P + DP LG+QLCYK+ +     P
Sbjct: 236 LSKGNIMIDSGTPETYLPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLE-GP 294

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           ILTAHF+ GA V L+   TFIPP  +GVFCFAM      + IFGNFAQS++ IG+D D +
Sbjct: 295 ILTAHFE-GADVKLLPLQTFIPPK-DGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKR 352

Query: 365 MVSFKPTDCTK 375
           +V FKPTD TK
Sbjct: 353 IVFFKPTDFTK 363


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 173/389 (44%), Positives = 240/389 (61%), Gaps = 24/389 (6%)

Query: 1   MSPATYFYPN-------NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC 53
           MS   +F P        +  QS + +  GEY+MKFS+GTP   DI  I DTGSDL+W QC
Sbjct: 62  MSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMKFSLGTPAF-DILAIADTGSDLIWTQC 120

Query: 54  LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL-DTVSCSSQ--QLCNYTYGYADSSL 110
            PC QCY+Q  P+++P SSS+Y+++SC ++QC LL +  SCS +  + C+Y+Y Y D S 
Sbjct: 121 KPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSF 180

Query: 111 TKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
           T G +A + IT G+++         + GCGHNN G F E   G+VGLG   +SL SQ+ S
Sbjct: 181 TSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGS 240

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV 227
            +   KFSYCLVP  ++++ +SK+ FG+   VSGGGV ST L+SK+  T+YF+TLE +SV
Sbjct: 241 TIDG-KFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSV 299

Query: 228 GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR 287
           G     S+ I +  SS   S+GN+ ID+G   TL P+DF++ L   V++A+  TP +DP 
Sbjct: 300 G-----SERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPS 354

Query: 288 LGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF 347
               LCY   +     P +TAHFD GA V L   +TF+    + V CFA  PI+    IF
Sbjct: 355 GILSLCYSIDADLKF-PSITAHFD-GADVKLNPLNTFVQVS-DTVLCFAFNPINSG-AIF 410

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           GN AQ +  +GYD + + VSFKPTDCT+ 
Sbjct: 411 GNLAQMNFLVGYDLEGKTVSFKPTDCTQD 439


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 166/367 (45%), Positives = 221/367 (60%), Gaps = 15/367 (4%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           +S V    G Y+M +S+GTPP   IYGI DTGSD++W+QC PC QCY Q  PI+NP+ SS
Sbjct: 77  ESTVIPDRGGYLMTYSVGTPPT-KIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSS 135

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFD 130
           SYK + C S+ CH +   SCS Q  C Y   Y DSS ++G L+ + ++  +++     F 
Sbjct: 136 SYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFP 195

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSITS 189
            +V GCG +N G F     G+VGLG   +SL +Q+ S +G  KFSYCLVP  + +S+ +S
Sbjct: 196 KIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG-KFSYCLVPLLNKESNASS 254

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISK 248
            + FG+ + VSG GVVST L+ K+D  +YF+TL+  SVGN     K + +  SS G   +
Sbjct: 255 ILSFGDAAVVSGDGVVSTPLI-KKDPVFYFLTLQAFSVGN-----KRVEFGGSSEGGDDE 308

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
           GN+ ID+G   TL+P D Y  LE  V + +KL    DP     LCY   S     PI+T 
Sbjct: 309 GNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITV 368

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA V L   STF+ P  +G+ CFA QP      IFGN AQ +L +GYD   + VSF
Sbjct: 369 HFK-GADVELHSISTFV-PITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426

Query: 369 KPTDCTK 375
           KPTDCTK
Sbjct: 427 KPTDCTK 433


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 166/367 (45%), Positives = 221/367 (60%), Gaps = 15/367 (4%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           +S V    G Y+M +S+GTPP   IYGI DTGSD++W+QC PC QCY Q  PI+NP+ SS
Sbjct: 77  ESTVIPDRGGYLMTYSVGTPPT-KIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSS 135

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFD 130
           SYK + C S+ CH +   SCS Q  C Y   Y DSS ++G L+ + ++  +++     F 
Sbjct: 136 SYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFP 195

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSITS 189
             V GCG +N G F     G+VGLG   +SL +Q+ S +G  KFSYCLVP  + +S+ +S
Sbjct: 196 KTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG-KFSYCLVPLLNKESNASS 254

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISK 248
            + FG+ + VSG GVVST L+ K+D  +YF+TL+  SVGN     K + +  SS G   +
Sbjct: 255 ILSFGDAAVVSGDGVVSTPLI-KKDPVFYFLTLQAFSVGN-----KRVEFGGSSEGGDDE 308

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
           GN+ ID+G   TL+P D Y  LE  V + +KL    DP     LCY   S     PI+TA
Sbjct: 309 GNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITA 368

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA + L   STF+ P  +G+ CFA QP      IFGN AQ +L +GYD   + VSF
Sbjct: 369 HFK-GADIELHSISTFV-PITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426

Query: 369 KPTDCTK 375
           KPTDCTK
Sbjct: 427 KPTDCTK 433


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  290 bits (742), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 178/384 (46%), Positives = 236/384 (61%), Gaps = 15/384 (3%)

Query: 1   MSPATYFYPN----NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC 56
           +S A +F  N    N +QS V + NGEY+M  S+GTPP+  ++GI DTGSDL+W QC PC
Sbjct: 68  ISRANHFRANGVSTNSIQSPVISNNGEYLMNISLGTPPV-SMHGIADTGSDLLWRQCKPC 126

Query: 57  VQCYKQVKPIYNPASSSSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVL 115
             CY+Q++PI++PA S +Y+ LSC+ + C +L     CS    C Y+Y Y D S T G L
Sbjct: 127 DSCYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDL 186

Query: 116 ATERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
           A + +T G++         VVFGCGHNN G F  +  GLVGLG   LS+ SQ+   +G  
Sbjct: 187 AVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGG- 245

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSN 232
           +FSYCLVP   D S++SKM+FG+   VSG G VST L S++  T+Y++TLE +SVG+   
Sbjct: 246 RFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKL 305

Query: 233 SSKLIPYYNSSGA-ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
           + K      S  A   +GN+ ID+G   TLLP+DFY  LE  V +AI   P +DP     
Sbjct: 306 AYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFS 365

Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
           LCY   S   I P +TAHF  GA + L   +TF+    E +FCFAM P+  D+ IFGN A
Sbjct: 366 LCYSNLSGLRI-PTITAHFV-GADLELKPLNTFVQVQ-EDLFCFAMIPVS-DLAIFGNLA 421

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
           Q +  +GYD  S+ VSFKPTDCTK
Sbjct: 422 QMNFLVGYDLKSRTVSFKPTDCTK 445


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 162/370 (43%), Positives = 226/370 (61%), Gaps = 17/370 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS +  + GEY+M   IGTPP+  +  IVDTGSDL W QC PC  CYKQV P+++P +S
Sbjct: 81  IQSRIVPSAGEYLMNLYIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNS 139

Query: 73  SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
           S+Y++ SC +  C  L    SCS ++ C + Y YAD S T G LA+E +T  ++      
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVS 199

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F    FGCGH++ G+F+++  G+VGLG   LSL SQ+ S +    FSYCL+P  TDSSI+
Sbjct: 200 FPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING-LFSYCLLPVSTDSSIS 258

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY--YNSSGAI 246
           S++ FG    VSG G VST LV K   T+Y++TLEGISVG      K +PY  Y+    +
Sbjct: 259 SRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGK-----KRLPYKGYSKKTEV 313

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
            +GN+ +D+G   T LP++FY++LE+ V N+IK    +DP     LCY T +    API+
Sbjct: 314 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEIN-APII 372

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           TAHF   A V L   +TF+    E + CF + P   D+G+ GN AQ +  +G+D   + V
Sbjct: 373 TAHFK-DANVELQPLNTFMRMQ-EDLVCFTVAPTS-DIGVLGNLAQVNFLVGFDLRKKRV 429

Query: 367 SFKPTDCTKQ 376
           SFK  DCT+ 
Sbjct: 430 SFKAADCTQH 439


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 163/369 (44%), Positives = 216/369 (58%), Gaps = 16/369 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  +S V    GEY+M +S+GTPP  ++YG+VDTGSD++W+QC PC QCYKQ  PI+NP+
Sbjct: 74  NTPESTVYVNGGEYLMTYSVGTPPF-NVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPS 132

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-- 128
            SSSYK + C S  C  +   SC+ Q  C YT  ++D S ++G L+ E +T  ++     
Sbjct: 133 KSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSV 192

Query: 129 -FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F   V GCGHNN G+F     G+VGLG   +SL +Q+ S +G  KFSYCL+P   DS+ 
Sbjct: 193 SFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGG-KFSYCLLPLLVDSNK 251

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           TSK+ FG+ + VSG GVVST  V K+ + +Y++TLE  SVGN     K I +     +  
Sbjct: 252 TSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGN-----KRIEFEVLDDS-E 305

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
           +GN+ +D+G   TLLP   Y  LE  V   +KL    DP     LCY   S     PI+T
Sbjct: 306 EGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPIIT 365

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFDSQMV 366
           AHF  GA + L   STF     +GV C A        G IFGN AQ +L +GYD    +V
Sbjct: 366 AHFK-GADIKLNPISTF-AHVADGVVCLAF--TSSQTGPIFGNLAQLNLLVGYDLQQNIV 421

Query: 367 SFKPTDCTK 375
           SFKP+DC K
Sbjct: 422 SFKPSDCIK 430


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 174/396 (43%), Positives = 233/396 (58%), Gaps = 31/396 (7%)

Query: 1   MSPATYFYPNNV-----VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP 55
           +S A  F PN++     VQS++    GEY+M+ SIG P + +I  I DTGSDL+WVQC P
Sbjct: 65  ISRANRFKPNSISARALVQSDIVPGGGEYLMRISIGNPQV-EILAIADTGSDLIWVQCQP 123

Query: 56  CVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ---QLCNYTYGYADSSL 110
           C  CYKQ  PI++P  SSSY+ + C +E C+ LD    SC ++   + C YTY Y D S 
Sbjct: 124 CEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSF 183

Query: 111 TKGVLATERITFGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
           + G LA ER   G++N+       +F  V FGCG  N G F+E   G++GLG   +SL S
Sbjct: 184 SDGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVS 243

Query: 164 QILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG--GVVSTSLVSKEDKTYYFVT 221
           Q+  +L + KFSYCLVP    S+ TSK+ FGN   +SG    VVST L+ K+ +TYY++T
Sbjct: 244 QLGPKL-SGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLT 302

Query: 222 LEGISVGNLSNSSKLIPYYN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL 280
           LE ISV N     K +PY N  +G + KGN+ ID+G   T L  +F+N L+  V  A+K 
Sbjct: 303 LEAISVEN-----KRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKG 357

Query: 281 TPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
               DP     +C+K      + PI+TAHF  GA V L   +TF     E + CF M P 
Sbjct: 358 ERVSDPHGLFNICFKDEKAIEL-PIITAHFT-GADVELQPVNTFAKVE-EDLLCFTMIP- 413

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
             D+ IFGN AQ +  +GYD + + VSF PTDCTKQ
Sbjct: 414 SNDIAIFGNLAQMNFLVGYDLEKKAVSFLPTDCTKQ 449


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 166/369 (44%), Positives = 224/369 (60%), Gaps = 15/369 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  QS +++  GEY+M  SIGTPP+  I  I DTGSDL+W QC PC  CY+Q  P+++P 
Sbjct: 73  NSPQSFITSNRGEYLMNISIGTPPV-PILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPK 131

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNF- 128
            SS+Y+++SC S QC  L+  SCS+ +  C+YT  Y D+S TKG +A + +T G+S    
Sbjct: 132 ESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRP 191

Query: 129 --FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
               N++ GCGH NTG F+    G++GLG    SL SQ+   +   KFSYCLVPF +++ 
Sbjct: 192 VSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSING-KFSYCLVPFTSETG 250

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +TSK+ FG    VSG GVVSTS+V K+  TYYF+ LE ISVG     SK I + ++    
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVG-----SKKIQFTSTIFGT 305

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
            +GN+ ID+G   TLLP +FY  LE  V + IK    QDP     LCY+  S   + P +
Sbjct: 306 GEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKV-PDI 364

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           T HF GG  V L + +TF+    E V CFA    +  + IFGN AQ +  +GYD  S  V
Sbjct: 365 TVHFKGG-DVKLGNLNTFVAVS-EDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTV 421

Query: 367 SFKPTDCTK 375
           SFK TDC++
Sbjct: 422 SFKKTDCSQ 430


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 168/374 (44%), Positives = 224/374 (59%), Gaps = 19/374 (5%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  Q+++    GEY MK SIGTP L+++  I DTGSDL WVQCLPC  CY+Q  P+++P+
Sbjct: 81  NSFQNDLVPNGGEYFMKMSIGTP-LVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPS 139

Query: 71  SSSSYKELSCQSEQCHLLDTV--SCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
            SSSY+ + C S  C+ LD    +C+    +C Y Y Y D S T G LATE+ T G++++
Sbjct: 140 RSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSS 199

Query: 128 ---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                  +VFGCG  N G F+E   G+VGLG   LSL SQ LS +   KFSYCLVP    
Sbjct: 200 RPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQ-LSSIIKGKFSYCLVPLSEQ 258

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS-- 242
           S++TSK+ FG  S +SG  VVST LVSK+  TYY+VTLE ISVGN     K +PY N   
Sbjct: 259 SNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGN-----KRLPYTNGLL 313

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
           +G + KGN+ ID+G   T L  +F+  LE  +   +K     DPR    +C+++     +
Sbjct: 314 NGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDL 373

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            P++  HF+  A V L   +TF+    E + CF M      +GIFGN AQ D  +GYD +
Sbjct: 374 -PVIAVHFN-DADVKLQPLNTFVKAD-EDLLCFTMIS-SNQIGIFGNLAQMDFLVGYDLE 429

Query: 363 SQMVSFKPTDCTKQ 376
            + VSFKPTDCTK 
Sbjct: 430 KRTVSFKPTDCTKH 443


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 160/367 (43%), Positives = 218/367 (59%), Gaps = 13/367 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS +  + GEY+M  SIGTPP+  +  IVDTGSDL W QC PC  CYKQV P ++P +S
Sbjct: 81  IQSRLVPSAGEYIMNLSIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNS 139

Query: 73  SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
           S+Y++ SC +  C  L +  SC + + C + Y YAD S T G LA E +T  ++      
Sbjct: 140 STYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVS 199

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F    FGC H + G+F+E+  G+VGLG   LS+ SQ+ S +   +FSYCL+P  TDSS++
Sbjct: 200 FPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTING-RFSYCLLPVFTDSSMS 258

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYF-VTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           S++ FG    VSG G VST LV K   TYY+ +TLEG SVG    S K    ++    + 
Sbjct: 259 SRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK---GFSKKAEVE 315

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
           +GN+ +D+G   T LP +FY +LEE V ++IK    +DP   S LCY T      API+T
Sbjct: 316 EGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIIT 375

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
           AHF   A V L   +TF+    E + CF + P   D+GI GN AQ +  +G+D   + VS
Sbjct: 376 AHFK-DANVELQPWNTFLRMQ-EDLVCFTVLPTS-DIGILGNLAQVNFLVGFDLRKKRVS 432

Query: 368 FKPTDCT 374
           FK  DCT
Sbjct: 433 FKAADCT 439


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 156/370 (42%), Positives = 221/370 (59%), Gaps = 15/370 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
              +S+V++  GEY+M  S+GTPP   I GI DTGSDL+W QC PC +CYKQV P+++P 
Sbjct: 82  KAAESDVTSNRGEYLMSLSLGTPPF-KIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPK 140

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
           SS +Y++ SC + QC LLD  +CS   +C Y Y Y D S T G +A++ IT  ++     
Sbjct: 141 SSKTYRDFSCDARQCSLLDQSTCSG-NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPV 199

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F   V GCGH N G F++   G+VGLG   LSL SQ+ S +G  KFSYCLVP  + +  
Sbjct: 200 SFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGG-KFSYCLVPLSSRAGN 258

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +SK+ FG+ + VSG GV ST L+S E   ++YF+TLE +SVGN     + I + +SS   
Sbjct: 259 SSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGN-----ERIKFGDSSLGT 313

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
            +GN+ ID+G   T++P DF++ L   V N ++    +DP     +CY   S   + P +
Sbjct: 314 GEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKV-PAI 372

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           TAHF  GA V L   +TF+    + V C A       + I+GN AQ +  + Y+   + +
Sbjct: 373 TAHFT-GADVKLKPINTFVQVS-DDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSL 430

Query: 367 SFKPTDCTKQ 376
           SFKPTDCTK+
Sbjct: 431 SFKPTDCTKK 440


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 161/371 (43%), Positives = 229/371 (61%), Gaps = 16/371 (4%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           +N  Q ++++ +GEY+M  S+GTPP   I  I DTGSDL+W QC PC  CY QV P+++P
Sbjct: 80  DNAPQIDLTSNSGEYLMNISLGTPPF-PIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDP 138

Query: 70  ASSSSYKELSCQSEQCHLLDT-VSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNN 127
            +SS+YK++SC S QC  L+   SCS++   C+Y+  Y D S TKG +A + +T G+++ 
Sbjct: 139 KASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDT 198

Query: 128 F---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                 N++ GCGHNN G FN+   G+VGLG   +SL +Q+   +   KFSYCLVP  ++
Sbjct: 199 RPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDG-KFSYCLVPLTSE 257

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           +  TSK+ FG  + VSG GVVST L++K  +T+Y++TL+ ISVG     SK + Y  S  
Sbjct: 258 NDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVG-----SKEVQYPGSDS 312

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
              +GN+ ID+G   TLLP +FY+ LE+ V ++I     QDP+ G  LCY       + P
Sbjct: 313 GSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKV-P 371

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +T HFD GA V L  ++ F+    E + CFA +       I+GN AQ +  +GYD  S+
Sbjct: 372 AITMHFD-GADVNLKPSNCFVQIS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSK 428

Query: 365 MVSFKPTDCTK 375
            VSFKPTDC K
Sbjct: 429 TVSFKPTDCAK 439


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 168/373 (45%), Positives = 224/373 (60%), Gaps = 12/373 (3%)

Query: 7   FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
           F   +  +S V  + GEY+M++S+G+PP   + GIVDTGSD++W+QC PC  CYKQ  PI
Sbjct: 74  FVSTDSAESTVVASQGEYLMRYSVGSPPF-QVLGIVDTGSDILWLQCEPCEDCYKQTTPI 132

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           ++P+ S +YK L C S  C  L   +CSS  +C Y+  Y D S + G L+ E +T G+++
Sbjct: 133 FDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTD 192

Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                F   V GCGHNN G F E   G+VGLG   +SL SQ+ S +G  KFSYCL P  +
Sbjct: 193 GSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGG-KFSYCLAPIFS 251

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           +S+ +SK+ FG+ + VSG G VST L     + +YF+TLE  SVG+  N  +     +S 
Sbjct: 252 ESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGD--NRIEFSGSSSSG 309

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
                GN+ ID+G   TLLP++ Y  LE  V + IKL   +DP     LCYKT S     
Sbjct: 310 SGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDL 369

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFD 362
           P++TAHF  GA V L   STF+P   +GV CFA   I   +G IFGN AQ +L +GYD  
Sbjct: 370 PVITAHFK-GADVELNPISTFVPVE-KGVVCFAF--ISSKIGAIFGNLAQQNLLVGYDLV 425

Query: 363 SQMVSFKPTDCTK 375
            + VSFKPTDCTK
Sbjct: 426 KKTVSFKPTDCTK 438


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 165/370 (44%), Positives = 226/370 (61%), Gaps = 19/370 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S +    GEY+M  S+GTPP  +I  I DTGSDL+W QC PC +CYKQ+ P+++P SS
Sbjct: 82  VESEIIANGGEYLMSLSLGTPPF-EILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSS 140

Query: 73  SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
            +Y++LSC + QC  L ++ SCSS+QLC Y+Y Y D S T G LA + +T  ++N    +
Sbjct: 141 KTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVY 200

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-I 187
           F   V GCG  N G F++ + G++GLG   +SL SQ+ S +G  KFSYCLVPF ++S+  
Sbjct: 201 FPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGG-KFSYCLVPFSSESAGN 259

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           +SK++FG  + VSG GV ST L+SK   T+Y++TLE +SVG+     K I +  SS   S
Sbjct: 260 SSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGD-----KKIEFGGSSFGGS 314

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNA-IKLTPYQDPRLGSQLCYK-TPSMAGIAPI 305
           +GN+ ID+G   TL P +F+      V NA I     QD       CY+ TP +    P+
Sbjct: 315 EGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLK--VPV 372

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +TAHF+ GA V L   +TFI    + V C A         IFGN AQ +  IGYD   + 
Sbjct: 373 ITAHFN-GADVVLQTLNTFILIS-DDVLCLAFNSTQSG-AIFGNVAQMNFLIGYDIQGKS 429

Query: 366 VSFKPTDCTK 375
           VSFKPTDCT+
Sbjct: 430 VSFKPTDCTQ 439


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 159/385 (41%), Positives = 225/385 (58%), Gaps = 20/385 (5%)

Query: 1   MSPATYFYP---NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV 57
           ++ A +FY     N+ QS V    GEY+M +S+GTPP   +YGIVDTGSD++W+QC PC 
Sbjct: 61  INRANHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPF-KLYGIVDTGSDIVWLQCEPCQ 119

Query: 58  QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           +CY Q  P++NP+ SSSYK + C S+ C  ++  SC+ +  C Y+  Y D+S + G L+ 
Sbjct: 120 ECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSV 179

Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           + +T  ++N     F N+V GCG NN   +     G+VG G    S  +Q+ S  G  KF
Sbjct: 180 DTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGG-KF 238

Query: 175 SYCLVPFHTDSSI----TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
           SYCL P  + ++I    TSK+ FG+ + VSG GVV+T ++ K+ +T+Y++TLE  SVGN 
Sbjct: 239 SYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNR 298

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
                 +P        ++GN+ ID+G   T L KD Y+ LE  V + +KL    DP    
Sbjct: 299 RVEIGGVP-----NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTL 353

Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
            LCY   +     PI+T HF  GA V L   STF+    +GVFC A +    D  IFGN 
Sbjct: 354 NLCYSVKAEGYDFPIITMHFK-GADVDLHPISTFV-SVADGVFCLAFES-SQDHAIFGNL 410

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
           AQ +L +GYD   ++VSFKP+DCTK
Sbjct: 411 AQQNLMVGYDLQQKIVSFKPSDCTK 435


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 164/370 (44%), Positives = 228/370 (61%), Gaps = 11/370 (2%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N +QS+V +  G Y+M  S+GTPP+  + GI DTGSDL+W QCLPC  CY+QV+P+++P 
Sbjct: 81  NDIQSDVISGGGAYLMNISLGTPPV-PMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPK 139

Query: 71  SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S +YK L C +E C  L    SC     C Y+Y Y D S T+G L+++ +T G++    
Sbjct: 140 ESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDP 199

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F  + FGCGH+N G FNE + GL+GLG   LSL  Q+ S++G  +FSYCLVP  +DS+
Sbjct: 200 ASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGG-QFSYCLVPLSSDST 258

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GA 245
           ++SK+ FG    VSG G VST L+     T+Y++TLEG+SVG+ + + K      SS  A
Sbjct: 259 VSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAA 318

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
           + +GN+ ID+G   TLLP+DFY  +E  + NAI      DP     LCY + +   I P 
Sbjct: 319 VEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEI-PT 377

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +TAHF  GA V L   +TF+    E + CF+M P   ++ IFGN AQ +  +GYD  +  
Sbjct: 378 ITAHFT-GADVQLPPLNTFVQVQ-EDLVCFSMIP-SSNLAIFGNLAQINFLVGYDLKNNK 434

Query: 366 VSFKPTDCTK 375
           VSFK TDCT+
Sbjct: 435 VSFKQTDCTE 444


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 156/386 (40%), Positives = 231/386 (59%), Gaps = 23/386 (5%)

Query: 1   MSPATYFY-PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC 59
           ++ A +F+  +   ++ ++  +GEY++ +S+G PP   +YGI+DTGSD++W+QC PC +C
Sbjct: 62  VNRANHFHKAHKAAKATITQNDGEYLISYSVGIPPF-QLYGIIDTGSDMIWLQCKPCEKC 120

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLAT 117
           Y Q   I++P+ S++YK L   S  C  ++  SCSS  +++C YT  Y D S ++G L+ 
Sbjct: 121 YNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSV 180

Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA--N 172
           E +T G++N     F   V GCG NNT  F     G+VGLG   +SL +Q+  +  +   
Sbjct: 181 ETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGR 240

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSN 232
           KFSYCL      S+I+SK+ FG+ + VSG G VST +V+ + K +Y++TLE  SVGN   
Sbjct: 241 KFSYCLASM---SNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGN--- 294

Query: 233 SSKLIPYYNSSGAI-SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
               I + +SS     KGN+ ID+G   TLLP D Y++LE  V + ++L   +DP     
Sbjct: 295 --NRIEFTSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLS 352

Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNF 350
           LCY++      AP++ AHF  GA V L   +TFI    +GV C A   I   +G IFGN 
Sbjct: 353 LCYRSTFDELNAPVIMAHF-SGADVKLNAVNTFIEVE-QGVTCLAF--ISSKIGPIFGNM 408

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           AQ +  +GYD   ++VSFKPTDC+KQ
Sbjct: 409 AQQNFLVGYDLQKKIVSFKPTDCSKQ 434


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  272 bits (695), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 164/368 (44%), Positives = 226/368 (61%), Gaps = 17/368 (4%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q ++++ +GEY+M  SIGTPP   I  I DTGSDL+W QC PC  CY QV P+++P +SS
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPF-PIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138

Query: 74  SYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF--- 128
           +YK++SC S QC  L+   SCS+    C+Y+  Y D+S TKG +A + +T G+S+     
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N++ GCGHNN G FN+   G+VGLG   +SL  Q+   +   KFSYCLVP  +    T
Sbjct: 199 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 257

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           SK+ FG  + VSG GVVST L++K   +T+Y++TL+ ISVG     SK I Y  S    S
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG-----SKQIQYSGSDSESS 312

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
           +GN+ ID+G   TLLP +FY+ LE+ V ++I     QDP+ G  LCY       + P++T
Sbjct: 313 EGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKV-PVIT 371

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFD GA V L  ++ F+    E + CFA +       I+GN AQ +  +GYD  S+ VS
Sbjct: 372 MHFD-GADVKLDSSNAFVQVS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVS 428

Query: 368 FKPTDCTK 375
           FKPTDC K
Sbjct: 429 FKPTDCAK 436


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 164/368 (44%), Positives = 226/368 (61%), Gaps = 17/368 (4%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q ++++ +GEY+M  SIGTPP   I  I DTGSDL+W QC PC  CY QV P+++P +SS
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPF-PIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138

Query: 74  SYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF--- 128
           +YK++SC S QC  L+   SCS+    C+Y+  Y D+S TKG +A + +T G+S+     
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N++ GCGHNN G FN+   G+VGLG   +SL  Q+   +   KFSYCLVP  +    T
Sbjct: 199 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 257

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           SK+ FG  + VSG GVVST L++K   +T+Y++TL+ ISVG     SK I Y  S    S
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG-----SKQIQYSGSDSESS 312

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
           +GN+ ID+G   TLLP +FY+ LE+ V ++I     QDP+ G  LCY       + P++T
Sbjct: 313 EGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKV-PVIT 371

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFD GA V L  ++ F+    E + CFA +       I+GN AQ +  +GYD  S+ VS
Sbjct: 372 MHFD-GADVKLDSSNAFVQVS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVS 428

Query: 368 FKPTDCTK 375
           FKPTDC K
Sbjct: 429 FKPTDCAK 436


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 169/389 (43%), Positives = 232/389 (59%), Gaps = 27/389 (6%)

Query: 1   MSPATYFYPNNV-----VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP 55
           +S A  F PN+V     ++ ++    GEY M+ SIGTPP+ ++  I DTGSDL+WVQC P
Sbjct: 66  ISRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPI-EVLVIADTGSDLIWVQCQP 124

Query: 56  CVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL--DTVSCSSQ---QLCNYTYGYADSSL 110
           C +CYKQ  PI+NP  SS+Y+ + C++  C+ L  D  +CS+    + C Y+Y Y D S 
Sbjct: 125 CQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSF 184

Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
           T G LATER   G++NN    + FGCG++N G F+E   G+VGLG   LSL SQ+ +++ 
Sbjct: 185 TMGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKID 244

Query: 171 ANKFSYCLVPFHTDSSIT-SKMYFGNGSEVSGGGV-VSTSLVSKEDKTYYFVTLEGISVG 228
            NKFSYCLVP    S+ +  K+ FG+ S +SG    VST LVSKE +T+Y++TLE ISVG
Sbjct: 245 -NKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVG 303

Query: 229 NLSNSSKLIPYYNSS--GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
           N     + + Y NS   G + KGN+ ID+G   T L    YN+LE  +  A++     DP
Sbjct: 304 N-----ERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDP 358

Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
                +C++     GI  PI+T HF   A V L   +TF     E + CF M P +G + 
Sbjct: 359 NGIFSICFR--DKIGIELPIITVHFT-DADVELKPINTFAKAE-EDLLCFTMIPSNG-IA 413

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           IFGN AQ +  +GYD D   VSF PTDC+
Sbjct: 414 IFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 157/306 (51%), Positives = 194/306 (63%), Gaps = 18/306 (5%)

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNF--FDNVVFG 135
           SC S  CH LDT  CS ++ CNYTYGY D+SLTKGVLA +  TF  N+         +FG
Sbjct: 20  SCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFG 79

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CGHNNTG FN++EMGL+GLG    SL SQI    G  KFS CLVPF TD  I+S+M FG 
Sbjct: 80  CGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGK 139

Query: 196 GSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
           GS+V G GVV+T LV +E D T YFVTL GISV +         Y   +  I KGNM +D
Sbjct: 140 GSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDT--------YLPMNSTIEKGNMLVD 191

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
           +G PP +LP+  Y+R+  +V+N + L     DP LG QLCY+T +     P LT HF+ G
Sbjct: 192 SGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLK-GPTLTYHFE-G 249

Query: 314 AKVPLIHTSTFIPPPVE--GVFCFAMQP-IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           A + L    TFIPP  E  GVFC A+    + + G++GNFAQS+  IG+D D Q+VSFK 
Sbjct: 250 ANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKA 309

Query: 371 TDCTKQ 376
           TDCTKQ
Sbjct: 310 TDCTKQ 315


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 163/371 (43%), Positives = 221/371 (59%), Gaps = 11/371 (2%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N +QSNV +  G Y+M  S+GTPP+  + GI DTGSDL+W QCLPC  CYKQV+P+++P 
Sbjct: 81  NDIQSNVISGGGSYLMNISLGTPPV-SMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPK 139

Query: 71  SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S +YK L C ++ C  L    SC     C  +Y Y D S T+  L++E  T G++    
Sbjct: 140 KSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDP 199

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F  + FGCGH+N G FNE + GL+GLG   LSL  Q+ S++G  +FSYCLVP  +DS+
Sbjct: 200 ASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDST 258

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GA 245
            +SK+ FG  + VSG G VST L+     T+Y++TLEG+S+G+   + K      SS  A
Sbjct: 259 ASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAA 318

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
             + N+ ID+G   TLLP+DFY  +E  +   I      DPR    LCY       I P 
Sbjct: 319 AEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI-PT 377

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +TAHF  GA V L   +TF+    E + CF+M P   ++ IFGN +Q +  +GYD  +  
Sbjct: 378 ITAHFI-GADVQLPPLNTFVQAQ-EDLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNK 434

Query: 366 VSFKPTDCTKQ 376
           VSFKPTDCTKQ
Sbjct: 435 VSFKPTDCTKQ 445


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 160/375 (42%), Positives = 217/375 (57%), Gaps = 18/375 (4%)

Query: 7   FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
           F   N  ++ V +A GEY++ +S+GTP L  ++GI+DTGSD++W+QC PC +CY+Q  PI
Sbjct: 72  FVSPNSPETTVISALGEYLISYSVGTPSL-QVFGILDTGSDIIWLQCQPCKKCYEQTTPI 130

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           ++ + S +YK L C S  C  +    CSS++ C Y+  Y D S + G L+ E +T G++N
Sbjct: 131 FDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTN 190

Query: 127 NF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                F   V GCG  N     E   G+VGLGR  +SL +Q+    G  KFSYCLVP   
Sbjct: 191 GSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGG-KFSYCLVP--G 247

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S+ +SK+ FGN + VSG G VST L SK    +YF+TLE  SVG   N  +    + S 
Sbjct: 248 LSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGR--NRIE----FGSP 301

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TP-SMAG 301
           G+  KGN+ ID+G   T LP   Y++LE  V   + L   +DP     LCYK TP  +  
Sbjct: 302 GSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDA 361

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P++TAHF  GA V L   +TF+    + V CFA QP +    +FGN AQ +L +GYD 
Sbjct: 362 SVPVITAHFS-GADVTLNAINTFV-QVADDVVCFAFQPTETG-AVFGNLAQQNLLVGYDL 418

Query: 362 DSQMVSFKPTDCTKQ 376
               VSFK TDCTKQ
Sbjct: 419 QMNTVSFKHTDCTKQ 433


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/367 (40%), Positives = 218/367 (59%), Gaps = 24/367 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS++   +GEY+M  SIGTPP+ D  GI DTGSDL W QCLPC++CY+Q++PI+NP  S
Sbjct: 81  LQSSIGPGSGEYLMSVSIGTPPV-DYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKS 139

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C ++ CH +D   C  Q +C+Y+Y Y D + +KG L  E+IT G+S+      
Sbjct: 140 TSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV---KS 196

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKM 191
           V GCGH ++G F     G++GLG  +LSL SQ+    G + +FSYCL    + ++   K+
Sbjct: 197 VIGCGHASSGGFGFAS-GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKI 253

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG  + VSG GVVST L+SK   TYY++TLE IS+GN  + +             +GN+
Sbjct: 254 NFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA----------FAKQGNV 303

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA---PILTA 308
            ID+G   T+LPK+ Y+ +   +   +K    +DP     LC+     A  +   P++TA
Sbjct: 304 IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITA 363

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMV 366
           HF GGA V L+  +TF     + V C  ++      + GI GN AQ++  IGYD +++ +
Sbjct: 364 HFSGGANVNLLPINTF-RKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRL 422

Query: 367 SFKPTDC 373
           SFKPT C
Sbjct: 423 SFKPTVC 429


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 150/365 (41%), Positives = 211/365 (57%), Gaps = 32/365 (8%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           QS V++  GEY+M +SIGTPP   ++G VDTGSDL+W+QC PC QCY Q+ PI++P+ SS
Sbjct: 78  QSTVNSDKGEYLMSYSIGTPPF-KVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSS 136

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
           SY+ + C S+ CH + T SC                  +G L+ E +T  ++  +   F 
Sbjct: 137 SYQNIPCLSDTCHSMRTTSCD----------------VRGYLSVETLTLDSTTGYSVSFP 180

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
             + GCG+ NTG F+    G+VGLG   +SL SQ+ + +G  KFSYCL P+  +S  TSK
Sbjct: 181 KTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGG-KFSYCLGPWLPNS--TSK 237

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG+ + V G G ++T +V K+ ++ Y++TLE  SVGN     KLI +   +   ++GN
Sbjct: 238 LNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGN-----KLIEFGGPTYGGNEGN 292

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
           + ID+G   T LP D Y R E  V   I L   +DP    +LCY        AP++TAHF
Sbjct: 293 ILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHGFEAPLITAHF 352

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA + L + STFI    +G+ C A  P      IFGN AQ +L +GY+     V+FKP
Sbjct: 353 K-GADIKLYYISTFIKVS-DGIACLAFIP--SQTAIFGNVAQQNLLVGYNLVQNTVTFKP 408

Query: 371 TDCTK 375
            DCTK
Sbjct: 409 VDCTK 413


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 167/373 (44%), Positives = 225/373 (60%), Gaps = 16/373 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  +S V  + GEY+M +S+GTPP   I GIVDTGSD++W+QC PC  CY Q  PI++P+
Sbjct: 81  NTAESTVIASQGEYLMSYSVGTPPF-QILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPS 139

Query: 71  SSSSYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
            S +YK L C S  C  + +  SCSS    C YT  Y D+S ++G L+ E +T G+++  
Sbjct: 140 QSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGS 199

Query: 129 ---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
              F   V GCGHNN G F     G+VGLG   +SL SQ+ S +G  KFSYCL P  + S
Sbjct: 200 SVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGG-KFSYCLAPLFSQS 258

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           + +SK+ FG+ + VSG G VST +V K    +YF+TLE  SVG   ++       +   +
Sbjct: 259 NSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVG---DNRIEFGSSSFESS 315

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
             +GN+ ID+G   T+LP+D Y  LE  V +AI+L   +DP    +LCY+T S   +  P
Sbjct: 316 GGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVP 375

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFDS 363
           ++TAHF  GA V L   STFI    EGV CFA +     +G IFGN AQ +L +GYD   
Sbjct: 376 VITAHFK-GADVELNPISTFIEVD-EGVVCFAFR--SSKIGPIFGNLAQQNLLVGYDLVK 431

Query: 364 QMVSFKPTDCTKQ 376
           Q VSFKPTDCT++
Sbjct: 432 QTVSFKPTDCTQE 444


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  258 bits (658), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 170/383 (44%), Positives = 231/383 (60%), Gaps = 21/383 (5%)

Query: 5   TYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
           ++    N  +S V  + GEY+M +S+GTPP  +I G+VDTGS + W+QC  C  CY+Q  
Sbjct: 78  SFVASTNTAESTVKASQGEYLMSYSVGTPPF-EILGVVDTGSGITWMQCQRCEDCYEQTT 136

Query: 65  PIYNPASSSSYKELSCQSEQCH-LLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITF 122
           PI++P+ S +YK L C S  C  ++ T SCSS ++ C YT  Y D S ++G L+ E +T 
Sbjct: 137 PIFDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTL 196

Query: 123 GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
           G++N     F N V GCGHNN G F     G+VGLG   +SL SQ+ S +G  KFSYCL 
Sbjct: 197 GSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGG-KFSYCLA 255

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIP 238
           P  + S+ +SK+ FG+ + VSG G VST LVSK   + +Y++TLE  SVG+     K I 
Sbjct: 256 PMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD-----KRIE 310

Query: 239 YY----NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
           +     +S  +  +GN+ ID+G   TLLP++ Y+ LE  V +AI+     DP     LCY
Sbjct: 311 FVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCY 370

Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQS 353
           + TPS     P++TAHF  GA V L   STF+    EGV CFA    +  V IFGN AQ 
Sbjct: 371 QTTPSGQLDVPVITAHFK-GADVELNPISTFV-QVAEGVVCFAFHSSEV-VSIFGNLAQL 427

Query: 354 DLFIGYDFDSQMVSFKPTDCTKQ 376
           +L +GYD   Q VSFKPTDCT++
Sbjct: 428 NLLVGYDLMEQTVSFKPTDCTQE 450


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 155/373 (41%), Positives = 218/373 (58%), Gaps = 20/373 (5%)

Query: 10  NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           +N V+S V+  + G+Y+M +S+GTPP   +YGIVDT SD++WVQC  C  CY    P+++
Sbjct: 73  SNAVESPVTLLDDGDYLMSYSLGTPPF-PVYGIVDTASDIIWVQCQLCETCYNDTSPMFD 131

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           P+ S +YK L C S  C  +   SCSS  +++C +T  Y D S ++G L  E +T G+ N
Sbjct: 132 PSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYN 191

Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
           +    F   V GC  N    F  + +G+VGLG   +SL  Q+ S + + KFSYCL P   
Sbjct: 192 DPFVHFPRTVIGCIRNTNVSF--DSIGIVGLGGGPVSLVPQLSSSI-SKKFSYCLAPI-- 246

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S  +SK+ FG+ + VSG G VST +V K+ K +Y++TLE  SVGN    +++    +SS
Sbjct: 247 -SDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGN----NRIEFRSSSS 301

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
            +  KGN+ ID+G   T+LP D Y++LE  V + +KL   +DP     LCYK+       
Sbjct: 302 RSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDV 361

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P++TAHF  GA V L   +TFI      V C A         IFGN AQ +  +GYD   
Sbjct: 362 PVITAHF-SGADVKLNALNTFIVAS-HRVVCLAFLSSQSG-AIFGNLAQQNFLVGYDLQR 418

Query: 364 QMVSFKPTDCTKQ 376
           ++VSFKPTDCTKQ
Sbjct: 419 KIVSFKPTDCTKQ 431


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 161/384 (41%), Positives = 223/384 (58%), Gaps = 25/384 (6%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+++  + GEY+M  SIGTPP   I  I DTGSDL W+Q  PC QCY Q  PI++P++S+
Sbjct: 70  QTDLLPSGGEYMMNLSIGTPPF-PILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNST 128

Query: 74  SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           ++ +L C +  C+ LD    SC+    C YTY Y D S T G LA++ +T GN++    N
Sbjct: 129 TFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRN 188

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-------TD 184
           V FGCG  N G F+E   G+VGLG   LS  SQ+   +G  KFSYCL+P         +D
Sbjct: 189 VAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIG-KKFSYCLLPLENEISSQPSD 247

Query: 185 SSITSKMYFGNG---SEVSGGGVV--STSLVSKEDKTYYFVTLEGISVGN-----LSNSS 234
           S  TS++ FG+    S  S  GVV  +T LV+KE  TYY++T+E I+VG       S+SS
Sbjct: 248 SPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSS 307

Query: 235 KLIPYYN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QL 292
           K   Y + S  ++ +GN+ ID+G   T L ++FY  LE  +   IK+    D +     L
Sbjct: 308 KTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSL 367

Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           C+K+       P++  HF GGA V L   +TF+    EG+ CF M P + DVGI+GN AQ
Sbjct: 368 CFKSGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-EGLVCFTMLPTN-DVGIYGNLAQ 425

Query: 353 SDLFIGYDFDSQMVSFKPTDCTKQ 376
            +  +GYD   + VSF P DC+KQ
Sbjct: 426 MNFVVGYDLGKRTVSFLPADCSKQ 449


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 159/386 (41%), Positives = 212/386 (54%), Gaps = 42/386 (10%)

Query: 1   MSPATYFYPN---NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV 57
           ++ A +FY     N  QS V   +GEY+M +S+GTPP   +YGI DTGSD++W+QC PC 
Sbjct: 61  INRANHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPF-KLYGIADTGSDIVWLQCEPCK 119

Query: 58  QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           +CY Q  P + P+ SS+YK + C S+ C        S QQ               G L+ 
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCK-------SGQQ---------------GNLSV 157

Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           + +T  +S      F   V GCG +NT  F     G+VGLG    SL +Q+ S + A KF
Sbjct: 158 DTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDA-KF 216

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSS 234
           SYCL+P   +S+ TSK+ FG+ + VSG GVVST +V K+   +Y++TLE  SVGN     
Sbjct: 217 SYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGN----- 271

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
           K I +  SS    +GN+ ID+G   T++P D YN LE  V   +KL    DP     LCY
Sbjct: 272 KRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCY 331

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP----IDGD-VGIFGN 349
              S     PI+T HF  GA V L   STF+    +G+ C A       I  D V IFGN
Sbjct: 332 SVTSDGYDFPIITTHFK-GADVKLHPISTFV-DVADGIVCLAFATTSAFIPSDVVSIFGN 389

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCTK 375
            AQ +L +GYD   ++VSFKPTDC+K
Sbjct: 390 LAQQNLLVGYDLQQKIVSFKPTDCSK 415


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 147/369 (39%), Positives = 215/369 (58%), Gaps = 26/369 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q+ ++  +GEY+M  SIGTPP+ D  G+ DTGSDLMW QCLPC++CYKQ +PI++P  S
Sbjct: 81  LQAPLTPGSGEYLMSVSIGTPPV-DYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKS 139

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C S+ C  +D   C +Q +C+Y+Y Y D + TKG L  E+IT G+S+      
Sbjct: 140 TSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV---KS 196

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKM 191
           V GCGH +         G++GLG  +LSL SQ+    G + +FSYCL    + ++   K+
Sbjct: 197 VIGCGHESG-GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKI 253

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG  + VSG GVVST L+SK   TYY+VTLE IS+GN  + +          +  +GN+
Sbjct: 254 NFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMA----------SAKQGNV 303

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIAPILT 307
            ID+G   + LPK+ Y+ +   +   +K    +DP     LC+       + +GI PI+T
Sbjct: 304 IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGI-PIIT 362

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQM 365
           A F GGA V L+  +TF       V C  + P     + GI GN A ++  IGYD +++ 
Sbjct: 363 AQFSGGANVNLLPVNTF-QKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 421

Query: 366 VSFKPTDCT 374
           +SFKPT CT
Sbjct: 422 LSFKPTVCT 430


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 149/360 (41%), Positives = 210/360 (58%), Gaps = 21/360 (5%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY++ +S+GTPP   +YG +DTGS+++W+QC PC  C+ Q  PI+NP+ SSSYK + C 
Sbjct: 87  GEYLISYSVGTPPF-KVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCT 145

Query: 82  SEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN---SNNFFDNVV 133
           S  C   DT    +SCS+   +C Y+  Y   + ++G L+ + +T  +   S+  F N+V
Sbjct: 146 SSTCK--DTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIV 203

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
            GCGH N    N    G+VG+GR  +SL  Q+ S    +KFSYCL+P+++DS+ +SK+ F
Sbjct: 204 IGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIF 263

Query: 194 GNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           G    VSG  VVST +V     + YYF+TLE  SVGN       I Y   S A S  N+ 
Sbjct: 264 GEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGN-----NRIEYGERSNA-STQNIL 317

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
           ID+G P T+LP  F ++L   V   +KL   + P     LCY T       P +TAHF+ 
Sbjct: 318 IDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHFN- 376

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA V L    TF P   +G+ CF     +G + IFGN AQ++L I YD + +++SFKPTD
Sbjct: 377 GADVKLNSNGTFFPFE-DGIMCFGFISSNG-LEIFGNIAQNNLLIDYDLEKEIISFKPTD 434


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 154/366 (42%), Positives = 207/366 (56%), Gaps = 23/366 (6%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           A   YVM +SIGTPP   +YG+VDTGSD +W QC PC  C  Q  PI+NP+ SS+YK + 
Sbjct: 86  AGSYYVMSYSIGTPPF-QLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIR 144

Query: 80  CQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNN----FFDNVV 133
           C S  C   +   CSS  ++ C Y   Y D S ++G ++ + +T  NSN+     F  +V
Sbjct: 145 CSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTL-NSNDGSPISFPKIV 203

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
            GCGH N+        G++G GR   S+ SQ+ S +G  KFSYCL    + ++I+SK+YF
Sbjct: 204 IGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGG-KFSYCLASLFSKANISSKLYF 262

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN----LSNSSKLIPYYNSSGAISKG 249
           G+ + VSG GVVST L+       YF  LE  SVG+    L +SS LIP        ++G
Sbjct: 263 GDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSS-LIP-------DNEG 314

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
           N  ID+G+  T LP D Y++LE  V + +KL   +DP     LCYKT       PI+TAH
Sbjct: 315 NAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPIITAH 374

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F  GA V L   +TFI    E V CFA         ++GN AQ +  +GYD    ++SFK
Sbjct: 375 FR-GADVKLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFK 432

Query: 370 PTDCTK 375
           PT+CTK
Sbjct: 433 PTNCTK 438


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 152/378 (40%), Positives = 216/378 (57%), Gaps = 21/378 (5%)

Query: 7   FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
           F PN V    VS   G+ Y++ F IGTPP   +YG++DT +D +W QC PC  C+    P
Sbjct: 71  FPPNKVPNIVVSPFMGDGYIISFLIGTPPF-QLYGVMDTANDNIWFQCNPCKPCFNTTSP 129

Query: 66  IYNPASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFG 123
           +++P+ SS+YK + C S +C  ++   CSS  +++C Y++ Y   + ++G L+ + +T  
Sbjct: 130 MFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTL- 188

Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
           NSNN     F N+V GCGH N G       G +GLGR  LS  SQ+ S +G  KFSYCLV
Sbjct: 189 NSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGG-KFSYCLV 247

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
           P  ++  I+ K++FG+ S VSG G VST + + E    Y  TL  +SVG+      +I +
Sbjct: 248 PLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGE--IGYSTTLNALSVGD-----HIIKF 300

Query: 240 YNSSGAISK-GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
            NS+      GN  ID+G   T+LP++ Y+RLE  V + +KL   + P    +LCYK   
Sbjct: 301 ENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATL 360

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFI 357
                PI+TAHF+ GA V L   +TF P   E V CFA   +    G I GN AQ +  +
Sbjct: 361 KNLDVPIITAHFN-GADVHLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLV 418

Query: 358 GYDFDSQMVSFKPTDCTK 375
           G+D    ++SFKPTDCTK
Sbjct: 419 GFDLQKNIISFKPTDCTK 436


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 141/352 (40%), Positives = 205/352 (58%), Gaps = 26/352 (7%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
           IGTPP+ D  GI DTGSDL W QCLPC++CY+Q++PI+NP  S+S+  + C ++ CH +D
Sbjct: 86  IGTPPV-DYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD 144

Query: 90  TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
              C  Q +C+Y+Y Y D + +KG L  E+IT G+S+      V GCGH ++G F     
Sbjct: 145 DGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS---VKSVIGCGHASSGGFGFAS- 200

Query: 150 GLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS 208
           G++GLG  +LSL SQ+    G + +FSYCL    + ++   K+ FG  + VSG GVVST 
Sbjct: 201 GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTP 258

Query: 209 LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
           L+SK   TYY++TLE IS+GN  + +             +GN+ ID+G   + LPK+ Y+
Sbjct: 259 LISKNTVTYYYITLEAISIGNERHMA----------FAKQGNVIIDSGTTLSFLPKELYD 308

Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIAPILTAHFDGGAKVPLIHTSTF 324
            +   +   +K    +DP     LC+       + +GI PI+TA F GGA V L+  +TF
Sbjct: 309 GVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGI-PIITAQFSGGANVNLLPVNTF 367

Query: 325 IPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                  V C  + P     + GI GN A ++  IGYD +++ +SFKPT CT
Sbjct: 368 -QKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 155/370 (41%), Positives = 216/370 (58%), Gaps = 21/370 (5%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N  Q++++   GEY+M  S+GTPP   I  + DTGS+L+W QC PC  CY QV P+++P 
Sbjct: 81  NSPQTDITPCGGEYLMNLSLGTPPS-PIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPK 139

Query: 71  SSSSYKELSCQSEQCHLLDT-VSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           +SS+YK++SC S QC  L+   SCS++ + C+Y   YAD S T G  A + +T G+++N 
Sbjct: 140 ASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNR 199

Query: 129 ---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                N++ GCG NN   F     G+VGLG   +SL  Q+   +   KFSYCLVP   ++
Sbjct: 200 PVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG-KFSYCLVP---EN 255

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             TSK+ FG  + VSG G VST LV K   T+Y++TL+ ISVG     SK +   +S+  
Sbjct: 256 DQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVG-----SKNMQTPDSN-- 308

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
             KGNM ID+G   TLLP  +Y  +E  V + I     +D R+GS LCY   +   I P+
Sbjct: 309 -IKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLNI-PV 366

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +T HF+ GA V L   ++F     E + C A        GI+GN AQ +  +GYD  S+ 
Sbjct: 367 ITMHFE-GADVKLYPYNSFF-KVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKT 424

Query: 366 VSFKPTDCTK 375
           +SFKPTDC K
Sbjct: 425 MSFKPTDCAK 434


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  243 bits (621), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 146/370 (39%), Positives = 209/370 (56%), Gaps = 18/370 (4%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           +S V +  G+Y+M +S+GTPP+   YGIVDTGSD++W+QC PC QCY Q  P +NP+ SS
Sbjct: 77  ESTVISYEGDYIMSYSVGTPPIKS-YGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSS 135

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
           SYK +SC S+ C  +   SC+ ++ C Y+  Y + S ++G L+ E +T  ++      F 
Sbjct: 136 SYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFP 195

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD----SS 186
             V GCG NN G F     G+VGLG    SL +Q+   +G  KFSYCLV         S 
Sbjct: 196 KTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGG-KFSYCLVRMSITLKNMSM 254

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            +SK+ FG+ + VSG  V+ST +V K+   +Y++T+E  SVG+     K + +  SS  +
Sbjct: 255 GSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGD-----KRVEFAGSSKGV 309

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PI 305
            +GN+ ID+    T +P D Y +L   + + + L    DP     LCY   S      P 
Sbjct: 310 EEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPY 369

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +TAHF  GA + L  T+TF+      V CFA  P +G   IFG+F+Q D  +GYD   + 
Sbjct: 370 MTAHFK-GADILLYATNTFV-EVARDVLCFAFAPSNGG-AIFGSFSQQDFMVGYDLQQKT 426

Query: 366 VSFKPTDCTK 375
           VSFK  DCT+
Sbjct: 427 VSFKSVDCTE 436


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 148/364 (40%), Positives = 206/364 (56%), Gaps = 12/364 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S +   +GE++M   IGTPP+ ++  I DTGSDL W QCLPC +C+ Q +PI+NP  S
Sbjct: 79  IRSPIIPDSGEFLMSIFIGTPPV-NVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRS 137

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSY+++SC S+ C  L++  C    Q C+Y Y Y D S T G LA+++IT G+       
Sbjct: 138 SSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFK--LPK 195

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPFHTDSSITSK 190
            V GCGH N G F     G++GLG   LSL SQ+ +  G   +FSYCL  F ++++IT  
Sbjct: 196 TVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT 255

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG  + VSG  VVST LV +   T+YF+TLE ISVG      +       S   + GN
Sbjct: 256 ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGK----KRFKAANGISAMTNHGN 311

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
           + ID+G   TLLP+  Y  +   +   IK     DP    +LCY    +  +  PI+TAH
Sbjct: 312 IIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAH 371

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F GGA V L+  +TF  P  + V C    P    V IFGN AQ +  +GYD  ++ +SF+
Sbjct: 372 FAGGADVKLLPVNTF-APVADNVTCLTFAPAT-QVAIFGNLAQINFEVGYDLGNKRLSFE 429

Query: 370 PTDC 373
           P  C
Sbjct: 430 PKLC 433


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 202/358 (56%), Gaps = 38/358 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS +  + GEY+M   IGTPP+  +  IVDTGSDL W QC PC  CYKQV P+++P +S
Sbjct: 81  IQSRIVPSAGEYLMNLYIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNS 139

Query: 73  SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
           S+Y++ SC +  C  L    SCS ++ C + Y YAD S T G LA+E +T  ++      
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVS 199

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F    FGCGH++ G+F+++  G+VGLG   LSL SQ+ S +    FSYCL+P  TDSSI+
Sbjct: 200 FPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING-LFSYCLLPVSTDSSIS 258

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY--YNSSGAI 246
           S++ FG    VSG G VST L                           +PY  Y+    +
Sbjct: 259 SRINFGASGRVSGYGTVSTPL--------------------------RLPYKGYSKKTEV 292

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
            +GN+ +D+G   T LP++FY++LE+ V N+IK    +DP     LCY T +    API+
Sbjct: 293 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEIN-APII 351

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           TAHF   A V L   +TF+    E + CF + P   D+G+ GN AQ +  +G+D   +
Sbjct: 352 TAHFK-DANVELQPLNTFMRMQ-EDLVCFTVAPTS-DIGVLGNLAQVNFLVGFDLRKK 406



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 85/155 (54%), Gaps = 10/155 (6%)

Query: 227 VGNLSNSSKLIPY-------YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
           +GNL+  + L+ +       ++    + +GN+ +D+G   T LP +FY +LEE V ++IK
Sbjct: 389 LGNLAQVNFLVGFDLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIK 448

Query: 280 LTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP 339
               +DP   S LCY T      API+TAHF   A V L   +TF+    E + CF + P
Sbjct: 449 GKRVRDPNGISSLCYNTTVDQIDAPIITAHFK-DANVELQPWNTFLRMQ-EDLVCFTVLP 506

Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              D+GI GN AQ +  +G+D   + VSFK  DCT
Sbjct: 507 TS-DIGILGNLAQVNFLVGFDLRKKRVSFKAADCT 540


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 136/371 (36%), Positives = 211/371 (56%), Gaps = 18/371 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N V++ +    GEY+MK S+GTPP   I  + DTGSD++W QC+PC  CY+Q  P++NP+
Sbjct: 72  NTVEAPIYNNRGEYLMKLSVGTPPF-PIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPS 130

Query: 71  SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S++Y+++SC S  C    +  SCS +  C Y+  Y D+S ++G  A + +T G+++   
Sbjct: 131 KSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F     GCGH+N G F+ N  G+VGLG    SL  Q+ S +G  KFSYCL P   D  
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGG-KFSYCLTPIGNDDG 249

Query: 187 ITSKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            ++K+ FG+ + VSG G VST + +S + K++Y + L+ +SVG  +       +Y+++ +
Sbjct: 250 GSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNT------FYSTANS 303

Query: 246 I--SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
           I   K N+ ID+G   TLLP D Y+   + + N+I L    DP    + C++T +     
Sbjct: 304 ILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKV 363

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFD 362
           P +  HF+ GA + L   +  I    + V C A     D D+ I+GN AQ +  +GYD  
Sbjct: 364 PFIAMHFE-GANLRLQRENVLIRVS-DNVICLAFAGAQDNDISIYGNIAQINFLVGYDVT 421

Query: 363 SQMVSFKPTDC 373
           +  +SFKP +C
Sbjct: 422 NMSLSFKPMNC 432


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 136/371 (36%), Positives = 210/371 (56%), Gaps = 18/371 (4%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N V++ +    GEY+MK S+GTPP   I  + DTGSD++W QC PC  CY+Q  P++NP+
Sbjct: 72  NTVEAPIYNNRGEYLMKLSVGTPPF-PIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPS 130

Query: 71  SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
            S++Y+++SC S  C    +  SCS +  C Y+  Y D+S ++G  A + +T G+++   
Sbjct: 131 KSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F     GCGH+N G F+ N  G+VGLG    SL  Q+ S +G  KFSYCL P   D  
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGG-KFSYCLTPIGNDDG 249

Query: 187 ITSKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            ++K+ FG+ + VSG G VST + +S + K++Y + L+ +SVG  +       +Y+++ +
Sbjct: 250 GSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNT------FYSTANS 303

Query: 246 I--SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
           I   K N+ ID+G   TLLP D Y+   + + N+I L    DP    + C++T +     
Sbjct: 304 ILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKV 363

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFD 362
           P +  HF+ GA + L   +  I    + V C A     D D+ I+GN AQ +  +GYD  
Sbjct: 364 PFIAMHFE-GANLRLQRENVLIRVS-DNVICLAFAGAQDNDISIYGNIAQINFLVGYDVT 421

Query: 363 SQMVSFKPTDC 373
           +  +SFKP +C
Sbjct: 422 NMSLSFKPMNC 432


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 123/230 (53%), Positives = 160/230 (69%), Gaps = 6/230 (2%)

Query: 7   FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
           F+  N +QS VS  + +Y+M+ SIGTPP+  IY   DTGSDL+W+QC+PC  CYKQ+ P+
Sbjct: 42  FFNRNTIQSPVSANHYDYLMELSIGTPPV-KIYAQADTGSDLIWLQCIPCTNCYKQLNPM 100

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNS 125
           ++  SSS++  ++C SE C  L + SCS  Q+ C Y Y Y D S T+GVLA E +T  ++
Sbjct: 101 FDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTST 160

Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
                 F  V+FGCGHNN G FN+ EMG++GLGR  LSL SQI S LG N FS CLVPF+
Sbjct: 161 TGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFN 220

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLS 231
           T+ SI+S M FG GSEV G GVVST LVSK   +++YFVTL GISV +++
Sbjct: 221 TNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVEDIN 270


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 158/384 (41%), Positives = 219/384 (57%), Gaps = 35/384 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS +  A+GE+ M  +IGTPP+  ++ I DTGSDL WVQC PC QCYK+  PI++   S
Sbjct: 74  LQSGLIGADGEFFMSITIGTPPM-KVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKS 132

Query: 73  SSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
           S+YK   C S  CH L +    C  S+ +C Y Y Y D S +KG +ATE I+  +++   
Sbjct: 133 STYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSP 192

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F   VFGCG+NN G F+E   G++GLG   LSL SQ+ S + + KFSYCL      ++
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKKFSYCLSHKSATTN 251

Query: 187 ITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            TS +  G  S  S      GV+ST LV KE +TYY++TLE ISVG      K IPY  S
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGK-----KKIPYTGS 306

Query: 243 S-----GAI---SKGNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGS 290
           S     G I   + GN+ ID+G   TLL   F+++    +EE V  A +++   DP+   
Sbjct: 307 SYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVS---DPQGLL 363

Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
             C+K+ S     P +T HF  GA V L   + F+    E + C +M P   +V I+GNF
Sbjct: 364 SHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKVS-EDMVCLSMVPTT-EVAIYGNF 420

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
           AQ D  +GYD +++ VSF+  DC+
Sbjct: 421 AQMDFLVGYDLETRTVSFQRMDCS 444


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 157/384 (40%), Positives = 217/384 (56%), Gaps = 35/384 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS +  A+GE+ M  +IGTPP+  ++ I DTGSDL WVQC PC QCYK+  PI++   S
Sbjct: 74  LQSGLIGADGEFFMSITIGTPPI-KVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKS 132

Query: 73  SSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
           S+YK   C S  C  L +    C  S  +C Y Y Y D S +KG +ATE ++  +++   
Sbjct: 133 STYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSP 192

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F   VFGCG+NN G F+E   G++GLG   LSL SQ+ S + + KFSYCL      ++
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKKFSYCLSHKSATTN 251

Query: 187 ITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            TS +  G  S  S      GVVST LV KE  TYY++TLE ISVG      K IPY  S
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK-----KKIPYTGS 306

Query: 243 S------GAISK--GNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGS 290
           S      G +S+  GN+ ID+G   TLL   F+++    +EE V  A +++   DP+   
Sbjct: 307 SYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVS---DPQGLL 363

Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
             C+K+ S     P +T HF  GA V L   + F+    E + C +M P   +V I+GNF
Sbjct: 364 SHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLS-EDMVCLSMVPTT-EVAIYGNF 420

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
           AQ D  +GYD +++ VSF+  DC+
Sbjct: 421 AQMDFLVGYDLETRTVSFQHMDCS 444


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 150/381 (39%), Positives = 208/381 (54%), Gaps = 51/381 (13%)

Query: 7   FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
           F PN +    +S+  G  YVM +SIGTPP   +Y ++DTG+D +W QC PC  C  Q  P
Sbjct: 72  FSPNKIQDVPLSSFMGAGYVMSYSIGTPPF-QLYSLIDTGNDNIWFQCKPCKPCLNQTSP 130

Query: 66  IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
           +++P+ SS+YK + C S  C   D              G+         L  + +T  NS
Sbjct: 131 MFHPSKSSTYKTIPCTSPICKNAD--------------GH--------YLGVDTLTL-NS 167

Query: 126 NN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
           NN     F N+V GCGH N G       G +GL R  LS  SQ+ S +G  KFSYCLVP 
Sbjct: 168 NNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGG-KFSYCLVPL 226

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
            +  +++SK++FG+ S VSG G VST +   +++  YFV+LE  SVG+      +I   N
Sbjct: 227 FSKENVSSKLHFGDKSTVSGLGTVSTPI---KEENGYFVSLEAFSVGD-----HIIKLEN 278

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           S    ++GN  ID+G   T+LPKD Y+RLE  V + +KL   +DP     LCY+T S   
Sbjct: 279 SD---NRGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTL 335

Query: 302 IAPIL--TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG----DVGIFGNFAQSDL 355
           +  +L  TAHF  G++V L   +TF P   E V CFA   + G     + IFGN  Q + 
Sbjct: 336 LTKVLIITAHFS-GSEVHLNALNTFYPITDE-VICFAF--VSGGNFSSLAIFGNVVQQNF 391

Query: 356 FIGYDFDSQMVSFKPTDCTKQ 376
            +G+D + + +SFKPTDCTK 
Sbjct: 392 LVGFDLNKKTISFKPTDCTKH 412


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 142/377 (37%), Positives = 210/377 (55%), Gaps = 28/377 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N + +S +    GEY+M+F IG+PP+ +   +VDTGS L+W+QC PC  C+ Q  P++ P
Sbjct: 75  NKLPESLLIPDKGEYLMRFYIGSPPV-ERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEP 133

Query: 70  ASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
             SS+YK  +C S+ C LL      C     C Y   Y D S + G+L TE ++FG++  
Sbjct: 134 LKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGG 193

Query: 128 F----FDNVVFGCG-HNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
                F N +FGCG  NN  ++  N+ MG+ GLG   LSL SQ+ +Q+G +KFSYCL+P+
Sbjct: 194 AQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG-HKFSYCLLPY 252

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYY 240
             DS+ TSK+ FG+ + ++  GVVST L+ K    TYYF+ LE +++G      K++   
Sbjct: 253 --DSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQ-----KVV--- 302

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
             S   + GN+ ID+G P T L   FYN     ++  + +   QD     + C+  P+ A
Sbjct: 303 --STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCF--PNRA 358

Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIG 358
            +A P +   F  GA V L   +  IP     + C A+ P  G  + +FG+ AQ D  + 
Sbjct: 359 NLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVE 417

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD + + VSF PTDC K
Sbjct: 418 YDLEGKKVSFAPTDCAK 434


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 135/359 (37%), Positives = 198/359 (55%), Gaps = 15/359 (4%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY+++ S+GTPP   I  + DTGSD++W QC PC  CY+Q  P+++P+ S++YK ++C 
Sbjct: 81  GEYLVEISVGTPPF-SIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACS 139

Query: 82  SEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
           S  C +  D  SCS    C Y+  Y D S ++G LA + +T  +++     F   V GCG
Sbjct: 140 SPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCG 199

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-ITSKMYFGNG 196
           H+N G FN N  G+VGLGR   SL +Q+    G  KFSYCL+P  T S+  ++K+ FG+ 
Sbjct: 200 HDNAGTFNANVSGIVGLGRGPASLVTQLGPATGG-KFSYCLIPIGTGSTNDSTKLNFGSN 258

Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           + VSG G VST + S    KT+Y + LE +SVG+    +K      +S    + N+ ID+
Sbjct: 259 ANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGD----TKFNFPEGASKLGGESNIIIDS 314

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP    N     +  ++ L   QDP      C+ T +     P +T HF+ GA 
Sbjct: 315 GTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPPVTMHFE-GAD 373

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           VPL   + F+    +   C A     D ++ I+GN AQS+  +GYD  +  VSF+P  C
Sbjct: 374 VPLQRENLFVRLS-DDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 141/368 (38%), Positives = 196/368 (53%), Gaps = 24/368 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+++V   +GEY+M  SIGTP       I+DTGSDL+W QC PC QC+ Q  PI+NP  S
Sbjct: 84  VETSVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L C S+ C  L + +CS+   C YTYGY D S T+G + TE +TFG+ +    N+
Sbjct: 143 SSFSTLPCSSQLCQALSSPTCSN-NFCQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG NN G    N  GLVG+GR  LSL     SQL   KFSYC+ P    SS  S + 
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSTPSNLL 253

Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
            G+  + V+ G   +T + S +  T+Y++TL G+SVG     S  +P   S+ A++    
Sbjct: 254 LGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG-----STRLPIDPSAFALNSNNG 308

Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
            G + ID+G   T    + Y  + ++  + I L        G  LC++TPS       P 
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPT 368

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
              HFDGG  + L   + FI P   G+ C AM      + IFGN  Q ++ + YD  + +
Sbjct: 369 FVMHFDGG-DLELPSENYFISPS-NGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSV 426

Query: 366 VSFKPTDC 373
           VSF    C
Sbjct: 427 VSFASAQC 434


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 144/369 (39%), Positives = 196/369 (53%), Gaps = 27/369 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +++ V   +GEY+M  +IGTP    +  I+DTGSDL+W QC PC QC+ Q  PI+NP  S
Sbjct: 85  IETPVYAGSGEYLMNVAIGTPAS-SLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDS 143

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L C+S+ C  L + SC +   C YTYGY D S T+G +ATE  TF  S+    N+
Sbjct: 144 SSFSTLPCESQYCQDLPSESCYND--CQYTYGYGDGSSTQGYMATETFTFETSS--VPNI 199

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG +N G    N  GL+G+G   LSL     SQLG  +FSYC+    + S  T  + 
Sbjct: 200 AFGCGEDNQGFGQGNGAGLIGMGWGPLSLP----SQLGVGQFSYCMTSSGSSSPSTLAL- 254

Query: 193 FGNGSEVSG--GGVVSTSLV-SKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAIS 247
              GS  SG   G  ST+L+ S  + TYY++TL+GI+VG  NL   S      +      
Sbjct: 255 ---GSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDD----G 307

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--API 305
            G M ID+G   T LP+D YN + +   + I L+P  +   G   C++ PS       P 
Sbjct: 308 TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPE 367

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           ++  FDGG  V  +     +  P EGV C AM       + IFGN  Q +  + YD  + 
Sbjct: 368 ISMQFDGG--VLNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNL 425

Query: 365 MVSFKPTDC 373
            VSF PT C
Sbjct: 426 AVSFVPTQC 434


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 140/367 (38%), Positives = 195/367 (53%), Gaps = 22/367 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +++ V   +GEY+M  +IGTP       I+DTGSDL+W QC PC QC+ Q  PI+NP  S
Sbjct: 85  IETPVYAGDGEYLMNVAIGTPDS-SFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDS 143

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L C+S+ C  L + +C++ + C YTYGY D S T+G +ATE  TF  S+    N+
Sbjct: 144 SSFSTLPCESQYCQDLPSETCNNNE-CQYTYGYGDGSTTQGYMATETFTFETSS--VPNI 200

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG +N G    N  GL+G+G   LSL     SQLG  +FSYC+  +   SS  S + 
Sbjct: 201 AFGCGEDNQGFGQGNGAGLIGMGWGPLSLP----SQLGVGQFSYCMTSY--GSSSPSTLA 254

Query: 193 FGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKG 249
            G+ +     G  ST+L+ S  + TYY++TL+GI+VG  NL   S      +       G
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDD----GTG 310

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--APILT 307
            M ID+G   T LP+D YN + +   + I L    +   G   C++ PS       P ++
Sbjct: 311 GMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEIS 370

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQMV 366
             FDGG  V  +     +  P EGV C AM       + IFGN  Q +  + YD  +  V
Sbjct: 371 MQFDGG--VLNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAV 428

Query: 367 SFKPTDC 373
           SF PT C
Sbjct: 429 SFVPTQC 435


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 206/383 (53%), Gaps = 32/383 (8%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    +  V+ + GEY+M  +IGTPPL     +VDTGSDL+W QC PCV C  Q  P + 
Sbjct: 77  PITAARILVAASQGEYLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFR 135

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
           PA S++Y+ + C+S  C  L   +C  + +C Y Y Y D + T GVLA+E  TFG +N+ 
Sbjct: 136 PARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSS 195

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                +V FGCG+ N+G    N  G+VGLGR  LSL    +SQLG ++FSYCL  F +  
Sbjct: 196 KVMVSDVAFGCGNINSGQL-ANSSGMVGLGRGPLSL----VSQLGPSRFSYCLTSFLSPE 250

Query: 186 SITSKMYFG-----NGSEVSGGG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLI 237
              S++ FG     NG+  S  G  V ST LV      + YF++L+GIS+G      K +
Sbjct: 251 P--SRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQ-----KRL 303

Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLGSQLC 293
           P      AI+    G +FID+G   T L +D Y+ +  ++ + ++ L P  D  +G + C
Sbjct: 304 PIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETC 363

Query: 294 Y---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
           +     PS+A   P +  HFDGGA + +   +  +     G  C AM    GD  I GN+
Sbjct: 364 FPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATIIGNY 422

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q ++ I YD  + ++SF P  C
Sbjct: 423 QQQNMHILYDIANSLLSFVPAPC 445


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 206/383 (53%), Gaps = 32/383 (8%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    +  V+ + GEY+M  +IGTPPL     +VDTGSDL+W QC PCV C  Q  P + 
Sbjct: 77  PITAARILVAASQGEYLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFR 135

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
           PA S++Y+ + C+S  C  L   +C  + +C Y Y Y D + T GVLA+E  TFG +N+ 
Sbjct: 136 PARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSS 195

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                +V FGCG+ N+G    N  G+VGLGR  LSL    +SQLG ++FSYCL  F +  
Sbjct: 196 KVMVSDVAFGCGNINSGQL-ANSSGMVGLGRGPLSL----VSQLGPSRFSYCLTSFLSPE 250

Query: 186 SITSKMYFG-----NGSEVSGGG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLI 237
              S++ FG     NG+  S  G  V ST LV      + YF++L+GIS+G      K +
Sbjct: 251 P--SRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQ-----KRL 303

Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLGSQLC 293
           P      AI+    G +FID+G   T L +D Y+ +  ++ + ++ L P  D  +G + C
Sbjct: 304 PIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETC 363

Query: 294 Y---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
           +     PS+A   P +  HFDGGA + +   +  +     G  C AM    GD  I GN+
Sbjct: 364 FPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATIIGNY 422

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q ++ I YD  + ++SF P  C
Sbjct: 423 QQQNMHILYDIANSLLSFVPAPC 445


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 149/382 (39%), Positives = 204/382 (53%), Gaps = 33/382 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS + +  GEY M  SIGTPP      I DTGSDL WVQC PC QCYKQ  P+++   S
Sbjct: 74  LQSGLISNGGEYFMSISIGTPPS-KFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKS 132

Query: 73  SSYKELSCQSEQCHLLD--TVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
           S+YK  SC S  C+ L      C  S+  C Y Y Y D S TKG +ATE I+  +S+   
Sbjct: 133 STYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSP 192

Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F    FGCG+NN G F E   G++GLG   LSL SQ+ S +G  KFSYCL      ++
Sbjct: 193 VSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIG-KKFSYCLSHTSATTN 251

Query: 187 ITSKMYFGNGSEVS----GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            TS +  G  S  S       +++T L+ K+ +TYYF+TLE I+VG        +PY   
Sbjct: 252 GTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTK-----LPYTGG 306

Query: 243 SG------AISKGNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGSQL 292
            G      +   GN+ ID+G   TLL   FY+     +EE V  A +++   DP+     
Sbjct: 307 GGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVS---DPQGILTH 363

Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           C+K+       P +T HF  GA V L   ++F+    E + C +M P   +V I+GN  Q
Sbjct: 364 CFKSGDKEIGLPTITMHFT-GADVKLSPINSFVKLS-EDIVCLSMIPTT-EVAIYGNMVQ 420

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            D  +GYD +++ VSF+  DC+
Sbjct: 421 MDFLVGYDLETKTVSFQRMDCS 442


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  217 bits (552), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 136/366 (37%), Positives = 193/366 (52%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V++ V   +GEY+M  SIGTP       I+DTGSDL+W QC PC QC+ Q  PI+NP  S
Sbjct: 84  VETPVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L C S+ C  L + +CS+   C YTYGY D S T+G + TE +TFG+ +    N+
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNS-CQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG NN G    N  GLVG+GR  LSL     SQL   KFSYC+ P    SS +S + 
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSTSSTLL 253

Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
            G+  + V+ G   +T + S +  T+Y++TL G+SVG+  L     +    +++G    G
Sbjct: 254 LGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT---G 310

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILT 307
            + ID+G   T    + Y  + +   + + L+       G  LC++ PS       P   
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFV 370

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFDGG  V  + +  +   P  G+ C AM      + IFGN  Q +L + YD  + +VS
Sbjct: 371 MHFDGGDLV--LPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428

Query: 368 FKPTDC 373
           F    C
Sbjct: 429 FLFAQC 434


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 136/366 (37%), Positives = 193/366 (52%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V++ V   +GEY+M  SIGTP       I+DTGSDL+W QC PC QC+ Q  PI+NP  S
Sbjct: 84  VETPVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L C S+ C  L + +CS+   C YTYGY D S T+G + TE +TFG+ +    N+
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNS-CQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG NN G    N  GLVG+GR  LSL     SQL   KFSYC+ P    SS +S + 
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSNSSTLL 253

Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
            G+  + V+ G   +T + S +  T+Y++TL G+SVG+  L     +    +++G    G
Sbjct: 254 LGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT---G 310

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILT 307
            + ID+G   T    + Y  + +   + + L+       G  LC++ PS       P   
Sbjct: 311 GIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFV 370

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFDGG  V  + +  +   P  G+ C AM      + IFGN  Q +L + YD  + +VS
Sbjct: 371 MHFDGGDLV--LPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428

Query: 368 FKPTDC 373
           F    C
Sbjct: 429 FLSAQC 434


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 153/394 (38%), Positives = 211/394 (53%), Gaps = 33/394 (8%)

Query: 1   MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
           +S +  F     +QS + +  GEY M  SIGTPP   ++ I DTGSDL WVQC PC QCY
Sbjct: 62  ISRSRRFTTKTDLQSGLISNGGEYFMSISIGTPPS-KVFAIADTGSDLTWVQCKPCQQCY 120

Query: 61  KQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLAT 117
           KQ  P+++   SS+YK  SC S+ C  L      C  S+ +C Y Y Y D+S TKG +AT
Sbjct: 121 KQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVAT 180

Query: 118 ERI---TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           E I   +   S+  F   VFGCG+NN G F E   G++GLG   LSL SQ+ S +G  KF
Sbjct: 181 ETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIG-KKF 239

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
           SYCL      ++ TS +  G  S  S        ++T L+ K+ +TYYF+TLE ++VG  
Sbjct: 240 SYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKT 299

Query: 231 SNSSKLIPY----YNSSGAISK--GNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKL 280
                 +PY    Y  +G  SK  GN+ ID+G   TLL   FY+     +EE V  A ++
Sbjct: 300 K-----LPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV 354

Query: 281 TPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
           +   DP+     C+K+       P +T HF   A V L   + F+    E   C +M P 
Sbjct: 355 S---DPQGLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLN-EDTVCLSMIPT 409

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             +V I+GN  Q D  +GYD +++ VSF+  DC+
Sbjct: 410 T-EVAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 140/377 (37%), Positives = 194/377 (51%), Gaps = 30/377 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q  V   NGE++M  SIGTP L     IVDTGSDL+W QC PCV+C+ Q  P+++P+SS
Sbjct: 107 LQVPVHAGNGEFLMDMSIGTPALA-YAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSS 165

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           S+Y  L C S  C  L T +C S+ + C YTY Y D+S T+GVLA E  T   +      
Sbjct: 166 STYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK--LPG 223

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG  N G       GLVGLGR  LSL    +SQLG  KFSYCL     D +  S +
Sbjct: 224 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLGKFSYCLTSL--DDTSKSPL 277

Query: 192 YFGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             G+ + +     S   + +T L+    + ++Y+VTL+ ++VG     S  IP   S+ A
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVG-----STRIPLPGSAFA 332

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
           +     G + +D+G   T L    Y  L++     +KL       +G  LC+K P+ +G+
Sbjct: 333 VQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPA-SGV 391

Query: 303 ----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
                P L  HFDGGA + L   +  +     G  C  +    G + I GNF Q ++   
Sbjct: 392 DDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRG-LSIIGNFQQQNIQFV 450

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD D   +SF P  C K
Sbjct: 451 YDVDKDTLSFAPVQCAK 467


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 138/370 (37%), Positives = 197/370 (53%), Gaps = 27/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V++ V   NGE++MK +IGTP   + Y  I+DTGSDL+W QC PC  C+ Q  PI++P  
Sbjct: 86  VEAPVHAGNGEFLMKLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKK 143

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +L C S+ C  L   SCS    C Y Y Y D S T+GVLATE   FG+++     
Sbjct: 144 SSSFSKLPCSSDLCAALPISSCSDG--CEYLYSYGDYSSTQGVLATETFAFGDAS--VSK 199

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           + FGCG +N G       GLVGLGR  LSL    +SQLG  KFSYCL        I+S +
Sbjct: 200 IGFGCGEDNDGSGFSQGAGLVGLGRGPLSL----ISQLGEPKFSYCLTSMDDSKGISSLL 255

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
               GSE +    ++T L+    + ++Y+++LEGISVG+      L+P   S+ +I    
Sbjct: 256 V---GSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGD-----TLLPIEKSTFSIQNDG 307

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--API 305
            G + ID+G   T L    +  L+++  + +KL   +    G  LC+  P  A     P 
Sbjct: 308 SGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQ 367

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L  HF+ GA + L   +  I     GV C  M    G + IFGNF Q ++ + +D + + 
Sbjct: 368 LVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425

Query: 366 VSFKPTDCTK 375
           +SF P  C +
Sbjct: 426 ISFAPAQCNQ 435


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 141/363 (38%), Positives = 193/363 (53%), Gaps = 24/363 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+M+F IGTPP+ + + I DTGSDL+WVQC PC +C  Q  P+++P  SS++K + C S
Sbjct: 91  EYLMRFYIGTPPV-ERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDS 149

Query: 83  EQCHLL--DTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVFGCG 137
           + C LL     +C  +   C Y Y Y D +L  G+L  E I FG+ NN   F  + FGC 
Sbjct: 150 QPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCT 209

Query: 138 HNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
            +N    +E++  MGLVGLG   LSL SQ+  Q+G  KFSYC  P  ++S  TSKM FGN
Sbjct: 210 FSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIG-RKFSYCFPPLSSNS--TSKMRFGN 266

Query: 196 GSEVSG-GGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            + V    GVVST L+ K    +YY++ LEG+S+GN            +S + + GN+ I
Sbjct: 267 DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKK--------VKTSESQTDGNILI 318

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
           D+G   T+L + FYN+    V+    +   + P L    C++        P +   F  G
Sbjct: 319 DSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFT-G 377

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           AKV  +  S         + C    P  D D  IFGN AQ    + YD    MVSF P D
Sbjct: 378 AKV-RVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPAD 436

Query: 373 CTK 375
           C K
Sbjct: 437 CAK 439


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 143/378 (37%), Positives = 209/378 (55%), Gaps = 29/378 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N + QS +   NGEY+M+F IGTPP+ +     DTGSDL+WVQC PC  C+ Q  P++ P
Sbjct: 76  NKLPQSVLILHNGEYLMRFYIGTPPV-ERLATADTGSDLIWVQCSPCASCFPQSTPLFQP 134

Query: 70  ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADS-SLTKGVLATERITF---- 122
             SS++   +C+S+ C LL  +   C     C YTY Y D  S ++G+L+TE + F    
Sbjct: 135 LKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQG 194

Query: 123 GNSNNFFDNVVFGCG-HNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           G     F N  FGCG +NN  VF   ++ G++GLG   LSL SQI  Q+G +KFSYCL+P
Sbjct: 195 GVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIG-HKFSYCLLP 253

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPY 239
             + S  TSK+ FGN S ++G GVVST ++ K    TYYF+ LE ++V       K +P 
Sbjct: 254 LGSTS--TSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQ-----KTVP- 305

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
              +G+ + GN+ ID+G   T L + FY      ++ ++ +   QD       C+     
Sbjct: 306 ---TGS-TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYRDN 361

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
             + P +   F  GA+V L   + F+        C  + P  + G + IFG+F+Q D  +
Sbjct: 362 F-VFPEIAFQFT-GARVSLKPANLFVMTEDRNTVCLMIAPSSVSG-ISIFGSFSQIDFQV 418

Query: 358 GYDFDSQMVSFKPTDCTK 375
            YD + + VSF+PTDC+K
Sbjct: 419 EYDLEGKKVSFQPTDCSK 436


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 139/376 (36%), Positives = 187/376 (49%), Gaps = 26/376 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q  V   NGE++M  +IGTP L     IVDTGSDL+W QC PCV C+KQ  P+++P+SS
Sbjct: 89  LQVPVHAGNGEFLMDVAIGTPAL-SYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 147

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  + C S  C  L T +C+S   C YTY Y D+S T+GVLA+E  T G        V
Sbjct: 148 STYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGV 207

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G       GLVGLGR  LSL    +SQLG +KFSYCL     D    S + 
Sbjct: 208 AFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSLD-DGDGKSPLL 262

Query: 193 FGNGSEVSGGG-----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            G  +           V +T LV    + ++Y+V+L G++VG     S  I    S+ AI
Sbjct: 263 LGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVG-----STRITLPASAFAI 317

Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
                G + +D+G   T L    Y  L++     + L       +G  LC++ P+  G+ 
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPA-KGVD 376

Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P L  HFDGGA + L   +  +     G  C  + P  G + I GNF Q +    Y
Sbjct: 377 EVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRG-LSIIGNFQQQNFQFVY 435

Query: 360 DFDSQMVSFKPTDCTK 375
           D     +SF P  C K
Sbjct: 436 DVAGDTLSFAPVQCNK 451


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 140/362 (38%), Positives = 194/362 (53%), Gaps = 20/362 (5%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +GEY+M+F IGTPP+ +   I DT SDL+WVQC PC  C+ Q  P++ P  SS++  LSC
Sbjct: 87  HGEYLMRFYIGTPPV-ERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSC 145

Query: 81  QSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            S+ C   +   C     LC YT  Y D S TKGVL TE I FG+    F   +FGCG N
Sbjct: 146 DSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSN 205

Query: 140 NTGV--FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           N  +   +    G+VGLG   LSL SQ+  Q+G +KFSYCL+PF + S+I  K+ FGN +
Sbjct: 206 NDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIG-HKFSYCLLPFTSTSTI--KLKFGNDT 262

Query: 198 EVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
            ++G GVVST L +     +YYF+ L GI++G      +   + N       GN+ ID G
Sbjct: 263 TITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTN-------GNIIIDLG 315

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK 315
              T L  +FY+     +R A+ ++  +D  +     +  P+ A I  P +   F  GAK
Sbjct: 316 TVLTYLEVNFYHNFVTLLREALGISETKD-DIPYPFDFCFPNQANITFPKIVFQFT-GAK 373

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           V L   + F       + C A+ P        +FGN AQ D  + YD   + VSF P DC
Sbjct: 374 VFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADC 433

Query: 374 TK 375
           +K
Sbjct: 434 SK 435


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 138/378 (36%), Positives = 203/378 (53%), Gaps = 19/378 (5%)

Query: 4   ATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           A+     + +++ +   NGEY+M+ +IGTPP+     ++DTGSDL+W QC PC QCYKQ 
Sbjct: 88  ASTLDSEDQLEAPIHAGNGEYLMELAIGTPPV-SYPAVLDTGSDLIWTQCKPCTQCYKQP 146

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
            PI++P  SSS+ ++SC S  C  + + +CS    C Y Y Y D S+T+GVLATE  TFG
Sbjct: 147 TPIFDPKKSSSFSKVSCGSSLCSAVPSSTCSDG--CEYVYSYGDYSMTQGVLATETFTFG 204

Query: 124 NSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
            S N     N+ FGCG +N G   E   GLVGLGR  LSL    +SQL   +FSYCL P 
Sbjct: 205 KSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSL----VSQLKEPRFSYCLTPM 260

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPY 239
             D +  S +  G+  +V     V T+ + K     ++Y+++LEGISVG+   S +   +
Sbjct: 261 --DDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTF 318

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
               G    G + ID+G   T + +  +  L+++  +  KL   +    G  LC+  PS 
Sbjct: 319 --EVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSG 376

Query: 300 AGIA--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
           +     P +  HF GG  + L   +  I     GV C AM    G + IFGN  Q ++ +
Sbjct: 377 STQVEIPKIVFHFKGG-DLELPAENYMIGDSNLGVACLAMGASSG-MSIFGNVQQQNILV 434

Query: 358 GYDFDSQMVSFKPTDCTK 375
            +D + + +SF PT C +
Sbjct: 435 NHDLEKETISFVPTSCDQ 452


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 137/369 (37%), Positives = 199/369 (53%), Gaps = 19/369 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +++ +   NGEY+++ +IGTPP+     ++DTGSDL+W QC PC +CYKQ  PI++P  S
Sbjct: 97  LEAPIHAGNGEYLIELAIGTPPV-SYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKS 155

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
           SS+ ++SC S  C  L + +CS    C Y Y Y D S+T+GVLATE  TFG S N     
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSDG--CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVH 213

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           N+ FGCG +N G   E   GLVGLGR  LSL SQ+  Q    +FSYCL P   D +  S 
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQ----RFSYCLTPI--DDTKESV 267

Query: 191 MYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           +  G+  +V     V T+ + K     ++Y+++LE ISVG+   S +   +    G    
Sbjct: 268 LLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTF--EVGDDGN 325

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA--PIL 306
           G + ID+G   T + +  Y  L+++  +  KL   +    G  LC+  PS +     P L
Sbjct: 326 GGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKL 385

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
             HF GG  + L   +  I     GV C AM    G + IFGN  Q ++ + +D + + +
Sbjct: 386 VFHFKGG-DLELPAENYMIGDSNLGVACLAMGASSG-MSIFGNVQQQNILVNHDLEKETI 443

Query: 367 SFKPTDCTK 375
           SF PT C +
Sbjct: 444 SFVPTSCDQ 452


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  207 bits (528), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 140/376 (37%), Positives = 191/376 (50%), Gaps = 29/376 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q  V   NGE++M  SIGTP L     IVDTGSDL+W QC PCV C+KQ  P+++P+SS
Sbjct: 84  LQVPVHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 142

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  + C S  C  L T  C+S   C YTY Y DSS T+GVLATE  T   S      V
Sbjct: 143 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGV 200

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           VFGCG  N G       GLVGLGR  LSL    +SQLG +KFSYCL     D +  S + 
Sbjct: 201 VFGCGDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLL 254

Query: 193 FGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            G+ + +     +   V +T L+    + ++Y+V+L+ I+VG     S  I   +S+ A+
Sbjct: 255 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAV 309

Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
                G + +D+G   T L    Y  L++     + L       +G  LC++ P+  G+ 
Sbjct: 310 QDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVD 368

Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P L  HFDGGA + L   +  +     G  C  +    G + I GNF Q +    Y
Sbjct: 369 QVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVY 427

Query: 360 DFDSQMVSFKPTDCTK 375
           D     +SF P  C K
Sbjct: 428 DVGHDTLSFAPVQCNK 443


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 140/376 (37%), Positives = 191/376 (50%), Gaps = 29/376 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q  V   NGE++M  SIGTP L     IVDTGSDL+W QC PCV C+KQ  P+++P+SS
Sbjct: 94  LQVPVHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 152

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  + C S  C  L T  C+S   C YTY Y DSS T+GVLATE  T   S      V
Sbjct: 153 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGV 210

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           VFGCG  N G       GLVGLGR  LSL    +SQLG +KFSYCL     D +  S + 
Sbjct: 211 VFGCGDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLL 264

Query: 193 FGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            G+ + +     +   V +T L+    + ++Y+V+L+ I+VG     S  I   +S+ A+
Sbjct: 265 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAV 319

Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
                G + +D+G   T L    Y  L++     + L       +G  LC++ P+  G+ 
Sbjct: 320 QDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVD 378

Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P L  HFDGGA + L   +  +     G  C  +    G + I GNF Q +    Y
Sbjct: 379 QVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVY 437

Query: 360 DFDSQMVSFKPTDCTK 375
           D     +SF P  C K
Sbjct: 438 DVGHDTLSFAPVQCNK 453


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 139/372 (37%), Positives = 189/372 (50%), Gaps = 29/372 (7%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V   NGE++M  SIGTP L     IVDTGSDL+W QC PCV C+KQ  P+++P+SSS+Y 
Sbjct: 67  VHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYA 125

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            + C S  C  L T  C+S   C YTY Y DSS T+GVLATE  T   S      VVFGC
Sbjct: 126 TVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGVVFGC 183

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G  N G       GLVGLGR  LSL    +SQLG +KFSYCL     D +  S +  G+ 
Sbjct: 184 GDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLLLGSL 237

Query: 197 SEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
           + +     +   V +T L+    + ++Y+V+L+ I+VG     S  I   +S+ A+    
Sbjct: 238 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAVQDDG 292

Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI----A 303
            G + +D+G   T L    Y  L++     + L       +G  LC++ P+  G+     
Sbjct: 293 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVDQVEV 351

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P L  HFDGGA + L   +  +     G  C  +    G + I GNF Q +    YD   
Sbjct: 352 PRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGH 410

Query: 364 QMVSFKPTDCTK 375
             +SF P  C K
Sbjct: 411 DTLSFAPVQCNK 422


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 193/359 (53%), Gaps = 38/359 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+MK  IGTPP  +I  ++DTGS+ +W QCLPCV CY Q  PI++P+ SS++KE+ C +
Sbjct: 64  EYLMKLQIGTPPF-EIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 122

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
                           C Y   Y   S TKG L TE +T  +++         + GCG N
Sbjct: 123 H------------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN 170

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N+G F     G+VGL R   SL +Q+  +      SYC          TSK+ FG  + V
Sbjct: 171 NSG-FKPGFAGVVGLDRGPKSLITQMGGEY-PGLMSYCFA-----GKGTSKINFGANAIV 223

Query: 200 SGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           +G GVVST++  K  K  +Y++ L+ +SVGN    +   P++       KGN+ ID+G+ 
Sbjct: 224 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH-----ALKGNIVIDSGST 278

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPL 318
            T  P+ + N + + V   +  T  + PR    LCY + ++  I P++T HF GGA + L
Sbjct: 279 LTYFPESYCNLVRKAVEQVV--TAVRFPR-SDILCYYSKTI-DIFPVITMHFSGGADLVL 334

Query: 319 IHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              + ++     GVFC A+    PI+    IFGN AQ++  +GYD  S +VSFKPT+C+
Sbjct: 335 DKYNMYVASNTGGVFCLAIICNSPIEE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 133/370 (35%), Positives = 193/370 (52%), Gaps = 27/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V++ V   NGE++M  +IGTP   + Y  I+DTGSDL+W QC PC  C+ Q  PI++P  
Sbjct: 86  VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +L C S+ C  L   SCS    C Y Y Y D S T+GVLATE  TFG+++     
Sbjct: 144 SSSFSKLPCSSDLCVALPISSCSDG--CEYRYSYGDHSSTQGVLATETFTFGDAS--VSK 199

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           + FGCG +N G       GLVGLGR  LSL    +SQLG  KFSYCL        I++ +
Sbjct: 200 IGFGCGEDNRGRAYSQGAGLVGLGRGPLSL----ISQLGVPKFSYCLTSIDDSKGISTLL 255

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
               GSE +    + T L+    + ++Y+++LEGISVG+      L+P   S+ +I    
Sbjct: 256 V---GSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGD-----TLLPIEKSTFSIQDDG 307

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
            G + ID+G   T L  + +  L+++  + +KL          +LC+  P        P 
Sbjct: 308 SGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQ 367

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L  HF+ G  + L   +  I      V C  M    G + IFGNF Q ++ + +D + + 
Sbjct: 368 LVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425

Query: 366 VSFKPTDCTK 375
           +SF P  C +
Sbjct: 426 ISFAPAQCNQ 435


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 193/359 (53%), Gaps = 38/359 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+MK  IGTPP  +I  ++DTGS+ +W QCLPCV CY Q  PI++P+ SS++KE+ C +
Sbjct: 58  EYLMKLQIGTPPF-EIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 116

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
                           C Y   Y   S TKG L TE +T  +++         + GCG N
Sbjct: 117 H------------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN 164

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N+G F     G+VGL R   SL +Q+  +      SYC          TSK+ FG  + V
Sbjct: 165 NSG-FKPGFAGVVGLDRGPKSLITQMGGEY-PGLMSYCFA-----GKGTSKINFGANAIV 217

Query: 200 SGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           +G GVVST++  K  K  +Y++ L+ +SVGN    +   P++       KGN+ ID+G+ 
Sbjct: 218 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH-----ALKGNIVIDSGST 272

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPL 318
            T  P+ + N + + V   +  T  + PR    LCY + ++  I P++T HF GGA + L
Sbjct: 273 LTYFPESYCNLVRKAVEQVV--TAVRFPR-SDILCYYSKTI-DIFPVITMHFSGGADLVL 328

Query: 319 IHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              + ++     GVFC A+    PI+    IFGN AQ++  +GYD  S +VSFKPT+C+
Sbjct: 329 DKYNMYVASNTGGVFCLAIICNSPIEE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 128/361 (35%), Positives = 193/361 (53%), Gaps = 37/361 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N  Y+MK  +GTPP  +I  ++DTGS++ W QCLPCV CYKQ  PI++P+ SS++KE   
Sbjct: 377 NSVYLMKLQVGTPPF-EIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKE--- 432

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
             ++CH            C Y   Y D + TKG LAT+ +T  +++         + GCG
Sbjct: 433 --KRCH---------DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCG 481

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            NN+  F  +  G VGL    LSL +Q+  +      SYC        + TSK+ FG  +
Sbjct: 482 RNNSW-FRPSFEGFVGLNWGPLSLITQMGGEY-PGLMSYCFA-----GNGTSKINFGTNA 534

Query: 198 EVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
            V GGGVVST++ V+     +Y++ L+ +SVG+    +   P++       +GN+ ID+G
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFH-----ALEGNIVIDSG 589

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
              T  P+ + N + + V + +   P  DP     LCY + +   I P++T HF GGA +
Sbjct: 590 TTLTYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYS-NTTEIFPVITMHFSGGADL 648

Query: 317 PLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            L   + F+     G+FC A+    P      IFGN AQ++  +GYD  S +VSFKPT+C
Sbjct: 649 VLDKYNMFMESYSGGLFCLAIICNNPTQE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706

Query: 374 T 374
           +
Sbjct: 707 S 707



 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 168/346 (48%), Gaps = 59/346 (17%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+MK  IGTPP  ++  ++DTGS+L+W QCLPC+ CY Q  PI++P+ SS++KE  C +
Sbjct: 64  EYLMKLQIGTPPF-EVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT 122

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
                           C Y   Y D S T+G LATE +T  +++         + GC  N
Sbjct: 123 P------------DHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRN 170

Query: 140 NTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
           N+G  F  +  G+VGL R  LSL SQ+                              G  
Sbjct: 171 NSGSGFRPSSSGIVGLSRGSLSLISQM------------------------------GGA 200

Query: 199 VSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
             G GVVST++ +K  K   Y++ L+ +SVG+    +   P++        GN+ ID+G 
Sbjct: 201 YPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFH-----ALNGNIVIDSGT 255

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVP 317
           P T  P  + N + + V   +      DP     LCY + ++  I P++T HF GGA + 
Sbjct: 256 PLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIE-IFPVITVHFSGGADLV 314

Query: 318 LIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
           L   + ++     GVFC A+    P    V IFGN AQ++  +GYD
Sbjct: 315 LDKYNMYMELNRGGVFCLAIICNNPT--QVAIFGNRAQNNFLVGYD 358


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 133/370 (35%), Positives = 192/370 (51%), Gaps = 27/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V++ V   NGE++M  +IGTP   + Y  I+DTGSDL+W QC PC  C+ Q  PI++P  
Sbjct: 86  VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +L C S+ C  L   SCS    C Y Y Y D S T+GVLATE  TFG+++     
Sbjct: 144 SSSFSKLPCSSDLCVALPISSCSDG--CEYRYSYGDHSSTQGVLATETFTFGDAS--VSK 199

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           + FGCG +N G       GLVGLGR  LSL    +SQLG  KFSYCL        I++ +
Sbjct: 200 IGFGCGEDNRGRAYSQGAGLVGLGRGPLSL----ISQLGVPKFSYCLTSIDDSKGISTLL 255

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
               GSE +    + T L+    + ++Y+++LEGISVG+      L+P   S+ +I    
Sbjct: 256 V---GSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGD-----TLLPIEKSTFSIQDDG 307

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
            G + ID+G   T L    +  L+++  + +KL          +LC+  P        P 
Sbjct: 308 SGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQ 367

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L  HF+ G  + L   +  I      V C  M    G + IFGNF Q ++ + +D + + 
Sbjct: 368 LVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425

Query: 366 VSFKPTDCTK 375
           +SF P  C +
Sbjct: 426 ISFAPAQCNQ 435


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 139/369 (37%), Positives = 186/369 (50%), Gaps = 79/369 (21%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N +QSNV +  G Y+M  S+GTPP+  + GI DTGSDL+W QCLPC  CYKQV+P+++P 
Sbjct: 16  NDIQSNVISGGGSYLMNISLGTPPV-SMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPK 74

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
            S +YK L                                  G L++E  T G++     
Sbjct: 75  KSKTYKTL----------------------------------GYLSSETFTIGSTEGDPA 100

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F  + FGCGH+N G FNE + GL+GLG   LSL  Q+ S++G  +FSYCLVP  +DS+ 
Sbjct: 101 SFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTA 159

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           +SK+ FG  + VSG G                                     +S  A  
Sbjct: 160 SSKINFGKSAVVSGSGT------------------------------------SSPAAAE 183

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
           + N+ ID+G   TLLP+DFY  +E  +   I      DPR    LCY       I P +T
Sbjct: 184 ESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI-PTIT 242

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
           AHF  GA V L   +TF+    E + CF+M P   ++ IFGN +Q +  +GYD  +  VS
Sbjct: 243 AHFI-GADVQLPPLNTFVQAQ-EDLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVS 299

Query: 368 FKPTDCTKQ 376
           FKPTDCTKQ
Sbjct: 300 FKPTDCTKQ 308


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 200/381 (52%), Gaps = 30/381 (7%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N +++     +GE++M+ SIG P +     IVDTGSDL+W QC PC +C+ Q  PI++P 
Sbjct: 95  NNIKAPTHGGSGEFLMELSIGNPAV-KYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPE 153

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
            SSSY ++ C S  C+ L   +C+  +  C Y Y Y D S T+G+LATE  TF + N+  
Sbjct: 154 KSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS-I 212

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
             + FGCG  N G       GLVGLGR  LSL    +SQL   KFSYCL     DS  +S
Sbjct: 213 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASS 267

Query: 190 KMYFGN---------GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
            ++ G+         G+ + G    + SL+   D+ ++Y++ L+GI+VG     +K +  
Sbjct: 268 SLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSV 322

Query: 240 YNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
             S+  +S+   G M ID+G   T L +  +  L+E+  + + L        G  LC+K 
Sbjct: 323 EKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKL 382

Query: 297 PSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
           P+ A     P L  HF  GA + L   +  +     GV C AM   +G + IFGN  Q +
Sbjct: 383 PNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQN 440

Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
             + +D + + V+F PT+C K
Sbjct: 441 FNVLHDLEKETVTFVPTECGK 461


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 199/381 (52%), Gaps = 30/381 (7%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           N +++     +GE++M+ SIG P +     IVDTGSDL+W QC PC +C+ Q  PI++P 
Sbjct: 94  NNIKAPTHGGSGEFLMELSIGNPAV-KYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPE 152

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
            SSSY ++ C S  C+ L   +C+  +  C Y Y Y D S T+G+LATE  TF + N+  
Sbjct: 153 KSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-I 211

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
             + FGCG  N G       GLVGLGR  LSL    +SQL   KFSYCL     DS  +S
Sbjct: 212 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASS 266

Query: 190 KMYFGN---------GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
            ++ G+         G+ + G    + SL+   D+ ++Y++ L+GI+VG     +K +  
Sbjct: 267 SLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSV 321

Query: 240 YNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
             S+  +++   G M ID+G   T L +  +  L+E+  + + L        G  LC+K 
Sbjct: 322 EKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKL 381

Query: 297 PSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
           P  A     P +  HF  GA + L   +  +     GV C AM   +G + IFGN  Q +
Sbjct: 382 PDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQN 439

Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
             + +D + + VSF PT+C K
Sbjct: 440 FNVLHDLEKETVSFVPTECGK 460


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 133/362 (36%), Positives = 197/362 (54%), Gaps = 17/362 (4%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           S   GE+++   +GTPP   +  I+DTGSDL W+Q  PC  C++Q  PI++P+ SS+Y +
Sbjct: 19  SAGYGEFLVPIYLGTPPQKAVV-IIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNK 77

Query: 78  LSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
           ++C S  C  LL T +CS+   C Y YGY D S+T+G  + E IT  ++    + V FG 
Sbjct: 78  IACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAG--EEVKFGA 135

Query: 137 GHNNTGVFNEN-EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
              NTG F +    G++GLG+  +S+ SQ+ S LG NKFSYCLV + +  S TS MYFG+
Sbjct: 136 SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLG-NKFSYCLVDWLSAGSETSTMYFGD 194

Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFI 253
            + V  G V  T +V   D  TYY++ ++GISV G+L +  + +   +S G+   G   I
Sbjct: 195 AA-VPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGS---GGTII 250

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
           D+G   T L ++ +N L     + ++  P      G  LC+ T      + P +T H D 
Sbjct: 251 DSGTTITYLQQEVFNALVAAYTSQVRY-PTTTSATGLDLCFNTRGTGSPVFPAMTIHLD- 308

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           G  + L   +TFI      + C A    +D  + IFGN  Q +  I YD D+  + F P 
Sbjct: 309 GVHLELPTANTFISLETN-IICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367

Query: 372 DC 373
           DC
Sbjct: 368 DC 369


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 138/387 (35%), Positives = 194/387 (50%), Gaps = 34/387 (8%)

Query: 8   YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
           YP +V    +S V++  G+YV   S+GTP  +    I DTGSDL+W+QC PC  C+ Q  
Sbjct: 21  YPPSVSTDYESPVASGGGDYVTTISLGTPAKV-FSVIADTGSDLIWIQCKPCQACFNQKD 79

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
           PI++P  SSSY  +SC    C  L   SCS    C+Y+YGY D S T+G L++E +T  +
Sbjct: 80  PIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPD--CDYSYGYGDGSGTRGTLSSETVTLTS 137

Query: 125 SNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
           +        N+ FGCGH N G FN+   GLVGLGR  LS  SQ L  L  +KFSYCLVP+
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDAS-GLVGLGRGNLSFVSQ-LGDLFGHKFSYCLVPW 195

Query: 182 HTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
               S TS M+FG+ S     G       +  + +   +++Y+V L+ IS+   +  +  
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI---AGRALR 252

Query: 237 IPYYNSSGAIS-----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
           IP    +G+        G M  D+G   TLLP   Y  +   +R+ I          G  
Sbjct: 253 IP----AGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLD 308

Query: 292 LCYKT----PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGI 346
           LCY       S     P +  HF+ GA   L   + FI     G + C AM   + D+GI
Sbjct: 309 LCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGI 367

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +GN  Q +  + YD  S  + + P+ C
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 131/366 (35%), Positives = 188/366 (51%), Gaps = 16/366 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY ++  IG+P  L  Y ++DTGSD+ W+QC PC  CYKQ   +++P +S
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQ-YLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRAS 61

Query: 73  SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS++ LSC + QC LLD  +C+S    C Y   Y D S T G LA++  +F  S      
Sbjct: 62  SSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASD--SFSVSRGRTSP 119

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           VVFGCGH+N G+F      L+GLG  +LS      SQL + KFSYCLV        +S +
Sbjct: 120 VVFGCGHDNEGLFVGAAG-LLGLGAGKLSFP----SQLSSRKFSYCLVSRDNGVRASSAL 174

Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
            FG+ +  +      T L+      T+Y+  L GIS+G   LS  S      +S+G   +
Sbjct: 175 LFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTG---R 231

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILT 307
           G + ID+G   T LP   Y  + +  R+A +  P          CY   ++  +  P ++
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVS 291

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF+GGA V L  ++  +P    G FCFA      D+ I GN  Q  + +  D DS  V 
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351

Query: 368 FKPTDC 373
           F P  C
Sbjct: 352 FAPRQC 357


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 194/387 (50%), Gaps = 34/387 (8%)

Query: 8   YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
           YP +V    +S V++  G+YV   S+GTP  +    I DTGSDL+W+QC PC  C+ Q  
Sbjct: 21  YPPSVSTDYESPVASGGGDYVTTISLGTPAKV-FSVIADTGSDLIWIQCKPCQACFNQKD 79

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
           PI++P  SSSY  +SC    C  L   SCS    C+Y+YGY D S T+G L++E +T  +
Sbjct: 80  PIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTS 137

Query: 125 SNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
           +        N+ FGCGH N G FN+   GLVGLGR  LS  SQ L  L  +KFSYCLVP+
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDAS-GLVGLGRGNLSFVSQ-LGDLFGHKFSYCLVPW 195

Query: 182 HTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
               S TS M+FG+ S     G       +  + +   +++Y+V L+ IS+   +  +  
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI---AGRALR 252

Query: 237 IPYYNSSGAIS-----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
           IP    +G+        G M  D+G   TLLP   Y  +   +R+ +          G  
Sbjct: 253 IP----AGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLD 308

Query: 292 LCYKT----PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGI 346
           LCY       S     P +  HF+ GA   L   + FI     G + C AM   + D+GI
Sbjct: 309 LCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGI 367

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +GN  Q +  + YD  S  + + P+ C
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 141/376 (37%), Positives = 205/376 (54%), Gaps = 27/376 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           NN+ +S +   NGEY+M   IGTPP+ +   I DTGSDL+WVQC PC  C+ Q  P++ P
Sbjct: 78  NNLPESLLIPENGEYLMTLYIGTPPV-ERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEP 136

Query: 70  ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
             SS++K  +C S+ C  +      C     C Y+Y Y D S T GV+ TE ++FG++ +
Sbjct: 137 LKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGD 196

Query: 128 F----FDNVVFGCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
                F + +FGCG  N   F+ ++   GLVGLG   LSL SQ+  Q+G  KFSYCL+PF
Sbjct: 197 AQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGY-KFSYCLLPF 255

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYY 240
            ++S  TSK+ FG+ + V+  GVVST L+ K    ++YF+ LE +++G      K++P  
Sbjct: 256 SSNS--TSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQ-----KVVP-- 306

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
                 + GN+ ID+G   T L + FYN     ++  + +   QD     + C+    M 
Sbjct: 307 ---TGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPYRDMT 363

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGY 359
              P++   F  GA V L   +  I      + C A+ P     + IFGN AQ D  + Y
Sbjct: 364 --IPVIAFQFT-GASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVY 420

Query: 360 DFDSQMVSFKPTDCTK 375
           D + + VSF PTDCTK
Sbjct: 421 DLEGKKVSFAPTDCTK 436


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/372 (35%), Positives = 185/372 (49%), Gaps = 33/372 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY  +  +GTP   D+Y ++DTGSD+ W+QC PC  CY+Q  P++NP SS
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAK-DMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSS 209

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+YK L+C + QC LL+T +C S + C Y   Y D S T G LAT+ +TFGNS    +NV
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGK-INNV 267

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F                    I +Q+ A  FSYCLV    DS  +S + 
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGVLSITNQMKATSFSYCLV--DRDSGKSSSLD 320

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
           F N  ++ GG   +  L +K+  T+Y+V L G SVG       ++P      ++SG+   
Sbjct: 321 F-NSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVG---GEKVVLPDAIFDVDASGS--- 373

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTPSMAGI 302
           G + +D G   T L    YN L +     +KLT   + + GS        CY   S++ +
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLT--VNLKKGSSSISLFDTCYDFSSLSTV 428

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P +  HF GG  + L   +  IP    G FCFA  P    + I GN  Q    I YD 
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488

Query: 362 DSQMVSFKPTDC 373
              ++      C
Sbjct: 489 SKNVIGLSGNKC 500


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 131/366 (35%), Positives = 188/366 (51%), Gaps = 16/366 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY ++  IG+P  L  Y ++DTGSD+ W+QC PC  CYKQ   +++P +S
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQ-YLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRAS 61

Query: 73  SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS++ LSC + QC LLD  +C+S    C Y   Y D S T G LA++  +F  S      
Sbjct: 62  SSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASD--SFLVSRGRTSP 119

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           VVFGCGH+N G+F      L+GLG  +LS      SQL + KFSYCLV        +S +
Sbjct: 120 VVFGCGHDNEGLFVGAAG-LLGLGAGKLSFP----SQLSSRKFSYCLVSRDNGVRASSAL 174

Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
            FG+ +  +      T L+      T+Y+  L GIS+G   LS  S      +S+G   +
Sbjct: 175 LFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTG---R 231

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILT 307
           G + ID+G   T LP   Y  + +  R+A +  P          CY   ++  +  P ++
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVS 291

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF+GGA V L  ++  +P    G FCFA      D+ I GN  Q  + +  D DS  V 
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351

Query: 368 FKPTDC 373
           F P  C
Sbjct: 352 FAPRQC 357


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 139/370 (37%), Positives = 198/370 (53%), Gaps = 21/370 (5%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
             V++ V   NGE++MK +IGTP L     I+DTGSDL W QC PC  CY Q  PIY+P+
Sbjct: 102 KAVEAPVYAGNGEFLMKMAIGTPSL-SFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPS 160

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
            SS+Y ++ C S  C  L   SCS    C Y Y Y D S T+G+L+ E  T   ++    
Sbjct: 161 QSSTYSKVPCSSSMCQALPMYSCSGAN-CEYLYSYGDQSSTQGILSYESFTL--TSQSLP 217

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           ++ FGCG  N G       GLVG GR  LSL SQ+   LG NKFSYCLV      S TS 
Sbjct: 218 HIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLG-NKFSYCLVSITDSPSKTSP 276

Query: 191 MYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
           ++ G  + ++   V ST LV S+   T+Y+++LEGISVG      +L+   + +  +   
Sbjct: 277 LFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGG-----QLLDIADGTFDLQLD 331

Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA--P 304
             G + ID+G   T L +  Y+ +++ V ++I L       +G  LC++  S +  +  P
Sbjct: 332 GTGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFP 391

Query: 305 ILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
            +T HF+G    +P      +I     G+ C AM P +G + IFGN  Q +  I YD + 
Sbjct: 392 TITFHFEGADFNLP---KENYIYTDSSGIACLAMLPSNG-MSIFGNIQQQNYQILYDNER 447

Query: 364 QMVSFKPTDC 373
            ++SF PT C
Sbjct: 448 NVLSFAPTVC 457


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 132/363 (36%), Positives = 182/363 (50%), Gaps = 18/363 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S  S  +GEY  +  IG P    +Y ++DTGSD+ W+QC PC  CY Q  PI+ PASS
Sbjct: 133 IISGTSQGSGEYFSRVGIGKPSS-PVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASS 191

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +SY  LSC ++QC  LD   C +   C Y   Y D S T G   TE IT G+++   DNV
Sbjct: 192 TSYSPLSCDTKQCQSLDVSECRNNT-CLYEVSYGDGSYTVGDFVTETITLGSAS--VDNV 248

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F      L+GLG  +LS  SQI     A+ FSYCLV   +DS+ T +  
Sbjct: 249 AIGCGHNNEGLFIGAAG-LLGLGGGKLSFPSQI----NASSFSYCLVDRDSDSASTLEF- 302

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
               S +    + +  L ++E  T+Y+V + G+SV G L +  + +   + SG    G +
Sbjct: 303 ---NSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESG---NGGI 356

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            ID+G   T L    YN L +      K  P          CY       +  P +T H 
Sbjct: 357 IIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHL 416

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GG  +PL  T+  IP   +G FCFA  P    + I GN  Q    +G+D  + +V F+P
Sbjct: 417 AGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEP 476

Query: 371 TDC 373
             C
Sbjct: 477 RQC 479


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 131/372 (35%), Positives = 185/372 (49%), Gaps = 33/372 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY  +  +GTP   ++Y ++DTGSD+ W+QC PC  CY+Q  P++NP SS
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSS 209

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+YK L+C + QC LL+T +C S + C Y   Y D S T G LAT+ +TFGNS    +NV
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGK-INNV 267

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F                    I +Q+ A  FSYCLV    DS  +S + 
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGVLSITNQMKATSFSYCLV--DRDSGKSSSLD 320

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
           F N  ++ GG   +  L +K+  T+Y+V L G SVG       ++P      ++SG+   
Sbjct: 321 F-NSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVG---GEKVVLPDAIFDVDASGS--- 373

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTPSMAGI 302
           G + +D G   T L    YN L +     +KLT   + + GS        CY   S++ +
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLT--VNLKKGSSSISLFDTCYDFSSLSTV 428

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P +  HF GG  + L   +  IP    G FCFA  P    + I GN  Q    I YD 
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488

Query: 362 DSQMVSFKPTDC 373
              ++      C
Sbjct: 489 SKNVIGLSGNKC 500


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/359 (34%), Positives = 186/359 (51%), Gaps = 39/359 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+MK  +GTPP  +I   +DTGSDL+W QC+PC  CY Q  PI++P++SS++KE  C   
Sbjct: 61  YLMKLQVGTPPF-EIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGN 119

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
            CH              Y   YAD++ +KG LATE +T  +++           GCGHN+
Sbjct: 120 SCH--------------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS 165

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +  F     G+VGL     SL +Q+  +      SYC       S  TSK+ FG  + V+
Sbjct: 166 SW-FKPTFSGMVGLSWGPSSLITQMGGEY-PGLMSYCFA-----SQGTSKINFGTNAIVA 218

Query: 201 GGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
           G GVVST++ ++      Y++ L+ +SVG+    +    ++       +GN+ ID+G   
Sbjct: 219 GDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH-----ALEGNIIIDSGTTL 273

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
           T  P  + N + E V + +      DP     LCY T ++  I P++T HF GGA + L 
Sbjct: 274 TYFPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTI-DIFPVITMHFSGGADLVLD 332

Query: 320 HTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             + +I     G FC A+     P D    IFGN AQ++  +GYD  S +VSF PT+C+
Sbjct: 333 KYNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 140/370 (37%), Positives = 202/370 (54%), Gaps = 28/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           +++ V   NGE++MK +IGTPP  + Y  I+DTGSDL+W QC PC QC+ Q  PI++P  
Sbjct: 86  IEAPVLPGNGEFLMKLAIGTPP--ETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKK 143

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +LSC S+ C  L   SC++   C Y Y Y D S T+G+LA+E +TFG ++    N
Sbjct: 144 SSSFSKLSCSSQLCEALPQSSCNNG--CEYLYSYGDYSSTQGILASETLTFGKAS--VPN 199

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG +N G       GLVGLGR  LSL    +SQL   KFSYCL     D + TS +
Sbjct: 200 VAFGCGADNEGSGFSQGAGLVGLGRGPLSL----VSQLKEPKFSYCLTT--VDDTKTSTL 253

Query: 192 YFGNGSEV--SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
             G+ + V  S   + +T L+ S    ++Y+++LEGISVG+       +P   S+ ++  
Sbjct: 254 LMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTR-----LPIKKSTFSLQD 308

Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IA 303
              G + ID+G   T L +  +N + ++    I L        G  +C+  PS +     
Sbjct: 309 DGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEV 368

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P L  HFD GA + L   +  I     GV C AM    G + IFGN  Q ++ + +D + 
Sbjct: 369 PKLVFHFD-GADLELPAENYMIGDSSMGVACLAMGSSSG-MSIFGNVQQQNMLVLHDLEK 426

Query: 364 QMVSFKPTDC 373
           + +SF PT C
Sbjct: 427 ETLSFLPTQC 436


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 141/373 (37%), Positives = 199/373 (53%), Gaps = 28/373 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           N  + S V + NGE++M  +IGTPP  + Y  I+DTGSDL+W QC PC QC+ Q  PI++
Sbjct: 86  NAEINSPVLSGNGEFLMNLAIGTPP--ETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFD 143

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           P  SSS+ +LSC S+ C  L   SCS    C Y Y Y D S T+G +ATE  TFG  +  
Sbjct: 144 PKKSSSFSKLSCSSQLCKALPQSSCSDS--CEYLYTYGDYSSTQGTMATETFTFGKVS-- 199

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             NV FGCG +N G       GLVGLGR  LSL    +SQL   KFSYCL     D + T
Sbjct: 200 IPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSL----VSQLKEAKFSYCLTSI--DDTKT 253

Query: 189 SKMYFGNGSEVSG--GGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           S +  G+ + V+G    + +T L+      ++Y+++LEGISVG        +P   S+  
Sbjct: 254 STLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTR-----LPIKESTFQ 308

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG- 301
           +     G + ID+G   T L +  ++ ++++  + + L        G +LCY  PS    
Sbjct: 309 LQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSE 368

Query: 302 -IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
              P L  HF  GA + L   +  I     GV C AM    G + IFGN  Q ++F+ +D
Sbjct: 369 LEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAMGS-SGGMSIFGNVQQQNMFVSHD 426

Query: 361 FDSQMVSFKPTDC 373
            + + +SF PT+C
Sbjct: 427 LEKETLSFLPTNC 439


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 130/370 (35%), Positives = 184/370 (49%), Gaps = 29/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S VS  +GEY  +  +GTP   ++Y ++DTGSD+ W+QC PC  CY+Q  P++NP SS
Sbjct: 151 VVSGVSQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSS 209

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+YK L+C + QC LL+T +C S + C Y   Y D S T G LAT+ +TFGNS    D V
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKIND-V 267

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F                 A  I +Q+ A  FSYCLV    DS  +S + 
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGALSITNQMKATSFSYCLV--DRDSGKSSSLD 320

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
           F N  ++  G   +  L +++  T+Y+V L G SVG       ++P      ++SG+   
Sbjct: 321 F-NSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVG---GQKVMMPDAIFDVDASGS--- 373

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-A 303
           G + +D G   T L    YN L +     +KLT        S      CY   S++ +  
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLTTNLKKGTSSISLFDTCYDFSSLSSVKV 430

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P +  HF GG  + L   +  IP    G FCFA  P    + I GN  Q    I YD  +
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLAN 490

Query: 364 QMVSFKPTDC 373
           +++      C
Sbjct: 491 KIIGLSGNKC 500


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 132/381 (34%), Positives = 199/381 (52%), Gaps = 32/381 (8%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    +  V+ ++GEY++  +IGTPPL     I+DTGSDL+W QC PC+ C  Q  P ++
Sbjct: 74  PITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCADQPTPYFD 132

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
              S++Y+ L C+S +C  L + SC  +++C Y Y Y D++ T GVLA E  TFG +N+ 
Sbjct: 133 VKKSATYRALPCRSSRCASLSSPSC-FKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                N+ FGCG  N G    N  G+VG GR  LSL    +SQLG ++FSYCL  +   S
Sbjct: 192 KVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYCLTSYL--S 244

Query: 186 SITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
           +  S++YFG     + +  S G  V ++  +++      YF++L+ IS+G     +KL+P
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLG-----TKLLP 299

Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
                 AI+    G + ID+G   T L +D Y  +   + +AI L    D  +G   C++
Sbjct: 300 IDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQ 359

Query: 296 TPSMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
            P    +    P L  HFD  A + L+  +  +     G  C  M P  G   I GN+ Q
Sbjct: 360 WPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVMAPT-GVGTIIGNYQQ 417

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
            +L + YD  +  +SF P  C
Sbjct: 418 QNLHLLYDIGNSFLSFVPAPC 438


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 139/375 (37%), Positives = 205/375 (54%), Gaps = 28/375 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           N+ + + V   NGE++MK +IGTPP  + Y  I+DTGSDL+W QC PC QC+ Q  PI++
Sbjct: 83  NSEIDAPVLPGNGEFLMKLAIGTPP--ETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFD 140

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           P  SSS+ +LSC S+ C  L   +CS    C Y YGY D S T+G+LA+E +TFG  +  
Sbjct: 141 PKKSSSFSKLSCSSKLCEALPQSTCSDG--CEYLYGYGDYSSTQGMLASETLTFGKVS-- 196

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
              V FGCG +N G       GLVGLGR  LSL    +SQL   KFSYCL     D +  
Sbjct: 197 VPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSL----VSQLKEPKFSYCLT--SVDDTKA 250

Query: 189 SKMYFGNGSEV--SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           S +  G+ + V  S   + +T L+    + ++Y+++LEGISVG+ S     +P   S+ +
Sbjct: 251 STLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTS-----LPIKKSTFS 305

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG- 301
           + +   G + ID+G   T L +  ++ + ++  + I L        G ++C+  PS +  
Sbjct: 306 LQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTD 365

Query: 302 -IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
              P L  HFD GA + L   +  I     GV C AM    G + IFGN  Q ++ + +D
Sbjct: 366 IEVPKLVFHFD-GADLELPAENYMIADASMGVACLAMGSSSG-MSIFGNIQQQNMLVLHD 423

Query: 361 FDSQMVSFKPTDCTK 375
            + + +SF PT C +
Sbjct: 424 LEKETLSFLPTQCDE 438


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 135/390 (34%), Positives = 202/390 (51%), Gaps = 34/390 (8%)

Query: 1   MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
           +SPA    P    +  V+ ++GEY++  +IGTPPL     I+DTGSDL+W QC PC+ C 
Sbjct: 66  VSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCA 124

Query: 61  KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
            Q  P ++   S++Y+ L C+S +C  L + SC  +++C Y Y Y D++ T GVLA E  
Sbjct: 125 AQPTPYFDVKRSATYRALPCRSSRCAALSSPSC-FKKMCVYQYYYGDTASTAGVLANETF 183

Query: 121 TFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           TFG +++      N+ FGCG  N G    N  G+VG GR  LSL    +SQLG ++FSYC
Sbjct: 184 TFGAASSTKVRAANISFGCGSLNAGEL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYC 238

Query: 178 LVPFHTDSSITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNL 230
           L  +   S   S++YFG     N +  S G  V ++  +++      YF++++GIS+G  
Sbjct: 239 LTSYL--SPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLG-- 294

Query: 231 SNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR 287
              +K +P      AI+    G + ID+G   T L +D Y  +   + + I L    D  
Sbjct: 295 ---TKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTD 351

Query: 288 LGSQLCYKTPSMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV 344
           +G   C++ P    +    P    HFD GA + L   +  +     G  C AM P    V
Sbjct: 352 IGLDTCFQWPPPPNVTVTVPDFVFHFD-GANMTLPPENYMLIASTTGYLCLAMAPT--SV 408

Query: 345 G-IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G I GN+ Q +L + YD  +  +SF P  C
Sbjct: 409 GTIIGNYQQQNLHLLYDIANSFLSFVPAPC 438


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 130/366 (35%), Positives = 190/366 (51%), Gaps = 30/366 (8%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           M+ SIG P +     IVDTGSDL+W QC PC +C+ Q  PI++P  SSSY ++ C S  C
Sbjct: 1   MELSIGNPAV-KYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59

Query: 86  HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
           + L   +C+  +  C Y Y Y D S T+G+LATE  TF + N+    + FGCG  N G  
Sbjct: 60  NALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-ISGIGFGCGVENEGDG 118

Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN--------- 195
                GLVGLGR  LSL    +SQL   KFSYCL     DS  +S ++ G+         
Sbjct: 119 FSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASSSLFIGSLASGIVNKT 173

Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
           G+ + G    + SL+   D+ ++Y++ L+GI+VG     +K +    S+  +++   G M
Sbjct: 174 GASLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSVEKSTFELAEDGTGGM 228

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILTAH 309
            ID+G   T L +  +  L+E+  + + L        G  LC+K P  A     P +  H
Sbjct: 229 IIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFH 288

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F  GA + L   +  +     GV C AM   +G + IFGN  Q +  + +D + + VSF 
Sbjct: 289 FK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQNFNVLHDLEKETVSFV 346

Query: 370 PTDCTK 375
           PT+C K
Sbjct: 347 PTECGK 352


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 174/374 (46%), Gaps = 34/374 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ-VKPIYNPASSSSYKELSCQ 81
           EY+M  S+GTPP   +   +DTGSDL+W QC PC+ C++Q   P+ +PA+SS++  L C 
Sbjct: 89  EYLMHVSVGTPPR-PVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCD 147

Query: 82  SEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF----FDNVV 133
           +  C  L   SC  +      C Y Y Y D SLT G LAT+  TFG  +N        V 
Sbjct: 148 APLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVT 207

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSS------ 186
           FGCGH N G+F  NE G+ G GR R SL     SQL    FSYC    F T SS      
Sbjct: 208 FGCGHINKGIFQANETGIAGFGRGRWSLP----SQLNVTSFSYCFTSMFDTKSSSVVTLG 263

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +            G V +T L+    + + YFV L GISVG    +   +P       
Sbjct: 264 AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVG---GARVAVPESR---- 316

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA----G 301
             + +  ID+GA  T LP+D Y  ++ +  + + L           LC+  P  A     
Sbjct: 317 -LRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRP 375

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P LT H DGGA   L   +         V C  +    G+  + GN+ Q +  + YD 
Sbjct: 376 AVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDL 435

Query: 362 DSQMVSFKPTDCTK 375
           ++ ++SF P  C K
Sbjct: 436 ENDVLSFAPARCDK 449


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 189/362 (52%), Gaps = 41/362 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+MK  +GTPP  +I   +DTGSD++W QC+PC  CY Q  PI++P+ SS+++E  C   
Sbjct: 421 YLMKLQVGTPPF-EIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGN 479

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
            CH              Y   YAD + +KG+LATE +T  +++           GCG +N
Sbjct: 480 SCH--------------YEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDN 525

Query: 141 TGV----FNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           T +    F  +  G+VGL    LSL SQ+ L   G    SYC          TSK+ FG 
Sbjct: 526 TNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGL--ISYCF-----SGQGTSKINFGT 578

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
            + V+G G V+  +  K+D  +Y++ L+ +SV +      LI    +      GN+FID+
Sbjct: 579 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVED-----NLIATLGTPFHAEDGNIFIDS 633

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCYKTPSMAGIAPILTAHFDGG 313
           G   T  P  + N + E V   +  T  + P +GS   LCY + ++  I P++T HF GG
Sbjct: 634 GTTLTYFPMSYCNLVREAVEQVV--TAVKVPDMGSDNLLCYYSDTI-DIFPVITMHFSGG 690

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           A + L   + ++     G+FC A+   D  +  +FGN AQ++  +GYD  S ++SF PT+
Sbjct: 691 ADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTN 750

Query: 373 CT 374
           C+
Sbjct: 751 CS 752



 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 177/354 (50%), Gaps = 41/354 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+MK  +GTPP  +I   +DTGSDL+W QC+PC  CY Q  PI++P+ SS++ E  C  +
Sbjct: 82  YLMKLQVGTPPF-EIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGK 140

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
            CH              Y   Y D++ +KG+LATE +T  +++           GCG +N
Sbjct: 141 SCH--------------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHN 186

Query: 141 TGV----FNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           T +    F  +  G+VGL     SL SQ+ L   G    SYC          TSK+ FG 
Sbjct: 187 TDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGL--ISYCF-----SGQGTSKINFGT 239

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
            + V+G G V+  +  K+D  +Y++ L+ +SV +    +   P++        GN+ ID+
Sbjct: 240 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFH-----AEDGNIVIDS 294

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G+  T  P  + N + + V   +      DP     LCY + ++  I P++T HF GGA 
Sbjct: 295 GSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSETI-DIFPVITMHFSGGAD 353

Query: 316 VPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           + L   + ++     G+FC A+    P      IFGN AQ++  +GYD  S ++
Sbjct: 354 LVLDKYNMYMESNSGGLFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLL 405


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 143/373 (38%), Positives = 198/373 (53%), Gaps = 36/373 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELS 79
           GEY+M  SIGTPPL     I DTGSDL+W QC PC   QC+ Q  P+YNPASS+++  L 
Sbjct: 90  GEYLMTLSIGTPPL-SYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLP 148

Query: 80  CQSEQCHLLDTVSCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDN 131
           C S        ++  +       + N TYG   +  T GV  +E  TFG++         
Sbjct: 149 CNSSLSMCAGVLAGKAPPPGCACMYNQTYG---TGWTAGVQGSETFTFGSAAADQARVPG 205

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           + FGC + ++  +N    GLVGLGR  LSL    +SQLGA +FSYCL PF  D++ TS +
Sbjct: 206 IAFGCSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTL 259

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
             G  + ++G GV ST  V+   K    TYY++ L GIS+G  + +  + P   S  A  
Sbjct: 260 LLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLG--AKALSISPDAFSLKADG 317

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYK--TPSMAGIA 303
            G + ID+G   T L    Y ++   V++ + L P  D     G  LCY   TP+ A  A
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTL-PAIDGSDSTGLDLCYALPTPTSAPPA 376

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDF 361
            P +T HFDG A + L   S  I     GV+C AM+   DG +  FGN+ Q ++ I YD 
Sbjct: 377 MPSMTLHFDG-ADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDV 433

Query: 362 DSQMVSFKPTDCT 374
            ++M+SF P  C+
Sbjct: 434 RNEMLSFAPAKCS 446


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 136/374 (36%), Positives = 193/374 (51%), Gaps = 28/374 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V++ V   NGE++MK +IG+PP      I+DTGSDL+W QC PC QC+ Q  PI++P  S
Sbjct: 100 VKAPVVAGNGEFLMKLAIGSPPR-SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQS 158

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
           SS+ ++SC SE C  L T +CSS   C Y Y Y DSS T+GVLA E  TFG+S       
Sbjct: 159 SSFYKISCSSELCGALPTSTCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISI 217

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
             + FGCG++N G       GLVGLGR  LSL SQ+  Q    KF+YCL     D S  S
Sbjct: 218 PGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ----KFAYCLTAI--DDSKPS 271

Query: 190 KMYFGNGSEV----SGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNS 242
            +  G+ + +    S   + +T L+    + ++Y+++L+GISVG   LS        ++ 
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAG 301
                 G + ID+G   T +    +  L+ +    + L P  D   G   LC+  P+   
Sbjct: 332 ----GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNL-PVDDSGTGGLDLCFNLPAGTN 386

Query: 302 I--APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P LT HF  GA + L   +  I     G+ C A+    G + IFGN  Q +  + +
Sbjct: 387 QVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRG-MSIFGNLQQQNFMVVH 444

Query: 360 DFDSQMVSFKPTDC 373
           D   + +SF PT C
Sbjct: 445 DLQEETLSFLPTQC 458


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 188/368 (51%), Gaps = 23/368 (6%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
           +V  +   Y++  +IGTPPL  +  ++DTGSDL+W QC  PC +C+ Q  P+Y PA S++
Sbjct: 84  SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142

Query: 75  YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           Y  +SC+S  C  L +    CS     C Y + Y D + T GVLATE  T G S+     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG  N G   +N  GLVG+GR  LSL    +SQLG  +FSYC  PF  +++  S +
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTRFSYCFTPF--NATAASPL 254

Query: 192 YFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           + G+ + +S        V S S  ++   +YY+++LEGI+VG+      + P       +
Sbjct: 255 FLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGD--TLLPIDPAVFRLTPM 312

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
             G + ID+G   T L +  +  L   + + ++L       LG  LC+   S   +  P 
Sbjct: 313 GDGGVIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 372

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L  HFD GA + L   S  +     GV C  M    G + + G+  Q +  I YD +  +
Sbjct: 373 LVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGI 430

Query: 366 VSFKPTDC 373
           +SF+P  C
Sbjct: 431 LSFEPAKC 438


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 131/363 (36%), Positives = 189/363 (52%), Gaps = 26/363 (7%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKEL 78
           NG Y+M+  IGTP + +   I DTGSDL WVQC PC   +C+ Q  P+Y+P +SS++  L
Sbjct: 93  NGNYLMRIYIGTPSV-ERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLL 151

Query: 79  SCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-VVFG 135
            C S+ C  L      CS    C Y Y Y D+S + G L+++ I        +++ + FG
Sbjct: 152 PCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFG 211

Query: 136 CGHNN--TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           CG  N  T   +    G+VGLG   LSL SQ+  ++G +KFSYCL+PF ++S+  SK+ F
Sbjct: 212 CGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIG-HKFSYCLLPFSSNSN--SKLKF 268

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G  + V G GVVST L+ K D  +Y++ LEGI+VG  +  +            + GN+ I
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKT----------GQTDGNIII 318

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
           D+G+  T L + FYN     V+  + +   Q        C+         P +  HF GG
Sbjct: 319 DSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGG 378

Query: 314 AKV-PLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
             V   ++T   I    + + C  + P   D + IFGN  Q D  +GYD     VSF PT
Sbjct: 379 DVVLKPMNTLVLIE---DNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPT 435

Query: 372 DCT 374
           DC+
Sbjct: 436 DCS 438


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 135/379 (35%), Positives = 185/379 (48%), Gaps = 32/379 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPA 70
           V+S + T + EY+M  ++GTPP   +  I DTGSDL+WV C              +++P+
Sbjct: 89  VESKIITRSFEYLMYVNVGTPPA-QMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPS 147

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-- 128
            S++Y  LSCQS  C  L   SC +   C Y Y Y D S T GVL+TE  +F  +     
Sbjct: 148 RSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGE 207

Query: 129 ----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHT 183
                  V FGC   + G F  +  GLVGLG   LSL SQ+  +   A +FSYCLVP + 
Sbjct: 208 GQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYA 265

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            ++ +S + FG  + VS  G  ST LV  E  +YY V LE ++V     +S      NSS
Sbjct: 266 AANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASA-----NSS 320

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA--- 300
                  + +D+G   T L       L  ++   I+L   Q P    QLCY     +   
Sbjct: 321 ------RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAE 374

Query: 301 --GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLF 356
             GI P +T  F GGA V L   +TF     EG  C  + P+     V I GN AQ +  
Sbjct: 375 DFGI-PDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFH 432

Query: 357 IGYDFDSQMVSFKPTDCTK 375
           +GYD D++ V+F   DCT+
Sbjct: 433 VGYDLDARTVTFAAVDCTR 451


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 185/359 (51%), Gaps = 39/359 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+MK  +GTPP  +I   +DTGSDL+W QC+PC  CY Q  PI++P++SS++KE  C   
Sbjct: 61  YLMKLQVGTPPF-EIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGN 119

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
            CH              Y   YAD++ +KG LATE +T  +++           GCGHN+
Sbjct: 120 SCH--------------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS 165

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +  F     G+VGL     SL +Q+  +      SYC       S  TSK+ FG  + V+
Sbjct: 166 SW-FKPTFSGMVGLSWGPSSLITQMGGEY-PGLMSYCFA-----SQGTSKINFGTNAIVA 218

Query: 201 GGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
           G GVVST++ ++      Y++ L+ +SVG+    +    ++       +GN+ ID+G   
Sbjct: 219 GDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH-----ALEGNIIIDSGTTL 273

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
           T  P  + N + E V + +      DP     LCY T ++  I P++T HF GGA + L 
Sbjct: 274 TYFPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTI-DIFPVITMHFSGGADLVLD 332

Query: 320 HTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             + +I     G FC A+     P D    IFGN AQ++  +GYD  S +V F PT+C+
Sbjct: 333 KYNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 188/368 (51%), Gaps = 23/368 (6%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
           +V  +   Y++  +IGTPPL  +  ++DTGSDL+W QC  PC +C+ Q  P+Y PA S++
Sbjct: 84  SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142

Query: 75  YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           Y  +SC+S  C  L +    CS     C Y + Y D + T GVLATE  T G S+     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG  N G   +N  GLVG+GR  LSL    +SQLG  +FSYC  PF  +++  S +
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTRFSYCFTPF--NATAASPL 254

Query: 192 YFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           + G+ + +S        V S S  ++   +YY+++LEGI+VG+      + P       +
Sbjct: 255 FLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGD--TLLPIDPAVFRLTPM 312

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
             G + ID+G   T L +  +  L   + + ++L       LG  LC+   S   +  P 
Sbjct: 313 GDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 372

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           L  HFD GA + L   S  +     GV C  M    G + + G+  Q +  I YD +  +
Sbjct: 373 LVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGI 430

Query: 366 VSFKPTDC 373
           +SF+P  C
Sbjct: 431 LSFEPAKC 438


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 140/378 (37%), Positives = 195/378 (51%), Gaps = 45/378 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     + DTGSDL+W QC PC  QC++Q  P+YNPASS+++  L C
Sbjct: 110 GEYLMTLAIGTPPL-PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPC 168

Query: 81  QS--EQCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
            S    C      +        + N TYG   +  T GV  +E  TFG+S         V
Sbjct: 169 NSSLSMCAGALAGAAPPPGCACMYNQTYG---TGWTAGVQGSETFTFGSSAADQARVPGV 225

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC + ++  +N    GLVGLGR  LSL    +SQLGA +FSYCL PF  D++ TS + 
Sbjct: 226 AFGCSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTLL 279

Query: 193 FGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
            G  + ++G GV ST  V+   +    TYY++ L GIS+G     +K +P   S GA S 
Sbjct: 280 LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLG-----AKALPI--SPGAFSL 332

Query: 248 ----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMA- 300
                G + ID+G   T L    Y ++   V++ +   P  D     G  LC+  P+   
Sbjct: 333 KPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTS 392

Query: 301 ---GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLF 356
               + P +T HFD GA + L   S  I     GV+C AM+   DG +  FGN+ Q ++ 
Sbjct: 393 APPAVLPSMTLHFD-GADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMH 449

Query: 357 IGYDFDSQMVSFKPTDCT 374
           I YD   + +SF P  C+
Sbjct: 450 ILYDVREETLSFAPAKCS 467


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 140/362 (38%), Positives = 197/362 (54%), Gaps = 27/362 (7%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
            +GEY+++ +IGTP  L +  I+DTGSDL+W +C PC  C      IY+P+SSS+Y ++ 
Sbjct: 38  GSGEYLIQMAIGTPA-LSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVL 94

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           CQS  C      SC++   C Y Y Y D S T G+L+ E  TF  S+    N+ FGCGH+
Sbjct: 95  CQSSLCQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDE--TFSISSQSLPNITFGCGHD 152

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N G   +   GLVG GR  LSL SQ+   +G NKFSYCLV   TDSS TS ++ GN + +
Sbjct: 153 NQGF--DKVGGLVGFGRGSLSLVSQLGPSMG-NKFSYCLVS-RTDSSKTSPLFIGNTASL 208

Query: 200 SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY----YNSSGAISKGNMFIDT 255
               V ST LV      +Y+++LEGISVG     S  IP       S G+   G + ID+
Sbjct: 209 EATTVGSTPLVQSSSTNHYYLSLEGISVG---GQSLAIPTGTFDIQSDGS---GGLIIDS 262

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGA 314
           G   T L +  Y+ ++E + ++I L P  D +L   LC+     +    P +T HF  GA
Sbjct: 263 GTTLTFLQQTAYDAVKEAMVSSINL-PQADGQL--DLCFNQQGSSNPGFPSMTFHFK-GA 318

Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
              +   +   P     + C AM P +   G++ IFGN  Q +  I YD ++ ++SF PT
Sbjct: 319 DYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPT 378

Query: 372 DC 373
            C
Sbjct: 379 AC 380


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  194 bits (492), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 136/374 (36%), Positives = 193/374 (51%), Gaps = 28/374 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V++ V   NGE++MK +IG+PP      I+DTGSDL+W QC PC QC+ Q  PI++P  S
Sbjct: 355 VKAPVVAGNGEFLMKLAIGSPPR-SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQS 413

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
           SS+ ++SC SE C  L T +CSS   C Y Y Y DSS T+GVLA E  TFG+S       
Sbjct: 414 SSFYKISCSSELCGALPTSTCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISI 472

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
             + FGCG++N G       GLVGLGR  LSL SQ+  Q    KF+YCL     D S  S
Sbjct: 473 PGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ----KFAYCLTAI--DDSKPS 526

Query: 190 KMYFGNGSEV----SGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNS 242
            +  G+ + +    S   + +T L+    + ++Y+++L+GISVG   LS        ++ 
Sbjct: 527 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 586

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAG 301
                 G + ID+G   T +    +  L+ +    + L P  D   G   LC+  P+   
Sbjct: 587 ----GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNL-PVDDSGTGGLDLCFNLPAGTN 641

Query: 302 I--APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P LT HF  GA + L   +  I     G+ C A+    G + IFGN  Q +  + +
Sbjct: 642 QVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRG-MSIFGNLQQQNFMVVH 699

Query: 360 DFDSQMVSFKPTDC 373
           D   + +SF PT C
Sbjct: 700 DLQEETLSFLPTQC 713


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 125/362 (34%), Positives = 184/362 (50%), Gaps = 37/362 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N  Y+MK  +GTPP  +I  I+DTGS++ W QCLPCV CY+Q  PI++P+ SS++KE  C
Sbjct: 62  NSVYLMKLQVGTPPF-EIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC 120

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
                             C Y   Y D + T G LATE IT  +++         + GCG
Sbjct: 121 DGHS--------------CPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCG 166

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           HNN+  F  +  G+VGL     SL +Q+  +      SYC          TSK+ FG  +
Sbjct: 167 HNNSW-FKPSFSGMVGLNWGPSSLITQMGGEY-PGLMSYCF-----SGQGTSKINFGANA 219

Query: 198 EVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
            V+G GVVST++     K  +Y++ L+ +SVGN       I    ++    +GN+ ID+G
Sbjct: 220 IVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGN-----TRIETMGTTFHALEGNIVIDSG 274

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
              T  P  + N + + V + +      DP     LCY + ++  I P++T HF GG  +
Sbjct: 275 TTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTI-DIFPVITMHFSGGVDL 333

Query: 317 PLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            L   + ++     GVFC A+    P      IFGN AQ++  +GYD  S +VSF PT+C
Sbjct: 334 VLDKYNMYMESNNGGVFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391

Query: 374 TK 375
           + 
Sbjct: 392 SA 393


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 129/374 (34%), Positives = 175/374 (46%), Gaps = 25/374 (6%)

Query: 7   FYPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           F P ++   + S  S  +GEY  +  IG PP    Y I+DTGSD+ WVQC PC  CY+Q 
Sbjct: 129 FKPEDLQSPIISGTSQGSGEYFSRVGIGKPPS-QAYLILDTGSDVNWVQCAPCADCYQQA 187

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
            PI+ PASS+S+  LSC + QC  LD   C +   C Y   Y D S T G   TE IT G
Sbjct: 188 DPIFEPASSASFSTLSCNTRQCRSLDVSECRNDT-CLYEVSYGDGSYTVGDFVTETITLG 246

Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
           ++    DNV  GCGHNN G+F                 +    SQ+ A  FSYCLV    
Sbjct: 247 SAP--VDNVAIGCGHNNEGLFVGAAG-----LLGLGGGSLSFPSQINATSFSYCLV--DR 297

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           DS   S + F   S +    V +  L +    T+Y+V L G+SVG      +L+    S+
Sbjct: 298 DSESASTLEF--NSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGG-----ELVSIPESA 350

Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
             I +   G + +D+G   T L  D YN L +      +  P  +       CY   S  
Sbjct: 351 FQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKG 410

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            +  P ++ HF  G ++PL   +  +P   EG FCFA  P    + I GN  Q    + Y
Sbjct: 411 NVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVY 470

Query: 360 DFDSQMVSFKPTDC 373
           D  + +V F P  C
Sbjct: 471 DLVNHLVGFVPNKC 484


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 133/377 (35%), Positives = 191/377 (50%), Gaps = 31/377 (8%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            +Q  V   NGE++M  SIGTP +     I+DTGSDL+W QC PCV+C+ Q  P+++P+S
Sbjct: 90  ALQVPVHAGNGEFLMDMSIGTPAVA-YAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSS 148

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS+Y  L C S  C  L +  C+S + C YTY Y DSS T+GVLA E  T   +     +
Sbjct: 149 SSTYAALPCSSTLCSDLPSSKCTSAK-CGYTYTYGDSSSTQGVLAAETFTLAKTK--LPD 205

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG  N G       GLVGLGR  LSL    +SQLG NKFSYCL     D +  S +
Sbjct: 206 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLNKFSYCLTSL--DDTSKSPL 259

Query: 192 YFGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             G+ + +     +   V +T L+    + ++Y+V L+G++VG     S  I   +S+ A
Sbjct: 260 LLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVG-----STHITLPSSAFA 314

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
           +     G + +D+G   T L    Y  L++     +KL       +G   C++ P+ +G+
Sbjct: 315 VQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPA-SGV 373

Query: 303 ----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
                P L  H D GA + L   +  +     G  C  +    G + I GNF Q ++   
Sbjct: 374 DQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGSRG-LSIIGNFQQQNIQFV 431

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD     +SF P  C K
Sbjct: 432 YDVGENTLSFAPVQCAK 448


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 133/359 (37%), Positives = 181/359 (50%), Gaps = 29/359 (8%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
           IGTP L     IVDTGSDL+W QC PCV C+KQ  P+++P+SSS+Y  + C S  C  L 
Sbjct: 173 IGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231

Query: 90  TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
           T  C+S   C YTY Y DSS T+GVLATE  T   S      VVFGCG  N G       
Sbjct: 232 TSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGVVFGCGDTNEGDGFSQGA 289

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV-----SGGGV 204
           GLVGLGR  LSL    +SQLG +KFSYCL     D +  S +  G+ + +     +   V
Sbjct: 290 GLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLLLGSLAGISEASAAASSV 343

Query: 205 VSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPT 260
            +T L+    + ++Y+V+L+ I+VG     S  I   +S+ A+     G + +D+G   T
Sbjct: 344 QTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAVQDDGTGGVIVDSGTSIT 398

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI----APILTAHFDGGAKV 316
            L    Y  L++     + L       +G  LC++ P+  G+     P L  HFDGGA +
Sbjct: 399 YLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAK-GVDQVEVPRLVFHFDGGADL 457

Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            L   +  +     G  C  +    G + I GNF Q +    YD     +SF P  C K
Sbjct: 458 DLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNK 515


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/368 (34%), Positives = 183/368 (49%), Gaps = 28/368 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S  S  +GEY  +  IG PP   +Y ++DTGSD+ WVQC PC +CY+Q  PI+ P SS
Sbjct: 140 IVSGASQGSGEYFSRVGIGRPPS-PVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSS 198

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  LSC++EQC  LD   C +   C Y   Y D S T G   TE +T G+++    N+
Sbjct: 199 ASFTSLSCETEQCKSLDVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTS--LGNI 255

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F      L          +    SQL A+ FSYCLV   +DS  TS + 
Sbjct: 256 AIGCGHNNEGLFIGAAGLL-----GLGGGSLSFPSQLNASSFSYCLVDRDSDS--TSTLD 308

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           F   S ++   V +    +    T++++ L G+SVG       ++P   +S  +S+   G
Sbjct: 309 F--NSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGG-----AVLPIPETSFQMSEDGNG 361

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-API 305
            + +D+G   T L    YN L +     +K T       G  L   CY   S + +  P 
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAF---VKSTHDLQTARGVALFDTCYDLSSKSRVEVPT 418

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           ++ HF  G ++PL   +  IP   EG FCFA  P D  + I GN  Q    +G+D  + +
Sbjct: 419 VSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 479 VGFSPNKC 486


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 132/381 (34%), Positives = 189/381 (49%), Gaps = 32/381 (8%)

Query: 5   TYFYPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
           T F P ++   V S  S  +GEY  +  +GTP   ++Y ++DTGSD+ W+QCLPC +CY+
Sbjct: 142 TRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAK-EMYVVLDTGSDVNWIQCLPCSECYQ 200

Query: 62  QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
           Q  PI++P SSS++K L+C   +C  LD  +C S + C Y   Y D S T G  AT+ +T
Sbjct: 201 QSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNK-CLYQVSYGDGSFTVGNYATDTVT 259

Query: 122 FGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
           FG S    D V  GCGH+N G+F                 A  + +Q+ A  FSYCLV  
Sbjct: 260 FGESGKVND-VALGCGHDNEGLFTGAAG-----LLGLGGGALSMTNQIKAKSFSYCLV-- 311

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPY 239
             DS+ +S + F N  ++  G   +  L + +  T+Y+V L G SVG   +S  S L   
Sbjct: 312 DRDSAKSSSLDF-NSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFE- 369

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLC 293
            ++SGA   G + +D G   T L    YN L +     +KLT   D + G+        C
Sbjct: 370 VDASGA---GGVILDCGTAVTRLQTQAYNSLRDAF---VKLT--TDFKKGTSPISLFDTC 421

Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           Y   S++ +  P +T HF GG  + L   +  IP    G FCFA  P    + I GN  Q
Sbjct: 422 YDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQ 481

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
               I YD  + ++      C
Sbjct: 482 QGTRITYDLANNLIGLSANKC 502


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 138/371 (37%), Positives = 191/371 (51%), Gaps = 22/371 (5%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +G Y M+  +G+PP      IVDTGSDL+W+QC PC QCY Q  PIY+P++SS++ + SC
Sbjct: 1   SGAYTMEIELGSPPK-KFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSC 59

Query: 81  QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGC 136
            +  C  L    CSS  + C Y Y Y DSS T+G  A E +T    G S+  F N  FGC
Sbjct: 60  STSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGC 119

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G  N+G F     G+VGLG+ ++SL++Q+ S +  NKFSYCLV F  DSS TS + FG+ 
Sbjct: 120 GRLNSGSFG-GAAGIVGLGQGKISLSTQLGSAIN-NKFSYCLVDFDDDSSKTSPLIFGS- 176

Query: 197 SEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNSSGA-------- 245
           S  +G G +ST ++    + TYYFV LEGISVG   LS +++ I + +            
Sbjct: 177 SASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRAL 236

Query: 246 -ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
            ++ G    D+G   TLL    Y++++    +++ L        G  LCY          
Sbjct: 237 EVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKF 296

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
           P LT  F G    P       I    E V C AM       +GI GN  Q +  + YD  
Sbjct: 297 PALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRG 356

Query: 363 SQMVSFKPTDC 373
           +  +S  P  C
Sbjct: 357 TSTISMSPAQC 367


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 131/365 (35%), Positives = 177/365 (48%), Gaps = 17/365 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY  +  +G P   D   ++DTGSD+ W+QC PC  CY+Q  PIYNPA S
Sbjct: 134 VVSGMDQGSGEYFSRIGVGAPRR-DQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALS 192

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SSYK + CQ+  C  LD   CS    C Y   Y D S T+G  ATE +T G +     NV
Sbjct: 193 SSYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAP--LQNV 250

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L+GLG   LS  SQ+  + G   FSYCLV    DS  +S + 
Sbjct: 251 AIGCGHDNEGLFVGAAG-LLGLGGGSLSFPSQLTDENG-KIFSYCLV--DRDSESSSTLQ 306

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
           FG  + V  G V++  L +    T+Y+V+L GISVG      K++   +S   I     G
Sbjct: 307 FGRAA-VPNGAVLAPMLKNSRLDTFYYVSLSGISVGG-----KMLSISDSVFGIDASGNG 360

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
            + +D+G   T L    Y+ L +  R   K  P  D       CY   S   +  P +  
Sbjct: 361 GVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVF 420

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF GG  + L   +  +P    G FCFA  P    + I GN  Q  + + +D  +  V F
Sbjct: 421 HFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGF 480

Query: 369 KPTDC 373
               C
Sbjct: 481 AVNKC 485


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 185/383 (48%), Gaps = 42/383 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP----IYN 68
           V+S + T + EY+M  ++GTPP   +  I DTGSDL+WV C                ++ 
Sbjct: 92  VESKIITRSFEYLMYVNVGTPPT-QLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQ 150

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
           P  SS+Y +LSCQS  C  L   SC +   C Y Y Y D S T GVL+TE  +F    G 
Sbjct: 151 PTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGK 210

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-----NKFSYCLV 179
                  V FGC   + G F  +  GLVGLG    SL    +SQLGA      K SYCL+
Sbjct: 211 GQVRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSL----VSQLGATTHIDRKLSYCLI 264

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
           P + D++ +S + FG+ + VS  G  ST LV  +  +YY V LE ++VG      + +  
Sbjct: 265 PSY-DANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGG-----QEVAT 318

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
           ++S        + +D+G   T L       L  ++   IKL   Q P    QLCY     
Sbjct: 319 HDS-------RIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGK 371

Query: 300 A-----GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQ 352
           +     GI P +T  F GGA V L   +TF     EG  C  + P+     V I GN AQ
Sbjct: 372 SETDNFGI-PDVTLRFGGGAAVTLRPENTF-SLLQEGTLCLVLVPVSESQPVSILGNIAQ 429

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +  +GYD D++ V+F   DC +
Sbjct: 430 QNFHVGYDLDARTVTFAAADCAR 452


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 136/362 (37%), Positives = 178/362 (49%), Gaps = 15/362 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY ++  IG P     Y ++DTGSD+ W+QC PC  CY+QV PI++PASS
Sbjct: 149 VTSGTSQGSGEYFLRVGIGRPSKT-FYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASS 207

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  L CQ+ QC  LD  +C +   C Y   Y D S T G  ATE ++FGNS +  D V
Sbjct: 208 SSFSRLGCQTPQCRNLDVFACRNDS-CLYQVSYGDGSYTVGDFATETVSFGNSGS-VDKV 265

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F     GL+GLG   LSL SQI     A+ FSYCLV  + DS  +S + 
Sbjct: 266 AIGCGHDNEGLF-VGAAGLIGLGGGPLSLTSQI----KASSFSYCLV--NRDSVDSSTLE 318

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           F N ++ S          SK D T+Y+V + G+SVG       + P         KG + 
Sbjct: 319 F-NSAKPSDSVTAPIFKNSKVD-TFYYVGITGMSVGG--EKLAIPPSIFEVDGSGKGGII 374

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
           +D G   T L    YN L +      K  P          CY   S   +  P +   FD
Sbjct: 375 VDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFD 434

Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           GG  +PL  ++  IP    G FC A  P    + I GN  Q    + YD  +  VSF   
Sbjct: 435 GGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSR 494

Query: 372 DC 373
            C
Sbjct: 495 KC 496


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 182/368 (49%), Gaps = 28/368 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S  S  +GEY  +  IG PP   +Y ++DTGSD+ WVQC PC +CY+Q  P + P SS
Sbjct: 140 IVSGASQGSGEYFSRVGIGRPPS-PVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSS 198

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  LSC++EQC  LD   C +   C Y   Y D S T G   TE +T G+++    N+
Sbjct: 199 ASFTSLSCETEQCKSLDVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTS--LGNI 255

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F      L          +    SQL A+ FSYCLV   +DS  TS + 
Sbjct: 256 AIGCGHNNEGLFIGAAGLL-----GLGGGSLSFPSQLNASSFSYCLVDRDSDS--TSTLD 308

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           F   S ++   V +    +    T++++ L G+SVG       ++P   +S  +S+   G
Sbjct: 309 F--NSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGG-----AVLPIPETSFQMSEDGNG 361

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-API 305
            + +D+G   T L    YN L +     +K T       G  L   CY   S + +  P 
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAF---VKSTHDLQTARGVALFDTCYDLSSKSRVEVPT 418

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           ++ HF  G ++PL   +  IP   EG FCFA  P D  + I GN  Q    +G+D  + +
Sbjct: 419 VSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 479 VGFSPNKC 486


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 138/376 (36%), Positives = 192/376 (51%), Gaps = 40/376 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     + DTGSDL+W QC PC  QC++Q  P+YNPASS+++  L C
Sbjct: 112 GEYLMTLAIGTPPL-PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPC 170

Query: 81  QS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNVVFG 135
            S    C      +          Y    +  T GV  +E  TFG+S         V FG
Sbjct: 171 NSSLSMCAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFG 230

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           C + ++  +N    GLVGLGR  LSL    +SQLGA +FSYCL PF  D++ TS +  G 
Sbjct: 231 CSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTLLLGP 284

Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---- 247
            + ++G GV ST  V+   +    TYY++ L GIS+G     +K +P   S GA S    
Sbjct: 285 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLG-----AKALPI--SPGAFSLKPD 337

Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT-PYQD--PRLGSQLCYKTPSMA--- 300
             G + ID+G   T L    Y ++   V++ +  T P  D     G  LC+  P+     
Sbjct: 338 GTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAP 397

Query: 301 -GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIG 358
             + P +T HFD GA + L   S  I     GV+C AM+   DG +  FGN+ Q ++ I 
Sbjct: 398 PAVLPSMTLHFD-GADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMHIL 454

Query: 359 YDFDSQMVSFKPTDCT 374
           YD   + +SF P  C+
Sbjct: 455 YDVREETLSFAPAKCS 470


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/363 (33%), Positives = 172/363 (47%), Gaps = 21/363 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  S  +GEY  +  IG+PP   +Y +VDTGSD+ WVQC PC  CY+Q  PI+ P+ SSS
Sbjct: 146 SGASQGSGEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSS 204

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y  L+C++ QC  LD   C +   C Y   Y D S T G  ATE IT   S +  +NV  
Sbjct: 205 YAPLTCETHQCKSLDVSECRNDS-CLYEVSYGDGSYTVGDFATETITLDGSAS-LNNVAI 262

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F      L          +    SQ+ A+ FSYCLV   TDS+ T +    
Sbjct: 263 GCGHDNEGLFVGAAGLL-----GLGGGSLSFPSQINASSFSYCLVNRDTDSASTLEF--- 314

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
             S +    V +  L + +  T+Y++ + GI VG      +++    SS  + +   G +
Sbjct: 315 -NSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGG-----QMLSIPRSSFEVDESGNGGI 368

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            +D+G   T L  D YN L +      +  P          CY   S + +  P ++ HF
Sbjct: 369 IVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHF 428

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             G  + L   +  IP    G FCFA  P    + I GN  Q    + YD  + +V F P
Sbjct: 429 PDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSP 488

Query: 371 TDC 373
             C
Sbjct: 489 NGC 491


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 135/376 (35%), Positives = 196/376 (52%), Gaps = 36/376 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     I DTGSDL+W QC PC  QC++Q  P+YNP+SS+++  L C
Sbjct: 90  GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 148

Query: 81  QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
            S    C      + ++      C Y  TYG   +S+ +G   +E  TFG++   +    
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGHARVP 205

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            + FGC   ++G    +  GLVGLGR RLSL    +SQLG  KFSYCL P+  D++ TS 
Sbjct: 206 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 260

Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           +  G  + ++G  GV ST  V+        T+Y++ L GIS+G  + S  + P   S  A
Sbjct: 261 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS--IPPDAFSLNA 318

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGIA 303
              G + ID+G   TLL    Y ++   V + + L P  D     G  LC+  PS     
Sbjct: 319 DGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAP 377

Query: 304 PI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGY 359
           P    +T HF+G   V  +   +++     G++C AMQ   DG+V I GN+ Q ++ I Y
Sbjct: 378 PAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILY 435

Query: 360 DFDSQMVSFKPTDCTK 375
           D   + +SF P  C+ 
Sbjct: 436 DIGQETLSFAPAKCSA 451


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 192/379 (50%), Gaps = 34/379 (8%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
           +V  +   Y++ F+IGTPPL  +  ++DTGSDL+W QC  PC +C+ Q  P+Y PA S +
Sbjct: 92  SVHASTATYLVDFAIGTPPLA-LSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVT 150

Query: 75  YKELSCQSEQCHLLDTVSCSSQQL------------CNYTYGYADSSLTKGVLATERITF 122
           Y  +SC S  C  L ++  SS+              C Y Y Y D S T GVLATE  TF
Sbjct: 151 YANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF 210

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
           G      D + FGCG +N G   +N  GLVG+GR  LSL    +SQLG  KFSYC  PF+
Sbjct: 211 GAGTTVHD-LAFGCGTDNLG-GTDNSSGLVGMGRGPLSL----VSQLGVTKFSYCFTPFN 264

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIP 238
            D++ +S ++ G+ + +S     ST  V         +YY+++LEGI+VG+      + P
Sbjct: 265 -DTTTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGITVGD--TLLPIDP 320

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
                 A  +G + ID+G   T L +  +  L   V   + L       LG  +C+  P 
Sbjct: 321 AVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQ 380

Query: 299 MAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
             G      P L  HFD GA + L  +S  +   V GV C  +    G + + G+  Q +
Sbjct: 381 GRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSARG-MSVLGSMQQQN 438

Query: 355 LFIGYDFDSQMVSFKPTDC 373
           + + YD    ++SF+P +C
Sbjct: 439 MHVRYDVGRDVLSFEPANC 457


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 135/376 (35%), Positives = 196/376 (52%), Gaps = 36/376 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     I DTGSDL+W QC PC  QC++Q  P+YNP+SS+++  L C
Sbjct: 30  GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88

Query: 81  QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
            S    C      + ++      C Y  TYG   +S+ +G   +E  TFG++   +    
Sbjct: 89  NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGHARVP 145

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            + FGC   ++G    +  GLVGLGR RLSL    +SQLG  KFSYCL P+  D++ TS 
Sbjct: 146 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 200

Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           +  G  + ++G  GV ST  V+        T+Y++ L GIS+G  + S  + P   S  A
Sbjct: 201 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS--IPPDAFSLNA 258

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGIA 303
              G + ID+G   TLL    Y ++   V + + L P  D     G  LC+  PS     
Sbjct: 259 DGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAP 317

Query: 304 PI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGY 359
           P    +T HF+G   V  +   +++     G++C AMQ   DG+V I GN+ Q ++ I Y
Sbjct: 318 PAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILY 375

Query: 360 DFDSQMVSFKPTDCTK 375
           D   + +SF P  C+ 
Sbjct: 376 DIGQETLSFAPAKCSA 391


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 136/380 (35%), Positives = 197/380 (51%), Gaps = 44/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     I DTGSDL+W QC PC  QC++Q  P+YNP+SS+++  L C
Sbjct: 88  GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 146

Query: 81  QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
            S    C      + ++      C Y  TYG   +S+ +G   +E  TFG++    +   
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGQSRVP 203

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            + FGC   ++G    +  GLVGLGR RLSL    +SQLG  KFSYCL P+  D++ TS 
Sbjct: 204 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 258

Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIP----YYN 241
           +  G  + ++G  GV ST  V+        T+Y++ L GIS+G  + S   IP      N
Sbjct: 259 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS---IPPDAFLLN 315

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSM 299
           + G    G + ID+G   TLL    Y ++   V + + L P  D     G  LC+  PS 
Sbjct: 316 ADG---TGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSS 371

Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDL 355
               P    +T HF+G   V  +   +++     G++C AMQ   DG+V I GN+ Q ++
Sbjct: 372 TSAPPAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNM 429

Query: 356 FIGYDFDSQMVSFKPTDCTK 375
            I YD   + +SF P  C+ 
Sbjct: 430 HILYDIGQETLSFAPAKCSA 449


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 135/368 (36%), Positives = 191/368 (51%), Gaps = 21/368 (5%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           + + ++ V++ NGEY++  S G PP      IVDTGSDL WVQCLPC  CY+ +   ++P
Sbjct: 76  DQLFETPVASGNGEYLIDISYGNPPQKST-AIVDTGSDLNWVQCLPCKSCYETLSAKFDP 134

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           + S+SYK L C S  C  L   SC++   C Y Y Y D S T G L+T+ +T G      
Sbjct: 135 SKSASYKTLGCGSNFCQDLPFQSCAAS--CQYDYMYGDGSSTSGALSTDDVTIGTGK--I 190

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            NV FGCG++N G F      LVGLG+  LSL SQ L      KFSYCLVP    S+ TS
Sbjct: 191 PNVAFGCGNSNLGTFAGAGG-LVGLGKGPLSLVSQ-LGGTATKKFSYCLVPL--GSTKTS 246

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAI 246
            +Y G+ S ++GG   +  L +    T+Y+  L+GISV       K + Y  ++    A 
Sbjct: 247 PLYIGD-STLAGGVAYTPMLTNNNYPTFYYAELQGISV-----EGKAVNYPANTFDIAAT 300

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPI 305
            +G + +D+G   T L  D +N +   ++ A+          G + C+ T  +A    P 
Sbjct: 301 GRGGLILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +  HF+ GA V L   +TFI    EG  C AM    G   IFGN  Q +  I +D  ++ 
Sbjct: 361 VVFHFN-GADVALAPDNTFIALDFEGTTCLAMASSTG-FSIFGNIQQLNHVIVHDLVNKR 418

Query: 366 VSFKPTDC 373
           + FK  +C
Sbjct: 419 IGFKSANC 426


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 120/375 (32%), Positives = 192/375 (51%), Gaps = 25/375 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY+++ S+G+PP  + Y +VD+GSD+MWVQC PC++CY Q  P+++PA+S
Sbjct: 160 VVSGLDEGSGEYLVRVSVGSPPT-EQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATS 218

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           +++  +SC S  C +L T +C   +L  C Y   YAD S TKG LA E +T G +    +
Sbjct: 219 ATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGTA--VE 276

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            VV GCGH N G+F     GL+GLG   +SL  Q+  ++G   FSYCL       S  + 
Sbjct: 277 GVVIGCGHRNRGLF-VGAAGLMGLGWGPMSLVGQLGGEVG-GAFSYCLASRGGYGSGAAD 334

Query: 191 -----MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
                +  G    V  G V    + +    ++Y+V L GI VG+     + +P       
Sbjct: 335 DDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGD-----ERLPLQAGLFQ 389

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM 299
           +++   G++ +DTG   T LP++ Y  L +    A+     +   + S +   CY     
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449

Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
           A +  P ++  FDG A++ L   +  +   + G++C A  P    + I GN  Q+ + I 
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLLEVDM-GIYCLAFAPSSSGLSIMGNTQQAGIQIT 508

Query: 359 YDFDSQMVSFKPTDC 373
            D  +  + F P +C
Sbjct: 509 VDSANGYIGFGPANC 523


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 129/366 (35%), Positives = 178/366 (48%), Gaps = 19/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  IG+P    +Y ++DTGSD+ WVQC PC  CY+Q  P+++P+ S
Sbjct: 155 VVSGVGQGSGEYFSRVGIGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 213

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           +SY  +SC S++C  LDT +C ++   C Y   Y D S T G  ATE +T G+S     N
Sbjct: 214 ASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP-VGN 272

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+ LG   LS  SQI     A+ FSYCLV    DS   S +
Sbjct: 273 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SASTFSYCLV--DRDSPAASTL 325

Query: 192 YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
            FG+G+  +  G V+  LV S    T+Y+V L GISVG   LS  +       +SG+   
Sbjct: 326 QFGDGA--AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGS--- 380

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
           G + +D+G   T L    Y  L +         P          CY       +  P ++
Sbjct: 381 GGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVS 440

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F+GG  + L   +  IP    G +C A  P +  V I GN  Q    + +D     V 
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVG 500

Query: 368 FKPTDC 373
           F P  C
Sbjct: 501 FTPNKC 506


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/376 (33%), Positives = 181/376 (48%), Gaps = 43/376 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+++ ++GTP    +   +DTGSDL+W QC PC  C+ Q  P+ +PA+SS+Y  L C +
Sbjct: 83  EYLVRLAVGTP-RRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGA 141

Query: 83  EQCHLLDTVSCSSQQL-----CNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNV 132
            +C  L   SC  + L     C Y Y Y D SLT G +AT+R TFG+S           +
Sbjct: 142 ARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRL 201

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSIT--- 188
            FGCGH N GVF  NE G+ G GR R SL     SQL    FSYC    F + SS+    
Sbjct: 202 TFGCGHLNKGVFQSNETGIAGFGRGRWSLP----SQLNVTSFSYCFTSMFESKSSLVTLG 257

Query: 189 ---SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKL-IPYYNSS 243
              + +Y    S    G V +T ++    + + YF++L+GISVG     ++L +P     
Sbjct: 258 GSPAALY----SHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGK----TRLPVPETKFR 309

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA--- 300
             I      ID+GA  T LP++ Y  ++ +    + L P         LC+  P  A   
Sbjct: 310 STI------IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWR 363

Query: 301 -GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
               P LT H + GA   L  ++         V C  +    G+  + GNF Q +  + Y
Sbjct: 364 RPAVPSLTLHLE-GADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVY 422

Query: 360 DFDSQMVSFKPTDCTK 375
           D ++  +SF P  C +
Sbjct: 423 DLENDRLSFAPARCDR 438


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 187/361 (51%), Gaps = 36/361 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+M+  +GTPP  +I   +DTGSDL+W QC+PC  CY Q  PI++P+ SS++KE  C   
Sbjct: 61  YLMRLQLGTPPF-EIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCHGN 119

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
            C               Y   YAD S + G+LATE +T  +++           GCG NN
Sbjct: 120 SCP--------------YEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNN 165

Query: 141 TGV----FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           + +    +  +  G+VGL     SL SQ+   +     SYC       S  TSK+ FG  
Sbjct: 166 SNLMTPGYAASSSGIVGLNMGPSSLISQMDLPI-PGLISYCF-----SSQGTSKINFGTN 219

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           + V+G G V+  +  K+D+ +Y++ L+ +SVG+    +   P++        GN+FID+G
Sbjct: 220 AVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFH-----AQDGNIFIDSG 274

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
              T LP  + N + E V  ++       DP   + LCY   +M  I P++T HF GGA 
Sbjct: 275 TTYTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTME-IFPVITLHFAGGAD 333

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   + ++     G FC A+  +D  +  IFGN A ++L +GYD  + ++SF PT+C+
Sbjct: 334 LVLDKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393

Query: 375 K 375
            
Sbjct: 394 A 394


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 128/374 (34%), Positives = 184/374 (49%), Gaps = 37/374 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +  ++DTGSDL+W QC PC  C  Q  P++ PA+SSSY  + C  
Sbjct: 102 EYLIDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSG 160

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
           + C+ +   SC     C Y Y Y D + T GV ATER TF +S+    +V   FGCG  N
Sbjct: 161 QLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMN 220

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN----- 195
            G  N N  G+VG GR  LSL    +SQL   +FSYCL P+   S+  S + FG+     
Sbjct: 221 VGSLN-NGSGIVGFGRDPLSL----VSQLSIRRFSYCLTPY--TSTRKSTLMFGSLSDGV 273

Query: 196 --GSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
             G + + G V +T L+ S+++ T+Y+V   G++VG       L  +         G + 
Sbjct: 274 FEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDG--SGGVI 331

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDPRLGSQLCYKTPSMAGI-------- 302
           +D+G   TL P      +    R  ++L  T    P  G  +C+ TP  AG         
Sbjct: 332 VDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDG--VCFATPMAAGGRRASAATV 389

Query: 303 --APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGY 359
              P +  HF  GA + L   +  +  P  G  C  +    GD G   GNF Q D+ + Y
Sbjct: 390 VSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSLCILLAD-SGDSGATIGNFVQQDMRVLY 447

Query: 360 DFDSQMVSFKPTDC 373
           D +++ +SF P  C
Sbjct: 448 DLEAETLSFAPAQC 461


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 124/379 (32%), Positives = 178/379 (46%), Gaps = 43/379 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  ++GTPP   +   +DTGSDL+W QC PC  C+ Q  P+ +PA+SS+Y  L C +
Sbjct: 91  EYLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGA 149

Query: 83  EQCHLLDTVSC---------SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-- 131
            +C  L   SC         +  + C Y Y Y D S+T G +AT+R TFG  N   D+  
Sbjct: 150 PRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRL 209

Query: 132 ----VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSS 186
               + FGCGH N GVF  NE G+ G GR R SL     SQL    FSYC    F + SS
Sbjct: 210 PTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLP----SQLNVTTFSYCFTSMFESKSS 265

Query: 187 I-------TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
           +        + + + + + +SG    +  L +    + YF++L+GISVG    +   +P 
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVG---KTRLAVPE 322

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
                 I      ID+GA  T LP+  Y  ++ +    + L P       +  LC+  P 
Sbjct: 323 AKLRSTI------IDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPV 376

Query: 299 MAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
            A       P LT H D GA   L   +         V C  +    GD  + GNF Q +
Sbjct: 377 TALWRRPPVPSLTLHLD-GADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQN 435

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             + YD ++  +SF P  C
Sbjct: 436 THVVYDLENDWLSFAPARC 454


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 124/363 (34%), Positives = 189/363 (52%), Gaps = 23/363 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+M+ +IG PP+     + DTGSDL W QC PC  C+ Q  P+Y+P++SS++  L C S
Sbjct: 70  EYLMELAIGKPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSS 128

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCGHNN 140
             C  + + +C+   LC Y Y Y D + + G+L TE +T G S+       V FGCG +N
Sbjct: 129 ATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDN 188

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G  + N  G VGLGR  LSL    L+QLG  KFSYCL  F  +S++ S    G  +E++
Sbjct: 189 GGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSALDSPFLLGTLAELA 242

Query: 201 GG--GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNMFID 254
            G   V ST L+ S ++ + YFV+L+GIS+G++      +P  N +  +     G M +D
Sbjct: 243 PGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVR-----LPIPNGTFDLRGDGTGGMIVD 297

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-MAGIAPILTAHFDGG 313
           +G   T+L +  +  +  +V   +   P     L +  C+  P+      P L  HF GG
Sbjct: 298 SGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLHFAGG 356

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           A + L   +       +  FC  +     +   + GNF Q ++ + +D     +SF PTD
Sbjct: 357 ADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTD 416

Query: 373 CTK 375
           C+K
Sbjct: 417 CSK 419


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 127/368 (34%), Positives = 193/368 (52%), Gaps = 28/368 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+M+ +IGTPP+     + DTGSDL W QC PC  C+ Q  P+Y+P++SS++  + C S
Sbjct: 76  EYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 134

Query: 83  EQC-HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNS----NNFFDNVVFGC 136
             C  +L + +CS+   LC Y Y Y+D + + G+L TE +T G+S         +V FGC
Sbjct: 135 ATCLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGC 194

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G +N G  + N  G VGLGR  LSL    L+QLG  KFSYCL  F  +S++ S    G  
Sbjct: 195 GTDNGGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSTLDSPFLLGTL 248

Query: 197 SEVS--GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKGN 250
           +E++   G V ST L+ S  + + Y V+L+GI++G++      +P  N +    A S G 
Sbjct: 249 AELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVR-----LPIPNKTFDLHANSTGG 303

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS---MAGIAPILT 307
           M +D+G   ++LP+  +  + + V   +   P     L S  C+  P+        P L 
Sbjct: 304 MVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP-CFPAPAGERQLPFMPDLV 362

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF GGA + L   +       +  FC  +        + GNF Q ++ + +D     +S
Sbjct: 363 LHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLS 422

Query: 368 FKPTDCTK 375
           F PTDC+K
Sbjct: 423 FLPTDCSK 430


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 142/393 (36%), Positives = 193/393 (49%), Gaps = 48/393 (12%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ--VKPIYNPA 70
           VQ+ +    G Y M  S+GTPPL D   IVDTGS+L+W QC PC +C+ +    P+  PA
Sbjct: 80  VQAQLENGAGAYNMNISLGTPPL-DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPA 138

Query: 71  SSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
            SS++  L C    C  L T S    C++   C Y Y Y  S  T G LATE +T G+  
Sbjct: 139 RSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYG-SGYTAGYLATETLTVGDGT 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F  V FGC   N GV  +N  G+VGLGR  LSL    +SQL   +FSYCL     D  
Sbjct: 198 --FPKVAFGCSTEN-GV--DNSSGIVGLGRGPLSL----VSQLAVGRFSYCLRSDMADGG 248

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
             S + FG+ ++++ G VV ++ + K    +  T+Y+V L GI+V      S  +P   S
Sbjct: 249 -ASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAV-----DSTELPVTGS 302

Query: 243 SGAISK----GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCY 294
           +   ++    G   +D+G   T L KD Y  +++    Q+ N  + TP         LCY
Sbjct: 303 TFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362

Query: 295 KTPSMAG-----IAPILTAHFDGGAK--VPLIHTSTFIPPPVEG---VFCFAMQPIDGD- 343
           K PS  G       P L   F GGAK  VP+ +    +    +G   V C  + P   D 
Sbjct: 363 K-PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421

Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            + I GN  Q D+ + YD D  M SF P DC K
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 133/370 (35%), Positives = 183/370 (49%), Gaps = 27/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  IG+P   ++Y ++DTGSD+ WVQC PC  CY+Q  P+++P+ S
Sbjct: 158 VVSGVGQGSGEYFSRVGIGSP-ARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 216

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           +SY  +SC S +C  LDT +C ++   C Y   Y D S T G  ATE +T G+S     N
Sbjct: 217 ASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP-VTN 275

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+ LG   LS  SQI     A+ FSYCLV    DS   S +
Sbjct: 276 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SASTFSYCLV--DRDSPAASTL 328

Query: 192 YFG-NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAIS 247
            FG +G+E      V+  LV S    T+Y+V L GISVG   LS  S       +SG+  
Sbjct: 329 QFGADGAEAD---TVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGS-- 383

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-A 303
            G + +D+G   T L    Y  L +     ++ TP      G  L   CY       +  
Sbjct: 384 -GGVIVDSGTAVTRLQSSAYAALRDAF---VRGTPSLPRTSGVSLFDTCYDLSDRTSVEV 439

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P ++  F+GG  + L   +  IP    G +C A  P +  V I GN  Q    + +D   
Sbjct: 440 PAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 499

Query: 364 QMVSFKPTDC 373
            +V F P  C
Sbjct: 500 GVVGFTPNKC 509


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 174/378 (46%), Gaps = 17/378 (4%)

Query: 3   PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
           P  +F   + V S +   +GEY ++  IG+PP  + Y +VD+GSD++WVQC PC++CY Q
Sbjct: 104 PTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPT-EQYLVVDSGSDVIWVQCKPCLECYAQ 162

Query: 63  VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
             P+++PASS+++  +SC S  C  L T  C     C Y   Y D S TKG LA E +T 
Sbjct: 163 ADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL 222

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
           G +    + V  GCGH N G+F     GL+GLG   +SL  Q L       FSYCL    
Sbjct: 223 GGTA--VEGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLASRG 278

Query: 183 TDSS----ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKL 236
              S        +  G    V  G V    + + +  ++Y+V + GI VG+  L     L
Sbjct: 279 GSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGL 338

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
                  G    G + +DTG   T LP++ Y  L +    A+   P          CY  
Sbjct: 339 FQLTEDGG----GGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDL 394

Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
                +  P ++ +FDG A + L   +  +     G++C A  P    + I GN  Q  +
Sbjct: 395 SGYTSVRVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQEGI 453

Query: 356 FIGYDFDSQMVSFKPTDC 373
            I  D  +  + F P  C
Sbjct: 454 QITVDSANGYIGFGPATC 471


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 131/366 (35%), Positives = 185/366 (50%), Gaps = 23/366 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI+ PA+S
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAK-SYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAAS 206

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SSY  L+C S+QC+ L   SC + Q C Y   Y D S T G   TE ++FG S    +++
Sbjct: 207 SSYSPLTCDSQQCNSLQMSSCRNGQ-CRYQVNYGDGSFTFGDFVTETMSFGGSGT-VNSI 264

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F     GL+GLG   LSL     SQL A  FSYCLV  + DS+ +S + 
Sbjct: 265 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----SQLKATSFSYCLV--NRDSAASSTLD 317

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
           F N + V G  V++  L S +  T+Y+V L G+SV G L    + +   + SG    G +
Sbjct: 318 F-NSAPV-GDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSG---DGGV 372

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-APILT 307
            +D G   T L  + YN L +     + ++ +     G  L   CY     + +  P ++
Sbjct: 373 IVDCGTAITRLQSEAYNSLRDSF---VSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVS 429

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFDGG    L   +  IP    G +CFA  P    + I GN  Q    + +D  +  V 
Sbjct: 430 FHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVG 489

Query: 368 FKPTDC 373
           F    C
Sbjct: 490 FSTNKC 495


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 191/370 (51%), Gaps = 30/370 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI++P +S
Sbjct: 150 VTSGTSQGSGEYFTRVGVGNPAR-QFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 208

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  ++CQS+QC  L+  SC S Q C Y   Y D S T G  ATE ++FGNS +   NV
Sbjct: 209 STYAPVTCQSQQCSSLEMSSCRSGQ-CLYQVNYGDGSYTFGDFATESVSFGNSGS-VKNV 266

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F     GL+GLG   LSL     +QL A  FSYCLV  + DS+ +S + 
Sbjct: 267 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----NQLKATSFSYCLV--NRDSAGSSTLD 319

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           F N +++    V +  + +++  T+Y+V L G+SVG      +++    S+  + +   G
Sbjct: 320 F-NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGG-----QMVSIPESTFRLDESGNG 373

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKTPSMAGI-A 303
            + +D G   T L    YN L +     +++T  Q+ +L S +     CY     A +  
Sbjct: 374 GIIVDCGTAITRLQTQAYNPLRDAF---VRMT--QNLKLTSAVALFDTCYDLSGQASVRV 428

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P ++ HF  G    L   +  IP    G +CFA  P    + I GN  Q    + +D  +
Sbjct: 429 PTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLAN 488

Query: 364 QMVSFKPTDC 373
             + F P  C
Sbjct: 489 NRMGFSPNKC 498


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 181/366 (49%), Gaps = 17/366 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S ++  +GEY  +  +GTPP    Y ++DTGSD+MW+QCLPC +CY Q  P++NPA+S
Sbjct: 142 IISGLAQGSGEYFTRLGVGTPPRY-TYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAAS 200

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y+++ C +  C  LD   C +++ C Y   Y D S T G  +TE +TF         V
Sbjct: 201 STYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTF--RGQVIRRV 258

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L+GLGR  LS  SQ  +Q  + +FSYCLV   + S   S + 
Sbjct: 259 ALGCGHDNEGLFIGAAG-LLGLGRGSLSFPSQTGAQF-SKRFSYCLVD-RSASGTASSLI 315

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  + +    + +  L + +  T+Y+V L GISVG    +S     +    A   G + 
Sbjct: 316 FGKAA-IPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMD-ATGNGGVI 373

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
           ID+G   T L    Y+ +    R+A ++        G       CY    +  +  P L 
Sbjct: 374 IDSGTSVTRLVDSAYSTM----RDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLV 429

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF GGA + L  T+  IP      FCFA     G + I GN  Q    + +D  +  V 
Sbjct: 430 FHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVG 489

Query: 368 FKPTDC 373
           FK   C
Sbjct: 490 FKAGSC 495


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 191/370 (51%), Gaps = 30/370 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI++P +S
Sbjct: 9   VTSGTSQGSGEYFTRVGVGNPAR-QFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 67

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  ++CQS+QC  L+  SC S Q C Y   Y D S T G  ATE ++FGNS +   NV
Sbjct: 68  STYAPVTCQSQQCSSLEMSSCRSGQ-CLYQVNYGDGSYTFGDFATESVSFGNSGS-VKNV 125

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F     GL+GLG   LSL     +QL A  FSYCLV  + DS+ +S + 
Sbjct: 126 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----NQLKATSFSYCLV--NRDSAGSSTLD 178

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           F N +++    V +  + +++  T+Y+V L G+SVG      +++    S+  + +   G
Sbjct: 179 F-NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGG-----QMVSIPESTFRLDESGNG 232

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKTPSMAGI-A 303
            + +D G   T L    YN L +     +++T  Q+ +L S +     CY     A +  
Sbjct: 233 GIIVDCGTAITRLQTQAYNPLRDAF---VRMT--QNLKLTSAVALFDTCYDLSGQASVRV 287

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P ++ HF  G    L   +  IP    G +CFA  P    + I GN  Q    + +D  +
Sbjct: 288 PTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLAN 347

Query: 364 QMVSFKPTDC 373
             + F P  C
Sbjct: 348 NRMGFSPNKC 357


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 134/377 (35%), Positives = 193/377 (51%), Gaps = 38/377 (10%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYK 76
           +T  GE++M  +IGTPPL     I DTGSDL+W QC PC  QC++Q  P+YNP+SS+++ 
Sbjct: 79  TTVPGEFLMTLAIGTPPL-PFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFS 137

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF----FDNV 132
            L C S         +C    + N TYG   + + +G   TE  TFG+S          +
Sbjct: 138 ALPCNSSLGLCAPACAC----MYNMTYGSGWTYVFQG---TETFTFGSSTPADQVRVPGI 190

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC + ++G    +  GLVGLGR  LSL    +SQLGA KFSYCL P+  D++ TS + 
Sbjct: 191 AFGCSNASSGFNASSASGLVGLGRGSLSL----VSQLGAPKFSYCLTPYQ-DTNSTSTLL 245

Query: 193 FGNGSEVSGGGVV-STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            G  + ++  GVV ST  V+     YY++ L GIS+G  + +  + P   S  A   G +
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLG--TTALPIPPNAFSLKADGTGGL 303

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGI---APIL 306
            ID+G   T+L    Y ++   V + + L P  D     G  LC++ PS        P +
Sbjct: 304 IIDSGTTITMLGNTAYQQVRAAVLSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPSM 362

Query: 307 TAHFDGGAKVPLIHTSTFI-----PPPVEGVFCFAMQ-PIDGD---VGIFGNFAQSDLFI 357
           T HFDG   V  +    ++     P     ++C AMQ   D D   V I GN+ Q ++ I
Sbjct: 363 TLHFDGADMV--LPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHI 420

Query: 358 GYDFDSQMVSFKPTDCT 374
            YD   + +SF P  C+
Sbjct: 421 LYDVGKETLSFAPAKCS 437


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 131/378 (34%), Positives = 192/378 (50%), Gaps = 39/378 (10%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           ++  +GEY  K  +GTP    +  ++DTGSD++W+QC PC +CY+Q   +++P  S SY 
Sbjct: 133 LAQGSGEYFTKIGVGTPATPALM-VLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYN 191

Query: 77  ELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + C +  C  LD+  C   +  C Y   Y D S+T G  ATE +TF         V  G
Sbjct: 192 AVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGAR-VARVALG 250

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MY 192
           CGH+N G+F      L+GLGR  LS  +QI  + G   FSYCLV   + ++  S+   + 
Sbjct: 251 CGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYG-RSFSYCLVDRTSSANTASRSSTVT 308

Query: 193 FGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVG-----NLSNSS-KLIPYYNS 242
           FG+G+    G  V++S      +   +T+Y+V L GISVG      ++NS  +L P   S
Sbjct: 309 FGSGAV---GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDP---S 362

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKT 296
           SG   +G + +D+G   T L +  Y+ L +  R A   ++L+P      G  L   CY  
Sbjct: 363 SG---RGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPG-----GFSLFDTCYDL 414

Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
                +  P ++ HF GGA+  L   +  IP   +G FCFA    DG V I GN  Q   
Sbjct: 415 SGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGF 474

Query: 356 FIGYDFDSQMVSFKPTDC 373
            + +D D Q V+F P  C
Sbjct: 475 RVVFDGDGQRVAFTPKGC 492


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 189/365 (51%), Gaps = 25/365 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY+M+ +IGTPP+     + DTGSDL W QC PC  C+ Q  P+Y+P++SS++  + C S
Sbjct: 65  EYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 123

Query: 83  EQC-HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNS----NNFFDNVVFGC 136
             C     + +CS+    C Y Y Y+D + + G+L TE +T G+S         +V FGC
Sbjct: 124 ATCLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G +N G  + N  G VGLGR  LSL    L+QLG  KFSYCL  F  +S++ S  + G  
Sbjct: 184 GTDNGGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSTMDSPFFLGTL 237

Query: 197 SEVS-GGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKGN 250
           +E++ G G V ++  L S  + + YFV L+GIS+G++      +P  N +    A   G 
Sbjct: 238 AELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVR-----LPIPNGTFDLRADGNGG 292

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
           M +D+G   T+L K  +  + ++V   +   P     L S  C+ +P      P L  HF
Sbjct: 293 MMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-CFPSPDGEPFMPDLVLHF 351

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GGA + L   +       +  FC  +          GNF Q ++ + +D     +SF P
Sbjct: 352 AGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLP 411

Query: 371 TDCTK 375
           TDC+K
Sbjct: 412 TDCSK 416


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 134/383 (34%), Positives = 193/383 (50%), Gaps = 39/383 (10%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSS 73
           + +S   GEY+M  +IGTPP+     I DTGSDL+W QC PC  QC++Q  P+YNP+SS+
Sbjct: 77  TQISPTAGEYLMTLAIGTPPV-SYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSST 135

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNSN-- 126
           ++  L C S        ++ ++       + N TYG   +S+ +G   +E  TFG+S   
Sbjct: 136 TFAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSVYQG---SETFTFGSSTPA 192

Query: 127 --NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                  + FGC + + G    +  GLVGLGR  LSL    +SQLG  KFSYCL P+  D
Sbjct: 193 NQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSL----VSQLGVPKFSYCLTPYQ-D 247

Query: 185 SSITSKMYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPY 239
           ++ TS +  G  + ++  GGV ST  V+        TYY++ L GIS+G  + S   IP 
Sbjct: 248 TNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALS---IPT 304

Query: 240 YN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD---PRLGSQLCYK 295
              S  A   G   ID+G   TLL    Y ++   V + + L P  D      G  LC++
Sbjct: 305 TALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATGLDLCFE 363

Query: 296 TPSMAGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFA 351
            PS     P    +T HFDG   V    +   +      ++C AMQ   DG V I GN+ 
Sbjct: 364 LPSSTSAPPTMPSMTLHFDGADMVLPADSYMMLD---SNLWCLAMQNQTDGGVSILGNYQ 420

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
           Q ++ I YD   + ++F P  C+
Sbjct: 421 QQNMHILYDVGQETLTFAPAKCS 443


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 136/384 (35%), Positives = 190/384 (49%), Gaps = 35/384 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q  V   NGE++M  S+GTP L     IVDTGSDL+W QC PCV+C+ Q  P+++PA+S
Sbjct: 105 LQVPVHAGNGEFLMDLSVGTPAL-PYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAAS 163

Query: 73  SSYKELSCQSEQCHLL-------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
           S+Y  L C S  C  L        + S S+   C YTY Y D+S T+GVLATE  TF  +
Sbjct: 164 STYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATE--TFTLA 221

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                 V FGCG  N G       GLVGLGR  LSL    +SQLG ++FSYCL     D+
Sbjct: 222 RQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGIDRFSYCLTSLD-DA 276

Query: 186 SITSKMYFGNGSEVSGGGVV----STSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYY 240
           +  S +  G+ + +S         +T LV    + ++Y+V+L G++VG     S  +   
Sbjct: 277 AGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVG-----STRLALP 331

Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
           +S+ AI     G + +D+G   T L    Y  L +     + L       +G  LC++ P
Sbjct: 332 SSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGP 391

Query: 298 SMA------GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
           + A         P L  HFDGGA + L   +  +     G  C  +    G + I GNF 
Sbjct: 392 AGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRG-LSIIGNFQ 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
           Q +    YD     +SF P +C K
Sbjct: 451 QQNFQFVYDVAGDTLSFAPAECNK 474


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 127/373 (34%), Positives = 183/373 (49%), Gaps = 21/373 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  K  +GTP    +  ++DTGSD++W+QC PC +CY Q   +++P  S
Sbjct: 131 VVSGLAQGSGEYFTKIGVGTPATPALM-VLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRS 189

Query: 73  SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            SY  + C +  C  LD+  C   ++ C Y   Y D S+T G  ATE +TF         
Sbjct: 190 RSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGAR-VAR 248

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD---SSIT 188
           +  GCGH+N G+F      L+GLGR  LS  +QI  + G   FSYCLV   +    +S +
Sbjct: 249 IALGCGHDNEGLFVAAAG-LLGLGRGSLSFPAQISRRYG-RSFSYCLVDRTSSANPASHS 306

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           S + FG+G+  S      T +V     +T+Y+V L GISVG    S           +  
Sbjct: 307 STVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSG 366

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAG 301
           +G + +D+G   T L +  Y+ L +  R A   ++L+P      G  L   CY       
Sbjct: 367 RGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLSGRKV 421

Query: 302 I-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
           +  P ++ HF GGA+  L   +  IP   +G FCFA    DG V I GN  Q    + +D
Sbjct: 422 VKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 481

Query: 361 FDSQMVSFKPTDC 373
            D Q V F P  C
Sbjct: 482 GDGQRVGFVPKGC 494


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 193/381 (50%), Gaps = 38/381 (9%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           + + +   EY+M+ +IGTPP+     + DTGSDL W QC PC  C+ Q  PIY+ A SSS
Sbjct: 84  ARLRSGQAEYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSS 142

Query: 75  YKELSCQSEQC-HLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDN 131
           +  + C S  C  +  + +C +S   C Y Y Y D + + GVL TE +TF G        
Sbjct: 143 FSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGG 202

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           + FGCG +N G+ + N  G VGLGR  LSL    ++QLG  KFSYCL  F  ++S+ S +
Sbjct: 203 IAFGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCLTDFF-NTSLGSPV 256

Query: 192 YFGNGSEVS----GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            FG  +E++    G  V ST LV S    T+Y+V+LEGIS+G+       +P  N +  +
Sbjct: 257 LFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGD-----ARLPIPNGTFDL 311

Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KT 296
                G M +D+G   T L +  +  + + V   ++        L S  C+       + 
Sbjct: 312 RDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSLDSP-CFPAATGEQQL 370

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF--AMQPIDGDVGIFGNFAQSD 354
           P+M    P +  HF GGA + L   +       E  FC   A  P   DV I GNF Q +
Sbjct: 371 PAM----PDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSP-SADVSILGNFQQQN 425

Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
           + + +D     +SF PTDC K
Sbjct: 426 IQMLFDITVGQLSFMPTDCGK 446


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 141/393 (35%), Positives = 192/393 (48%), Gaps = 48/393 (12%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ--VKPIYNPA 70
           VQ+ +    G Y M  S+GTPPL D   IVDTGS+L+W QC PC +C+ +    P+  PA
Sbjct: 80  VQAQLENGAGAYNMNISLGTPPL-DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPA 138

Query: 71  SSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
            SS++  L C    C  L T S    C++   C Y Y Y  S  T G LATE +T G+  
Sbjct: 139 RSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYG-SGYTAGYLATETLTVGDGT 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             F  V FGC   N GV  +N  G+VGLGR  LSL    +SQL   +FSYCL     D  
Sbjct: 198 --FPKVAFGCSTEN-GV--DNSSGIVGLGRGPLSL----VSQLAVGRFSYCLRSDMADGG 248

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
             S + FG+ ++++   VV ++ + K    +  T+Y+V L GI+V      S  +P   S
Sbjct: 249 -ASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAV-----DSTELPVTGS 302

Query: 243 SGAISK----GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCY 294
           +   ++    G   +D+G   T L KD Y  +++    Q+ N  + TP         LCY
Sbjct: 303 TFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362

Query: 295 KTPSMAG-----IAPILTAHFDGGAK--VPLIHTSTFIPPPVEG---VFCFAMQPIDGD- 343
           K PS  G       P L   F GGAK  VP+ +    +    +G   V C  + P   D 
Sbjct: 363 K-PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421

Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            + I GN  Q D+ + YD D  M SF P DC K
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 182/382 (47%), Gaps = 35/382 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  K  +GTP    +  ++DTGSD++WVQC PC +CY+Q  P+++P  S
Sbjct: 118 VVSGLAQGSGEYFTKIGVGTPATQALM-VLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRS 176

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSY  + C +  C  LD+  C  ++  C Y   Y D S+T G   TE +TF         
Sbjct: 177 SSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGAR-VAR 235

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD------- 184
           V  GCGH+N G+F      L+GLGR  LS  +QI  + G   FSYCLV   +        
Sbjct: 236 VALGCGHDNEGLFVAAAG-LLGLGRGGLSFPTQISRRYG-RSFSYCLVDRTSSGAGAAPG 293

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           S  +S + FG GS  +     +  + +   +T+Y+V L GISVG                
Sbjct: 294 SHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDP 353

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA----IKLTP---------YQDPRLGSQ 291
           +  +G + +D+G   T L +  Y+ L +  R A    ++L+P         Y    LG +
Sbjct: 354 STGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYD---LGGR 410

Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
              K P+       ++ HF GGA+  L   +  IP    G FCFA    DG V I GN  
Sbjct: 411 RVVKVPT-------VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQ 463

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
           Q    + +D D Q V F P  C
Sbjct: 464 QQGFRVVFDGDGQRVGFAPKGC 485


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 133/364 (36%), Positives = 188/364 (51%), Gaps = 30/364 (8%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +GEY+M+FS+GTP + +   I DTGSDL W+QC PC  CY Q  P+++P  SS+Y ++ C
Sbjct: 85  HGEYLMRFSLGTPSV-ERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPC 143

Query: 81  QSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-----GNSNNFFDNVV 133
           +S+ C L   +   C S + C Y + Y   S T G L  + I+F     G     F   V
Sbjct: 144 ESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV 203

Query: 134 FGCG--HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           FGC    N T   +    G VGLG   LSLASQ+  Q+G +KFSYC+VPF + S  T K+
Sbjct: 204 FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG-HKFSYCMVPFSSTS--TGKL 260

Query: 192 YFGNGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
            F  GS      VVST  +++    +YY + LEGI+VG      K++     +G I  GN
Sbjct: 261 KF--GSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQ----KKVL-----TGQIG-GN 308

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
           + ID+    T L +  Y      V+ AI +   +D     + C + P+     P    HF
Sbjct: 309 IIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNF-PEFVFHF 367

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA V L   + FI      + C  + P  G + IFGN+AQ +  + YD   + VSF P
Sbjct: 368 T-GADVVLGPKNMFIALD-NNLVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEKKVSFAP 424

Query: 371 TDCT 374
           T+C+
Sbjct: 425 TNCS 428


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 129/373 (34%), Positives = 183/373 (49%), Gaps = 37/373 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PCV C+ Q  P ++ + SS+   L C+S
Sbjct: 34  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCES 92

Query: 83  EQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
            QC L  TV+       + Q C Y   Y D+S+T G+LA ++ TF  +      V FGCG
Sbjct: 93  TQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTF-VAGTSLPGVTFGCG 151

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKM 191
            NNTGVFN NE G+ G GR  LSL     SQL    FS+C       +P      + + +
Sbjct: 152 LNNTGVFNSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL 207

Query: 192 YFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
            F NG     G V +T L+    ++ + T Y+++L+GI+VG     S  +P   S+ A++
Sbjct: 208 -FSNGQ----GAVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALT 257

Query: 248 KGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
            G     ID+G   T LP   Y  + ++    IKL        G   C+  PS A    P
Sbjct: 258 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVP 317

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            L  HF+G           F  P   G  + C A+   D +  I GNF Q ++ + YD  
Sbjct: 318 KLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQ 376

Query: 363 SQMVSFKPTDCTK 375
           + M+SF    C K
Sbjct: 377 NNMLSFVAAQCDK 389


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 134/367 (36%), Positives = 184/367 (50%), Gaps = 25/367 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           VS  +GEYV++ S+GTPP      IVDTGSDL WVQC PC +C++Q  P++ P +SSSY 
Sbjct: 1   VSAGSGEYVLQISLGTPPQ-QFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYS 59

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
             SC    C  L   +CS +  C Y+Y Y D S T+G  A E +T   S      + FGC
Sbjct: 60  NASCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST--LARIGFGC 117

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           GHN  G F   + GL+GLG+  LSL SQ+ S    + FSYCLV   T  +  S + FGN 
Sbjct: 118 GHNQEGTFAGAD-GLIGLGQGPLSLPSQLNSSF-THIFSYCLVDQSTTGTF-SPITFGNA 174

Query: 197 SEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMF 252
           +E S      T L+  ED  +YY+V +E ISVGN     + +P   S+  I     G + 
Sbjct: 175 AENSRASF--TPLLQNEDNPSYYYVGVESISVGN-----RRVPTPPSAFRIDANGVGGVI 227

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR-LGSQLCYKTPSMAGIA---PILTA 308
           +D+G   T      +  +  ++R  I   P  DP   G  LCY   S++  +   P +T 
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQISY-PEADPTPYGLNLCYDISSVSASSLTLPSMTV 286

Query: 309 HFDG-GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
           H      ++P+ +    +    E V C AM   D    I GN  Q +  I  D  +  V 
Sbjct: 287 HLTNVDFEIPVSNLWVLVDNFGETV-CTAMSTSD-QFSIIGNVQQQNNLIVTDVANSRVG 344

Query: 368 FKPTDCT 374
           F  TDC+
Sbjct: 345 FLATDCS 351


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 183/374 (48%), Gaps = 22/374 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  K  +GTP +     ++DTGSD++W+QC PC +CY Q   +++P +S
Sbjct: 136 VVSGLAQGSGEYFTKIGVGTP-VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194

Query: 73  SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            SY  + C +  C  LD+  C   ++ C Y   Y D S+T G  ATE +TF  S      
Sbjct: 195 HSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-SGARVPR 253

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSI 187
           V  GCGH+N G+F      L+GLGR  LS  SQI  + G   FSYCLV       + +S 
Sbjct: 254 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFG-RSFSYCLVDRTSSSASATSR 311

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +S + FG+G+         T +V     +T+Y+V L GISVG        +       + 
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
            +G + +D+G   T L +  Y  L +  R A   ++L+P      G  L   CY    + 
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLSGLK 426

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            +  P ++ HF GGA+  L   +  IP    G FCFA    DG V I GN  Q    + +
Sbjct: 427 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 486

Query: 360 DFDSQMVSFKPTDC 373
           D D Q + F P  C
Sbjct: 487 DGDGQRLGFVPKGC 500


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 136/365 (37%), Positives = 180/365 (49%), Gaps = 29/365 (7%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V++ NGEY++  S G+PP      IVDTGSDL+W QCLPC  C      I++P  SS+Y 
Sbjct: 73  VASGNGEYLIDISFGSPPQ-KASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYD 131

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            +SC S  C  L   SC++   C Y Y Y D S T G L+TE +T         NV FGC
Sbjct: 132 TVSCASNFCSSLPFQSCTTS--CKYDYMYGDGSSTSGALSTETVT--VGTGTIPNVAFGC 187

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           GH N G F     G+VGLG+  LSL SQ  S + + KFSYCLVP    S+ TS M  G+ 
Sbjct: 188 GHTNLGSF-AGAAGIVGLGQGPLSLISQA-SSITSKKFSYCLVPL--GSTKTSPMLIGD- 242

Query: 197 SEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPY---YNSSGAISKGNMF 252
              + GGV  T+L++   + T+Y+  L GISV     S K + Y     S  A  +G   
Sbjct: 243 -SAAAGGVAYTALLTNTANPTFYYADLTGISV-----SGKAVTYPVGTFSIDASGQGGFI 296

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTA 308
           +D+G   T L    +N L   ++  +          G   C+ T   AG+A    P +T 
Sbjct: 297 LDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFST---AGVANPTYPTMTF 353

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA   L   + F+     G  C AM    G   I GN  Q +  I +D  +Q V F
Sbjct: 354 HFK-GADYELPPENVFVALDTGGSICLAMAASTG-FSIMGNIQQQNHLIVHDLVNQRVGF 411

Query: 369 KPTDC 373
           K  +C
Sbjct: 412 KEANC 416


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 173/365 (47%), Gaps = 20/365 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  IG+PP  + Y +VD+GSD++WVQC PC++CY Q  P+++PA+S
Sbjct: 116 VVSGLDEGSGEYFVRVGIGSPPT-EQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATS 174

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +++  + C S  C  L T  C     C+Y   Y D S TKG LA E +T G +    + V
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTA--VEGV 232

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F     GL+GLG   +SL  Q L       FSYCL      S     + 
Sbjct: 233 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLA-----SRGAGSLV 285

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
            G    V  G V    + + +  ++Y+V L GI VG+     + +P       +++   G
Sbjct: 286 LGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGD-----ERLPLQEDLFQLTEDGAG 340

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
            + +DTG   T LP++ Y  L +    A+   P          CY       +  P ++ 
Sbjct: 341 GVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSF 400

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +FDG A + L   +  +     G++C A  P      I GN  Q  + I  D  +  + F
Sbjct: 401 YFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGF 459

Query: 369 KPTDC 373
            PT C
Sbjct: 460 GPTTC 464


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 176/362 (48%), Gaps = 10/362 (2%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY  +  IGTP   + Y ++DTGSD++W+QC PC +CY Q  PI+NP+SS
Sbjct: 143 VVSGMEQGSGEYFTRIGIGTP-TREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSS 201

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            S+  + C S  C  LD   C     C Y   Y D S T G  ATE +TFG ++    NV
Sbjct: 202 VSFSTVGCDSAVCSQLDANDCHGGG-CLYEVSYGDGSYTVGSYATETLTFGTTS--IQNV 258

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L     + LS  +Q+ +Q G   FSYCLV   ++SS T  + 
Sbjct: 259 AIGCGHDNVGLFVGAAGLLGLGAGS-LSFPAQLGTQTG-RAFSYCLVDRDSESSGT--LE 314

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  S V  G + +  + +    T+Y++++  ISVG +   S     +       +G + 
Sbjct: 315 FGPES-VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGII 373

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFD 311
           ID+G   T L    Y+ L +      +  P  D       CY   ++  ++ P +  HF 
Sbjct: 374 IDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFS 433

Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            GA   L   +  IP    G FCFA  P D ++ I GN  Q  + + +D  + +V F   
Sbjct: 434 NGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAID 493

Query: 372 DC 373
            C
Sbjct: 494 QC 495


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 129/365 (35%), Positives = 180/365 (49%), Gaps = 22/365 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY ++  IG PP    Y ++DTGSD+ W+QC PC +CY+Q  PI++P SS
Sbjct: 138 VVSGTSQGSGEYFLRVGIGKPPS-QAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSS 196

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +SY  + C + QC  LD   C +   C Y   Y D S T G  ATE +T G +    +NV
Sbjct: 197 NSYSPIRCDAPQCKSLDLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGTAA--VENV 253

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F     GL+GLG  +LS  +Q+     A  FSYCLV  + DS   S + 
Sbjct: 254 AIGCGHNNEGLF-VGAAGLLGLGGGKLSFPAQV----NATSFSYCLV--NRDSDAVSTLE 306

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS---SGAISKG 249
           F   S +    V +    + E  T+Y++ L+GISVG      + +P   S     AI  G
Sbjct: 307 F--NSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGG-----EALPIPESIFEVDAIGGG 359

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
            + ID+G   T L  + Y+ L +      K  P  +       CY   S   +  P ++ 
Sbjct: 360 GIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSF 419

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  G ++PL   +  IP    G FCFA  P    + I GN  Q    +G+D  + +V F
Sbjct: 420 HFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGF 479

Query: 369 KPTDC 373
               C
Sbjct: 480 SADSC 484


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 171/365 (46%), Gaps = 25/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  S  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI++P SSSS
Sbjct: 146 SGTSQGSGEYFSRVGVGQPAK-PFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSS 204

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           +  L C+S+QC  L+T  C + + C Y   Y D S T G   TE +TFGNS    ++V  
Sbjct: 205 FASLPCESQQCQALETSGCRASK-CLYQVSYGDGSFTVGEFVTETLTFGNS-GMINDVAV 262

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F  +                 + SQ+ A+ FSYCLV    D   +S     
Sbjct: 263 GCGHDNEGLFVGSAG-----LLGLGGGPLSLTSQMKASSFSYCLV----DRDSSSSSDLE 313

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMF 252
             S      V +  L S +  T+Y+V L G+SVG   LS    L    +S      G + 
Sbjct: 314 FNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS----GYGGII 369

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PILTA 308
           +D+G   T L    YN L +     +  TPY     G  L   CY   S + +  P ++ 
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSF 426

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            F GG  + L   +  IP    G FCFA  P    + I GN  Q    + YD  + +V F
Sbjct: 427 EFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGF 486

Query: 369 KPTDC 373
            P  C
Sbjct: 487 SPHKC 491


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 132/376 (35%), Positives = 188/376 (50%), Gaps = 31/376 (8%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ +    G Y M  S+GTP LL    + DTGSDL+W QC PC +C++Q  P + PASSS
Sbjct: 76  QALLENGVGGYNMNISVGTP-LLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134

Query: 74  SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           ++ +L C S  C  L +++   +   C Y Y Y  S  T G LATE +  G+++  F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC   N GV N    G+ GLGR  LSL    + QLG  +FSYCL      ++  S + 
Sbjct: 192 AFGCSTEN-GVGNSTS-GIAGLGRGALSL----IPQLGVGRFSYCLR--SGSAAGASPIL 243

Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
           FG+ + ++ G V ST  V+      +YY+V L GI+VG        +P   S+   ++  
Sbjct: 244 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 298

Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIA-P 304
             G   +D+G   T L KD Y  +++   +        +   G  LC+K T    GIA P
Sbjct: 299 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVP 358

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIGY 359
            L   FDGGA+  +      +    +G   V C  M P  GD  + + GN  Q D+ + Y
Sbjct: 359 SLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLY 418

Query: 360 DFDSQMVSFKPTDCTK 375
           D D  + SF P DC K
Sbjct: 419 DLDGGIFSFSPADCAK 434


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 124/403 (30%), Positives = 193/403 (47%), Gaps = 56/403 (13%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V ++ V +A GEY++K  +GTP        +DT SDL+W QC PCV+CYKQ+ P++NP +
Sbjct: 76  VAEAPVLSAGGEYLVKLGLGTPQHC-FTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVA 134

Query: 72  SSSYKELSCQSEQCHLLDTVSCSS------QQLCNYTYGYADSSLTKGVLATERITFGNS 125
           S+SY  + C S+ C  LDT  C+       +  C YTY Y  ++ T+G+LA +R+  G  
Sbjct: 135 STSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIG-- 192

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
           ++ F  VVFGC  ++ G       G+VGLGR  LSL    +SQL   +F YCL P  + S
Sbjct: 193 DDVFRGVVFGCSSSSVGGPPPQVSGVVGLGRGALSL----VSQLSVRRFMYCLPPPVSRS 248

Query: 186 SITSKMYFGNGS-----EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
           +   ++  G  +       S   VV  S  S+   +YY++ L+GIS+G+ + S +     
Sbjct: 249 A--GRLVLGADAAATVRNASERVVVPMSTGSRY-PSYYYLNLDGISIGDRAMSFR---SR 302

Query: 241 NSSGAISKGN--------------------------MFIDTGAPPTLLPKDFYNRLEEQV 274
           N   A + G                           M ID  +  T L +  Y  + + +
Sbjct: 303 NRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDL 362

Query: 275 RNAIKLTPYQDPRLGSQLCYKTPS---MAGI-APILTAHFDGGAKVPLIHTSTFIPPPVE 330
              I+L       LG  LC+  P    M+ + AP ++  F+ G  + L     F+     
Sbjct: 363 EEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFVEDRAS 421

Query: 331 GVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G+ C  +   DG V I GN+ Q ++ + Y+     ++F  T C
Sbjct: 422 GMMCLMVGKTDG-VSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 174/367 (47%), Gaps = 22/367 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
            Q  +S   G YV+   +GTP   D+  + DTGSDL WVQC PC  CY+Q  P+++PA S
Sbjct: 135 AQRGISLGTGNYVVSMGLGTP-ARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARS 193

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  + C S +C  LD+ SCS  + C Y   Y D S T G LA + +T   S +     
Sbjct: 194 STYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQS-DVLPGF 252

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           VFGCG  +TG+F   + GLVGLGR ++SL+SQ  S+ GA  FSYCL      SS ++  Y
Sbjct: 253 VFGCGEQDTGLFGRAD-GLVGLGREKVSLSSQAASKYGAG-FSYCL-----PSSPSAAGY 305

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
              G         +      +  ++Y+V L G+ V G     S ++  ++++G +     
Sbjct: 306 LSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIV--FSAAGTV----- 358

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTA 308
            ID+G   T LP   Y  L      ++    Y+     S L  CY       +  P +  
Sbjct: 359 -IDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVAL 417

Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GGA V L  +   ++    +    FA      D GI GN  Q  L + YD   Q + 
Sbjct: 418 VFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIG 477

Query: 368 FKPTDCT 374
           F    C+
Sbjct: 478 FGANGCS 484


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 171/365 (46%), Gaps = 25/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  S  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI++P SSSS
Sbjct: 146 SGTSQGSGEYFSRVGVGQPAK-PFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSS 204

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           +  L C+S+QC  L+T  C + + C Y   Y D S T G    E +TFGNS    +NV  
Sbjct: 205 FASLPCESQQCQALETSGCRASK-CLYQVSYGDGSFTVGEFVIETLTFGNS-GMINNVAV 262

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F  +              +  + SQ+ A+ FSYCLV    D   +S     
Sbjct: 263 GCGHDNEGLFVGSAG-----LLGLGGGSLSLTSQMKASSFSYCLV----DRDSSSSSDLE 313

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMF 252
             S      V +  L S +  T+Y+V L G+SVG   LS    L    +S      G + 
Sbjct: 314 FNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS----GYGGII 369

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PILTA 308
           +D+G   T L    YN L +     +  TPY     G  L   CY   S + +  P ++ 
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSF 426

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            F GG  + L   +  IP    G FCFA  P    + I GN  Q    + YD  + +V F
Sbjct: 427 EFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGF 486

Query: 369 KPTDC 373
            P  C
Sbjct: 487 SPHKC 491


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 179/362 (49%), Gaps = 10/362 (2%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTP + + Y ++DTGSD++W+QC PC +CY QV PI+NP+ S
Sbjct: 186 VVSGMAQGSGEYFTRIGVGTP-MREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLS 244

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  L C S  C  LD  +C     C Y   Y D S T G  ATE +TFG ++    NV
Sbjct: 245 ASFSTLGCNSAVCSYLDAYNCHGGG-CLYKVSYGDGSYTIGSFATEMLTFGTTS--VRNV 301

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L+GLG   LS  SQ+ +Q G   FSYCLV   ++SS T  + 
Sbjct: 302 AIGCGHDNAGLFVGAAG-LLGLGAGLLSFPSQLGTQTG-RAFSYCLVDRFSESSGT--LE 357

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  S V  G +++  L +    T+Y+V L  ISVG     S     +       +G   
Sbjct: 358 FGPES-VPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFI 416

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
           +D+G   T L    Y+ + +      +  P  +       CY    +  +  P +  HF 
Sbjct: 417 VDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFS 476

Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            GA + L   +  IP    G FCFA  P   D+ I GN  Q  + + +D  + +V F   
Sbjct: 477 NGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALR 536

Query: 372 DC 373
            C
Sbjct: 537 QC 538


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 128/365 (35%), Positives = 180/365 (49%), Gaps = 22/365 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  S  +GEY ++  IG PP    Y ++DTGSD+ W+QC PC +CY+Q  PI++P SS
Sbjct: 138 VVSGTSQGSGEYFLRVGIGKPPS-QAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISS 196

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +SY  + C   QC  LD   C +   C Y   Y D S T G  ATE +T G++    +NV
Sbjct: 197 NSYSPIRCDEPQCKSLDLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGSAA--VENV 253

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F     GL+GLG  +LS  +Q+     A  FSYCLV  + DS   S + 
Sbjct: 254 AIGCGHNNEGLF-VGAAGLLGLGGGKLSFPAQV----NATSFSYCLV--NRDSDAVSTLE 306

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKG 249
           F   S +      +  + + E  T+Y++ L+GISVG      + +P   SS    AI  G
Sbjct: 307 F--NSPLPRNAATAPLMRNPELDTFYYLGLKGISVGG-----EALPIPESSFEVDAIGGG 359

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTA 308
            + ID+G   T L  + Y+ L +      K  P  +       CY   S   +  P ++ 
Sbjct: 360 GIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSF 419

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            F  G ++PL   +  IP    G FCFA  P    + I GN  Q    +G+D  + +V F
Sbjct: 420 RFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGF 479

Query: 369 KPTDC 373
               C
Sbjct: 480 SVDSC 484


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 137/389 (35%), Positives = 189/389 (48%), Gaps = 54/389 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--------QCYKQVKPIYNPASSS 73
           GEY+M  SIGTPPL     I DTGSDL+W QC PC         QC+KQ   +YNP+SS+
Sbjct: 85  GEYIMTLSIGTPPL-SYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSST 143

Query: 74  SYKELSCQS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---- 127
           ++  L C S    C  +   S      C Y   Y  +  T GV + E  TFG+S+     
Sbjct: 144 TFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYG-TGWTAGVQSVETFTFGSSSTPPAV 202

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
              N+ FGC + ++  +N    GLVGLGR  +SL    +SQLGA  FSYCL PF  D++ 
Sbjct: 203 RVPNIAFGCSNASSNDWN-GSAGLVGLGRGSMSL----VSQLGAGAFSYCLTPFQ-DANS 256

Query: 188 TSKMYFGNGSEVS---GGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYY 240
           TS +  G  +  +    G V ST  V+   K    TYY++ L GISVG    +  + P  
Sbjct: 257 TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGE--TALAIPPDA 314

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN----AIKLTPYQDPRLGSQLCY-- 294
            S  A   G + ID+G   T L    Y ++   VR+     + L    D   G  LC+  
Sbjct: 315 FSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFAL 374

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQ-PIDGDVGI 346
           K  +     P +T HF+GGA + L         PVE       GV+C AM+    G + +
Sbjct: 375 KASTPPPAMPSMTLHFEGGADMVL---------PVENYMILGSGVWCLAMRNQTVGAMSM 425

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            GN+ Q ++ + YD   + +SF P  C+ 
Sbjct: 426 VGNYQQQNIHVLYDVRKETLSFAPAVCSS 454


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 134/383 (34%), Positives = 190/383 (49%), Gaps = 37/383 (9%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ +  + G Y M  SIGTPP+     + DTGS L+W QC PC +C  +  P + PASSS
Sbjct: 80  QTLLDNSAGAYNMNLSIGTPPV-TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSS 138

Query: 74  SYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           ++ +L C S  C  L +  ++C++   C Y Y Y     T G LATE +  G ++  F  
Sbjct: 139 TFSKLPCASSLCQFLTSPYLTCNATG-CVYYYPYG-MGFTAGYLATETLHVGGAS--FPG 194

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-TSK 190
           V FGC   N GV N +  G+VGLGR+ LSL SQ+    G  +FSYCL    +D+    S 
Sbjct: 195 VAFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVGRFSYCL---RSDADAGDSP 245

Query: 191 MYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVG--NLSNSSKLIPYYNSSGA 245
           + FG+ ++V+GG V ST L+   +    +YY+V L GI+VG  +L  +S    +   +GA
Sbjct: 246 ILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGA 305

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              G   +D+G   T L K+ Y  ++     Q+  A   T     R G  LC+   +  G
Sbjct: 306 GLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGG 365

Query: 302 IA----PILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPIDG--DVGIFGNF 350
            +    P L   F GGA+  +   S      V+      V C  + P      + I GN 
Sbjct: 366 GSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNV 425

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q DL + YD D  M SF P DC
Sbjct: 426 MQMDLHVLYDLDGGMFSFAPADC 448


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 172/356 (48%), Gaps = 12/356 (3%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
            +GEY  +  IGTP   + Y ++DTGSD++W+QC PC +CY Q  PI+NP+SS S+  + 
Sbjct: 4   GSGEYFTRIGIGTP-TREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVG 62

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C S  C  LD   C     C Y   Y D S T G  ATE +TFG ++    NV  GCGH+
Sbjct: 63  CDSAVCSQLDANDCHGGG-CLYEVSYGDGSYTVGSYATETLTFGTTS--IQNVAIGCGHD 119

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N G+F      L     + LS  +Q+ +Q G   FSYCLV   ++SS T +     G E 
Sbjct: 120 NVGLFVGAAGLLGLGAGS-LSFPAQLGTQTG-RAFSYCLVDRDSESSGTLEF----GPES 173

Query: 200 SGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
              G + T LV+     T+Y++++  ISVG +   S     +       +G + ID+G  
Sbjct: 174 VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 233

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
            T L    Y+ L +      +  P  D       CY   ++  ++ P +  HF  GA   
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFI 293

Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           L   +  IP    G FCFA  P D ++ I GN  Q  + + +D  + +V F    C
Sbjct: 294 LPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 130/373 (34%), Positives = 186/373 (49%), Gaps = 38/373 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPPL     I DTGSDL+W QC PC  QC+KQ    YNP+SS+++  L C
Sbjct: 86  GEYIMTLAIGTPPL-SYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPC 144

Query: 81  QS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNVVFG 135
            S    C  L   S      C Y   Y  +  T G+ + E  TFG++         + FG
Sbjct: 145 NSSVSMCAALAGPSPPPGCSCMYNQTYG-TGWTAGIQSVETFTFGSTPADQTRVPGIAFG 203

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           C + ++  +N    GLVGLGR  +SL    +SQLGA  FSYCL PF  D++ TS +  G 
Sbjct: 204 CSNASSDDWN-GSAGLVGLGRGSMSL----VSQLGAGMFSYCLTPFQ-DANSTSTLLLGP 257

Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            + ++G GV++T  V+   K    TYY++ L GIS+G  + S  + P   +      G +
Sbjct: 258 SAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALS--IPPNAFALRTDGTGGL 315

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKT-------PSMAGI 302
            ID+G   T L    Y ++   + + + L P  D     G  LC+         PSM   
Sbjct: 316 IIDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLDLCFALTSETSTPPSM--- 371

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDF 361
            P +T HFDG   V  +     +     GV+C AM+    G +  FGN+ Q ++ + YD 
Sbjct: 372 -PSMTFHFDGADMVLPVDNYMILG---SGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDI 427

Query: 362 DSQMVSFKPTDCT 374
             + +SF P  C+
Sbjct: 428 HEETLSFAPAKCS 440


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 196/383 (51%), Gaps = 35/383 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V +++ + +G Y+MK  IGTPP  +I+  +DTGS+++W+ C+ C  C+ Q   I+NP +S
Sbjct: 87  VHASIFSGDGNYLMKLLIGTPPT-EIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLAS 145

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNF--- 128
           S+Y++  C S QC    + SC S  +C Y+       +   G +A + +T  +S+     
Sbjct: 146 STYQDAPCDSYQCETTSS-SCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFP 204

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
                F CG++    F    +G++GLGR  LSL S+ L  L   KFSYCL  ++  S   
Sbjct: 205 LPYSDFVCGNSIYKTF--AGVGVIGLGRGALSLTSK-LYHLSDGKFSYCLADYY--SKQP 259

Query: 189 SKMYFGNGSEVSGGG--VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           SK+ FG  S +S     VVST+L        Y+VTLEGISVG      + + Y +   A 
Sbjct: 260 SKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVG---EKRQDLYYVDDPFAP 316

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI- 305
             GNM ID+G   TLLPKDFY+ L   V  AI   P   P   S+  +   +   ++P  
Sbjct: 317 PVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPH-NSRFPFSMDNTLKLSPCF 375

Query: 306 ----------LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQ 352
                     +T HF   A V L   ++FI    E V CFA    QP  G   ++G++ Q
Sbjct: 376 WYYPELKFPKITIHF-TDADVELSDDNSFI-RVAEDVVCFAFAATQP--GQSTVYGSWQQ 431

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +  +GYD     VSFK TDC+K
Sbjct: 432 MNFILGYDLKRGTVSFKRTDCSK 454


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +  I+DTGSDL W QC PCV C++Q  P +NP+ S ++  L C  
Sbjct: 110 EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 168

Query: 83  EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
             C  L   SC  Q     +C Y Y YAD S+T G L ++  +F ++++        ++ 
Sbjct: 169 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 228

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F  NE G+ G  R  LS+     +QL  + FSYC        S  S ++ 
Sbjct: 229 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 282

Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           G      S+ +GGG   V ST+L+   S + K YY ++L+G++VG     +  +P   S 
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 336

Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
            A+ +   G   +D+G   T+LP+  YN + +      KLT +      SQLC+  P  A
Sbjct: 337 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 396

Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
               P L  HF+ GA + L   +        G   + C A+   + D+ + GNF Q ++ 
Sbjct: 397 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 454

Query: 357 IGYDFDSQMVSFKPTDCTK 375
           + YD  + M+SF P  C K
Sbjct: 455 VLYDLANDMLSFVPARCNK 473


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +  I+DTGSDL W QC PCV C++Q  P +NP+ S ++  L C  
Sbjct: 110 EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 168

Query: 83  EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
             C  L   SC  Q     +C Y Y YAD S+T G L ++  +F ++++        ++ 
Sbjct: 169 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 228

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F  NE G+ G  R  LS+     +QL  + FSYC        S  S ++ 
Sbjct: 229 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 282

Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           G      S+ +GGG   V ST+L+   S + K YY ++L+G++VG     +  +P   S 
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 336

Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
            A+ +   G   +D+G   T+LP+  YN + +      KLT +      SQLC+  P  A
Sbjct: 337 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 396

Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
               P L  HF+ GA + L   +        G   + C A+   + D+ + GNF Q ++ 
Sbjct: 397 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 454

Query: 357 IGYDFDSQMVSFKPTDCTK 375
           + YD  + M+SF P  C K
Sbjct: 455 VLYDLANDMLSFVPARCNK 473


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 122/403 (30%), Positives = 178/403 (44%), Gaps = 52/403 (12%)

Query: 13  VQSNVSTANG-------EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ-VK 64
           V++ V TA         EY++  S+GTPP   +   +DTGSDL+W QC PC+ C+ Q   
Sbjct: 76  VRARVRTAGAGGGIVTNEYLVHLSVGTPPR-PVALTLDTGSDLVWTQCAPCLNCFDQGAI 134

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSS------QQLCNYTYGYADSSLTKGVLATE 118
           P+ +PA+SS++  + C +  C  L   SC        ++ C Y Y Y D S+T G LA++
Sbjct: 135 PVLDPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASD 194

Query: 119 RITFGNSNNF------FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
           R TFG  +N          + FGCGH N G+F  NE G+ G GR R SL     SQLG  
Sbjct: 195 RFTFGPGDNADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLP----SQLGVT 250

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLS 231
            FSYC       +S    +          G V ST L+    + + YF++L+ I+VG   
Sbjct: 251 SFSYCFTSMFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVG--- 307

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
             +  IP       + + +  ID+GA  T LP+D Y  ++ +    + L           
Sbjct: 308 --ATRIPIPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALD 365

Query: 292 LCYKTPSMAG------------------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF 333
           LC+  PS A                     P L  H  GGA   L   +         V 
Sbjct: 366 LCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVM 425

Query: 334 CFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           C  +    G      + GN+ Q +  + YD ++ ++SF P  C
Sbjct: 426 CLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +  I+DTGSDL W QC PCV C++Q  P +NP+ S ++  L C  
Sbjct: 84  EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 142

Query: 83  EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
             C  L   SC  Q     +C Y Y YAD S+T G L ++  +F ++++        ++ 
Sbjct: 143 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 202

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F  NE G+ G  R  LS+     +QL  + FSYC        S  S ++ 
Sbjct: 203 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 256

Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           G      S+ +GGG   V ST+L+   S + K YY ++L+G++VG     +  +P   S 
Sbjct: 257 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 310

Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
            A+ +   G   +D+G   T+LP+  YN + +      KLT +      SQLC+  P  A
Sbjct: 311 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 370

Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
               P L  HF+ GA + L   +        G   + C A+   + D+ + GNF Q ++ 
Sbjct: 371 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 428

Query: 357 IGYDFDSQMVSFKPTDCTK 375
           + YD  + M+SF P  C K
Sbjct: 429 VLYDLANDMLSFVPARCNK 447


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 128/368 (34%), Positives = 175/368 (47%), Gaps = 24/368 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q       G Y++    GTP    +  I+DTGSD+ W+QC PC  CY QV PI+ P  S
Sbjct: 127 LQPGSKVGTGNYIVTAGFGTPAKNSLL-IIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQS 185

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SSYK LSC S  C  L T++      C Y   Y D S ++G  + E +T G+ +  F + 
Sbjct: 186 SSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--FPSF 243

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCGH NTG+F +   GL+GLGRT LS  SQ  S+ G  +FSYCL  F + +S T    
Sbjct: 244 AFGCGHTNTGLF-KGSAGLLGLGRTALSFPSQTKSKYGG-QFSYCLPDFVSSTS-TGSFS 300

Query: 193 FGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            G GS  +    V   LVS  +  ++YFV L GISVG    S   IP       + +G  
Sbjct: 301 VGQGSIPATATFV--PLVSNSNYPSFYFVGLNGISVGGERLS---IP----PAVLGRGGT 351

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            +D+G   T L    Y+ L+   R+  +  P   P      CY   S + +  P +T HF
Sbjct: 352 IVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHF 411

Query: 311 DGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
              A V +           +G      F  A Q I  +  I GNF Q  + + +D  +  
Sbjct: 412 QNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTN--IIGNFQQQRMRVAFDTGAGR 469

Query: 366 VSFKPTDC 373
           + F P  C
Sbjct: 470 IGFAPGSC 477


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 176/362 (48%), Gaps = 21/362 (5%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P Y+ + SS++   SC S
Sbjct: 90  EYLLHLAIGTPPQ-PVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 148

Query: 83  EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            QC L  +V+ C +Q  Q C ++Y Y D S T G L  E ++F  +      VVFGCG N
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 207

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           NTG+F  NE G+ G GR  LSL     SQL    FS+C          T           
Sbjct: 208 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 263

Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
           +G G V T+ + K     T+Y+++L+GI+VG     S  +P   S+ A+  G     ID+
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 318

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
           G   T LP   Y  + ++    +KL        G  LC+  P +  A   P L  HF+ G
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 377

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A + L   +        G     +  I+G++ I GNF Q ++ + YD  +  +SF    C
Sbjct: 378 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437

Query: 374 TK 375
            K
Sbjct: 438 DK 439


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 31/348 (8%)

Query: 42  VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNY 101
           +DTGSDL+W QC PC+ C  Q  P ++   S++Y+ L C+S +C  L + SC  +++C Y
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC-FKKMCVY 59

Query: 102 TYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTR 158
            Y Y D++ T GVLA E  TFG +N+      N+ FGCG  N G    N  G+VG GR  
Sbjct: 60  QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118

Query: 159 LSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG-----NGSEVSGGGVVSTS--LVS 211
           LSL    +SQLG ++FSYCL  +   S+  S++YFG     + +  S G  V ++  +++
Sbjct: 119 LSL----VSQLGPSRFSYCLTSYL--SATPSRLYFGVYANLSSTNTSSGSPVQSTPFVIN 172

Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYN 268
                 YF++L+ IS+G     +KL+P      AI+    G + ID+G   T L +D Y 
Sbjct: 173 PALPNMYFLSLKAISLG-----TKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYE 227

Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI---APILTAHFDGGAKVPLIHTSTFI 325
            +   + +AI L    D  +G   C++ P    +    P L  HFD  A + L+  +  +
Sbjct: 228 AVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD-SANMTLLPENYML 286

Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                G  C  M P  G   I GN+ Q +L + YD  +  +SF P  C
Sbjct: 287 IASTTGYLCLVMAPT-GVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 123/378 (32%), Positives = 179/378 (47%), Gaps = 25/378 (6%)

Query: 7   FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           + P ++V      V   +GEY ++  +G+PP  D Y +VD+GSD++WVQC PC QCY Q 
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
            P+++PA+SSS+  +SC S  C  L    C        C+Y+  Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           T G +      V  GCGH N+G+F     GL+GLG   +SL  Q+    G   FSYCL  
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCLAS 284

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                +    +  G    V  G V    + + +  ++Y+V L GI VG      + +P  
Sbjct: 285 RGAGGA--GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGG-----ERLPLQ 337

Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
           +S   +++   G + +DTG   T LP++ Y  L      A+   P          CY   
Sbjct: 338 DSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 397

Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
             A +  P ++ +FD GA + L   +  +   V G VFC A  P    + I GN  Q  +
Sbjct: 398 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 455

Query: 356 FIGYDFDSQMVSFKPTDC 373
            I  D  +  V F P  C
Sbjct: 456 QITVDSANGYVGFGPNTC 473


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 127/363 (34%), Positives = 180/363 (49%), Gaps = 12/363 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC +CY Q  P+++P  S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKS 194

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            S+  +SC+S  C  LD+  C+S+Q C Y   Y D S T G  +TE +TF  +      V
Sbjct: 195 GSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR--VPKV 252

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L+GLGR RLS  +Q   + G  KFSYCLV   + SS  S + 
Sbjct: 253 ALGCGHDNEGLFVGAAG-LLGLGRGRLSFPTQTGLRFG-RKFSYCLVD-RSASSKPSSVV 309

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  S VS   V +  + + +  T+Y++ L GISVG    +      +    A   G + 
Sbjct: 310 FGQ-SAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTA-GNGGVI 367

Query: 253 IDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
           ID+G   T L +  Y  L +  R  A  L    D  L    C+       +  P +  HF
Sbjct: 368 IDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSL-FDTCFDLSGKTEVKVPTVVMHF 426

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA V L  T+  IP    GVFCFA       + I GN  Q    + +D  +  + F  
Sbjct: 427 R-GADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAA 485

Query: 371 TDC 373
             C
Sbjct: 486 RGC 488


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 129/377 (34%), Positives = 186/377 (49%), Gaps = 32/377 (8%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ +    G Y M  S+GTP LL    + DTGSDL+W QC PC +C++Q  P + PASSS
Sbjct: 76  QALLENGVGGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134

Query: 74  SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           ++ +L C S  C  L +++   +   C Y Y Y  S  T G LATE +  G+++  F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC   N GV N    G+ GLGR  LSL    + QLG  +FSYCL      ++  S + 
Sbjct: 192 AFGCSTEN-GVGNSTS-GIAGLGRGALSL----IPQLGVGRFSYCLR--SGSAAGASPIL 243

Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
           FG+ + ++ G V ST  V+      +YY+V L GI+VG        +P   S+   ++  
Sbjct: 244 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 298

Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IA 303
             G   +D+G   T L KD Y  +++   +        +   G  LC+K+    G     
Sbjct: 299 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAV 358

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIG 358
           P L   FDGGA+  +      +    +G   V C  M P  GD  + + GN  Q D+ + 
Sbjct: 359 PSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLL 418

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD D  + SF P DC K
Sbjct: 419 YDLDGGIFSFAPADCAK 435


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 126/375 (33%), Positives = 175/375 (46%), Gaps = 28/375 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
            QS +    G Y++   +GTP   D+  I DTGSDL W QC PCV+ CY Q +PI++P++
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPST 201

Query: 72  SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S +Y  +SC S  C  L + +     CSS   C Y   Y DSS T G  A +++T    N
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSN-CVYGIQYGDSSFTIGFFAKDKLTL-TQN 259

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           + FD  +FGCG NN G+F +   GL+GLGR  LS+  Q   + G   FSYCL    T   
Sbjct: 260 DVFDGFMFGCGQNNKGLFGKTA-GLIGLGRDPLSIVQQTAQKFG-KYFSYCL---PTSRG 314

Query: 187 ITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
               + FGNG+ V        G+  T   S +   YYF+ + GISVG  + S   + + N
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN 374

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           +          ID+G   T LP   Y  L+   +  +   P          CY   +   
Sbjct: 375 A-------GTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTS 427

Query: 302 IA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           I+ P ++ +F+G A V L      I      V   FA    D  +GIFGN  Q  L + Y
Sbjct: 428 ISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVY 487

Query: 360 DFDSQMVSFKPTDCT 374
           D     + F    C+
Sbjct: 488 DVAGGQLGFGYKGCS 502


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 133/377 (35%), Positives = 187/377 (49%), Gaps = 45/377 (11%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           + GEY+M   IG+PP      ++DTGSDL+W QC PC+ C +Q  P + PA S+SY  L 
Sbjct: 84  SEGEYLMDVGIGSPPRY-FSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLP 142

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCG 137
           C S  C+ L +  C  Q  C Y   Y DS+ + GVLA E  TFG ++       V FGCG
Sbjct: 143 CSSAMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCG 201

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--- 194
           + N G    N  G+VG GR  LSL    +SQLG+ +FSYCL  F   S  TS++YFG   
Sbjct: 202 NMNAGTL-FNGSGMVGFGRGALSL----VSQLGSPRFSYCLTSFM--SPATSRLYFGAYA 254

Query: 195 --NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
             N +  S  G V ++  +V+    T YF+ + GISV     +  L+P   S  AI++  
Sbjct: 255 TLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISV-----AGDLLPIDPSVFAINETD 309

Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTP--- 297
             G + ID+G   T L +  Y  ++      + L     PR  +        C+K P   
Sbjct: 310 GTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGL-----PRANATPSDTFDTCFKWPPPP 364

Query: 298 -SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
             M  + P +  HFD GA + L   +  +     G  C AM P D D  I G+F   +  
Sbjct: 365 RRMVTL-PEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSD-DGSIIGSFQHQNFH 421

Query: 357 IGYDFDSQMVSFKPTDC 373
           + YD ++ ++SF P  C
Sbjct: 422 MLYDLENSLLSFVPAPC 438


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 125/374 (33%), Positives = 179/374 (47%), Gaps = 22/374 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  K  +GTP    +  ++DTGSD++W+QC PC +CY Q  P+++P  S
Sbjct: 129 VVSGLAQGSGEYFTKIGVGTPSTPALM-VLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRS 187

Query: 73  SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSY  + C +  C  LD+  C   ++ C Y   Y D S+T G  ATE +TF         
Sbjct: 188 SSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGAR-VAR 246

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+GLGR  LS  +QI  + G   FSYCLV   + SS  +  
Sbjct: 247 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYG-KSFSYCLVDRTSSSSSGAAS 304

Query: 192 YFGNGSEVSGGGVVSTS-----LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
              + +   G    S +     + +   +T+Y+V L GISVG                + 
Sbjct: 305 RSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPST 364

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
            +G + +D+G   T L +  Y+ L +  R A   ++L+P      G  L   CY      
Sbjct: 365 GRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLGGRK 419

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            +  P ++ HF GGA+  L   +  IP    G FCFA    DG V I GN  Q    + +
Sbjct: 420 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 479

Query: 360 DFDSQMVSFKPTDC 373
           D D Q V F P  C
Sbjct: 480 DGDGQRVGFAPKGC 493


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 133/377 (35%), Positives = 187/377 (49%), Gaps = 45/377 (11%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           + GEY+M   IG+PP      ++DTGSDL+W QC PC+ C +Q  P + PA S+SY  L 
Sbjct: 81  SEGEYLMDVGIGSPPRY-FSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLP 139

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCG 137
           C S  C+ L +  C  Q  C Y   Y DS+ + GVLA E  TFG ++       V FGCG
Sbjct: 140 CSSAMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCG 198

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--- 194
           + N G    N  G+VG GR  LSL    +SQLG+ +FSYCL  F   S  TS++YFG   
Sbjct: 199 NMNAGTL-FNGSGMVGFGRGALSL----VSQLGSPRFSYCLTSFM--SPATSRLYFGAYA 251

Query: 195 --NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
             N +  S  G V ++  +V+    T YF+ + GISV     +  L+P   S  AI++  
Sbjct: 252 TLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISV-----AGDLLPIDPSVFAINETD 306

Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTP--- 297
             G + ID+G   T L +  Y  ++      + L     PR  +        C+K P   
Sbjct: 307 GTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGL-----PRANATPSDTFDTCFKWPPPP 361

Query: 298 -SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
             M  + P +  HFD GA + L   +  +     G  C AM P D D  I G+F   +  
Sbjct: 362 RRMVTL-PEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSD-DGSIIGSFQHQNFH 418

Query: 357 IGYDFDSQMVSFKPTDC 373
           + YD ++ ++SF P  C
Sbjct: 419 MLYDLENSLLSFVPAPC 435


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 166/366 (45%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP  + Y ++D+GSD++WVQC PC QCY Q  P++NPA S
Sbjct: 123 VVSGMEQGSGEYFVRIGVGSPPR-NQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADS 181

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SSY  +SC S  C  +D   C   + C Y   Y D S TKG LA E +TFG +     NV
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGR-CRYEVSYGDGSYTKGTLALETLTFGRT--LIRNV 238

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F     GL+GLG   +S   Q+  Q G   FSYCLV     SS    + 
Sbjct: 239 AIGCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGGT-FSYCLVSRGIQSS--GLLQ 294

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  +   G   V      +    YY         G     S+ +   +  G    G + 
Sbjct: 295 FGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG---DGGVV 351

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
           +DTG   T LP   Y    E  R+A        PR         CY       +  P ++
Sbjct: 352 MDTGTAVTRLPTAAY----EAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 407

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            +F GG  + L   +  IP    G FCFA  P    + I GN  Q  + I  D  +  V 
Sbjct: 408 FYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVG 467

Query: 368 FKPTDC 373
           F P  C
Sbjct: 468 FGPNVC 473


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  174 bits (441), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 122/378 (32%), Positives = 178/378 (47%), Gaps = 25/378 (6%)

Query: 7   FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           + P ++V      V   +GEY ++  +G+PP  D Y +VD+GSD++WVQC PC QCY Q 
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
            P+++PA+SSS+  +SC S  C  L    C        C+Y+  Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           T G +      V  GCGH N+G+F     GL+GLG   +SL  Q+    G   FSYCL  
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLIGQLGGAAG-GVFSYCLAS 284

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                +    +  G    V  G V    + + +  ++Y+V L GI VG      + +P  
Sbjct: 285 RGAGGA--GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGG-----ERLPLQ 337

Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
           +    +++   G + +DTG   T LP++ Y  L      A+   P          CY   
Sbjct: 338 DGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 397

Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
             A +  P ++ +FD GA + L   +  +   V G VFC A  P    + I GN  Q  +
Sbjct: 398 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 455

Query: 356 FIGYDFDSQMVSFKPTDC 373
            I  D  +  V F P  C
Sbjct: 456 QITVDSANGYVGFGPNTC 473


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 175/362 (48%), Gaps = 21/362 (5%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGS L+W QC PC  C+ Q  P Y+ + SS++   SC S
Sbjct: 90  EYLLHLAIGTPPQ-PVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 148

Query: 83  EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            QC L  +V+ C +Q  Q C Y+Y Y D S T G L  E ++F  +      VVFGCG N
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 207

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           NTG+F  NE G+ G GR  LSL     SQL    FS+C          T           
Sbjct: 208 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 263

Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
           +G G V T+ + K     T+Y+++L+GI+VG     S  +P   S+ A+  G     ID+
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 318

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
           G   T LP   Y  + ++    +KL        G  LC+  P +  A   P L  HF+ G
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 377

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A + L   +        G     +  I+G++ I GNF Q ++ + YD  +  +SF    C
Sbjct: 378 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437

Query: 374 TK 375
            K
Sbjct: 438 DK 439


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 127/380 (33%), Positives = 181/380 (47%), Gaps = 38/380 (10%)

Query: 7   FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           + P ++V      V   +GEY ++  +G+PP  D Y +VD+GSD++WVQC PC QCY Q 
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
            P+++PA+SSS+  +SC S  C  L    C        C+Y+  Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           T G +      V  GCGH N+G+F     GL+GLG   +SL  Q+    G   FSYCL  
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCL-- 282

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIP 238
                   S+   G GS V G     T  V +  +  ++Y+V L GI VG      + +P
Sbjct: 283 -------ASRGAGGAGSLVLG----RTEAVPRGRRASSFYYVGLTGIGVGG-----ERLP 326

Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
             +S   +++   G + +DTG   T LP++ Y  L      A+   P          CY 
Sbjct: 327 LQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYD 386

Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQS 353
               A +  P ++ +FD GA + L   +  +   V G VFC A  P    + I GN  Q 
Sbjct: 387 LSGYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQE 444

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
            + I  D  +  V F P  C
Sbjct: 445 GIQITVDSANGYVGFGPNTC 464


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 174/370 (47%), Gaps = 20/370 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S ++   GEY     +GTP   D+Y +VDTGSD+ W+QC PC  CYKQ   ++NP+SSSS
Sbjct: 7   SGLAFGTGEYFAVVGVGTP-RRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSS 65

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI----TFGNSNNFFD 130
           +K L C S  C  LD + C S + C Y   Y D S T G L T+ +     FG       
Sbjct: 66  FKVLDCSSSLCLNLDVMGCLSNK-CLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLT 124

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           N+  GCGH+N G F     G++GLGR  LS  +  L     N FSYCL    +D +  S 
Sbjct: 125 NIPLGCGHDNEGTFG-TAAGILGLGRGPLSFPNN-LDASTRNIFSYCLPDRESDPNHKST 182

Query: 191 MYFGNGS---EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGA 245
           + FG+ +     +G       L +    TYY+V + GISVG   L+N    +   +S G 
Sbjct: 183 LVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHG- 241

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA-IKLTPYQDPRLGSQLCYKTPSMAGIA- 303
              G    D+G   T L    Y  + +  R A + LT   D ++    CY    M  I+ 
Sbjct: 242 --NGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKI-FDTCYDFTGMNSISV 298

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P +T HF G   + L  ++  +P     +FCFA     G   + GN  Q    + YD   
Sbjct: 299 PTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGP-SVIGNVQQQSFRVIYDNVH 357

Query: 364 QMVSFKPTDC 373
           + +   P  C
Sbjct: 358 KQIGLLPDQC 367


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 175/362 (48%), Gaps = 21/362 (5%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGS L+W QC PC  C+ Q  P Y+ + SS++   SC S
Sbjct: 34  EYLLHLAIGTPPQ-PVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 92

Query: 83  EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            QC L  +V+ C +Q  Q C Y+Y Y D S T G L  E ++F  +      VVFGCG N
Sbjct: 93  TQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 151

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           NTG+F  NE G+ G GR  LSL     SQL    FS+C          T           
Sbjct: 152 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 207

Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
           +G G V T+ + K     T+Y+++L+GI+VG     S  +P   S+ A+  G     ID+
Sbjct: 208 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 262

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
           G   T LP   Y  + ++    +KL        G  LC+  P +  A   P L  HF+ G
Sbjct: 263 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 321

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A + L   +        G     +  I+G++ I GNF Q ++ + YD  +  +SF    C
Sbjct: 322 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381

Query: 374 TK 375
            K
Sbjct: 382 DK 383


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 181/372 (48%), Gaps = 22/372 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +  A+GEY     +GTPP   +  ++DTGSD++W+QC PCV CY+Q+ P+Y+P  S
Sbjct: 88  VISGLPFASGEYFASVGVGTPPTPALL-VIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGS 146

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y +  C   QC    T   ++   C Y   Y D+S T G LAT+R+ F N  +   NV
Sbjct: 147 STYAQTPCSPPQCRNPQTCDGTTGG-CGYRIVYGDASSTSGNLATDRLVFSNDTS-VGNV 204

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F  +  GL+G+ R   S A+Q+    G   F+YCL       S +S + 
Sbjct: 205 TLGCGHDNEGLFG-SAAGLLGVARGNNSFATQVADSYG-RYFAYCLGDRTRSGSSSSYLV 262

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG-----NLSNSS-KLIPYYNSSGAI 246
           FG  +      V +    +    + Y+V + G SVG       SN+S  L P      A 
Sbjct: 263 FGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDP------AT 316

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGS--QLCYKTPSMA-GI 302
            +G + +D+G   T   +D Y  L +     A K+   +  R  S    CY    +A   
Sbjct: 317 GRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVAD 376

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDF 361
           AP +  HF GGA V L   +  +P       CFA++    D + + GN  Q    + +D 
Sbjct: 377 APGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDV 436

Query: 362 DSQMVSFKPTDC 373
           +++ V F+P  C
Sbjct: 437 ENERVGFEPNGC 448


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 168/368 (45%), Gaps = 19/368 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  IG+P    +Y ++DTGSD+ W+QC PC  CY Q  P+++PA S
Sbjct: 185 VVSGVGQGSGEYFSRIGIGSP-ARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL-----CNYTYGYADSSLTKGVLATERITF-GNSN 126
           SSY  + C S  C  LD  +C +        C Y   Y D S T G  ATE +T  G+ +
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGS 303

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
               +V  GCGH+N G+F      L+ LG   LS  SQI     A +FSYCLV    DS 
Sbjct: 304 AAVHDVAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATEFSYCLV--DRDSP 356

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
             S + FG     +   V +  + S    T+Y+V L GISVG     S + P   +    
Sbjct: 357 SASTLQFGASDSST---VTAPLMRSPRSNTFYYVALNGISVGG-ETLSDIPPAAFAMDEQ 412

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
             G + +D+G   T L    Y+ L +      +  P          CY     + +  P 
Sbjct: 413 GSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPA 472

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           ++  F+GG ++ L   +  IP    G +C A     G V I GN  Q  + + +D     
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNT 532

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 533 VGFSPNKC 540


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 126/375 (33%), Positives = 175/375 (46%), Gaps = 28/375 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
            QS +    G Y++   +GTP   D+  I DTGSDL W QC PCV+ CY Q +PI++P++
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSA 201

Query: 72  SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S +Y  +SC S  C  L + +     CSS   C Y   Y DSS T G  A + +T    N
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSN-CVYGIQYGDSSFTVGFFAKDTLTL-TQN 259

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           + FD  +FGCG NN G+F +   GL+GLGR  LS+  Q   + G   FSYCL    T   
Sbjct: 260 DVFDGFMFGCGQNNRGLFGKTA-GLIGLGRDPLSIVQQTAQKFG-KYFSYCL---PTSRG 314

Query: 187 ITSKMYFGNG-----SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
               + FGNG     S+    G+  T   S +  T+YF+ + GISVG  + S   + + N
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           +          ID+G   T LP   Y  L+   +  +   P          CY   +   
Sbjct: 375 A-------GTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTS 427

Query: 302 IA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           I+ P ++ +F+G A V L      I      V   FA    D  +GIFGN  Q  L + Y
Sbjct: 428 ISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVY 487

Query: 360 DFDSQMVSFKPTDCT 374
           D     + F    C+
Sbjct: 488 DVAGGQLGFGYKGCS 502


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 127/373 (34%), Positives = 177/373 (47%), Gaps = 36/373 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P ++P++SS+    SC S
Sbjct: 34  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 92

Query: 83  EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C  L   SC S      Q C YTY Y D S+T G L  ++ TF  +      V FGCG
Sbjct: 93  TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 152

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKM 191
             N GVF  NE G+ G GR  LSL     SQL    FS+C       +P      + + +
Sbjct: 153 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL 208

Query: 192 YFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
            F NG     G V +T L+    ++ + T Y+++L+GI+VG     S  +P   S+ A++
Sbjct: 209 -FSNGQ----GAVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALT 258

Query: 248 KGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
            G     ID+G   T LP   Y  + ++    IKL        G   C+  PS A    P
Sbjct: 259 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVP 318

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            L  HF+G           F  P   G  + C A+   D +  I GNF Q ++ + YD  
Sbjct: 319 KLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQ 377

Query: 363 SQMVSFKPTDCTK 375
           + M+SF    C K
Sbjct: 378 NNMLSFVAAQCDK 390


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 119/363 (32%), Positives = 171/363 (47%), Gaps = 16/363 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  +G+P    +Y ++DTGSD+ WVQC PC  CY+Q  P+++P+ S
Sbjct: 152 VVSGVGLGSGEYFSRVGVGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 210

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           +SY  ++C + +CH LD  +C +S   C Y   Y D S T G  ATE +T G+S     +
Sbjct: 211 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP-VSS 269

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+ LG   LS  SQI     A  FSYCLV    DS  +S +
Sbjct: 270 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATTFSYCLV--DRDSPSSSTL 322

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG+ ++     V +  + S    T+Y+V L GISVG    S  + P   +      G +
Sbjct: 323 QFGDAADAE---VTAPLIRSPRTSTFYYVGLSGISVGGQILS--IPPSAFAMDGTGAGGV 377

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            +D+G   T L    Y  L +      +  P          CY       +  P ++  F
Sbjct: 378 IVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRF 437

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GG ++ L   +  IP    G +C A  P +  V I GN  Q    + +D     V F  
Sbjct: 438 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTS 497

Query: 371 TDC 373
             C
Sbjct: 498 NKC 500


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 131/376 (34%), Positives = 190/376 (50%), Gaps = 36/376 (9%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V  + GEY+M   IGTPP      I+DTGSDL+W QC PC+ C  Q  P ++PA S SY 
Sbjct: 82  VLASEGEYLMSMGIGTPPRY-YSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYA 140

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--NVVF 134
           +L C S  C+ L    C  + +C Y Y Y DS+ T GVL+ E  TFG ++       + F
Sbjct: 141 KLPCNSPMCNALYYPLC-YRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAF 199

Query: 135 GCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           GCG+ N G +FN +  G+VG GR  LSL    +SQLG+ +FSYCL  F   S + S++YF
Sbjct: 200 GCGNLNAGSLFNGS--GMVGFGRGPLSL----VSQLGSPRFSYCLTSFM--SPVPSRLYF 251

Query: 194 G-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           G     N +  S G  V ++  +V+    T Y++ + GISVG      +L+P   S  AI
Sbjct: 252 GAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGG-----ELLPIDPSVFAI 306

Query: 247 SK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CY---KTP 297
           +     G + ID+G+  T L +  Y+ + +   + + L       L   L  C+     P
Sbjct: 307 NDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPP 366

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
                 P L  HF+ GA + L   +  +     G  C A+   D D  I G+F   +  +
Sbjct: 367 RKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNLCLAIAASD-DGSIIGSFQHQNFHV 424

Query: 358 GYDFDSQMVSFKPTDC 373
            YD ++ ++SF P  C
Sbjct: 425 LYDNENSLLSFTPATC 440


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 178/344 (51%), Gaps = 42/344 (12%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           S   G+Y+M+FSIG PPLL I+  VDTGSDLMWV+C PC  C     P+Y+PA S S  +
Sbjct: 81  SQKGGKYIMQFSIGEPPLL-IWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGK 139

Query: 78  LSCQSEQCHLLDTVSCSSQQ------LC--NYTYGYADSSLTKGVLATERITFGNSNNFF 129
           L C S+ C  L      S Q      LC  +Y YG++    T+GVL TE  TFG+     
Sbjct: 140 LPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGD-GYVA 198

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           +NV FG      G       GLVGLGR  LSL    +SQLGA +F+YCL     D ++ S
Sbjct: 199 NNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSL----VSQLGAGRFAYCLA---ADPNVYS 251

Query: 190 KMYFGN--GSEVSGGGVVSTSLVS--KEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            + FG+    + S G V ST LV+  K D+ T+Y+V L+GISVG        +P  + + 
Sbjct: 252 TILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGG-----SRLPIKDGTF 306

Query: 245 AIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           AI+    G +F D+GA  T L    Y  + + + + I+   Y     G   C+   +   
Sbjct: 307 AINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYD---AGDDTCFVAANQQA 363

Query: 302 IA--PILTAHFDGGAKVPL-----IHTSTFIPPPVEGVFCFAMQ 338
           +A  P L  HFD GA + L     + TST    P E + C A++
Sbjct: 364 VAQMPPLVLHFDDGADMSLNGRNYLKTST--KGPSEVLVCMAIK 405


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 189/384 (49%), Gaps = 30/384 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V +    ++GEY++ F+IGTP    +   +DTGSDL+W QC PC  C+ Q  P+++P+ S
Sbjct: 76  VTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVS 135

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNN- 127
           S+++ ++C    C     +S S+  L    C Y   Y D S+T G +  +  TF + N  
Sbjct: 136 STFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGE 195

Query: 128 -----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF- 181
                    + FGCG  NTGVF  NE G+ G GR  LSL     SQL   +FSYCL    
Sbjct: 196 GAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLP----SQLRVGRFSYCLTSHD 251

Query: 182 HTDSSITSKMYFG---NGSEV-SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKL 236
            T+S+ TS ++ G   NG    S G   ST ++ S    T+Y+++LEGI+VG        
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTR----- 306

Query: 237 IPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQL 292
           +P  +S  A+ K   G   ID+G   T  P   + +L+ +    + L  Y +   +G+ L
Sbjct: 307 LPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLL 366

Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPIDGDVGIFGNFA 351
           C++ P      P+    F   +    +    +IP   + GV C  +   + D+ + GNF 
Sbjct: 367 CFQRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQ 426

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
           Q ++ I YD ++  + F    C K
Sbjct: 427 QQNMHIVYDVENSKLLFASAQCDK 450


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 123/374 (32%), Positives = 182/374 (48%), Gaps = 40/374 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  ++GTPP   +  ++DTGSDL+W QC PC  C  Q  PI++P +SSSY+ + C  
Sbjct: 103 EYLVDLAVGTPPQ-PVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAG 161

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GNSNNFFDNVVFGC 136
           E C+ +   SC     C Y Y Y D + T+GV ATER TF      G +      + FGC
Sbjct: 162 ELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGC 221

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN- 195
           G  N G  N N  G+VG GR  LSL    +SQL   +FSYCL P+   S   S + FG+ 
Sbjct: 222 GTMNKGSLN-NGSGIVGFGRAPLSL----VSQLAIRRFSYCLTPYA--SGRKSTLLFGSL 274

Query: 196 --GSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAIS 247
             G   +    V T+  L S+++ T+Y+V   G++VG     +  S+  +    S GAI 
Sbjct: 275 RGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAI- 333

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCY-----KTPSMA 300
                +D+G   TL P      +    R+ ++L    +   G    +C+     + P  A
Sbjct: 334 -----VDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPA 388

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGY 359
            + P +  H   GA + L   +  +    +G  C  +    GD G   GNF Q D+ + Y
Sbjct: 389 -VVPRMVFHLQ-GADLDLPRRNYVLDDQRKGNLCLLLAD-SGDSGTTIGNFVQQDMRVLY 445

Query: 360 DFDSQMVSFKPTDC 373
           D ++  +SF P  C
Sbjct: 446 DLEADTLSFAPAQC 459


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 34/371 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+  ++GTPP   I  ++DTGSDL+W QC  C  C +Q  P+++P  SSSY+ + C  
Sbjct: 97  EYVLDLAVGTPPQ-PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
           + C  +   SC     C Y Y Y D + T G  ATER TF +S+    +V   FGCG  N
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMN 215

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV- 199
            G  N N  G+VG GR  LSL    +SQL   +FSYCL P+   SS  S + FG+ ++V 
Sbjct: 216 VGSLN-NASGIVGFGRDPLSL----VSQLSIRRFSYCLTPYA--SSRKSTLQFGSLADVG 268

Query: 200 ---SGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNM 251
                 G V T+  L S ++ T+Y+V   G++VG     ++ +    S+ A+     G +
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVG-----ARRLRIPASAFALRPDGSGGV 323

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KTPSMAG--I 302
            ID+G   TL P      +    R+ ++L           +C+           MA    
Sbjct: 324 IIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVA 383

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            P +  HF  GA + L   +  +     G  C  +     D    GNF Q D+ + YD +
Sbjct: 384 VPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442

Query: 363 SQMVSFKPTDC 373
            + +SF P +C
Sbjct: 443 RETLSFAPVEC 453


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 173/366 (47%), Gaps = 22/366 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  +G P    +Y ++DTGSD+ W+QC PC  CY Q  P+Y+P+ S
Sbjct: 152 VVSGVGQGSGEYFSRVGVGRP-ARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVS 210

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           +SY  + C S +C  LD  +C +S   C Y   Y D S T G  ATE +T G+S     N
Sbjct: 211 TSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAP-VSN 269

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+ LG   LS  SQI     A  FSYCLV    DS  +S +
Sbjct: 270 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATTFSYCLV--DRDSPSSSTL 322

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---K 248
            FG+  + +   V +  + S    T+Y+V L GISVG  + S   IP  +S+ A+     
Sbjct: 323 QFGDSEQPA---VTAPLIRSPRTNTFYYVALSGISVGGEALS---IP--SSAFAMDDAGS 374

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
           G + +D+G   T L    Y  L E      +  P          CY     + +  P + 
Sbjct: 375 GGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVA 434

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F+GG ++ L   +  IP    G +C A     G V I GN  Q  + + +D     V 
Sbjct: 435 LWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVG 494

Query: 368 FKPTDC 373
           F    C
Sbjct: 495 FTADKC 500


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 117/363 (32%), Positives = 170/363 (46%), Gaps = 16/363 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY  +  +G+P    +Y ++DTGSD+ WVQC PC  CY+Q  P+++P+ S
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214

Query: 73  SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           +SY  ++C + +CH LD  +C +S   C Y   Y D S T G  ATE +T G+S     +
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP-VSS 273

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L   G   LS  SQI     A  FSYCLV    DS  +S +
Sbjct: 274 VAIGCGHDNEGLFVGAAGLLALGGGP-LSFPSQI----SATTFSYCLV--DRDSPSSSTL 326

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG+ ++     V +  + S    T+Y+V L G+SVG    S  + P   +  +   G +
Sbjct: 327 QFGDAADAE---VTAPLIRSPRTSTFYYVGLSGLSVGGQILS--IPPSAFAMDSTGAGGV 381

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            +D+G   T L    Y  L +      +  P          CY       +  P ++  F
Sbjct: 382 IVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRF 441

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GG ++ L   +  IP    G +C A  P +  V I GN  Q    + +D     V F  
Sbjct: 442 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTT 501

Query: 371 TDC 373
             C
Sbjct: 502 NKC 504


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 189/381 (49%), Gaps = 26/381 (6%)

Query: 2   SPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
           +P T  + ++VV S +S  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC +CY 
Sbjct: 121 APRTGGFSSSVV-SGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYS 178

Query: 62  QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERI 120
           Q  PI++P  S +Y  + C S  C  LD+  C++++  C Y   Y D S T G  +TE +
Sbjct: 179 QSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL 238

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           TF    N    V  GCGH+N G+F      L+GLG+ +LS   Q   +    KFSYCLV 
Sbjct: 239 TF--RRNRVKGVALGCGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFN-QKFSYCLVD 294

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
             + SS  S + FGN + VS     +  L + +  T+Y+V L GISVG        +P  
Sbjct: 295 -RSASSKPSSVVFGNAA-VSRIARFTPLLSNPKLDTFYYVELLGISVGGTR-----VPGV 347

Query: 241 NSS----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLC 293
            +S      I  G + ID+G   T L +  Y  + +  R    A+K  P  D  L    C
Sbjct: 348 AASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAP--DFSL-FDTC 404

Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           +   +M  +  P +  HF  GA V L  T+  IP    G FCFA     G + I GN  Q
Sbjct: 405 FDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQ 463

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
               + YD  S  V F P  C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/302 (36%), Positives = 167/302 (55%), Gaps = 27/302 (8%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    +  V+ ++GEY++  +IGTPPL     I+DTGSDL+W QC PC+ C  Q  P ++
Sbjct: 74  PITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCADQPTPYFD 132

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
              S++Y+ L C+S +C  L + SC  +++C Y Y Y D++ T GVLA E  TFG +N+ 
Sbjct: 133 VKKSATYRALPCRSSRCASLSSPSC-FKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191

Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
                N+ FGCG  N G    N  G+VG GR  LSL    +SQLG ++FSYCL  +   S
Sbjct: 192 KVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYCLTSYL--S 244

Query: 186 SITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
           +  S++YFG     + +  S G  V ++  +++      YF++L+ IS+G     +KL+P
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLG-----TKLLP 299

Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
                 AI+    G + ID+G   T L +D Y  +   + +AI LT   D  +G   C++
Sbjct: 300 IDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQ 359

Query: 296 TP 297
            P
Sbjct: 360 WP 361


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 178/369 (48%), Gaps = 30/369 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  +  +GEY  +  +G P     Y ++DTGSD+ W+QC PC  CY+Q  PI++P +S
Sbjct: 146 VSSGTAQGSGEYFSRVGVGQPSK-PFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTAS 204

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SSY  L+C ++QC  L+  +C + + C Y   Y D S T G   TE ++FG  +   + V
Sbjct: 205 SSYNPLTCDAQQCQDLEMSACRNGK-CLYQVSYGDGSFTVGEYVTETVSFGAGS--VNRV 261

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F  +   L             + SQ+ A  FSYCLV    DS  +S + 
Sbjct: 262 AIGCGHDNEGLFVGSAGLL-----GLGGGPLSLTSQIKATSFSYCLV--DRDSGKSSTLE 314

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           F   S   G  VV+  L +++  T+Y+V L G+SVG      +++     + A+ +   G
Sbjct: 315 F--NSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGG-----EIVTVPPETFAVDQSGAG 367

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL-TPYQDPRLGSQL---CYKTPSMAGI-AP 304
            + +D+G   T L    YN     VR+A K  T    P  G  L   CY   S+  +  P
Sbjct: 368 GVIVDSGTAITRLRTQAYN----SVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVP 423

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            ++ HF G     L   +  IP    G +CFA  P    + I GN  Q    + +D  + 
Sbjct: 424 TVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANS 483

Query: 365 MVSFKPTDC 373
           +V F P  C
Sbjct: 484 LVGFSPNKC 492


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 130/371 (35%), Positives = 186/371 (50%), Gaps = 28/371 (7%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V  ++GEY+M+  IGTP       I+DTGSDL+W QC PC+ C  Q  P ++PA+SS+Y+
Sbjct: 85  VLASDGEYLMEMGIGTPARF-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYR 143

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVF 134
            L C +  C+ L    C  Q+ C Y Y Y DS+ T GVLA E  TFG ++       + F
Sbjct: 144 SLGCSAPACNALYYPLC-YQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISF 202

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG+ N G    N  G+VG GR  LSL    +SQLG+ +FSYCL  F   S + S++YFG
Sbjct: 203 GCGNLNAGSL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVRSRLYFG 255

Query: 195 NGSEV---SGGGVVSTS-LVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISK 248
             + +   +   V ST  +++    T YF+ + GISVG   L     ++   ++ G    
Sbjct: 256 AYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG---T 312

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT-PYQDPRLGSQL--CYKT---PSMAGI 302
           G   ID+G   T L +  Y  + E     +  T P  D    S L  C++    P  +  
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT 372

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            P L  HFD GA   L   +  +  P  G  C AM     D  I G++   +  + YD +
Sbjct: 373 LPQLVLHFD-GADWELPLQNYMLVDPSTGGLCLAMA-TSSDGSIIGSYQHQNFNVLYDLE 430

Query: 363 SQMVSFKPTDC 373
           + ++SF P  C
Sbjct: 431 NSLLSFVPAPC 441


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 164/338 (48%), Gaps = 18/338 (5%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSC-SSQQLC 99
           ++DTGSD+ WVQC PC  CY+Q  P+++P+ S+SY  +SC S++C  LDT +C ++   C
Sbjct: 2   VLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGAC 61

Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
            Y   Y D S T G  ATE +T G+S     NV  GCGH+N G+F      L+ LG   L
Sbjct: 62  LYEVAYGDGSYTVGDFATETLTLGDSTP-VGNVAIGCGHDNEGLFVGAAG-LLALGGGPL 119

Query: 160 SLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYY 218
           S  SQI     A+ FSYCLV    DS   S + FG+G+  +  G V+  LV S    T+Y
Sbjct: 120 SFPSQI----SASTFSYCLV--DRDSPAASTLQFGDGAAEA--GTVTAPLVRSPRTSTFY 171

Query: 219 FVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
           +V L GISVG   LS  +       +SG+   G + +D+G   T L    Y  L +    
Sbjct: 172 YVALSGISVGGQPLSIPASAFAMDATSGS---GGVIVDSGTAVTRLQSAAYAALRDAFVQ 228

Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
                P          CY       +  P ++  F+GG  + L   +  IP    G +C 
Sbjct: 229 GAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCL 288

Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A  P +  V I GN  Q    + +D     V F P  C
Sbjct: 289 AFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 125/365 (34%), Positives = 173/365 (47%), Gaps = 16/365 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC  CY Q  P++NP  S
Sbjct: 31  VISGLAQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 89

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            S+ ++ C++  C  L++  C+ +Q C Y   Y D S T G   TE +TF  +    + V
Sbjct: 90  GSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK--VEQV 147

Query: 133 VFGCGHNNTGVF--NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
             GCGH+N G+F      +GL   G +  S A +  +Q    KFSYCLV   + SS  S 
Sbjct: 148 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ----KFSYCLVD-RSASSKPSS 202

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FGN S VS     +  L +    T+Y+V L GISVG    S     ++        G 
Sbjct: 203 VVFGN-SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLD-RTGNGG 260

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
           + ID G   T L K  Y  L +  R  A  L    +  L    CY       +  P +  
Sbjct: 261 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVKVPTVVL 319

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA V L  ++  IP    G FCFA       + I GN  Q    + YD  S  V F
Sbjct: 320 HFR-GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGF 378

Query: 369 KPTDC 373
            P  C
Sbjct: 379 SPRGC 383


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 175/369 (47%), Gaps = 26/369 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP    Y ++D+GSD++WVQC PC QCY Q  P+++PA S
Sbjct: 129 VISGMEQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 187

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  +SC S  C  L+   C + + C Y   Y D S TKG LA E +TFG +     +V
Sbjct: 188 ASFTGVSCSSSVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTFGRT--MVRSV 244

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q+  Q G   FSYCLV   TDSS    + 
Sbjct: 245 AIGCGHRNRGMFVGAAGLLGLGGGS-MSFVGQLGGQTGG-AFSYCLVSRGTDSS--GSLV 300

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           FG  +  +G   V   + +    ++Y++ L G+ VG +      +P       +++   G
Sbjct: 301 FGREALPAGAAWVPL-VRNPRAPSFYYIGLAGLGVGGIR-----VPISEEVFRLTELGDG 354

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-AP 304
            + +DTG   T LP   Y    +  R+A        PR         CY       +  P
Sbjct: 355 GVVMDTGTAVTRLPTLAY----QAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVP 410

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            ++ +F GG  + L   +  IP    G FCFA  P    + I GN  Q  + I +D  + 
Sbjct: 411 TVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANG 470

Query: 365 MVSFKPTDC 373
            V F P  C
Sbjct: 471 YVGFGPNIC 479


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/377 (33%), Positives = 181/377 (48%), Gaps = 22/377 (5%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N  V S +   +GEY ++  +GTP    ++ +VDTGSDL W+QC PC  CYKQ  PI++P
Sbjct: 115 NGPVTSGLLYGSGEYFVRLGVGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDP 173

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNS 125
            +SSS++ + C S  C  L+  SCS  +     C+Y   Y D S + G  +++  T G  
Sbjct: 174 RNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTG 233

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQIL----SQLGANKFSYCLVPF 181
           +    +V FGCG +N G+      GL+GLG  +LS  SQI     +   AN FSYCLV  
Sbjct: 234 SKAM-SVAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
               + +S       + +     +S  L + +  T+Y+  + G+SVG        +P   
Sbjct: 292 SNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQ-----LPISL 346

Query: 242 SSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP 297
            S  +S+   G + ID+G   T  P   Y  + +  RNA    P   PR      CY   
Sbjct: 347 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLP-SAPRYSLFDTCYNFS 405

Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
             A +  P L  HF+ GA + L  T+  IP    G FC A  P   ++GI GN  Q    
Sbjct: 406 GKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFR 465

Query: 357 IGYDFDSQMVSFKPTDC 373
           IG+D     ++F P  C
Sbjct: 466 IGFDLQKSHLAFAPQQC 482


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 137/383 (35%), Positives = 195/383 (50%), Gaps = 45/383 (11%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           QS ++ + G Y++K S+GTPP  +I  + D   DL W+ C  C  C K     + P+ SS
Sbjct: 87  QSELNFSKGNYLIKISVGTPPA-EILALADITGDLTWLPCKTCQDCTKDGFTFF-PSESS 144

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYG----YADSSLTKGVLATERITFGNSNN-- 127
           +Y   +C+S QC + +   C ++ +C Y  G       S   KG++A + I+F +S+   
Sbjct: 145 TYTSAACESYQCQITNGAVCQTK-MCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQA 203

Query: 128 -FFDNVVFGCGHNNTGVFNENEMG--LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
             + N  F CG   T + N + +G  +VGLGR   S+ SQ +  L    FS CLVP+ + 
Sbjct: 204 LSYPNTNFICG---TFIDNWHYIGAGIVGLGRGLFSMTSQ-MKHLINGTFSQCLVPYSSK 259

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNS 242
            S  SK+ FG    VSG GVVST +    +   YF+ LE +SVG   ++N+    P    
Sbjct: 260 QS--SKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAP---- 313

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP--YQDPRLGSQLCYKTPSMA 300
                K N++ID     T LP DFY  +E +VR AI LTP  Y + R  S LCYK+ S  
Sbjct: 314 -----KSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLS-LCYKSESDH 367

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV--------GIFGNFA 351
              AP +T HF   A V L   +TF+      V CFA   +DG           ++G++ 
Sbjct: 368 DFDAPPITMHFT-NADVQLSPLNTFVRMDWN-VVCFAF--LDGTFNATKRITHAVYGSWQ 423

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
           Q +  +GYD  S  VSFK  DCT
Sbjct: 424 QMNFIVGYDLKSSTVSFKQADCT 446


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 34/371 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+  ++GTPP   I  ++DTGSDL+W QC  C  C +Q  P+++P  SSSY+ + C  
Sbjct: 97  EYVLDLAVGTPPQ-PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
           + C  +   SC     C Y Y Y D + T G  ATER TF +S+    +V   FGCG  N
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMN 215

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV- 199
            G  N N  G+VG GR  LSL    +SQL   +FSYCL P+   SS  S + FG+ ++V 
Sbjct: 216 VGSLN-NASGIVGFGRDPLSL----VSQLSIRRFSYCLTPYA--SSRKSTLQFGSLADVG 268

Query: 200 ---SGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNM 251
                 G V T+  L S ++ T+Y+V   G++VG     ++ +    S+ A+     G +
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVG-----ARRLRIPASAFALRPDGSGGV 323

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KTPSMAG--I 302
            ID+G   TL P      +    R+ ++L           +C+           MA    
Sbjct: 324 IIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVA 383

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            P +  HF  GA + L   +  +     G  C  +     D    GNF Q D+ + YD +
Sbjct: 384 VPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442

Query: 363 SQMVSFKPTDC 373
            + +SF P +C
Sbjct: 443 RETLSFAPVEC 453


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 173/364 (47%), Gaps = 24/364 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  +  +GEY  +  IG P   ++Y ++DTGSD+ W+QC PC  CY Q +PI+ P+SSSS
Sbjct: 139 SGTTQGSGEYFTRVGIGKPAR-EVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSS 197

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y+ LSC + QC+ L+   C +   C Y   Y D S T G  ATE +T G++     NV  
Sbjct: 198 YEPLSCDTPQCNALEVSECRNAT-CLYEVSYGDGSYTVGDFATETLTIGST--LVQNVAV 254

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F                    + SQL    FSYCLV   +DS+ T      
Sbjct: 255 GCGHSNEGLFVGAAG-----LLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDF--- 306

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
            G+ +S   VV+  L + +  T+Y++ L GISVG      +L+    SS  + +   G +
Sbjct: 307 -GTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGG-----ELLQIPQSSFEMDESGSGGI 360

Query: 252 FIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
            ID+G   T L  + YN L +  V+  + L       +    CY   +   +  P +  H
Sbjct: 361 IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAM-FDTCYNLSAKTTVEVPTVAFH 419

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F GG  + L   +  IP    G FC A  P    + I GN  Q    + +D  + ++ F 
Sbjct: 420 FPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFS 479

Query: 370 PTDC 373
              C
Sbjct: 480 SNKC 483


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 127/367 (34%), Positives = 177/367 (48%), Gaps = 19/367 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC +CY Q  PI++P  S
Sbjct: 131 VVSGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            +Y  + C S  C  LD+  C++++  C Y   Y D S T G  +TE +TF    N    
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF--RRNRVKG 247

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+GLG+ +LS   Q   +    KFSYCLV   + SS  S +
Sbjct: 248 VALGCGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFN-QKFSYCLVD-RSASSKPSSV 304

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
            FGN + VS     +  L + +  T+Y+V L GISVG        +P   +S      I 
Sbjct: 305 VFGNAA-VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTR-----VPGVTASLFKLDQIG 358

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
            G + ID+G   T L +  Y  + +  R   K             C+   +M  +  P +
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTV 418

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
             HF  GA V L  T+  IP    G FCFA     G + I GN  Q    + YD  S  V
Sbjct: 419 VLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477

Query: 367 SFKPTDC 373
            F P  C
Sbjct: 478 GFAPGGC 484


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 125/365 (34%), Positives = 173/365 (47%), Gaps = 16/365 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC  CY Q  P++NP  S
Sbjct: 118 VISGLAQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 176

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            S+ ++ C++  C  L++  C+ +Q C Y   Y D S T G   TE +TF  +    + V
Sbjct: 177 GSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK--VEQV 234

Query: 133 VFGCGHNNTGVF--NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
             GCGH+N G+F      +GL   G +  S A +  +Q    KFSYCLV   + SS  S 
Sbjct: 235 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ----KFSYCLVD-RSASSKPSS 289

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FGN S VS     +  L +    T+Y+V L GISVG    S     ++        G 
Sbjct: 290 VVFGN-SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLD-RTGNGG 347

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
           + ID G   T L K  Y  L +  R  A  L    +  L    CY       +  P +  
Sbjct: 348 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVKVPTVVL 406

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA V L  ++  IP    G FCFA       + I GN  Q    + YD  S  V F
Sbjct: 407 HFR-GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGF 465

Query: 369 KPTDC 373
            P  C
Sbjct: 466 SPRGC 470


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 127/364 (34%), Positives = 177/364 (48%), Gaps = 11/364 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC +CY Q  PI+NP  S
Sbjct: 99  VVSGLSQGSGEYFTRLGVGTPPRY-LYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKS 157

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+  + C S  C  LD+  CS+++  C Y   Y D S T G  ATE +TF    N    
Sbjct: 158 KSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTF--RGNKIAK 215

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+GLGR RLS  SQ   +   +KFSYCLV   + SS  S M
Sbjct: 216 VALGCGHHNEGLFVGAAG-LLGLGRGRLSFPSQTGIRFN-HKFSYCLVD-RSASSKPSSM 272

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG+ + +S     +  + + +  T+Y+V L GISVG +     + P      +   G +
Sbjct: 273 VFGDAA-ISRLARFTPLIRNPKLDTFYYVGLIGISVGGV-RVRGVSPSLFKLDSAGNGGV 330

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            ID+G   T L +  Y  L +  R   +             CY     + +  P +  HF
Sbjct: 331 IIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHF 390

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA + L  T+  IP    G FCFA       + I GN  Q    + YD     + F P
Sbjct: 391 R-GADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAP 449

Query: 371 TDCT 374
             CT
Sbjct: 450 RGCT 453


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 176/368 (47%), Gaps = 24/368 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY ++  +G+PP    Y ++D+GSD++WVQC PC QCY Q  P+++PA S
Sbjct: 32  VVSGMNQGSGEYFVRIGLGSPPRSQ-YMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADS 90

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  +SC S  C  ++   C+S + C Y   Y D S TKG LA E +TFG +     NV
Sbjct: 91  ASFMGVSCSSAVCDRVENAGCNSGR-CRYEVSYGDGSYTKGTLALETLTFGRT--VVRNV 147

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L   G + +S   Q+  Q G N FSYCLV   T+    +  +
Sbjct: 148 AIGCGHSNRGMFVGAAGLLGLGGGS-MSFMGQLSGQTG-NAFSYCLVSRGTN----TNGF 201

Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGAISKGN 250
              GSE    G     LV      ++Y++ L G+ VG+     S+ +   N  G+   G 
Sbjct: 202 LEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGS---GG 258

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-API 305
           + +DTG   T  P   Y    E  RNA        PR         CY       +  P 
Sbjct: 259 VVMDTGTAVTRFPTVAY----EAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPT 314

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           ++ +F GG  + +   +  IP    G FCFA  P    + I GN  Q  + I  D  ++ 
Sbjct: 315 VSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEF 374

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 375 VGFGPNIC 382


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 186/374 (49%), Gaps = 33/374 (8%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V  ++GEY+M+  IGTP       I+DTGSDL+W QC PC+ C  Q  P ++PA S++Y+
Sbjct: 83  VLASDGEYLMEMGIGTPTRY-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYR 141

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVF 134
            L C S  C+ L    C  Q++C Y Y Y DS+ T GVLA E  TFG +        + F
Sbjct: 142 SLGCASPACNALYYPLC-YQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF 200

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG+ N G+   N  G+VG GR  LSL    +SQLG+ +FSYCL  F   S + S++YFG
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVPSRLYFG 253

Query: 195 -----NGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
                N +  S   V ST  +V+    T YF+ + GISVG       L+P   +  AI+ 
Sbjct: 254 VYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-----YLLPIDPAVFAIND 308

Query: 249 ----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK---TPSM 299
               G   ID+G   T L +  Y+ +     + I L P  +    S L  C++    P  
Sbjct: 309 TDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQ 367

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           +   P L  HFDG      +     + P   G  C AM     D  I G++   +  + Y
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLY 426

Query: 360 DFDSQMVSFKPTDC 373
           D ++ ++SF P  C
Sbjct: 427 DLENSLMSFVPAPC 440


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 181/363 (49%), Gaps = 25/363 (6%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
            + TAN  Y++   +GTP   D+  + DTGSDL WVQC PC  CYKQ  P+++P+ S++Y
Sbjct: 182 RLGTAN--YIVSVGLGTP-RRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTY 238

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
             + C +++C  LD+ +CSS + C Y   Y D S T G LA + +T G S++     VFG
Sbjct: 239 SAVPCGAQEC--LDSGTCSSGK-CRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFG 295

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG ++TG+F   + GL GLGR R+SLASQ  ++ GA  FSYCL      SS  ++ Y   
Sbjct: 296 CGDDDTGLFGRAD-GLFGLGRDRVSLASQAAARYGAG-FSYCL-----PSSWRAEGYLSL 348

Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMFI 253
           GS  +      T++V++ D  ++Y++ L GI V     + ++ P  + + G +      I
Sbjct: 349 GSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAG--RTVRVAPAVFKAPGTV------I 400

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
           D+G   T LP   Y+ L       ++             CY       +  P +   FDG
Sbjct: 401 DSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDG 460

Query: 313 GAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           GA + L      ++    +    FA    D  VGI GN  Q    + YD  +Q + F   
Sbjct: 461 GATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAK 520

Query: 372 DCT 374
            C+
Sbjct: 521 GCS 523


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 125/377 (33%), Positives = 181/377 (48%), Gaps = 22/377 (5%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N  V S +   +GEY ++  +GTP    ++ +VDTGSDL W+QC PC  CYKQ  PI++P
Sbjct: 40  NGPVTSGLLYGSGEYFVRLGLGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDP 98

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNS 125
            +SSS++ + C S  C  L+  SCS  +     C+Y   Y D S + G  +++  T G  
Sbjct: 99  RNSSSFQRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTG 158

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQIL----SQLGANKFSYCLVPF 181
           +    +V FGCG +N G+      GL+GLG  +LS  SQI     +   AN FSYCLV  
Sbjct: 159 SKAM-SVAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 216

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
               + +S       + +     +S  L + +  T+Y+  + G+SVG        +P   
Sbjct: 217 SNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQ-----LPISL 271

Query: 242 SSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP 297
            S  +S+   G + ID+G   T  P   Y  + +  RNA    P   PR      CY   
Sbjct: 272 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLP-SAPRYSLFDTCYNFS 330

Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
             A +  P L  HF+ GA + L  T+  IP    G FC A  P   ++GI GN  Q    
Sbjct: 331 GKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFR 390

Query: 357 IGYDFDSQMVSFKPTDC 373
           IG+D     ++F P  C
Sbjct: 391 IGFDLQKSHLAFAPQQC 407


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 175/367 (47%), Gaps = 16/367 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S +S  +GEY  +  IG P     Y  +DTGSD+ W+QC PC  CY QV PIY+P++S
Sbjct: 1   ISSGLSLGSGEYFARMGIGNP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 59

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
           SSY+ + C S  C  LD  +C     C+Y   Y DSS + G L  E    G NS+    N
Sbjct: 60  SSYRRVYCGSALCQALDYSACQGMG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRN 118

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSK 190
           + FGCGH+N+G+F      L+G+G   LS  SQI + +G   FSYCLV  ++   S +S 
Sbjct: 119 IAFGCGHSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGP-AFSYCLVDRYSQLQSRSSP 176

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG  + +      +  L +    T+Y+  L GISVG       + P   +      G 
Sbjct: 177 LIFGR-TAIPFAARFTPLLKNPRINTFYYAVLTGISVGG--TPLPIPPAQFALTGNGTGG 233

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PIL 306
             +D+G   T +    Y  L +  R A +  P   P  G  L   C+    +  +  P L
Sbjct: 234 AILDSGTSVTRVVPPAYAVLRDAYRAASRNLP---PAPGVYLLDTCFNFQGLPTVQIPSL 290

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
             HFD G  + L   +  IP    G FC A  P    + + GN  Q    IG+D    ++
Sbjct: 291 VLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLI 350

Query: 367 SFKPTDC 373
           +  P +C
Sbjct: 351 AIAPREC 357


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 122/378 (32%), Positives = 183/378 (48%), Gaps = 49/378 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+  +IGTPP   +  ++DTGSDL+W QC PC  C  Q  P++ P  S+SY+ + C  
Sbjct: 101 EYVVDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAG 159

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV-----FGCG 137
           + C  +    C     C Y Y Y D ++T GV ATER TF +S    D ++     FGCG
Sbjct: 160 QLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGG--DRLMTVPLGFGCG 217

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N G  N N  G+VG GR  LSL    +SQL   +FSYCL  +   S   S + FG+  
Sbjct: 218 SMNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSY--GSGRKSTLLFGS-- 268

Query: 198 EVSGG------GVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
            +SGG      G V T+  L S ++ T+Y+V L G++VG     ++ +    S+ A+   
Sbjct: 269 -LSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVG-----ARRLRIPESAFALRPD 322

Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA 303
             G + +D+G   TLLP      +    R  ++L P+    +P  G  +C+  P+    +
Sbjct: 323 GSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWRRS 379

Query: 304 --------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
                   P +  HF   A + L   +  +    +G  C  +     D    GN  Q D+
Sbjct: 380 SSTSQVPVPRMVFHFQ-DADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDM 438

Query: 356 FIGYDFDSQMVSFKPTDC 373
            + YD +++ +SF P  C
Sbjct: 439 RVLYDLEAETLSFAPAQC 456


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 24/345 (6%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT------VSCS 94
           IVDTGSDL WVQC PC +CY Q  P++NP++S SY+ + C S  C  L +      V  S
Sbjct: 149 IVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGS 208

Query: 95  SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
           +   CNY   Y D S T+G L TE +  GNS    +N +FGCG NN G+F     GLVGL
Sbjct: 209 NPPSCNYVVNYGDGSYTRGELGTEHLDLGNSTA-VNNFIFGCGRNNQGLFG-GASGLVGL 266

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE 213
           GR+ LSL SQ  +  G   FSYCL    T++S  S +  GN S       +S T ++   
Sbjct: 267 GRSSLSLISQTSAMFGG-VFSYCLPITETEAS-GSLVMGGNSSVYKNTTPISYTRMIPNP 324

Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
              +YF+ L GI+VG+++  +   P +   G      M ID+G   T LP   Y  L+++
Sbjct: 325 QLPFYFLNLTGITVGSVAVQA---PSFGKDG------MMIDSGTVITRLPPSIYQALKDE 375

Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
                   P     +    C+       +  P +  HF+G A++ +  T  F     +  
Sbjct: 376 FVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDAS 435

Query: 333 -FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             C A+  +  + +VGI GN+ Q +  + YD    M+ F    CT
Sbjct: 436 QVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEACT 480


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 176/378 (46%), Gaps = 47/378 (12%)

Query: 7   FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
           + P ++V      V   +GEY ++  +G+PP  D Y +VD+GSD++WVQC PC QCY Q 
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
            P+++PA+SSS+  +SC S  C  L    C        C+Y+  Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           T G +      V  GCGH N+G+F     GL+GLG   +SL  Q+    G   FSYCL  
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCL-- 282

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                   S+   G GS  S               ++Y+V L GI VG      + +P  
Sbjct: 283 -------ASRGAGGAGSLAS---------------SFYYVGLTGIGVGG-----ERLPLQ 315

Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
           +S   +++   G + +DTG   T LP++ Y  L      A+   P          CY   
Sbjct: 316 DSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 375

Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
             A +  P ++ +FD GA + L   +  +   V G VFC A  P    + I GN  Q  +
Sbjct: 376 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 433

Query: 356 FIGYDFDSQMVSFKPTDC 373
            I  D  +  V F P  C
Sbjct: 434 QITVDSANGYVGFGPNTC 451


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 179/375 (47%), Gaps = 41/375 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+  +IGTPP   +  ++DTGSDL+W QC PC  C  Q  P++ P  S+SY+ + C  
Sbjct: 95  EYVVDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAG 153

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV-----FGCG 137
             C  +   SC     C Y Y Y D ++T GV ATER TF +S             FGCG
Sbjct: 154 TLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCG 213

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N G  N N  G+VG GR  LSL    +SQL   +FSYCL  +   S   S + FG+ S
Sbjct: 214 SVNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSYA--SRRQSTLLFGSLS 266

Query: 198 E-VSG---GGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
           + V G   G V +T L+ S ++ T+Y+V   G++VG     ++ +    S+ A+     G
Sbjct: 267 DGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVG-----ARRLRIPESAFALRPDGSG 321

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA--- 303
            + +D+G   TLLP      +    R  ++L P+    +P  G  +C+  P+    +   
Sbjct: 322 GVIVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWRRSSST 378

Query: 304 -----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
                P +  HF  GA + L   +  +     G  C  +     D    GN  Q D+ + 
Sbjct: 379 SQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVL 437

Query: 359 YDFDSQMVSFKPTDC 373
           YD +++ +S  P  C
Sbjct: 438 YDLEAETLSIAPARC 452


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 174/366 (47%), Gaps = 28/366 (7%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  +  +GEY  +  IG P   ++Y ++DTGSD+ W+QC PC  CY Q +PI+ P+SSSS
Sbjct: 142 SGTTQGSGEYFTRVGIGNPAR-EVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSS 200

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y+ LSC + QC+ L+   C +   C Y   Y D S T G  ATE +T G++     NV  
Sbjct: 201 YEPLSCDTPQCNALEVSECRNAT-CLYEVSYGDGSYTVGDFATETLTIGST--LVQNVAV 257

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F      L             + SQL    FSYCLV   +DS+ T +    
Sbjct: 258 GCGHSNEGLFVGAAGLL-----GLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVEF--- 309

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
            G+ +    VV+  L + +  T+Y++ L GISVG      +L+    SS  + +   G +
Sbjct: 310 -GTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGG-----ELLQIPQSSFEMDESGSGGI 363

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-APILT 307
            ID+G   T L    YN L +     +K T   +   G  +   CY   +   I  P + 
Sbjct: 364 IIDSGTAVTRLQTGIYNSLRDSF---LKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVA 420

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF GG  + L   +  IP    G FC A  P    + I GN  Q    + +D  + ++ 
Sbjct: 421 FHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIG 480

Query: 368 FKPTDC 373
           F    C
Sbjct: 481 FSSNKC 486


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 131/387 (33%), Positives = 188/387 (48%), Gaps = 38/387 (9%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYN 68
           + ++++      G Y M  S+GTPPL     I+DTGSDL W QC PC   C+ Q  P+Y+
Sbjct: 82  HGLLEALAENGAGAYHMILSVGTPPLA-FPAIIDTGSDLTWTQCAPCTTACFAQPTPLYD 140

Query: 69  PASSSSYKELSCQSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
           PA SS++ +L C S  C  L +   +C++   C Y Y YA    T G LA + +      
Sbjct: 141 PARSSTFSKLPCASPLCQALPSAFRACNATG-CVYDYRYA-VGFTAGYLAADTLAIGDGD 198

Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
             G++++ F  V FGC   N G   +   G+VGLGR+ LSL    LSQ+G  +FSYCL  
Sbjct: 199 GDGDASSSFAGVAFGCSTANGGDM-DGASGIVGLGRSALSL----LSQIGVGRFSYCL-- 251

Query: 181 FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV-----SKEDKTYYFVTLEGISVGNLSNSS 234
             +D+    S + FG  + V+G  V ST+L+     ++    YY+V L GI+VG     S
Sbjct: 252 -RSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVG-----S 305

Query: 235 KLIPYYNSS---GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLG 289
             +P  +S+    A   G + +D+G   T L +  Y  L +    + A  LT     +  
Sbjct: 306 TDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFD 365

Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFG 348
             LC++  +     P L   F GGA+  +   S F      G V C  + P  G V + G
Sbjct: 366 FDLCFEAGAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIG 424

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           N  Q DL + YD D    SF P DC  
Sbjct: 425 NVMQMDLHVLYDLDGATFSFAPADCAS 451


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 176/367 (47%), Gaps = 16/367 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY  +  IG+P     Y  +DTGSD+ W+QC PC  CY QV PIY+P++S
Sbjct: 34  VSSGLSLGSGEYFARMGIGSPQR-SYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 92

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
           SSY+ + C S  C  LD  +C     C+Y   Y DSS + G L  E    G NS+    N
Sbjct: 93  SSYRRVYCGSALCQALDYSACQGMG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRN 151

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSK 190
           + FGCGH+N+G+F      L+G+G   LS  SQI + +G   FSYCLV  ++   S +S 
Sbjct: 152 IAFGCGHSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGP-AFSYCLVDRYSQLQSRSSP 209

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG  + +      +  L +    T+Y+  L GISVG    +  + P   +      G 
Sbjct: 210 LIFGR-TAIPFAARFTPLLKNPRIDTFYYAILTGISVGG--TALPIPPAQFALTGNGTGG 266

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PIL 306
             +D+G   T +    Y  L +  R A +  P   P  G  L   C+    +  +  P L
Sbjct: 267 AILDSGTSVTRVVPAAYAVLRDAYRAASRNLP---PAPGVYLLDTCFNFQGLPTVQIPSL 323

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
             HFD    + L   +  IP    G FC A  P    + + GN  Q    IG+D    ++
Sbjct: 324 VLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLI 383

Query: 367 SFKPTDC 373
           +  P +C
Sbjct: 384 AIAPREC 390


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 175/381 (45%), Gaps = 51/381 (13%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--------LPCVQCYKQVKP--IYNPASS 72
           EY+M  +IGTPP   +  I DTGSDL+W+ C        L   +      P   ++P+ S
Sbjct: 99  EYLMAVNIGTPPT-RMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKS 157

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS------- 125
           ++++ + C S  C  L   SC +   C Y+Y Y D S T GVL+TE  TF ++       
Sbjct: 158 TTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDG 217

Query: 126 -NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN-----KFSYCLV 179
                 NV FGC     G    + +               ++SQLGA+     +FSYCLV
Sbjct: 218 TTTRVANVNFGCSTTFVGSSVGDGL------VGLGGGDLSLVSQLGADTSLGRRFSYCLV 271

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
           P+   +S  S + FG  + V+  G V+T L+  + K YY V L  + VGN          
Sbjct: 272 PYSVKAS--SALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGN---------- 319

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----- 294
             +  A  +  + +D+G   T LP+   + L +++   IKL P Q P     LC+     
Sbjct: 320 -KTFEAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGV 378

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQ 352
           +   +A + P +T    GGA V L   +TF+    EG  C A+  +       I GN AQ
Sbjct: 379 REGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQ-EGTLCLAVSAMSEQFPASIIGNIAQ 437

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
            ++ +GYD D   V+F P  C
Sbjct: 438 QNMHVGYDLDKGTVTFAPAAC 458


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 131/374 (35%), Positives = 189/374 (50%), Gaps = 38/374 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            + +  SIGTPP      I+DTGSDL+W QC        + KP+Y+PA SSS+    C  
Sbjct: 88  HHTLTVSIGTPPQPRTL-ILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146

Query: 83  EQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C     +T +CS  + C YTY Y  S+ TKG LA+E  TFG       ++ FGCG   
Sbjct: 147 RLCETGSFNTKNCSRNK-CIYTYNYG-SATTKGELASETFTFGEHRRVSVSLDFGCGKLT 204

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G       G++G+   RLSL    +SQL   +FSYCL PF  D + TS ++FG  +++S
Sbjct: 205 SGSL-PGASGILGISPDRLSL----VSQLQIPRFSYCLTPF-LDRNTTSHIFFGAMADLS 258

Query: 201 G----GGVVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
                G + +TSLV+  D +  YY+V L GISVG     +K +    SS AI +   G  
Sbjct: 259 KYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVG-----TKRLNVPVSSFAIGRDGSGGT 313

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPSMAGIA------ 303
           F+D+G    +LP      L+E +  A+KL        G   +LC++ P   G A      
Sbjct: 314 FVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQ 373

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDF 361
            P L  HFDGGA + L+   +++     G  C  +    G  G I GN+ Q ++ + +D 
Sbjct: 374 VPPLVYHFDGGAAM-LLRRDSYMVEVSAGRMCLVIS--SGARGAIIGNYQQQNMHVLFDV 430

Query: 362 DSQMVSFKPTDCTK 375
           ++   SF PT C +
Sbjct: 431 ENHEFSFAPTQCNQ 444


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 185/374 (49%), Gaps = 33/374 (8%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V  ++GEY+M+  IGTP       I+DTGSDL+W QC PC+ C  Q  P ++PA S++Y+
Sbjct: 83  VLASDGEYLMEMGIGTPTRY-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYR 141

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVF 134
            L C S  C+ L    C  Q++C Y Y Y DS+ T GVLA E  TFG +        + F
Sbjct: 142 SLGCASPACNALYYPLC-YQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF 200

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG+ N G    N  G+VG GR  LSL    +SQLG+ +FSYCL  F   S + S++YFG
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVPSRLYFG 253

Query: 195 -----NGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
                N +  S   V ST  +V+    T YF+ + GISVG       L+P   +  AI+ 
Sbjct: 254 VYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-----YLLPIDPAVFAIND 308

Query: 249 ----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK---TPSM 299
               G   ID+G   T L +  Y+ +     + I L P  +    S L  C++    P  
Sbjct: 309 TDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQ 367

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           +   P L  HFDG      +     + P   G  C AM     D  I G++   +  + Y
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLY 426

Query: 360 DFDSQMVSFKPTDC 373
           D ++ ++SF P  C
Sbjct: 427 DLENSLMSFVPAPC 440


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 125/387 (32%), Positives = 187/387 (48%), Gaps = 26/387 (6%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S  S   GEY +   +GTPP   ++ I+DTGSDL W+QC PC  C++Q    Y P  
Sbjct: 159 TLESGASLGTGEYFLDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKD 217

Query: 72  SSSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATE----RITF 122
           SS+Y+ +SC   +C L+ +      C ++ Q C Y Y YAD S T G  A+E     +T+
Sbjct: 218 SSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTW 277

Query: 123 GNSNNFFDNVV---FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            N    F  VV   FGCGH N G F     GL+GLGR  +S  SQI S  G + FSYCL 
Sbjct: 278 PNGKEKFKQVVDVMFGCGHWNKGFFY-GASGLLGLGRGPISFPSQIQSIYG-HSFSYCLT 335

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE---DKTYYFVTLEGISV-GNLSNSS 234
              +++S++SK+ FG   E+     ++ T+L++ E   D+T+Y++ ++ I V G + + S
Sbjct: 336 DLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDIS 395

Query: 235 KLIPYYNSSGAISKGNM--FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
           +   +++S GA +       ID+G+  T  P   Y+ ++E     IKL            
Sbjct: 396 EQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSP 455

Query: 293 CYKTPS--MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFG 348
           CY      M    P    HF  G        + F     + V C A+   P    + I G
Sbjct: 456 CYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIG 515

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           N  Q +  I YD     + + P  C +
Sbjct: 516 NLLQQNFHILYDVKRSRLGYSPRRCAE 542


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 130/373 (34%), Positives = 186/373 (49%), Gaps = 30/373 (8%)

Query: 14  QSNVSTANGEYVMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           QS V   NGEY+M  ++G+PP   D+  IVDTGSDL WVQCLPC  CY+Q  P ++P+ S
Sbjct: 29  QSPVKAGNGEYLMTLTLGSPPQSFDV--IVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKS 86

Query: 73  SSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN--SNNF 128
            S+++ +C    C++  L   +C++  +C Y Y Y D S T G LA E I+  N      
Sbjct: 87  RSFRKAACTDNLCNVSALPLKACAA-NVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQS 145

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N  FGCG  N G F     GLVGLG+  LSL SQ LS   ANKFSYCLV  ++ S+  
Sbjct: 146 VPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSLNSQ-LSHTFANKFSYCLVSLNSLSA-- 201

Query: 189 SKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGA 245
           S + F  GS  +   +  TS+ V+    TYY+V L  I VG   L+ +  +     S+G 
Sbjct: 202 SPLTF--GSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTG- 258

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-- 303
             +G   ID+G   T+L    Y+ +     + +          G  LC+   ++AG++  
Sbjct: 259 --RGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCF---NIAGVSNP 313

Query: 304 --PILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
             P +   F  GA   +   + F+         C AM    G   I GN  Q +  + YD
Sbjct: 314 SVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAMGGSQG-FSIIGNIQQQNHLVVYD 371

Query: 361 FDSQMVSFKPTDC 373
            +++ + F   DC
Sbjct: 372 LEAKKIGFATADC 384


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 174/368 (47%), Gaps = 21/368 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC +CY Q  PI++P  S
Sbjct: 131 VVSGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            +Y  + C S  C  LD+  C++++  C Y   Y D S T G  +TE +TF    N    
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF--RRNRVKG 247

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L      +LS   Q   +    KFSYCLV   + SS  S +
Sbjct: 248 VALGCGHDNEGLFVGAAGLLGLGK-GKLSFPGQTGHRFN-QKFSYCLVD-RSASSKPSSV 304

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
            FGN + VS     +  L + +  T+Y+V L GISVG        +P   +S      I 
Sbjct: 305 VFGNAA-VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTR-----VPGVTASLFKLDQIG 358

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-API 305
            G + ID+G   T L +  Y  + +  R   K T  + P       C+   +M  +  P 
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK-TLKRAPNFSLFDTCFDLSNMNEVKVPT 417

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +  HF   A V L  T+  IP    G FCFA     G + I GN  Q    + YD  S  
Sbjct: 418 VVLHFR-RADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSR 476

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 477 VGFAPGGC 484


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 171/363 (47%), Gaps = 14/363 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY ++  +G+PP  + Y ++D+GSD++WVQC PC QCY Q  P+++PA S
Sbjct: 131 VVSGMNQGSGEYFIRIGVGSPPR-EQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADS 189

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C S  C  ++   C +   C Y   Y D S TKG LA E +TFG +     NV
Sbjct: 190 ASFMGVPCSSSVCERIENAGCHAGG-CRYEVMYGDGSYTKGTLALETLTFGRT--VVRNV 246

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +SL  Q+  Q G   FSYCLV   TDS+    + 
Sbjct: 247 AIGCGHRNRGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG-AFSYCLVSRGTDSA--GSLE 302

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS-SKLIPYYNSSGAISKGNM 251
           FG G+   G   +   + +    ++Y++ L G+ VG +    S+ +   N  G    G +
Sbjct: 303 FGRGAMPVGAAWIPL-IRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMG---NGGV 358

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            +DTG   T +P   Y    +         P          CY       +  P ++ +F
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYF 418

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GG  + L   +  IP    G FCFA       + I GN  Q  + I +D  +  V F P
Sbjct: 419 AGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGP 478

Query: 371 TDC 373
             C
Sbjct: 479 NVC 481


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 183/396 (46%), Gaps = 46/396 (11%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V ++ +  A GEY++K  IGTPP       +DT SDL+W QC PC  CY QV P++NP  
Sbjct: 77  VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135

Query: 72  SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           SS+Y  L C S+ C  LD   C     + C YTY Y+ ++ T+G LA +++  G   + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193

Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             V FGC  ++T G       G+VGLGR  LSL    +SQL   +F+YCL P    S I 
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYN---- 241
            K+  G  ++ +       ++  + D    +YY++ L+G+ +G+ + S            
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATAT 307

Query: 242 ------------SSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
                       ++ A++ G+     M ID  +  T L    Y+ L   +   I+L    
Sbjct: 308 ATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367

Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
              LG  LC+  P   G+A      P +   FD G  + L     F      G+ C  + 
Sbjct: 368 GSSLGLDLCFILPD--GVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLMVG 424

Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             + G V I GNF Q ++ + Y+     V+F  + C
Sbjct: 425 RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 117/366 (31%), Positives = 170/366 (46%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP    Y ++D+GSD++WVQC PC QCY Q  P+++PA S
Sbjct: 32  VVSGMDQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADS 90

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  +SC S  C  +D   C+S + C Y   Y D S TKG LA E +T G +     NV
Sbjct: 91  ASFMGVSCSSAVCDQVDNAGCNSGR-CRYEVSYGDGSSTKGTLALETLTLGRT--VVQNV 147

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q LS+   N FSYCLV   T+    S  +
Sbjct: 148 AIGCGHMNQGMFVGAAGLLGLGGGS-MSFVGQ-LSRERGNAFSYCLVSRVTN----SNGF 201

Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
              GSE    G     L+      +YY++ L G+ VG++      +P       +++   
Sbjct: 202 LEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMK-----VPISEDIFELTELGN 256

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
           G + +DTG   T  P   Y    +   +     P          CY       +  P ++
Sbjct: 257 GGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVS 316

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            +F GG  + L   +  IP    G FCFA  P    + I GN  Q  + I  D  ++ V 
Sbjct: 317 FYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVG 376

Query: 368 FKPTDC 373
           F P  C
Sbjct: 377 FGPNVC 382


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 183/396 (46%), Gaps = 46/396 (11%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V ++ +  A GEY++K  IGTPP       +DT SDL+W QC PC  CY QV P++NP  
Sbjct: 77  VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135

Query: 72  SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           SS+Y  L C S+ C  LD   C     + C YTY Y+ ++ T+G LA +++  G   + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193

Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             V FGC  ++T G       G+VGLGR  LSL    +SQL   +F+YCL P    S I 
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYN---- 241
            K+  G  ++ +       ++  + D    +YY++ L+G+ +G+ + S            
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATAT 307

Query: 242 ------------SSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
                       ++ A++ G+     M ID  +  T L    Y+ L   +   I+L    
Sbjct: 308 ATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367

Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
              LG  LC+  P   G+A      P +   FD G  + L     F      G+ C  + 
Sbjct: 368 GSSLGLDLCFILPD--GVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLMVG 424

Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             + G V I GNF Q ++ + Y+     V+F  + C
Sbjct: 425 RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 130/394 (32%), Positives = 194/394 (49%), Gaps = 44/394 (11%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           +N   + + +   EY+M+ +IGTPP+     + DTGSDL W QC PC  C+ Q  PIY+ 
Sbjct: 81  SNAGPARLRSGQAEYLMELAIGTPPV-PFVALADTGSDLTWTQCKPCKLCFPQDTPIYDT 139

Query: 70  ASSSSYKELSCQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
           A+S+S+  + C S  C  +       + ++   C Y Y Y D + + GVL TE +TF  S
Sbjct: 140 AASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGS 199

Query: 126 NN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
           +            V FGCG +N G+ + N  G VGLGR  LSL    ++QLG  KFSYCL
Sbjct: 200 SPGAPGPGVSVGGVAFGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCL 254

Query: 179 VPFHTDSSITSKMYFGNGSE------VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLS 231
             F  ++S+ S + FG+ +E      + G  V ST LV    + + Y+V+LEGIS+G+  
Sbjct: 255 TDFF-NTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGD-- 311

Query: 232 NSSKLIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
                +P  N +  +     G M +D+G   T+L +  +  +   V   +         L
Sbjct: 312 ---ARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSL 368

Query: 289 GSQLCYKTPSMAGI-----APILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPIDG 342
            S  C+  P+ AG       P +  HF GGA + L H   ++    E   FC  +     
Sbjct: 369 DSP-CF--PATAGEQQLPDMPDMLLHFAGGADMRL-HRDNYMSFNQESSSFCLNIAGAPS 424

Query: 343 DVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
             G I GNF Q ++ + +D     +SF PTDC+K
Sbjct: 425 AYGSILGNFQQQNIQMLFDITVGQLSFVPTDCSK 458


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 131/383 (34%), Positives = 201/383 (52%), Gaps = 35/383 (9%)

Query: 10  NNVVQSNVSTA-------NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
           N+++ ++++ A       NG+++MK SIG PP  ++   V TGSDL+W+ CL    C   
Sbjct: 77  NDLISNSITAAEFPSILDNGDFLMKISIGIPPT-ELLVNVATGSDLVWIPCLSFKPCTHN 135

Query: 63  VK-PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERI 120
                ++P  SS+YK + C S +C + +  +C     C Y+       S   G LA + +
Sbjct: 136 CDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSD-CFYSCDPRHQDSCPDGDLAMDTL 194

Query: 121 TFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           T  ++        N  F CG+   G +    +G++GLG   LSL ++I S L   KFS+C
Sbjct: 195 TLNSTTGKSFMLPNTGFICGNRIGGDYPG--VGILGLGHGSLSLLNRI-SHLIDGKFSHC 251

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           +VP+ ++   TSK+ FG+ + VSG  + ST L        Y ++  GISVGN S S+  I
Sbjct: 252 IVPYSSNQ--TSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGI 309

Query: 238 --PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQDPRLGSQLCY 294
              YY +   +  G MF       T  P+ FY++LE  VR AI+  P Y DP    +LCY
Sbjct: 310 GSDYYMNGLGMDSGTMF-------TYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCY 362

Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQ 352
           + +P  +   P +T HF+GG+ V L  +++FI    E + C A      +   +FG + Q
Sbjct: 363 RYSPDFS--PPTITMHFEGGS-VELSSSNSFIRM-TEDIVCLAFATSSSEQDAVFGYWQQ 418

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
           ++L IGYD D+  +SF  TDCTK
Sbjct: 419 TNLLIGYDLDAGFLSFLKTDCTK 441


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 124/380 (32%), Positives = 189/380 (49%), Gaps = 30/380 (7%)

Query: 7   FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
           F  + +  + V+   G+ +++ FS+G PP+  + GI DTGSDL+WVQC PC  C++Q  P
Sbjct: 73  FITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTP 131

Query: 66  IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
           I++P+ SS+Y +LS  S  C        +    C Y   YAD S + G LATE I F  S
Sbjct: 132 IFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETS 191

Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
           +       +VVFGCGH+N G F+  + G++GL     S    I+S+LG+ +FSYC+    
Sbjct: 192 DQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLF 246

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
                 +++  G+G ++ G      +        +Y+VTLEGISVG       + P    
Sbjct: 247 DPHYTHNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQ 299

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK--- 295
                +G + +D+G   T L KD +    N ++  VR   +   Y+   +   LCYK   
Sbjct: 300 RTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRV 357

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVG-IFGNFAQS 353
              + G  P L  HF  GA + L   S F+    + VFC A ++    ++G + G  AQ 
Sbjct: 358 NEDLRGF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQ 415

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
              + YD   + V F+ TDC
Sbjct: 416 HYNVAYDLIGKRVYFQRTDC 435


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 125/375 (33%), Positives = 189/375 (50%), Gaps = 31/375 (8%)

Query: 13  VQSN-VSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           +Q+N V+   G+ +++ FS+G PP+  + GI DTGSDL+WVQC PC  C++Q  PI++P+
Sbjct: 46  IQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTPIFDPS 104

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
            SS+Y +LS  S  C        +    C Y   YAD S + G LATE I F  S+    
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV 164

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
              +VVFGCGH+N G F+  + G++GL     S    I+S+LG+ +FSYC+         
Sbjct: 165 TVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLFDPHYT 219

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
            +++  G+G ++ G      +        +Y+VTLEGISVG       + P         
Sbjct: 220 HNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQRTESG 272

Query: 248 KGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK---TPSMA 300
           +G + +D+G   T L KD +    N ++  VR   +   Y+   +   LCYK      + 
Sbjct: 273 QGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRVNEDLR 330

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG-IFGNFAQSDLFIG 358
           G  P L  HF  GA + L   S F+    + VFC A+   +  ++G + G  AQ    + 
Sbjct: 331 GF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQHYNVA 388

Query: 359 YDFDSQMVSFKPTDC 373
           YD   + V F+ TDC
Sbjct: 389 YDLIGKRVYFQRTDC 403


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  168 bits (425), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 124/380 (32%), Positives = 189/380 (49%), Gaps = 30/380 (7%)

Query: 7   FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
           F  + +  + V+   G+ +++ FS+G PP+  + GI DTGSDL+WVQC PC  C++Q  P
Sbjct: 41  FITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTP 99

Query: 66  IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
           I++P+ SS+Y +LS  S  C        +    C Y   YAD S + G LATE I F  S
Sbjct: 100 IFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETS 159

Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
           +       +VVFGCGH+N G F+  + G++GL     S    I+S+LG+ +FSYC+    
Sbjct: 160 DQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLF 214

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
                 +++  G+G ++ G      +        +Y+VTLEGISVG       + P    
Sbjct: 215 DPHYTHNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQ 267

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK--- 295
                +G + +D+G   T L KD +    N ++  VR   +   Y+   +   LCYK   
Sbjct: 268 RTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRV 325

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG-IFGNFAQS 353
              + G  P L  HF  GA + L   S F+    + VFC A+   +  ++G + G  AQ 
Sbjct: 326 NEDLRGF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQ 383

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
              + YD   + V F+ TDC
Sbjct: 384 HYNVAYDLIGKRVYFQRTDC 403


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 116/346 (33%), Positives = 165/346 (47%), Gaps = 18/346 (5%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ-LC 99
           ++DTGSD++WVQC PC +CY+Q  P+++P  SSSY  + C +  C  LD+  C  ++  C
Sbjct: 2   VLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGAC 61

Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
            Y   Y D S+T G   TE +TF         V  GCGH+N G+F      L+GLGR  L
Sbjct: 62  MYQVAYGDGSVTAGDFVTETLTFAGGAR-VARVALGCGHDNEGLFVAAAG-LLGLGRGGL 119

Query: 160 SLASQILSQLGANKFSYCLVPFHTD-------SSITSKMYFGNGSEVSGGGVVSTSLVSK 212
           S  +QI  + G   FSYCLV   +        S  +S + FG GS  +     +  + + 
Sbjct: 120 SFPTQISRRYG-RSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP 178

Query: 213 EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
             +T+Y+V L GISVG                +  +G + +D+G   T L +  Y+ L +
Sbjct: 179 RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRD 238

Query: 273 QVRNA----IKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPP 327
             R A    ++L+P          CY       +  P ++ HF GGA+  L   +  IP 
Sbjct: 239 AFRAAAAGGLRLSPGGFSLF--DTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPV 296

Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              G FCFA    DG V I GN  Q    + +D D Q V F P  C
Sbjct: 297 DSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 125/367 (34%), Positives = 176/367 (47%), Gaps = 19/367 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC +CY Q  P+++P  S
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRY-VYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKS 173

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+  ++C+S  CH LD+  C++Q Q C Y   Y D S T G  +TE +TF  +      
Sbjct: 174 RSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR--VAR 231

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+GLGR RLS  SQ   +   +KFSYCLV   + SS  S M
Sbjct: 232 VALGCGHDNEGLFVGAAG-LLGLGRGRLSFPSQTGRRFN-HKFSYCLVD-RSASSKPSSM 288

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
            FG+ S VS     +  + + +  T+Y+V L GISVG        +P   +S        
Sbjct: 289 VFGD-SAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTR-----VPGITASLFKLDQTG 342

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
            G + ID+G   T L +  Y    +  R                 C+       +  P +
Sbjct: 343 NGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTV 402

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
             HF  GA V L  ++  IP    G FC A     G + I GN  Q    + YD     V
Sbjct: 403 VLHFR-GADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRV 461

Query: 367 SFKPTDC 373
            F P  C
Sbjct: 462 GFAPHGC 468


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 168/364 (46%), Gaps = 22/364 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P ++P++SS+    SC S
Sbjct: 81  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139

Query: 83  EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C  L   SC S      Q C YTY Y D S+T G L  ++ TF  +      V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N GVF  NE G+ G GR  LSL     SQL    FS+C    +     T  +      
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255

Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
             SG G V ST L+    + T+Y+++L+GI+VG     S  +P   S  A+  G     I
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 310

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
           D+G   T LP   Y  + +     +KL            C   P  A    P L  HF+G
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 313 GA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
               +P  +    +      + C A+    G+V   GNF Q ++ + YD  +  +SF P 
Sbjct: 371 ATMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429

Query: 372 DCTK 375
            C K
Sbjct: 430 QCDK 433


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 129/366 (35%), Positives = 185/366 (50%), Gaps = 17/366 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC++CY Q  P+++P  S
Sbjct: 134 VISGLAQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKS 192

Query: 73  SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+  + C S  C  LD   CS+ +Q+C Y   Y D S T G  +TE +TF  +      
Sbjct: 193 RSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTR--VGR 250

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           VV GCGH+N G+F      L+GLGR RLS  SQI  +  + KFSYCL    + SS  S +
Sbjct: 251 VVLGCGHDNEGLFVGAAG-LLGLGRGRLSFPSQIGRRFNS-KFSYCLGD-RSASSRPSSI 307

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKG 249
            FG+ S +S     +  L + +  T+Y+V L GISVG   +S  S  +   +S+G    G
Sbjct: 308 VFGD-SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG---NG 363

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
            + ID+G   T L +  Y  L +  +  A  L    +  L    C+       +  P + 
Sbjct: 364 GVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPTVV 422

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF  GA VPL  ++  IP    G FCFA       + I GN  Q    + YD  +  V 
Sbjct: 423 LHFR-GADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVG 481

Query: 368 FKPTDC 373
           F P  C
Sbjct: 482 FAPRGC 487


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 122/369 (33%), Positives = 177/369 (47%), Gaps = 18/369 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY ++ S+GTPP   +Y ++DTGSD++W+QC PCV CY Q   I++P  S
Sbjct: 47  VVSGLSLGSGEYFIRISVGTPPRR-MYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKS 105

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN----F 128
           S+Y  L C + QC  LD  +C + + C Y   Y D S T G   T+ ++  +++      
Sbjct: 106 STYSTLGCSTRQCLNLDIGTCQANK-CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            + +  GCGH+N G F     GL+GLG+  LS  +Q+  Q G  +FSYCL    TDS+  
Sbjct: 165 LNKIPLGCGHDNEGYF-VGAAGLLGLGKGPLSFPNQVDPQNGG-RFSYCLTDRETDSTEG 222

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           S + FG  +    G   +    +    T+Y++ + GISVG    +     +   S  +  
Sbjct: 223 SSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDS--LGN 280

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-AP 304
           G + ID+G   T L    Y  L +  R     T    P  G  L   CY    +A +  P
Sbjct: 281 GGVIIDSGTSVTRLQNAAYASLRDAFRAG---TSDLAPTAGFSLFDTCYDLSGLASVDVP 337

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +T HF GG  + L  ++  IP      FC A     G   I GN  Q    + YD    
Sbjct: 338 TVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHN 396

Query: 365 MVSFKPTDC 373
            V F P+ C
Sbjct: 397 QVGFVPSQC 405


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 115/384 (29%), Positives = 185/384 (48%), Gaps = 34/384 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY    ++G PP   +  ++DTGSDL+W+QC+PC  CY+QV P+Y+P SS
Sbjct: 77  VMSGVPFDSGEYFAVINVGDPPTRALV-VIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSS 135

Query: 73  SSYKELSCQSEQCH-LLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           S+++ + C S +C  +L    C ++   C Y   Y D S + G LAT+R+ F +  +   
Sbjct: 136 STHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH- 194

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTDSSITS 189
           NV  GCGH+N G+  E+  GL+G+GR +LS  +Q+    G + FSYCL        + +S
Sbjct: 195 NVTLGCGHDNVGLL-ESAAGLLGVGRGQLSFPTQLAPAYG-HVFSYCLGDRLSRAQNGSS 252

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
            + FG   E             +    YY V + G SVG      ++  + N+S A++  
Sbjct: 253 YLVFGRTPEPPSTAFTPLRTNPRRPSLYY-VDMVGFSVGG----ERVTGFSNASLALNPA 307

Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYK----- 295
             +G + +D+G   +   +D Y  + +   +          +L ++      CY      
Sbjct: 308 TGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMR-KLATKFSVFDACYDLRGNG 366

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNF 350
            P+ A   P +  HF GGA + L   +  I  PV+G      FC  +Q  D  + + GN 
Sbjct: 367 APAAAVRVPSIVLHFAGGADMALPQANYLI--PVQGGDRRTYFCLGLQAADDGLNVLGNV 424

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q    + +D +   + F P  C+
Sbjct: 425 QQQGFGLVFDVERGRIGFTPNGCS 448


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 123/368 (33%), Positives = 173/368 (47%), Gaps = 23/368 (6%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           +S  +   G YV+   +GTP   D+  I DTGSDL W QC PC + CY Q +PI+NP+ S
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKR-DLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKS 186

Query: 73  SSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           +SY  +SC S  C  L     ++ SCS+   C Y   Y D S + G  A +++    S +
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLAL-TSTD 244

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F+N +FGCG NN G+F     GL+GLGR  LSL SQ   + G   FSYCL    + SS 
Sbjct: 245 VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLVSQTAQKYG-KLFSYCL---PSTSSS 299

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           T  + FG+G   S     + SLV+ +  ++YF+ L  ISVG    S+       S+   S
Sbjct: 300 TGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLST-------SASVFS 352

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
                ID+G   + LP   Y+ L    +  +   P   P      CY       +  P +
Sbjct: 353 TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKI 412

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
             +F  GA++ L  +  F    +  V   FA      D+ I GN  Q    + YD     
Sbjct: 413 NLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGR 472

Query: 366 VSFKPTDC 373
           + F P  C
Sbjct: 473 IGFAPGGC 480


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 174/367 (47%), Gaps = 14/367 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY ++ S+GTPP   +Y ++DTGSD++W+QC PCV CY Q   +++P  S
Sbjct: 26  VISGLSLGSGEYFIRVSVGTPPR-GMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKS 84

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI----TFGNSNNF 128
           S+Y  L C S QC  LD   C   + C Y   Y D S + G  AT+ +    T G     
Sbjct: 85  STYSTLGCNSRQCLNLDVGGCVGNK-CLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            + +  GCGH+N G F     GL+GLG+  LS  +QI S+ G  +FSYCL    TDS+  
Sbjct: 144 LNKIPLGCGHDNEGYF-VGAAGLLGLGKGPLSFPNQINSENGG-RFSYCLTGRDTDSTER 201

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAIS 247
           S + FG+ +    G   +    +    T+Y++ + GISVG    S   IP       ++ 
Sbjct: 202 SSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVG---GSILTIPTSAFQLDSLG 258

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
            G + ID+G   T L    Y  L E  R                 CY    ++ +  P +
Sbjct: 259 NGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTV 318

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           T HF GGA + L  ++  +P      FC A     G   I GN  Q    + YD     V
Sbjct: 319 TLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQV 377

Query: 367 SFKPTDC 373
            F P+ C
Sbjct: 378 GFVPSQC 384


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 121/363 (33%), Positives = 169/363 (46%), Gaps = 20/363 (5%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P ++P++SS+    SC S
Sbjct: 81  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139

Query: 83  EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C  L   SC S      Q C YTY Y D S+T G L  ++ TF  +      V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N GVF  NE G+ G GR  LSL     SQL    FS+C    +     T  +      
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255

Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFID 254
             SG G V ST L+    + T+Y+++L+GI+VG    S++L +P    +     G   ID
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG----STRLPVPESEFTLKNGTGGTIID 311

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDGG 313
           +G   T LP   Y  + +     +KL            C   P  A    P L  HF+G 
Sbjct: 312 SGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA 371

Query: 314 A-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
              +P  +    +      + C A+    G+V   GNF Q ++ + YD  +  +SF P  
Sbjct: 372 TMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 373 CTK 375
           C K
Sbjct: 431 CDK 433


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 129/384 (33%), Positives = 191/384 (49%), Gaps = 28/384 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           ++ V+S      GEY M   +G PP      I+DTGSDL W+QC PC  C+ Q  P+++P
Sbjct: 73  DSTVESGAELGAGEYFMDVFVGNPPR-HFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDP 131

Query: 70  ASSSSYKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
           + S+S+K + C +  C L+      D  S +S + C Y Y Y DSS T G LA E ++  
Sbjct: 132 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVS 191

Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            S++       ++V GCGH+N G+  +   GL+GLG+  LS  SQ+ S      FSYCLV
Sbjct: 192 LSDHPSSLEIRDMVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV 250

Query: 180 PFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSK 235
               + S++S + FG G  +S     +  T  V   +  +T+Y++ ++GI +       +
Sbjct: 251 DRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQ-----E 305

Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
           L+P      AI+    G   ID+G   T L +D Y  +E      I   P  DP     +
Sbjct: 306 LLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDILGI 364

Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPP-PVEGVFCFAMQPIDGDVGIFGNF 350
           CY     A +  P L+  F  GA++ L   + FI P P E   C A+ P DG + I GNF
Sbjct: 365 CYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNF 423

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q ++   YD     + F  TDC+
Sbjct: 424 QQQNIHFLYDVQHARLGFANTDCS 447


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 186/383 (48%), Gaps = 33/383 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY     +GTP    +  ++DTGSDL+W+QC PC +CY Q   +++P  S
Sbjct: 75  VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S+Y+ + C S QC  L    C S       C Y   Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN-DTY 192

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            +NV  GCG +N G+F ++  GL+G+GR ++S+++Q+    G + F YCL    + S+ +
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLGVGRGKISISTQVAPAYG-SVFEYCLGDRTSRSTRS 250

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI- 246
           S + FG   E        T+L+S   + + Y+V + G SVG      ++  + N+S A+ 
Sbjct: 251 SYLVFGRTPEPP--STAFTALLSNPRRPSLYYVDMAGFSVGG----ERVTGFSNASLALD 304

Query: 247 ---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM- 299
               +G + +D+G   +   +D Y  L +      +    +       +   CY      
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFA 351
           A  AP++  HF GGA + L   + F+  PV+G          C   +  D  + + GN  
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFL--PVDGGRRRAASYRRCLGFEAADDGLSVIGNVQ 422

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
           Q    + +D + + + F P  CT
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCT 445


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 125/366 (34%), Positives = 175/366 (47%), Gaps = 18/366 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY  +  +GTP   + Y ++DTGSD+ W+QC PC +CY Q  PI+NP+ S
Sbjct: 146 VVSGMEQGSGEYFTRIGVGTP-TREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYS 204

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C S  C  LD   C S   C Y   Y D S + G  ATE +TFG ++    NV
Sbjct: 205 ASFSTVGCDSAVCSQLDAYDCHSGG-CLYEASYGDGSYSTGSFATETLTFGTTS--VANV 261

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L+GLG   LS  +QI +Q G + FSYCLV   +DSS    + 
Sbjct: 262 AIGCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTG-HTFSYCLVDRESDSS--GPLQ 317

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  S V  G + +    +    T+Y++++  ISVG     S     +        G   
Sbjct: 318 FGPKS-VPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFI 376

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILT 307
           ID+G   T L    Y+     VR+A      Q PR  +      CY    +  ++ P + 
Sbjct: 377 IDSGTVVTRLVTSAYD----AVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVG 432

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HF  GA + L   +  IP    G FCFA  P    V I GN  Q  + + +D  + +V 
Sbjct: 433 FHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVG 492

Query: 368 FKPTDC 373
           F    C
Sbjct: 493 FAFDQC 498


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 125/394 (31%), Positives = 179/394 (45%), Gaps = 66/394 (16%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-----------LPCVQCYK 61
           V S V + + EY+M  ++G+PP   +  I DTGSDL+WV+C            P  Q   
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPR-SMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 62  QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
                ++P+ SS+Y  +SCQ++ C  L   +C     C Y Y Y D S T GVL+TE  T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 122 FGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---- 170
           F +  +           V FGC     G F  + +            A  +++QLG    
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGL------VGLGGGAVSLVTQLGGATS 254

Query: 171 -ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
              +FSYCLVP   ++S  S + FG  ++V+  G  ST LV+ +  TYY V L+ + VGN
Sbjct: 255 LGRRFSYCLVPHSVNAS--SALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGN 312

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
            + +S            +   + +D+G   T L       + +++   I L P Q P   
Sbjct: 313 KTVASA-----------ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 361

Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM------QP 339
            QLCY        A    P LT  F GGA V L   + F+    EG  C A+      QP
Sbjct: 362 LQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQP 420

Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
               V I GN AQ ++ +GYD D+  V+F   DC
Sbjct: 421 ----VSILGNLAQQNIHVGYDLDAGTVTFAGADC 450


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 182/364 (50%), Gaps = 20/364 (5%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKE 77
             GEY+M+ SIGTPP L I  ++DTGSDL+W++C  C  C      + I+   +SSSYK+
Sbjct: 1   GEGEYMMELSIGTPPQL-IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKK 59

Query: 78  LSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
           L C S  C  + +  +    ++ C Y Y Y D S T G + ++RI+F       +  +FF
Sbjct: 60  LPCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           D  +FGCG    G +N  + GL+GLG+   SL  Q+  +LG  KFSYCLV + +  S  S
Sbjct: 120 DGFLFGCGRKLKGDWNFTQ-GLIGLGQKSHSLIQQLGDKLGY-KFSYCLVSYDSPPSAKS 177

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNS--SKLIPYYNSSGA 245
            ++ G+ + + G  VVST ++  +  D+T Y+V L+ I+VG +      K   +  S G 
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
                  ID+G   TLL    Y  + + +   + L P      G  LC+ +        P
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL-PTLGNSAGLDLCFNSSGDTSYGFP 296

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +T +F    ++ L   + F     + V C +M    GD+ I GN  Q +  I YD  + 
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRD-VVCLSMDSSGGDLSIIGNMQQQNFHILYDLVAS 355

Query: 365 MVSF 368
            +SF
Sbjct: 356 QISF 359


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 122/369 (33%), Positives = 182/369 (49%), Gaps = 24/369 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S +S  +GEY +   +GTPP   +  + DTGSD++W+QCLPC  CY Q  P++NP+ S
Sbjct: 70  LRSGLSDGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+++ ++C S  C  L    C   Q C Y   Y D S T G  +TE ++FG  +N  ++V
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQ-CLYQVSYGDGSFTVGEFSTETLSFG--SNAVNSV 185

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F      L+GLG+  LS  SQ+  QL  + FSYCL    +  S+   + 
Sbjct: 186 AIGCGHNNQGLFTGAAG-LLGLGKGLLSFPSQV-GQLYGSVFSYCLPTRESTGSV--PLI 241

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-- 250
           FGN   V+     +T L + +  T+Y+V + GI VG  S S   IP  + S   S GN  
Sbjct: 242 FGN-QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVS---IPAGSLSLDSSTGNGG 297

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-----QLCYKTPSMAGIA-P 304
           + +D+G   T L    YN + +  R  +      D ++ S       CY     + I  P
Sbjct: 298 VILDSGTAVTRLVTSAYNPMRDAFRAGMP----SDAKMTSGFSLFDTCYDLSGRSSIMLP 353

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            ++  F+GGA + L   +  +P    G +C A  P   +  I GN  Q    + +D    
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413

Query: 365 MVSFKPTDC 373
            V      C
Sbjct: 414 RVGIGANQC 422


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 133/379 (35%), Positives = 187/379 (49%), Gaps = 31/379 (8%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           + VV S +S  +GEY M+  +GTP   ++Y ++DTGSD++W+QC PC  CY Q  P++NP
Sbjct: 122 SGVVISGLSQGSGEYFMRLGVGTPAT-NMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNP 180

Query: 70  ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
           A S ++  + C S  C  LD  S C S++   C Y   Y D S T G  +TE +TF  + 
Sbjct: 181 AKSKTFATVPCGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGAR 240

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
              D+V  GCGH+N G+F      L+GLGR  LS  SQ  ++    KFSYCLV      +
Sbjct: 241 --VDHVALGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNG-KFSYCLVDRTSSGS 296

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S   S + FGNG+ V    V +  L + +  T+Y++ L GISVG        +P  + S
Sbjct: 297 SSKPPSTIVFGNGA-VPKTAVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 350

Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
                A   G + ID+G   T L +  Y  L    R+A +L   +  R  S      C+ 
Sbjct: 351 QFKLDATGNGGVIIDSGTSVTRLTQSAYVAL----RDAFRLGATRLKRAPSYSLFDTCFD 406

Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
              M  +  P +  HF GG +V L  ++  IP   +G FCFA     G + I GN  Q  
Sbjct: 407 LSGMTTVKVPTVVFHFTGG-EVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQG 465

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             + YD     V F    C
Sbjct: 466 FRVAYDLVGSRVGFLSRAC 484


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 125/369 (33%), Positives = 178/369 (48%), Gaps = 26/369 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPA 70
             S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PA
Sbjct: 169 ASSGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPA 226

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
            SS+Y  +SC +  C  LDT  CS    C Y   Y D S + G  A + +T  +S +   
Sbjct: 227 RSSTYANISCAAPACSDLDTRGCSGGN-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVK 284

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
              FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      SS T  
Sbjct: 285 GFRFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGY 339

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSGAISK 248
           + FG GS  + G  ++T +++    T+Y+V + GI VG    S   IP   + ++G I  
Sbjct: 340 LDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLS---IPQSVFTTAGTI-- 394

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PI 305
               +D+G   T LP   Y+ L     +A+    Y+     S L  CY    M+ +A P 
Sbjct: 395 ----VDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPT 450

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           ++  F GGA++ +  +       V  V   FA     GDVGI GN       + YD   +
Sbjct: 451 VSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510

Query: 365 MVSFKPTDC 373
           +V F P  C
Sbjct: 511 VVGFSPGAC 519


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 128/384 (33%), Positives = 191/384 (49%), Gaps = 28/384 (7%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           ++ V+S      GEY M   +G PP   +  I+DTGSDL W+QC PC  C+ Q  P+++P
Sbjct: 157 DSTVESGAELGAGEYFMDVFVGNPPRHFLL-IIDTGSDLTWLQCKPCKACFDQSGPVFDP 215

Query: 70  ASSSSYKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
           + S+S+K + C +  C L+      D  S +S + C Y Y Y DSS T G LA E ++  
Sbjct: 216 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVS 275

Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            S++       ++V GCGH+N G+  +   GL+GLG+  LS  SQ+ S      FSYCLV
Sbjct: 276 LSDHPSSLEIRDMVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV 334

Query: 180 PFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSK 235
               + S++S + FG G  +S     +  T  V   +  +T+Y++ ++GI +       +
Sbjct: 335 DRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKI-----DQE 389

Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
           L+P      AI+    G   ID+G   T L +D Y  +E      I   P  DP     +
Sbjct: 390 LLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDILGI 448

Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPP-PVEGVFCFAMQPIDGDVGIFGNF 350
           CY       +  P L+  F  GA++ L   + FI P P E   C A+ P DG + I GNF
Sbjct: 449 CYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNF 507

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q ++   YD     + F  TDC+
Sbjct: 508 QQQNIHFLYDVQHARLGFANTDCS 531


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 174/369 (47%), Gaps = 29/369 (7%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +GEY+ K ++GTP +  +  + DT SDL W+QC PC +CY Q  P+++P  S+SY+E+S 
Sbjct: 135 SGEYIAKIAVGTPGVEALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSF 193

Query: 81  QSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            +  C  L       + +  C YT GY D S T G    E +TF         +  GCGH
Sbjct: 194 NAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVR-LPRISIGCGH 252

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSKMYFGNGS 197
           +N G+F     G++GLGR  +S  +QI        FSYCLV F +   S++S + FG G+
Sbjct: 253 DNKGLFGAPAAGILGLGRGLMSFPNQIDHN---GTFSYCLVDFLSGPGSLSSTLTFGAGA 309

Query: 198 EVSGGGVVST-SLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNSSGAISKGN 250
             +   V  T ++++    T+Y+V L GISVG +          +L PY        +G 
Sbjct: 310 VDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPY------TGRGG 363

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRN-AIKL--TPYQDPRLGSQLCYKTPSMAGI--API 305
           + +D+G   T L +  Y    +  R  A+ L       P      CY T    G+   P 
Sbjct: 364 VIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCY-TVGGRGMKKVPT 422

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQ 364
           ++ HF G  +V L   +  IP    G  CFA     D  V I GN  Q    I YD   +
Sbjct: 423 VSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIGGR 482

Query: 365 MVSFKPTDC 373
            V F P  C
Sbjct: 483 -VGFAPNSC 490


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 128/383 (33%), Positives = 188/383 (49%), Gaps = 26/383 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S  S   GEY +   +GTPP   ++ I+DTGSDL W+QC PC  C++Q  P YNP  S
Sbjct: 159 LESGASLGTGEYFIDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNES 217

Query: 73  SSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATE----RITFG 123
           SSY+ +SC   +C L+ +      C ++ Q C Y Y YAD S T G  A E     +T+ 
Sbjct: 218 SSYRNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWP 277

Query: 124 NSNNFFDNVV---FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           N    F +VV   FGCGH N G F+    GL+GLGR  LS  SQ+ S  G + FSYCL  
Sbjct: 278 NGKEKFKHVVDVMFGCGHWNKGFFH-GAGGLLGLGRGPLSFPSQLQSIYG-HSFSYCLTD 335

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE---DKTYYFVTLEGISV-GNLSNSSK 235
             +++S++SK+ FG   E+     ++ T L++ E   D T+Y++ ++ I V G + +  +
Sbjct: 336 LFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPE 395

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
              +++S G    G   ID+G+  T  P   Y+ ++E     IKL            CY 
Sbjct: 396 KTWHWSSEGV---GGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYN 452

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQ 352
              +M    P    HF  GA       + F     + V C A+   P    + I GN  Q
Sbjct: 453 VSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQ 512

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +  I YD     + + P  C +
Sbjct: 513 QNFHILYDVKRSRLGYSPRRCAE 535


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 185/383 (48%), Gaps = 33/383 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY     +GTP    +  ++DTGSDL+W+QC PC +CY Q   +++P  S
Sbjct: 75  VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S+Y+ + C S QC  L    C S       C Y   Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAN-DTY 192

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            +NV  GCG +N G+F ++  GL+G+ R ++S+++Q+    G + F YCL    + S+ +
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLGVARGKISISTQVAPAYG-SVFEYCLGDRTSRSTRS 250

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI- 246
           S + FG   E        T+L+S   + + Y+V + G SVG      ++  + N+S A+ 
Sbjct: 251 SYLVFGRTPEPP--STAFTALLSNPRRPSLYYVDMAGFSVGG----ERVTGFSNASLALD 304

Query: 247 ---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM- 299
               +G + +D+G   +   +D Y  L +      +    +       +   CY      
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFA 351
           A  AP++  HF GGA + L   + F+  PV+G          C   +  D  + + GN  
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFL--PVDGGRRRAASYRRCLGFEAADDGLSVIGNVQ 422

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
           Q    + +D + + + F P  CT
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCT 445


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 182/369 (49%), Gaps = 24/369 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S +S  +GEY +   +GTPP   +  + DTGSD++W+QCLPC  CY Q  P++NP+ S
Sbjct: 70  LRSGLSDGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+++ ++C S  C  L    C   Q C Y   Y D S T G  +TE ++FG  +N  ++V
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQ-CLYQVSYGDGSFTVGEFSTETLSFG--SNAVNSV 185

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGHNN G+F      L+GLG+  LS  SQ+  QL  + FSYCL    +  S+   + 
Sbjct: 186 AIGCGHNNQGLFTGAAG-LLGLGKGLLSFPSQV-GQLYGSVFSYCLPTRESTGSV--PLI 241

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-- 250
           FGN   V+     +T L + +  T+Y+V + GI VG  S +   IP  + S   S GN  
Sbjct: 242 FGN-QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVN---IPAGSLSLDSSTGNGG 297

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-----QLCYKTPSMAGIA-P 304
           + +D+G   T L    YN + +  R  +      D ++ S       CY     + I  P
Sbjct: 298 VILDSGTAVTRLVTSAYNPMRDAFRAGMP----SDAKMTSGFSLFDTCYDLSGRSSIMLP 353

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            ++  F+GGA + L   +  +P    G +C A  P   +  I GN  Q    + +D    
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413

Query: 365 MVSFKPTDC 373
            V      C
Sbjct: 414 RVGIGANQC 422


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 171/360 (47%), Gaps = 12/360 (3%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S ++  +G+Y  +  +GTP    +Y + DTGSD+ W+QC PC +CY+Q  PI+NP+ SSS
Sbjct: 5   SGIAGGSGDYFARIGVGTP-ARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSS 63

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           +K L+C S  C  L    CS +  C Y   Y D S T G  +TE ++FG   +   +V  
Sbjct: 64  FKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFG--EHAVRSVAM 121

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG NN G+F+     L+GLGR  LS  SQ  +   A+ FSYCL     +S+I + + FG
Sbjct: 122 GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSY-ASVFSYCLP--RRESAIAASLVFG 177

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
             S V      +  L ++   TYY+V L  I V    +   + P   + G+   G + +D
Sbjct: 178 P-SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAG--SPVNIPPDAFAMGSRGTGGVIVD 234

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
           +G   + L    Y  L +  R+ +         L    CY   SM     P +   FDGG
Sbjct: 235 SGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL-FDTCYDLSSMKTATLPAVVLDFDGG 293

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A +PL      +    EG +C A  P +    I GN  Q    I  D   + +   P  C
Sbjct: 294 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 117/372 (31%), Positives = 179/372 (48%), Gaps = 27/372 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
            +  V      Y++   +GTP   D+  + DTGSDL WVQC PC  CY+Q  P+++P+ S
Sbjct: 127 ARRGVPLGTANYIVSVGLGTPKR-DLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQS 185

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNN 127
           ++Y  + C +++C  LD+ SCSS + C Y   Y D S T G LA + +T G     +S++
Sbjct: 186 TTYSAVPCGAQECRRLDSGSCSSGK-CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
                VFGCG ++TG+F + + GL GLGR R+SLASQ  ++ GA  FSYCL      SS 
Sbjct: 245 QLQEFVFGCGDDDTGLFGKAD-GLFGLGRDRVSLASQAAAKYGAG-FSYCL-----PSSS 297

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAI 246
           T++ Y   GS        +  +   +  ++Y++ L GI V     + ++ P  + + G +
Sbjct: 298 TAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAG--RTVRVSPAVFRTPGTV 355

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
                 ID+G   T LP   Y  L       ++   Y+     S L  CY       +  
Sbjct: 356 ------IDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQI 409

Query: 304 PILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
           P +   FDGGA + L      ++    +    FA    D  + I GN  Q    + YD  
Sbjct: 410 PSVALLFDGGATLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVA 469

Query: 363 SQMVSFKPTDCT 374
           +Q + F    C+
Sbjct: 470 NQKIGFGAKGCS 481


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 171/360 (47%), Gaps = 12/360 (3%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S ++  +G+Y  +  +GTP    +Y + DTGSD+ W+QC PC +CY+Q  PI+NP+ SSS
Sbjct: 72  SGIAGGSGDYFARIGVGTP-ARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSS 130

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           +K L+C S  C  L    CS +  C Y   Y D S T G  +TE ++FG   +   +V  
Sbjct: 131 FKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFG--EHAVRSVAM 188

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG NN G+F+     L+GLGR  LS  SQ  +   A+ FSYCL     +S+I + + FG
Sbjct: 189 GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSY-ASVFSYCLP--RRESAIAASLVFG 244

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
             S V      +  L ++   TYY+V L  I V    +   + P   + G+   G + +D
Sbjct: 245 P-SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAG--SPVNIPPDAFAMGSRGTGGVIVD 301

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
           +G   + L    Y  L +  R+ +         L    CY   SM     P +   FDGG
Sbjct: 302 SGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL-FDTCYDLSSMKTATLPAVVLDFDGG 360

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A +PL      +    EG +C A  P +    I GN  Q    I  D   + +   P  C
Sbjct: 361 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 181/364 (49%), Gaps = 20/364 (5%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKE 77
             GEY+M+ SIGTPP L I  ++DTGSDL+W++C  C  C      + I+   +SSSYK+
Sbjct: 1   GEGEYMMELSIGTPPQL-IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKK 59

Query: 78  LSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
           L C S  C  + +  +    ++ C Y Y Y D S T G + ++RI+F       +  +FF
Sbjct: 60  LPCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           D  +FGC     G +N  + GL+GLG+   SL  Q+  +LG  KFSYCLV + +  S  S
Sbjct: 120 DGFLFGCARKLKGDWNFTQ-GLIGLGQKSHSLIQQLGDKLGY-KFSYCLVSYDSPPSAKS 177

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNS--SKLIPYYNSSGA 245
            ++ G+ + + G  VVST ++  +  D+T Y+V L+ I++G +      K   +  S G 
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
                  ID+G   TLL    Y  + + +   + L P      G  LC+ +        P
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL-PTLGNSAGLDLCFNSSGDTSYGFP 296

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +T +F    ++ L   + F     + V C +M    GD+ I GN  Q +  I YD  + 
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRD-VVCLSMDSSGGDLSIIGNMQQQNFHILYDLVAS 355

Query: 365 MVSF 368
            +SF
Sbjct: 356 QISF 359


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 174/379 (45%), Gaps = 42/379 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY++K  IGTP        +DT SDL+W+QC PCV CY+Q+ PI+NP  SSSY  + C 
Sbjct: 86  GEYLVKLGIGTPQHY-FSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCS 144

Query: 82  SEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           S+ C  LD   C     Q C Y Y Y+ +++T G LA +++  G   N F  VV GC  +
Sbjct: 145 SDTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVG--GNVFHAVVLGCSDS 202

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           + G       GLVGL R  LSL    LSQL   +F YCL P    S    K+  G G+  
Sbjct: 203 SVGGPPPQASGLVGLARGPLSL----LSQLSVRRFMYCLPPPM--SRTPGKLVLGAGAGA 256

Query: 200 SGGGVVSTSLV-----SKEDKTYYFVTLEGISVGNLSNSSKLIP-------------YYN 241
                VS  +      S    +YY++  +G++VG+ +  +   P               +
Sbjct: 257 DAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGD 316

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTPSM 299
                +   M +D  +  + L    Y+ L + +   I+L P   P  RLG  LC+  P  
Sbjct: 317 GGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRL-PRATPSTRLGLDLCFILPEG 375

Query: 300 AGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSD 354
            GI     P ++  FD G  + L     F+    +G + C  +    G V I GN+ Q +
Sbjct: 376 VGIDRVYVPTVSMSFD-GRWLELERDRLFLE---DGRMMCLMIGRTSG-VSILGNYQQQN 430

Query: 355 LFIGYDFDSQMVSFKPTDC 373
           + + Y+     ++F    C
Sbjct: 431 MHVLYNLRRGKITFAKASC 449


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 181/368 (49%), Gaps = 21/368 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTP    ++ ++DTGSD++W+QC PC +CY Q  P++NP  S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPARY-VFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKS 194

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+  + C S  C  LD+  CS+++ +C Y   Y D S T G  +TE +TF  +      
Sbjct: 195 RSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTR--VGR 252

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V  GCGH+N G+F      L+GLGR RLS  SQI  +  + KFSYCLV   + SS  S M
Sbjct: 253 VALGCGHDNEGLFIGAAG-LLGLGRGRLSFPSQIGRRF-SRKFSYCLVD-RSASSKPSYM 309

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
            FG+ S +S     +  + + +  T+Y+V L G+SVG        +P   +S     +  
Sbjct: 310 VFGD-SAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTR-----VPGITASLFKLDSTG 363

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
            G + ID+G   T L +  Y  L +  R  A  L    +  L    C+       +  P 
Sbjct: 364 NGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPT 422

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +  HF  GA V L  ++  IP    G FCFA       + I GN  Q    + YD  +  
Sbjct: 423 VVLHFR-GADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASR 481

Query: 366 VSFKPTDC 373
           V F P  C
Sbjct: 482 VGFAPRGC 489


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 172/368 (46%), Gaps = 17/368 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +S  +GEY ++  +G+PP  + Y +VD+GSD++W+QC PC +CY+Q  P+++PA+S
Sbjct: 122 VVSGISEGSGEYFVRVGVGSPPT-EQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAAS 180

Query: 73  SSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           +S+  + C S  C  L   +  C+    C Y   Y D S T+GVLA E +TFG+S     
Sbjct: 181 ASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTP-VQ 239

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            V  GCGH N G+F     GL+GLG   +SL  Q L       FSYCL     D+   S 
Sbjct: 240 GVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLASRGADAGAGS- 296

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
           + FG    +  G V    L + +  ++Y+V L       L    + +P  +    +++  
Sbjct: 297 LVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLT-----GLGVGGERLPLQDGLFDLTEDG 351

Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTPSMAGI-API 305
            G + +DTG   T LP D Y  L +   + I     + P +     CY     A +  P 
Sbjct: 352 GGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPT 411

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +  +F        +     +     GV+C A       + I GN  Q  + I  D  +  
Sbjct: 412 VALYFGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGY 471

Query: 366 VSFKPTDC 373
           V F P+ C
Sbjct: 472 VGFGPSTC 479


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 184/382 (48%), Gaps = 49/382 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPP      I DTGSDL+W QC PC  +C+KQ  P+YNP+SS +++ L C
Sbjct: 90  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 148

Query: 81  QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
            S    C     ++ ++      C Y   Y  +  T G+  +E  TFG+S         +
Sbjct: 149 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 207

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC + ++  +N +   +             ++SQL A  FSYCL PF  D+   S + 
Sbjct: 208 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 261

Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            G     + ++G GV ST  V    K    TYY++ L GISVG  + +  + P   +  A
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--AAALPIPPGAFALRA 319

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
              G + ID+G   T L    Y R+   VR+ +KL P  D     G  LC+  PS +   
Sbjct: 320 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 378

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
              P +T HF GGA + L         PVE       G++C AM+   DG++   GN+ Q
Sbjct: 379 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 429

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            +L I YD   + +SF P  C+
Sbjct: 430 QNLHILYDVQKETLSFAPAKCS 451


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/359 (33%), Positives = 173/359 (48%), Gaps = 26/359 (7%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    I  +VDTGS L W+QC PC V C++Q  P+++P +SSSY  +SC
Sbjct: 135 GNYVTRMGLGTPAKPYIM-VVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSC 193

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + QC+ L T      +CSS  +C Y   Y DSS + G L+ + ++FG  +N   N  +G
Sbjct: 194 STPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFG--SNSVPNFYYG 251

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG + FSYCL    +   ++   Y  N
Sbjct: 252 CGQDNEGLFGRSA-GLMGLARNKLSLLYQLAPTLGYS-FSYCLPSSSSSGYLSIGSY--N 307

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   +VS++L    D + YF+ L G++V          P   SS   S     ID+
Sbjct: 308 PGQYSYTPMVSSTL----DDSLYFIKLSGMTVAG-------KPLAVSSSEYSSLPTIIDS 356

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   Y+ L + V  A+K T   D       C+   + +   P ++  F GGA 
Sbjct: 357 GTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAA 416

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  +        C A  P      I GN  Q    + YD  S  + F    CT
Sbjct: 417 LKLSAQNLLVDVD-SSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 179/360 (49%), Gaps = 28/360 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           ++   SIG PP+  +  ++DTGSDL W+QCLPC +CY Q  P ++P+ SS+Y+  SC+S 
Sbjct: 88  FLANISIGDPPVPQLL-LIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESA 145

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNN 140
              +           C Y   Y D S T+G+LA E++TF  S+       N+VFGCG +N
Sbjct: 146 PHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDN 205

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G    +  G++GLG    S+ ++       +KFSYC       +   + +  GNG+ + 
Sbjct: 206 SGFTQYS--GVLGLGPGTFSIVTRNF----GSKFSYCFGSLIDPTYPHNFLILGNGARIE 259

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           G     T L   +D+  Y++ L+ IS+G   L     +   Y      SKG   IDTG  
Sbjct: 260 GD---PTPLQIFQDR--YYLDLQAISLGEKLLDIEPGIFQRYR-----SKGGTVIDTGCS 309

Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
           PT+L ++ Y  L E++   +   L   +D    +  CY+      +   P++T HF GGA
Sbjct: 310 PTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGA 369

Query: 315 KVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           ++ L   S F+       FC AM      D+ + G  AQ +  +GY+  +  V F+ TDC
Sbjct: 370 ELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 118/362 (32%), Positives = 162/362 (44%), Gaps = 12/362 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S     +GEY ++  +G+PP    Y ++D+GSD++WVQC PC +CY+Q  P+++PA S
Sbjct: 126 VVSGTEQGSGEYFVRIGVGSPPRSQ-YVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGS 184

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           ++Y  +SC S  C  LD   C+  + C Y   Y D S T+G LA E +TFG       N+
Sbjct: 185 ATYAGISCDSSVCDRLDNAGCNDGR-CRYEVSYGDGSYTRGTLALETLTFGRV--LIRNI 241

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L+GLG   +S   Q+  Q G   FSYCLV   T+S  T  + 
Sbjct: 242 AIGCGHMNRGMFIGAAG-LLGLGGGAMSFVGQLGGQTGG-AFSYCLVSRGTES--TGTLE 297

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG G+   G   V      +    YY         G      + I      G    G + 
Sbjct: 298 FGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLG---YGGVV 354

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
           +DTG   T LP   Y    +         P  D       CY       +  P ++ +F 
Sbjct: 355 MDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFS 414

Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           GG  + L   +  IP   EG FCFA       + I GN  Q  + I  D  +  V F PT
Sbjct: 415 GGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPT 474

Query: 372 DC 373
            C
Sbjct: 475 IC 476


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 129/372 (34%), Positives = 173/372 (46%), Gaps = 28/372 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +QS  +   G Y++    GTP    +  I+DTGSDL W+QC PC  CY QV  I+ P  S
Sbjct: 126 LQSGTTVGTGNYIVTAGFGTPAKNSLL-IIDTGSDLTWIQCKPCADCYSQVDAIFEPKQS 184

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           SSYK L C S  C  L T   +        C Y   Y D S ++G  + E +T G+ +  
Sbjct: 185 SSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDS-- 242

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F N  FGCGH NTG+F +   GL+GLG+  LS  SQ  S+ G  +F+YCL P    S+ T
Sbjct: 243 FQNFAFGCGHTNTGLF-KGSSGLLGLGQNSLSFPSQSKSKYGG-QFAYCL-PDFGSSTST 299

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
                G GS  +    V T LVS     T+YFV L GISVG    S   IP       + 
Sbjct: 300 GSFSVGKGSIPA--SAVFTPLVSNFMYPTFYFVGLNGISVGGDRLS---IP----PAVLG 350

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
           +G+  +D+G   T L    YN L+   R+  +  P   P      CY     + +  P +
Sbjct: 351 RGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTI 410

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
           T HF   A V +      +P    G      F  A Q +DG   I GNF Q  + + +D 
Sbjct: 411 TFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQ-MDG-FNIIGNFQQQRMRVAFDT 468

Query: 362 DSQMVSFKPTDC 373
            +  + F    C
Sbjct: 469 GAGRIGFASGSC 480


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 182/382 (47%), Gaps = 41/382 (10%)

Query: 20  ANG----EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           ANG    EY++  +IGTPP   +  I+DTGSDL+W QC PC  C+ +     +P++SS++
Sbjct: 407 ANGVPDTEYLVHLAIGTPPQ-PVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTF 465

Query: 76  KELSCQSEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITF----GNSNN 127
             L C S  C  L   SC       Q C Y Y YAD S+T G L  E  TF    G    
Sbjct: 466 DVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQA 525

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
              ++ FGCG  N G+F  NE G+ G GR  LSL     SQL  + FS+C        S 
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLP----SQLKVDNFSHCFTAI--TGSE 579

Query: 188 TSKMYFGNGSEV---SGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S +  G  + +   + G V ST LV        Y+++L+GI+VG     S  +P   S+
Sbjct: 580 PSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVG-----STRLPIPEST 634

Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLC--YKT 296
            A+ +   G   ID+G   T LP+D Y  + +     ++L P  +      S+LC  +  
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVDNATSSSLSRLCFSFSV 693

Query: 297 PSMAG-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQS 353
           P  A    P L  HF+ GA + L   +        G  V C A+   D D+ I GN+ Q 
Sbjct: 694 PRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAINAGD-DLTIIGNYQQQ 751

Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
           +L + YD    M+SF P  C +
Sbjct: 752 NLHVLYDLVRNMLSFVPAQCNR 773


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 183/382 (47%), Gaps = 49/382 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPP      I DTGSDL+W QC PC  +C+KQ  P+YNP+SS +++ L C
Sbjct: 90  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 148

Query: 81  QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
            S    C     ++ ++      C Y   Y  +  T G+  +E  TFG+S         +
Sbjct: 149 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 207

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC + ++  +N +   +             ++SQL A  FSYCL PF  D+   S + 
Sbjct: 208 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 261

Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            G     + ++G GV ST  V    K    TYY++ L GISVG    +  + P   +  A
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--PAALPIPPGAFALRA 319

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
              G + ID+G   T L    Y R+   VR+ +KL P  D     G  LC+  PS +   
Sbjct: 320 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 378

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
              P +T HF GGA + L         PVE       G++C AM+   DG++   GN+ Q
Sbjct: 379 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 429

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            +L I YD   + +SF P  C+
Sbjct: 430 QNLHILYDVQKETLSFAPAKCS 451


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 183/382 (47%), Gaps = 49/382 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           GEY+M  +IGTPP      I DTGSDL+W QC PC  +C+KQ  P+YNP+SS +++ L C
Sbjct: 95  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 153

Query: 81  QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
            S    C     ++ ++      C Y   Y  +  T G+  +E  TFG+S         +
Sbjct: 154 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 212

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC + ++  +N +   +             ++SQL A  FSYCL PF  D+   S + 
Sbjct: 213 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 266

Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            G     + ++G GV ST  V    K    TYY++ L GISVG    +  + P   +  A
Sbjct: 267 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--PAALPIPPGAFALRA 324

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
              G + ID+G   T L    Y R+   VR+ +KL P  D     G  LC+  PS +   
Sbjct: 325 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 383

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
              P +T HF GGA + L         PVE       G++C AM+   DG++   GN+ Q
Sbjct: 384 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 434

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            +L I YD   + +SF P  C+
Sbjct: 435 QNLHILYDVQKETLSFAPAKCS 456


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 22/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PA S
Sbjct: 170 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARS 227

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC +  C  LDT  CS    C Y   Y D S + G  A + +T  +S +     
Sbjct: 228 STYANVSCAAPACFDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 285

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      SS T  + 
Sbjct: 286 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGYLD 340

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG GS  + G  ++T +++    T+Y+V + GI VG      +L+    S    +     
Sbjct: 341 FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 393

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
           +D+G   T LP   Y+ L     +A+    Y+     S L  CY    M+ +A P ++  
Sbjct: 394 VDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 453

Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA + +  +       V  V   FA     GDVGI GN       + YD   ++V F
Sbjct: 454 FQGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 513

Query: 369 KPTDC 373
            P  C
Sbjct: 514 SPGAC 518


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 181/374 (48%), Gaps = 31/374 (8%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +GEY+ K ++GTP +  +  + DT SDL W+QC PC +CY Q  P+++P  S+SY E++ 
Sbjct: 131 SGEYMAKIAVGTPAVQALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 189

Query: 81  QSEQCHLLDTVSC--SSQQLCNYTYGYAD----SSLTKGVLATERITF-GNSNNFFDNVV 133
            +  C  L       + +  C YT  Y D    +S + G L  E +TF G     + ++ 
Sbjct: 190 DAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSI- 248

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD-SSITSKM 191
            GCGH+N G+F     G++GLGR ++S+  QI + LG N  FSYCLV F +   S +S +
Sbjct: 249 -GCGHDNKGLFGAPAAGILGLGRGQISIPHQI-AFLGYNASFSYCLVDFISGPGSPSSTL 306

Query: 192 YFGNGS-EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNSSG 244
            FG G+ + S     + +++++   T+Y+V L G+SVG +          +L PY     
Sbjct: 307 TFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPY----- 361

Query: 245 AISKGNMFIDTGAPPTLLPKDFY---NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              +G + +D+G   T L +  Y            ++       P      CY     AG
Sbjct: 362 -TGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAG 420

Query: 302 I-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGY 359
           +  P ++ HF GG +V L   +  IP    G  CFA     D  V + GN  Q    + Y
Sbjct: 421 VKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVY 480

Query: 360 DFDSQMVSFKPTDC 373
           D   Q V F P +C
Sbjct: 481 DLAGQRVGFAPNNC 494


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 114/366 (31%), Positives = 171/366 (46%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP  + Y ++D+GSD++WVQC PC +CY+Q  P+++PA S
Sbjct: 132 VISGMEAGSGEYFVRIGVGSPPR-NQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADS 190

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  +SC S+ C  L+   C++ + C Y   Y D S TKG LA E +T G       +V
Sbjct: 191 SSFAGVSCGSDVCDRLENTGCNAGR-CRYEVSYGDGSYTKGTLALETLTVGQV--MIRDV 247

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q+  Q G   FSYCLV   T S  T  + 
Sbjct: 248 AIGCGHTNQGMFIGAAGLLGLGGGS-MSFIGQLGGQTGG-AFSYCLVSRGTGS--TGALE 303

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS----SKLIPYYNSSGAISK 248
           FG G+   G   +S  + +    ++Y++ L GI VG +  S    +  +  Y ++G +  
Sbjct: 304 FGRGALPVGATWISL-IRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVV-- 360

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
               +DTG   T  P   Y    +         P          CY       +  P ++
Sbjct: 361 ----MDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVS 416

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            +F  G  + L   +  IP    G FC A  P    + I GN  Q  + I +D  +  V 
Sbjct: 417 FYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 476

Query: 368 FKPTDC 373
           F P  C
Sbjct: 477 FGPNIC 482


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 174/372 (46%), Gaps = 30/372 (8%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYN 68
           VV      +  EY+ +  +G P  L  Y + DTGSD+ W+QC PC     CYKQ  PI++
Sbjct: 136 VVSGQSKGSGAEYLAQIGVGQPVKL-FYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFD 194

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           P SSSSY  LSC S+QC LLD  +C+S   C Y   Y D S T G LATE ++FGNSN+ 
Sbjct: 195 PKSSSSYSPLSCNSQQCKLLDKANCNSDT-CIYQVHYGDGSFTTGELATETLSFGNSNS- 252

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N+  GCGH+N G+F                 A  + SQL A+ FSYCLV   +DSS T
Sbjct: 253 IPNLPIGCGHDNEGLFAGGAG-----LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSST 307

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            +      S +    + S  + +    +Y +V + GISVG      K +P   +   I +
Sbjct: 308 LEF----NSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG-----KTLPISPTRFEIDE 358

Query: 249 ---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI 302
              G + +D+G   + LP D Y  L E     +KLT    P  G  +   CY     + +
Sbjct: 359 SGLGGIIVDSGTIISRLPSDVYESLREAF---VKLTSSLSPAPGISVFDTCYNFSGQSNV 415

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P +      G  + L   +  I     G +C A       + I G+F Q  + + YD 
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475

Query: 362 DSQMVSFKPTDC 373
            + +V F    C
Sbjct: 476 TNSLVGFSTNKC 487


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S +   +GEY  +  +GTP    +  ++DTGSD++W+QC PC  CY Q   +++P  S S
Sbjct: 119 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 177

Query: 75  YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           Y  + C +  C  LD+  C  ++  C Y   Y D S+T G  A+E +TF         V 
Sbjct: 178 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 236

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
            GCGH+N G+F      L      RLS  SQI    G   FSYCLV         S+ +S
Sbjct: 237 IGCGHDNEGLFIAASGLLGLGR-GRLSFPSQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 294

Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            + FG G+  +  G   T +  +    T+Y+V L G SVG              +    +
Sbjct: 295 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 354

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
           G + +D+G   T L +  Y  + +  R A   ++++P      G  L   CY       +
Sbjct: 355 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 409

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P ++ H  GGA V L   +  IP    G FCFAM   DG V I GN  Q    + +D 
Sbjct: 410 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 469

Query: 362 DSQMVSFKPTDC 373
           D+Q V F P  C
Sbjct: 470 DAQRVGFVPKSC 481


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 135/374 (36%), Positives = 185/374 (49%), Gaps = 36/374 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
           VQS  S  +G+Y +   +GTP   +   I DTGSDL W QC PC + CYKQ +P  +P  
Sbjct: 122 VQSGASIGSGDYAVTVGLGTPKK-EFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTK 180

Query: 72  SSSYKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S+SYK +SC S  C LLDT    SCSS   C Y   Y D S + G  ATE +T  +S+N 
Sbjct: 181 STSYKNISCSSAFCKLLDTEGGESCSSPT-CLYQVQYGDGSYSIGFFATETLTL-SSSNV 238

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F N +FGCG  N+G+F     GL+GLGRT+LSL SQ  +Q     FSYCL      +S +
Sbjct: 239 FKNFLFGCGQQNSGLF-RGAAGLLGLGRTKLSLPSQT-AQKYKKLFSYCL-----PASSS 291

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKT--YYFVTLEGISVG--NLSNSSKLIPYYNSSG 244
           SK Y   G +VS    V  + +S++ K+  +Y + +  +SVG   LS  + +   +++SG
Sbjct: 292 SKGYLSFGGQVS--KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASI---FSTSG 346

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
            +      ID+G   T LP   Y+ L    +  +   P  D       CY       I  
Sbjct: 347 TV------IDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKI 400

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           P +   F GG ++ +  +   I  PV G+      FA    D    IFGN  Q    + Y
Sbjct: 401 PKVGVSFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVY 458

Query: 360 DFDSQMVSFKPTDC 373
           D     V F P+ C
Sbjct: 459 DDAKGRVGFAPSGC 472


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S +   +GEY  +  +GTP    +  ++DTGSD++W+QC PC  CY Q   +++P  S S
Sbjct: 113 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 171

Query: 75  YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           Y  + C +  C  LD+  C  ++  C Y   Y D S+T G  A+E +TF         V 
Sbjct: 172 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 230

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
            GCGH+N G+F      L      RLS  SQI    G   FSYCLV         S+ +S
Sbjct: 231 IGCGHDNEGLFIAASGLLGLGR-GRLSFPSQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 288

Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            + FG G+  +  G   T +  +    T+Y+V L G SVG              +    +
Sbjct: 289 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 348

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
           G + +D+G   T L +  Y  + +  R A   ++++P      G  L   CY       +
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 403

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P ++ H  GGA V L   +  IP    G FCFAM   DG V I GN  Q    + +D 
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463

Query: 362 DSQMVSFKPTDC 373
           D+Q V F P  C
Sbjct: 464 DAQRVGFVPKSC 475


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 167/366 (45%), Gaps = 20/366 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP  + Y ++D+GSD++WVQC PC QCY Q  P++NPA S
Sbjct: 125 VVSGMEQGSGEYFVRIGVGSPPR-NQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADS 183

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           SS+  +SC S  C  +D  +C   + C Y   Y D S TKG LA E ITFG +     NV
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGR-CRYEVSYGDGSYTKGTLALETITFGRT--LIRNV 240

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH+N G+F      L+GLG   +S   Q+  Q G   FSYCLV    +SS    + 
Sbjct: 241 AIGCGHHNQGMFVGAAG-LLGLGGGPMSFVGQLGGQTGG-AFSYCLVSRGIESS--GLLE 296

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG  +   G   V      +    YY         G   + S+ +   +  G    G + 
Sbjct: 297 FGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELG---DGGVV 353

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
           +DTG   T LP   Y    E  R+         PR         CY       +  P ++
Sbjct: 354 MDTGTAVTRLPTVAY----EAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 409

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            +F GG  + L   +  IP    G FCFA  P    + I GN  Q  + I  D  +  V 
Sbjct: 410 FYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVG 469

Query: 368 FKPTDC 373
           F P  C
Sbjct: 470 FGPNVC 475


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 171/356 (48%), Gaps = 19/356 (5%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y     +GTP   D+   +DTGSD  W+QC PC  CY+Q + +++P+ SS+Y +++C S 
Sbjct: 134 YFTSLRLGTP-ATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSR 192

Query: 84  QCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
           +C  L +    +CSS + C Y   YAD S T G LA + +T  +  +     VFGCGHNN
Sbjct: 193 ECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTL-SPTDAVPGFVFGCGHNN 251

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G F E + GL+GLGR + SL+SQ+ ++ GA  FSYCL    +  S T  + F   +  +
Sbjct: 252 AGSFGEID-GLLGLGRGKASLSSQVAARYGAG-FSYCLP---SSPSATGYLSFSGAAAAA 306

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
                 T +V+ +  ++Y++ L GI+V     + K+ P   ++ A       ID+G   +
Sbjct: 307 PTNAQFTEMVAGQHPSFYYLNLTGITVAG--RAIKVPPSVFATAA----GTIIDSGTAFS 360

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
            LP   Y  L   VR+A+              CY       +  P +   F  GA V L 
Sbjct: 361 CLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLH 420

Query: 320 HTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            +            C A    P D  +G+ GN  Q  L + YD D+Q V F    C
Sbjct: 421 PSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 121/382 (31%), Positives = 179/382 (46%), Gaps = 35/382 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  +  +G+Y + F +GTPP      IVD+GSDL+WVQC PC+QCY Q  P+Y P++S
Sbjct: 54  VVSGSTLGSGQYFVDFFLGTPPQ-KFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNS 112

Query: 73  SSYKELSCQSEQCHLLDTVS---CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNN 127
           S++  + C S +C L+       C       C Y Y YAD+SL+KGV A E  T  +   
Sbjct: 113 STFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVR- 171

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
             D V FGCG +N G F     G++GLG+  LS  SQ+    G NKF+YCLV +   +S+
Sbjct: 172 -IDKVAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYG-NKFAYCLVNYLDPTSV 228

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGN----LSNSSKLIPYYNS 242
           +S + FG+    +   +  T +VS   + T Y+V +E + VG     +S+S+  + +  +
Sbjct: 229 SSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGN 288

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-----P 297
            G+I      +    PP    ++     ++ VR      P      G  LC        P
Sbjct: 289 GGSIFDSGTTVTYWLPPAY--RNILAAFDKNVR-----YPRAASVQGLDLCVDVTGVDQP 341

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF---GNFAQSD 354
           S      +L     GG  V       +       V C AM  +   VG F   GN  Q +
Sbjct: 342 SFPSFTIVL-----GGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQN 396

Query: 355 LFIGYDFDSQMVSFKPTDCTKQ 376
             + YD +   + F P  C+  
Sbjct: 397 FLVQYDREENRIGFAPAKCSSH 418


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S +   +GEY  +  +GTP    +  ++DTGSD++W+QC PC  CY Q   +++P  S S
Sbjct: 113 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 171

Query: 75  YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           Y  + C +  C  LD+  C  ++  C Y   Y D S+T G  A+E +TF         V 
Sbjct: 172 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 230

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
            GCGH+N G+F      L      RLS  +QI    G   FSYCLV         S+ +S
Sbjct: 231 IGCGHDNEGLFIAASGLLGLGR-GRLSFPTQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 288

Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            + FG G+  +  G   T +  +    T+Y+V L G SVG              +    +
Sbjct: 289 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 348

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
           G + +D+G   T L +  Y  + +  R A   ++++P      G  L   CY       +
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 403

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P ++ H  GGA V L   +  IP    G FCFAM   DG V I GN  Q    + +D 
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463

Query: 362 DSQMVSFKPTDC 373
           D+Q V F P  C
Sbjct: 464 DAQRVGFVPKSC 475


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 174/372 (46%), Gaps = 30/372 (8%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYN 68
           VV      +  EY+ +  +G P  L  Y + DTGSD+ W+QC PC     CYKQ  PI++
Sbjct: 136 VVSGQSKGSGAEYLAQIGVGQPVKL-FYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFD 194

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           P SSSSY  LSC S+QC LLD  +C+S   C Y   Y D S T G LATE ++FGNSN+ 
Sbjct: 195 PKSSSSYSPLSCNSQQCKLLDKANCNSDT-CIYQVHYGDGSFTTGELATETLSFGNSNS- 252

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N+  GCGH+N G+F                 A  + SQL A+ FSYCLV   +DSS T
Sbjct: 253 IPNLPIGCGHDNEGLFAGGAG-----LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSST 307

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            +      S +    + S  + +    +Y +V + GISVG      K +P   +   I +
Sbjct: 308 LEF----NSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG-----KTLPISPTRFEIDE 358

Query: 249 ---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI 302
              G + +D+G   + LP D Y  L E     +KLT    P  G  +   CY     + +
Sbjct: 359 SGLGGIIVDSGTIISRLPSDVYESLREAF---VKLTSSLSPAPGISVFDTCYNFSGQSNV 415

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P +      G  + L   +  I     G +C A       + I G+F Q  + + YD 
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475

Query: 362 DSQMVSFKPTDC 373
            + +V F    C
Sbjct: 476 TNSIVGFSTNKC 487


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 177/381 (46%), Gaps = 44/381 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY++K   GTP        +DT SDL+W+QC PCV CY+Q+ P++NP  SSSY  + C 
Sbjct: 90  GEYLVKLGTGTPQHF-FSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCT 148

Query: 82  SEQCHLLDTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           S+ C  LD   C       C YTY Y+   +TKG LA +++  G   + F  VVFGC  +
Sbjct: 149 SDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIG--GDVFHAVVFGCSDS 206

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           + G       GLVGLGR  LSL    +SQL  ++F YCL P  + +S   K+  G G++ 
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSL----VSQLSVHRFMYCLPPPMSRTS--GKLVLGAGADA 260

Query: 200 ---SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG------- 249
                  V  T   S    +YY++ L+G++VG+ +  +        SG    G       
Sbjct: 261 VRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGG 320

Query: 250 ----------NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTP 297
                      M +D  +  + L    Y+ L + +   I+L P   P  RLG  LC+  P
Sbjct: 321 IVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRL-PRATPSLRLGLDLCFILP 379

Query: 298 SMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQ 352
              G+     P ++  FD G  + L     F+    +G + C  +    G V I GNF  
Sbjct: 380 EGVGMDRVYVPTVSLSFD-GRWLELDRDRLFV---TDGRMMCLMIGRTSG-VSILGNFQL 434

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
            ++ + ++     ++F    C
Sbjct: 435 QNMRVLFNLRRGKITFAKASC 455


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 149/298 (50%), Gaps = 37/298 (12%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           A  EY++  ++GTPP   +   +DTGSDL+W QC PC  C+ Q  P+ +PA+SS+Y  L 
Sbjct: 82  ATNEYLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALP 140

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--------FFDN 131
           C + +C  L   SC  +  C Y Y Y D S+T G +AT+R TFG++              
Sbjct: 141 CGAPRCRALPFTSCGGRS-CVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRR 199

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSIT-- 188
           + FGCGH N GVF  NE G+ G GR R SL     SQL A  FSYC    F + SSI   
Sbjct: 200 LTFGCGHFNKGVFQSNETGIAGFGRGRWSLP----SQLNATSFSYCFTSMFDSKSSIVTL 255

Query: 189 ----SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKL-IPYYNS 242
               + +Y    S    G V +T L     + + YF++L+GISVG     ++L +P    
Sbjct: 256 GGAPAALY----SHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGK----TRLPVPETKF 307

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
              I      ID+GA  T LP++ Y  ++ +    + L P         +C+  P  A
Sbjct: 308 RSTI------IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSA 359


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 123/380 (32%), Positives = 182/380 (47%), Gaps = 31/380 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S  +  +G+Y + F +GTPP      IVD+GSDL+WVQC PC QCY Q  P+Y P++S
Sbjct: 53  VVSGSTLGSGQYFVDFFLGTPPQ-KFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNS 111

Query: 73  SSYKELSCQSEQCHLLDTVS---CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNN 127
           S++  + C S  C L+       C  +    C Y Y YAD+S +KGV A E  T      
Sbjct: 112 STFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVR- 170

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
             D V FGCG +N G F     G++GLG+  LS  SQ+    G NKF+YCLV +   +S+
Sbjct: 171 -IDKVAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYG-NKFAYCLVNYLDPTSV 227

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +S + FG+    +   +  T +VS  +  T Y+V +E ++VG      K +P  +S+  I
Sbjct: 228 SSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGG-----KSLPISDSAWEI 282

Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
                G    D+G   T      Y+ +     + +   P  +   G  LC +   + G+ 
Sbjct: 283 DLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHY-PRAESVQGLDLCVE---LTGVD 338

Query: 304 ----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF---GNFAQSDLF 356
               P  T  FD GA       + F+      V C AM  +   +G F   GN  Q + F
Sbjct: 339 QPSFPSFTIEFDDGAVFQPEAENYFV-DVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFF 397

Query: 357 IGYDFDSQMVSFKPTDCTKQ 376
           + YD +  ++ F P  C+  
Sbjct: 398 VQYDREENLIGFAPAKCSSH 417


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 130/379 (34%), Positives = 184/379 (48%), Gaps = 31/379 (8%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           +  V S +S  +GEY M+  +GTP   ++Y ++DTGSD++W+QC PC  CY Q   I++P
Sbjct: 124 SGAVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDP 182

Query: 70  ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
             S ++  + C S  C  LD  S C +++   C Y   Y D S T+G  +TE +TF  + 
Sbjct: 183 KKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR 242

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
              D+V  GCGH+N G+F      L+GLGR  LS  SQ  S+    KFSYCLV      +
Sbjct: 243 --VDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKSRYNG-KFSYCLVDRTSSGS 298

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S   S + FGN + V    V +  L + +  T+Y++ L GISVG        +P  + S
Sbjct: 299 SSKPPSTIVFGNDA-VPKTSVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 352

Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
                A   G + ID+G   T L +  Y  L    R+A +L   +  R  S      C+ 
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVAL----RDAFRLGATKLKRAPSYSLFDTCFD 408

Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
              M  +  P +  HF GG +V L  ++  IP   EG FCFA     G + I GN  Q  
Sbjct: 409 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 467

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             + YD     V F    C
Sbjct: 468 FRVAYDLVGSRVGFLSRAC 486


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/379 (34%), Positives = 184/379 (48%), Gaps = 31/379 (8%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           +  V S +S  +GEY M+  +GTP   ++Y ++DTGSD++W+QC PC  CY Q   I++P
Sbjct: 121 SGAVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDP 179

Query: 70  ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
             S ++  + C S  C  LD  S C +++   C Y   Y D S T+G  +TE +TF  + 
Sbjct: 180 KKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR 239

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
              D+V  GCGH+N G+F      L+GLGR  LS  SQ  ++    KFSYCLV      +
Sbjct: 240 --VDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNG-KFSYCLVDRTSSGS 295

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            S   S + FGN + V    V +  L + +  T+Y++ L GISVG        +P  + S
Sbjct: 296 SSKPPSTIVFGNAA-VPKTSVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 349

Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
                A   G + ID+G   T L +  Y  L    R+A +L   +  R  S      C+ 
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVAL----RDAFRLGATKLKRAPSYSLFDTCFD 405

Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
              M  +  P +  HF GG +V L  ++  IP   EG FCFA     G + I GN  Q  
Sbjct: 406 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 464

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             + YD     V F    C
Sbjct: 465 FRVAYDLVGSRVGFLSRAC 483


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 131/369 (35%), Positives = 174/369 (47%), Gaps = 42/369 (11%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
           + TAN  YV+    GTP       I DTGS++ W+QC PCV  CY Q +P+++P  SS+Y
Sbjct: 11  IGTAN--YVITVGFGTPKKNQTV-IFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTY 67

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           + +SC S  C  L +  CS    C Y   Y D S T G LATE  T   + N F+N +FG
Sbjct: 68  RNISCTSAACTGLSSRGCSGST-CVYGVTYGDGSSTVGFLATETFTLA-AGNVFNNFIFG 125

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG NN G+F     GL+GLGR+  SL SQ+ + LG N FSYCL    + SS T  +  GN
Sbjct: 126 CGQNNQGLF-TGAAGLIGLGRSPYSLNSQLATSLG-NIFSYCL---PSTSSATGYLNIGN 180

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFI 253
                G    +  L +    T YF+ L GISVG   L+ SS +   + S G I      I
Sbjct: 181 PLRTPG---YTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTV---FQSVGTI------I 228

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHF 310
           D+G   T LP   Y  L    R A  +T Y      S L  CY       +  P +  H+
Sbjct: 229 DSGTVITRLPPTAYGALRTAFRAA--MTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHY 286

Query: 311 DG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            G      GA V  + +S+ +         FA       +GI GN  Q  + + YD   +
Sbjct: 287 TGLDVTIPGAGVFYVISSSQV------CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALK 340

Query: 365 MVSFKPTDC 373
            + F    C
Sbjct: 341 RIGFAAGAC 349


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 24/359 (6%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
            G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PA SS+Y  +
Sbjct: 176 TGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANV 233

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
           SC +  C  LDT  CS    C Y   Y D S + G  A + +T  +S +      FGCG 
Sbjct: 234 SCAAPACSDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCGE 291

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
            N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + FG GS 
Sbjct: 292 RNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLDFGAGSP 346

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
            +   + +T ++     T+Y+V L GI VG      +L+  Y      +     +D+G  
Sbjct: 347 AA--RLTTTPMLVDNGPTFYYVGLTGIRVGG-----RLL--YIPQSVFATAGTIVDSGTV 397

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
            T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  F GGA+
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGAR 457

Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + +  +          V   FA     GDVGI GN       + YD   ++VSF P  C
Sbjct: 458 LDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/360 (33%), Positives = 177/360 (49%), Gaps = 28/360 (7%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PC V C++Q  P++NP SSS+Y  + C
Sbjct: 120 GNYVTRMGLGTPATQYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGC 178

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            ++QC       L+  +CSS  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 179 SAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--LPNFYYG 236

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG + F+YCL    + SS    +   N
Sbjct: 237 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FTYCLP--SSSSSGYLSLGSYN 292

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFID 254
             + S   +VS+SL    D + YF+ L G++V GN        P   SS A S     ID
Sbjct: 293 PGQYSYTPMVSSSL----DDSLYFIKLSGMTVAGN--------PLSVSSSAYSSLPTIID 340

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
           +G   T LP   Y+ L + V  A+K T           C+K  +    AP +T  F GGA
Sbjct: 341 SGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGA 400

Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + L   +  +    +   C A  P      I GN  Q    + YD  S  + F    C+
Sbjct: 401 ALKLSAQNLLVDVD-DSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 458


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 160/345 (46%), Gaps = 25/345 (7%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT------VSCS 94
           IVDTGSDL WVQC PC +CY Q  P++NP+ S SY+ + C S  C  L        V  S
Sbjct: 80  IVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGS 139

Query: 95  SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
           +   CNY   Y D S T G +  E +  GN+    +N +FGCG  N G+F     GLVGL
Sbjct: 140 NPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT--VNNFIFGCGRKNQGLFG-GASGLVGL 196

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE 213
           GRT LSL SQI    G   FSYCL     ++S  S +  GN S       +S T ++   
Sbjct: 197 GRTDLSLISQISPMFGG-VFSYCLPTTEAEAS-GSLVMGGNSSVYKNTTPISYTRMIHNP 254

Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
              +YF+ L GI+VG +   +          +  K  M ID+G   + LP   Y  L+ +
Sbjct: 255 LLPFYFLNLTGITVGGVEVQAP---------SFGKDRMIIDSGTVISRLPPSIYQALKAE 305

Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
                   P     +    C+       +  P +  +F+G A++ +  T  F     +  
Sbjct: 306 FVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDAS 365

Query: 333 -FCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             C A+   P + +VGI GN+ Q +  I YD    M+ F    C+
Sbjct: 366 QVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 126/367 (34%), Positives = 187/367 (50%), Gaps = 33/367 (8%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPI---YNPASSSSYKE 77
           GEY+M F+IG P    + G +DT + L+WVQC  C  QC  + + +   +  + S +Y+ 
Sbjct: 73  GEYLMSFNIGNPSS-QVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131

Query: 78  LSCQSEQCHLLDTV-SC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV--- 132
             C S  C+ L    +C SS + C Y   Y D+  T G+L+++   F  S+    +V   
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC         ++  G VGL +T LSL    +SQLG  KFSYCLVPF+   S TSKMY
Sbjct: 192 NFGCSEAPLTGDEQSYTGNVGLNQTPLSL----ISQLGIKKFSYCLVPFNNLGS-TSKMY 246

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI--SKGN 250
           FG+    SGG    T L+      YY V + GIS+GN        P+++    +   +  
Sbjct: 247 FGSLPVTSGG---QTPLLYPNSDAYY-VKVLGISIGNDE------PHFDGVFDVYEVRDG 296

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYKTPSMAGIA--PIL 306
             IDTG   + L  D ++ L  +    +K  P +  DP+   +LC++  +   +   P +
Sbjct: 297 WIIDTGITYSSLETDAFDSLLAKFL-TLKDFPQRKDDPKERFELCFELQNANDLESFPDV 355

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           T HFDG A + L   STF+    +G+FC A+      V I GNF   +  +GYD ++Q++
Sbjct: 356 TVHFDG-ADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVI 414

Query: 367 SFKPTDC 373
           SF P DC
Sbjct: 415 SFAPVDC 421


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 165/367 (44%), Gaps = 24/367 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            Q  +S   G YV+   +GTP     Y ++ DTGSDL WVQC PC  CY+Q  P+++P+ 
Sbjct: 138 AQRGISLGTGNYVVSVGLGTP--AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSL 195

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS+Y  ++C + +C  LD   CSS   C Y   Y D S T G L  + +T   S+     
Sbjct: 196 SSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT-LPG 254

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
            VFGCG  N G+F + + GL GLGR ++SL SQ     G   F+YCL      SS + + 
Sbjct: 255 FVFGCGDQNAGLFGQVD-GLFGLGREKVSLPSQGAPSYGPG-FTYCL-----PSSSSGRG 307

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
           Y   G          T+L      ++Y++ L GI VG     +  IP    +   + G  
Sbjct: 308 YLSLGGAPPANAQF-TALADGATPSFYYIDLVGIKVG---GRAIRIPATAFA---AAGGT 360

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAPILTA 308
            ID+G   T LP   Y  L      A  +  Y+     S L  CY  T       P +  
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAF--ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVEL 418

Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GGA V L  T   ++    +    FA    D  + I GN  Q    + YD  +Q + 
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIG 478

Query: 368 FKPTDCT 374
           F    C+
Sbjct: 479 FGAKGCS 485


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 119/362 (32%), Positives = 171/362 (47%), Gaps = 30/362 (8%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    I  +VDTGS L W+QC PC V C++Q  P+++P +SSSY  +SC
Sbjct: 115 GNYVTRMGLGTPAKPYIM-VVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSC 173

Query: 81  QSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S QC  L T +     CS   +C Y   Y DSS + G L+ + ++FG   N   N  +G
Sbjct: 174 SSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG--ANSVPNFYYG 231

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG + FSYCL       S +S  Y   
Sbjct: 232 CGQDNEGLFGRSA-GLMGLARNKLSLLYQLAPTLGYS-FSYCL------PSTSSSGYLSI 283

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           GS   GG   +  + +  D + YF++L G++V          P   SS   +     ID+
Sbjct: 284 GSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGK-------PLAVSSSEYTSLPTIIDS 336

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-TPSMAGIAPILTAHFDGG 313
           G   T LP   Y  L + V  A+K +  +         C++   S     P ++  F GG
Sbjct: 337 GTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGG 396

Query: 314 AKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           A + L   +  +   V+G   C A  P      I GN  Q    + YD  S  + F    
Sbjct: 397 ATLKLSAGNLLV--DVDGATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSNRIGFAAAG 453

Query: 373 CT 374
           C+
Sbjct: 454 CS 455


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 181/381 (47%), Gaps = 33/381 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V   +GEY++   +GTPP      I+DTGSDL W+QC PC+ C++Q  PI++PA+S
Sbjct: 138 VESGVPVGSGEYLVDVYLGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAAS 196

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCN--------YTYGYADSSLTKGVLATERITFG- 123
            SY+ ++C  ++C L+   + S+ + C         Y Y Y D S T G LA E  T   
Sbjct: 197 ISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNL 256

Query: 124 --NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
             +     D V FGCGH N G+F+     L       LS ASQ+    G + FSYCLV  
Sbjct: 257 TQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGR-GPLSFASQLRGVYGGHAFSYCLV-- 313

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
              S+  SK+ FG+   +     ++ T+     D  T+Y++ L+ I VG  + +      
Sbjct: 314 EHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNI----- 368

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKT 296
             SS  +S G   ID+G   +  P+  Y  + +   +  +++P     LG  +   CY  
Sbjct: 369 --SSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFID--RMSPSYPLILGFPVLSPCYNV 424

Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQS 353
                +  P L+  F  GA       + FI    EG+ C A+   P  G + I GN+ Q 
Sbjct: 425 SGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSG-MSIIGNYQQQ 483

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + YD +   + F P  C 
Sbjct: 484 NFHVLYDLEHNRLGFAPRRCA 504


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 120/377 (31%), Positives = 176/377 (46%), Gaps = 48/377 (12%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ +    G Y M  S+GTP LL    + DTGSDL+W QC PC +C++Q  P + PASSS
Sbjct: 76  QALLENGVGGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134

Query: 74  SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           ++ +L C S  C  L +++   +   C Y Y Y  S  T G LATE +  G+++  F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGC   N            GLG+            LG  +FSYCL      ++  S + 
Sbjct: 192 AFGCSTEN------------GLGQL----------DLGVGRFSYCLR--SGSAAGASPIL 227

Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
           FG+ + ++ G V ST  V+      +YY+V L GI+VG        +P   S+   ++  
Sbjct: 228 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 282

Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IA 303
             G   +D+G   T L KD Y  +++   +        +   G  LC+K+    G     
Sbjct: 283 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAV 342

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIG 358
           P L   FDGGA+  +      +    +G   V C  M P  GD  + + GN  Q D+ + 
Sbjct: 343 PSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLL 402

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD D  + SF P DC K
Sbjct: 403 YDLDGGIFSFAPADCAK 419


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 113/360 (31%), Positives = 177/360 (49%), Gaps = 28/360 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           ++   SIG PP+  +  ++DTGSDL W+ CLPC +CY Q  P ++P+ SS+Y+  SC S 
Sbjct: 78  FLANISIGNPPVPQLL-LIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSA 135

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNN 140
              +           C Y   Y D S T+G+LA E++TF  S++      N+VFGCG +N
Sbjct: 136 PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN 195

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G    +  G++GLG    S+ ++       +KFSYC       +   + +  GNG+++ 
Sbjct: 196 SGFTKYS--GVLGLGPGTFSIVTRNF----GSKFSYCFGSLTNPTYPHNILILGNGAKIE 249

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           G     T L   +D+  Y++ L+ IS G   L         Y      S+G   IDTG  
Sbjct: 250 GD---PTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTFQRYR-----SQGGTVIDTGCS 299

Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
           PT+L ++ Y  L E++   +   L   +D    +  CY+      +   P++T HF GGA
Sbjct: 300 PTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGA 359

Query: 315 KVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           ++ L   S F+       FC AM      D+ + G  AQ +  +GY+  +  V F+ TDC
Sbjct: 360 ELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 129/372 (34%), Positives = 182/372 (48%), Gaps = 25/372 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V+ A GEY+    +GTP  +    IVDTGSDL WVQC PC +CY Q   ++ P +S+S+ 
Sbjct: 6   VAAARGEYLATVRLGTPERV-FSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFT 64

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVV 133
           +L+C S  C+ L    C +Q  C Y Y Y D SLT G    + IT    N       N  
Sbjct: 65  KLACGSALCNGLPFPMC-NQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFA 123

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCGH+N G F   + G++GLG+  LS  SQ+ S     KFSYCLV +    + TS + F
Sbjct: 124 FGCGHDNEGSFAGAD-GILGLGQGPLSFHSQLKSVYNG-KFSYCLVDWLAPPTQTSPLLF 181

Query: 194 GNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVG-NLSNSSKLIPYYNSSGAISKGNM 251
           G+ +      V    +++     TYY+V L GISVG NL N S  +   +S G    G +
Sbjct: 182 GDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGG--AGTI 239

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-------AP 304
           F D+G   T L +  Y    ++V  A+  +     R    +      ++G         P
Sbjct: 240 F-DSGTTVTQLAEAAY----KEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVP 294

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +T HF+GG  V L  ++ FI       +CFAM     DV I G+  Q +  + YD   +
Sbjct: 295 AMTFHFEGGDMV-LPPSNYFIYLESSQSYCFAMTS-SPDVNIIGSVQQQNFQVYYDTAGR 352

Query: 365 MVSFKPTDCTKQ 376
            + F P DC  +
Sbjct: 353 KLGFVPKDCVGR 364


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 172/361 (47%), Gaps = 15/361 (4%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           +S  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC +CY Q   I++P+ S S+ 
Sbjct: 123 LSQGSGEYFTRLGVGTPPKY-LYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFA 181

Query: 77  ELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + C S  C  LD+  CS    LC Y   Y D S T G  +TE +TF  +      V  G
Sbjct: 182 GIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRAA--VPRVAIG 239

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CGH+N G+F      L+GLGR  LS  +Q  ++   NKFSYCL    T S+  S + FG+
Sbjct: 240 CGHDNEGLFVGAAG-LLGLGRGGLSFPTQTGTRFN-NKFSYCLTD-RTASAKPSSIVFGD 296

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY--NSSGAISKGNMFI 253
            S VS     +  + + +  T+Y+V L GISVG          ++  +S+G    G + I
Sbjct: 297 -SAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTG---NGGVII 352

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDG 312
           D+G   T L +  Y  L +  R                 CY    ++ +  P +  HF  
Sbjct: 353 DSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFR- 411

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA V L   +  +P    G FCFA       + I GN  Q    + +D     V F P  
Sbjct: 412 GADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRG 471

Query: 373 C 373
           C
Sbjct: 472 C 472


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 25/360 (6%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKE 77
             G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PASSS+Y  
Sbjct: 176 GTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYAN 233

Query: 78  LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           +SC +  C  LD   CS    C Y   Y D S + G  A + +T  +S +      FGCG
Sbjct: 234 VSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCG 291

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N G+F E   GL+GLGR + SL  Q   + G   F++CL P    S+ T  + FG GS
Sbjct: 292 ERNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLPP---RSTGTGYLDFGAGS 346

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
             +     +T +++    T+Y+V + GI VG      +L+P   S    +     +D+G 
Sbjct: 347 PPA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGT 396

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGA 314
             T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  F GGA
Sbjct: 397 VITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGA 456

Query: 315 KVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + +  +          V   FA     GDVGI GN       + YD   ++V F P  C
Sbjct: 457 ALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 165/367 (44%), Gaps = 24/367 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            Q  +S   G YV+   +GTP     Y ++ DTGSDL WVQC PC  CY+Q  P+++P+ 
Sbjct: 138 AQRGISLGTGNYVVSVGLGTP--AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSL 195

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS+Y  ++C + +C  LD   CSS   C Y   Y D S T G L  + +T   S+     
Sbjct: 196 SSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT-LPG 254

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
            VFGCG  N G+F + + GL GLGR ++SL SQ     G   F+YCL      SS + + 
Sbjct: 255 FVFGCGDQNAGLFGQVD-GLFGLGREKVSLPSQGAPSYGPG-FTYCL-----PSSSSGRG 307

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
           Y   G          T+L      ++Y++ L GI VG     +  IP    +   + G  
Sbjct: 308 YLSLGGAPPANAQF-TALADGATPSFYYIDLVGIKVG---GRAIRIPATAFA---AAGGT 360

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAPILTA 308
            ID+G   T LP   Y  L      A  +  Y+     S L  CY  T       P +  
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAF--ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVEL 418

Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GGA V L  T   ++    +    FA    D  + I GN  Q    + YD  +Q + 
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIG 478

Query: 368 FKPTDCT 374
           F    C+
Sbjct: 479 FGAKGCS 485


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 123/380 (32%), Positives = 192/380 (50%), Gaps = 43/380 (11%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           + + +   EY+M+ +IGTPP+  I  + DTGSDL W QC PC  C+ Q  PIY+  +SSS
Sbjct: 74  ARLRSGQAEYLMELAIGTPPVPFI-ALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSS 132

Query: 75  YKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           +  L C S  C  + +  CS+    C Y Y Y D + +        I+ G        + 
Sbjct: 133 FSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSP---ECAGISVG-------GIA 182

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG +N G+ + N  G VGLGR  LSL    ++QLG  KFSYCL  F  ++S++S ++F
Sbjct: 183 FGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCLTDFF-NTSLSSPVFF 236

Query: 194 GNGSEVSGGG-------VVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           G+ +E++          V ST LV S  + + Y+V+LEGIS+G+       +P  N +  
Sbjct: 237 GSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGD-----ARLPIPNGTFD 291

Query: 246 IS----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           ++     G M +D+G   T+L +  +  + + V   +   P  +     + C+  P+ AG
Sbjct: 292 LNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLG-QPVVNASSLDRPCFPAPA-AG 349

Query: 302 I-----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDL 355
           +      P +  HF GGA + L   +       E  FC  +   +   G + GNF Q ++
Sbjct: 350 VQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNI 409

Query: 356 FIGYDFDSQMVSFKPTDCTK 375
            + +D     +SF PTDC+K
Sbjct: 410 QMLFDITVGQLSFMPTDCSK 429


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 176/368 (47%), Gaps = 23/368 (6%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           +S  +  +G YV+   +G+P   D+  I DTGSDL W QC PCV  CY+Q + I++P++S
Sbjct: 137 KSASTLGSGNYVVTVGLGSPKR-DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS 195

Query: 73  SSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
            SY  +SC S  C  L++ +     CSS   C Y   Y D S + G  A E+++   S +
Sbjct: 196 LSYSNVSCDSPSCEKLESATGNSPGCSSST-CLYGIRYGDGSYSIGFFAREKLSL-TSTD 253

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F+N  FGCG NN G+F     GL+GL R  LSL SQ   + G   FSYCL    + SS 
Sbjct: 254 VFNNFQFGCGQNNRGLFG-GTAGLLGLARNPLSLVSQTAQKYG-KVFSYCL---PSSSSS 308

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           T  + FG+G   S     + S V+ +  ++YF+ + GISVG      + +P   S    S
Sbjct: 309 TGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGE-----RKLPIPKS--VFS 361

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
                ID+G   + LP   Y+ +++  R  +   P          CY       +  P +
Sbjct: 362 TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKI 421

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
             +F GGA++ L          V  V   FA    D +V I GN  Q  + + YD     
Sbjct: 422 ILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGR 481

Query: 366 VSFKPTDC 373
           V F P+ C
Sbjct: 482 VGFAPSGC 489


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 164/370 (44%), Gaps = 47/370 (12%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP    Y ++D+GSD++WVQC PC QCY Q  P+++PA S
Sbjct: 190 VISGMEQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 248

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  +SC S  C  L+   C + + C Y   Y D S TKG LA E +TFG +     +V
Sbjct: 249 ASFTGVSCSSSVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTFGRT--MVRSV 305

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q+  Q G   FSYCLV             
Sbjct: 306 AIGCGHRNRGMFVGAAGLLGLGGGS-MSFVGQLGGQTGG-AFSYCLV------------- 350

Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
                           LV      ++Y++ L G+ VG +      +P       +++   
Sbjct: 351 ----------SAAWVPLVRNPRAPSFYYIGLAGLGVGGIR-----VPISEEVFRLTELGD 395

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-A 303
           G + +DTG   T LP   Y    +  R+A        PR         CY       +  
Sbjct: 396 GGVVMDTGTAVTRLPTLAY----QAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRV 451

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P ++ +F GG  + L   +  IP    G FCFA  P    + I GN  Q  + I +D  +
Sbjct: 452 PTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGAN 511

Query: 364 QMVSFKPTDC 373
             V F P  C
Sbjct: 512 GYVGFGPNIC 521


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/361 (32%), Positives = 165/361 (45%), Gaps = 30/361 (8%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP       +VDTGS L W+QC PCV  C++QV P+Y+P +SS+Y  + C
Sbjct: 132 GNYVTELGLGTP-ATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPC 190

Query: 81  QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + QC  L        +CS + +C Y   Y DSS + G L+ + ++FG+ +  + N  +G
Sbjct: 191 SASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS--YPNFYYG 248

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG   FSYCL       +  S  Y   
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLG-YSFSYCL------PTPASTGYLSI 300

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           G   SG    +    S  D + YFVTL G+SVG         P   S    S     ID+
Sbjct: 301 GPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGG-------SPLAVSPAEYSSLPTIIDS 353

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIAPILTAHFDGG 313
           G   T LP   Y  L + V  A  +   Q     S L  C++  +     P +   F GG
Sbjct: 354 GTVITRLPTAVYTALSKAV--AAAMVGVQSAPAFSILDTCFQGQASQLRVPAVAMAFAGG 411

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A + L   +  I    +   C A  P D    I GN  Q    + YD     + F    C
Sbjct: 412 ATLKLATQNVLIDVD-DSTTCLAFAPTDSTT-IIGNTQQQTFSVVYDVAQSRIGFAAGGC 469

Query: 374 T 374
           +
Sbjct: 470 S 470


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 115/408 (28%), Positives = 190/408 (46%), Gaps = 52/408 (12%)

Query: 1   MSPATYFYPNNVVQSNVSTANGE-----------YVMKFSIGTPPLLDIYGIVDTGSDLM 49
           +S A + Y  N +   + ++N +           +++ FS+G PP+  +  I+DTGS L+
Sbjct: 62  ISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQL-TIMDTGSSLL 120

Query: 50  WVQCLPCVQCY--KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYAD 107
           W+QC PC  C     + P++NPA SS++ E SC    C       C S   C Y   Y  
Sbjct: 121 WIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYIS 180

Query: 108 SSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
            + +KGVLA ER+TF   N        + FGCG+ N      +  G++GLG    SLA Q
Sbjct: 181 GTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQ 240

Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEG 224
           +      +KFSYC+      +   +++  G  +++ G     T +  + + + Y++ LEG
Sbjct: 241 L-----GSKFSYCIGDLANKNYGYNQLVLGEDADILGD---PTPIEFETENSIYYMNLEG 292

Query: 225 ISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
           ISVG+   + + + +        +  + +D+G   T L    Y  L  ++++ +      
Sbjct: 293 ISVGDTQLNIEPVVFKRRG---PRTGVILDSGTLYTWLADIAYRELYNEIKSIL------ 343

Query: 285 DPRL-----GSQLCYK---TPSMAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVF 333
           DP+L        LCY    +  + G  P++T HF GGA++ +  TS F P   P    VF
Sbjct: 344 DPKLERFWFRDFLCYHGRVSEELIGF-PVVTFHFAGGAELAMEATSMFYPLSEPNTFNVF 402

Query: 334 CFAMQPIDGDVGIFGNF------AQSDLFIGYDFDSQMVSFKPTDCTK 375
           C +++P     G +  F      AQ    IGYD   + +  +  DC +
Sbjct: 403 CMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCVQ 450


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 174/370 (47%), Gaps = 28/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNP 69
           V S  S   GEY  +  +G P +   + + DTGSD+ W+QC PC     CYKQ+ PI++P
Sbjct: 173 VTSGASQGAGEYFARIGVGQP-VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
            SSSSY  LSC SEQCHLLD  +C +   C Y   Y D S T G LATE  +F +SN+  
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEAACDANS-CIYEVEYGDGSFTVGELATETFSFRHSNS-I 289

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            N+  GCGH+N G+F   +             A  + SQL A  FSYCLV   ++SS T 
Sbjct: 290 PNLPIGCGHDNEGLFVGADG-----LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTL 344

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
                N  + S      TS + K D+  T+ +V + G+SVG      K +P  +SS  I 
Sbjct: 345 DF---NADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGG-----KPLPISSSSFEID 393

Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
           +   G + +D+G   T +P D Y+ L +      K  P          CY   S + +  
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P +     G   + L   +  I     G FC A  P    + I GN  Q  + + YD  +
Sbjct: 454 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513

Query: 364 QMVSFKPTDC 373
            +V F    C
Sbjct: 514 SLVGFSTDKC 523


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 126/380 (33%), Positives = 182/380 (47%), Gaps = 23/380 (6%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S VS  +GEY M   +GTPP      I+DTGSDL W+QC+PC  C++Q  P Y+P  
Sbjct: 183 TLESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKD 241

Query: 72  SSSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSN 126
           SSS+K ++C   +C L+ +      C  + Q C Y Y Y DSS T G  A E  T   + 
Sbjct: 242 SSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTT 301

Query: 127 -------NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
                     +NV+FGCGH N G+F+     L+GLGR  LS A+Q+ S  G + FSYCLV
Sbjct: 302 PEGKPELKIVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFATQLQSLYG-HSFSYCLV 359

Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISVGNLSNSSK 235
             +++SS++SK+ FG   E +S   +  TS V  ++    T+Y+V ++ I VG      K
Sbjct: 360 DRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGG--EVLK 417

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
           +        A   G   ID+G   T   +  Y  ++E     IK  P  +     + CY 
Sbjct: 418 IPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYN 477

Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
              +  +  P     F  GA       + FI    E V C A+       + I GN+ Q 
Sbjct: 478 VSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQ 537

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
           +  I YD     + + P  C
Sbjct: 538 NFHILYDLKKSRLGYAPMKC 557


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/350 (29%), Positives = 167/350 (47%), Gaps = 40/350 (11%)

Query: 38  IYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS-Q 96
           ++ ++DTGSD+ W+QC PC QCYKQ   ++ PA S++YK L C S  C  L + S S   
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNNTGVFNENEMGLVG 153
             CNY   Y D S T+G  A E +T  + +       N  FGCGH N G+FN    GL+G
Sbjct: 61  SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFN-GAAGLMG 119

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
           LG++ +   +Q     G   FSYCL P  + +  +  ++FG  + +      +  + S  
Sbjct: 120 LGKSSIGFPAQTSVAFG-KVFSYCL-PSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSS 177

Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
             + YFV++ GI+VG+     +L+P            + +D+G   +   +  Y RL + 
Sbjct: 178 GPSQYFVSMTGINVGD-----ELLPI--------SATVMVDSGTVISRFEQSAYERLRDA 224

Query: 274 -------VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPL--IHTST 323
                  ++ A+ + P+         C++  ++  I  P++T HF   A++ L  +H   
Sbjct: 225 FTQILPGLQTAVSVAPFDT-------CFRVSTVDDINIPLITLHFRDDAELRLSPVH--- 274

Query: 324 FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + P  +GV CFA  P      + GNF Q +L   YD     +     +C
Sbjct: 275 ILYPVDDGVMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 160/354 (45%), Gaps = 26/354 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++D+GSD+ WVQC PC+QC+ QV P+++P+ SS+Y   SC S
Sbjct: 130 EYLITVRLGSPAKTQTV-LIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSS 188

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  D   CSS   C Y   YAD S T G  +++ +  G  +N   N  FGC H  
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLALG--SNTISNFQFGCSHVE 246

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SLASQ     G   FSYCL P  + S     +  G G+   
Sbjct: 247 SG-FNDLTDGLMGLGGGAPSLASQTAGTFG-TAFSYCLPPTPSSSGF---LTLGAGTS-- 299

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
            G V +  L S    T+Y V LE I VG    S   IP      ++    M +D+G   T
Sbjct: 300 -GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLS---IPT-----SVFSAGMVMDSGTIIT 350

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
            LP+  Y+ L    +  +K      PR     C+     + +  P +   F GGA V L 
Sbjct: 351 RLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLD 410

Query: 320 HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                +         FA    D   GI GN  Q    + YD     V FK   C
Sbjct: 411 ANGIIL----GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 173/372 (46%), Gaps = 27/372 (7%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPI 66
           P+    S  + + G YV+   +GTP     Y +V DTGSD  WVQC PCV +CYKQ +P+
Sbjct: 148 PSLPATSGRAVSTGNYVVTVGLGTP--ASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPL 205

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           ++PA SS+Y  +SC    C  LDT  C+    C Y   Y D S T G  A + +T   ++
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTI--AH 262

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           +      FGCG  N G+F +   GL+GLGR + SL  Q  ++ G   F+YCL    T   
Sbjct: 263 DAIKGFRFGCGEKNNGLFGKT-AGLMGLGRGKTSLTVQAYNKYG-GAFAYCLPALTTG-- 318

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            T  + FG GS  +G     T +++ + +T+Y+V + GI VG      + +P   S    
Sbjct: 319 -TGYLDFGPGS--AGNNARLTPMLTDKGQTFYYVGMTGIRVGG-----QQVPVAES--VF 368

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
           S     +D+G   T LP   Y  L       +    Y+     S L  CY    ++ +  
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVEL 428

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDF 361
           P ++  F GGA +  +  S  +    E   C  FA    D  V I GN  Q    + YD 
Sbjct: 429 PTVSLVFQGGACLD-VDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDL 487

Query: 362 DSQMVSFKPTDC 373
             + V F P  C
Sbjct: 488 GKKTVGFAPGSC 499


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 125/359 (34%), Positives = 172/359 (47%), Gaps = 27/359 (7%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELS 79
           +G YV+    GTP       + DTGSD+ W+QC PC V+CY Q +P+++P+ SS+Y+ +S
Sbjct: 13  SGNYVITVGFGTPTRTQTV-VFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVS 71

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C    C  L T  CSS   C Y   Y D S T G LA +      +   F N +FGCG N
Sbjct: 72  CTEPACVGLSTRGCSSST-CLYGVFYGDGSSTIGFLAMDTFMLTPAQK-FKNFIFGCGQN 129

Query: 140 NTGVFNENEMGLVGLGRTRL-SLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
           NTG+F +   GLVGLGR+   SL SQ+   LG N FSYCL    + SS T  +  GN   
Sbjct: 130 NTGLF-QGTAGLVGLGRSSTYSLNSQVAPSLG-NVFSYCL---PSTSSATGYLNIGNPQN 184

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTG 256
             G    +  L      T YF+ L GISVG   LS SS +   + S G I      ID+G
Sbjct: 185 TPG---YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTV---FQSVGTI------IDSG 232

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDG-GA 314
              T LP   Y+ L+  VR A+              CY  + + + + P++  HF G   
Sbjct: 233 TVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDV 292

Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           ++P      F+    +    FA       +GI GN  Q  + + YD + + + F    C
Sbjct: 293 RIPATGVF-FVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 130/363 (35%), Positives = 165/363 (45%), Gaps = 40/363 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
           +YV+  S+GTP +      VDTGSD+ WVQC PC    CY Q  P+++P  SSSY  + C
Sbjct: 141 QYVVTVSLGTPAVAQTL-EVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 199

Query: 81  QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            +  C  L   S  CS  Q C Y   Y D S T GV +++ +T   SN      +FGCGH
Sbjct: 200 AAASCSQLALYSNGCSGGQ-CGYVVSYGDGSTTTGVYSSDTLTLTGSNA-LKGFLFGCGH 257

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+F   + GL+GLGR   SL SQ  S  G   FSYCL P        S  Y   G  
Sbjct: 258 AQQGLFAGVD-GLLGLGRQGQSLVSQASSTYG-GVFSYCLPPTQ-----NSVGYISLGGP 310

Query: 199 VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDT 255
            S  G  +T L++   D TYY V L GISVG   LS  + +     +SGA+      +DT
Sbjct: 311 SSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF----ASGAV------VDT 360

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHF 310
           G   T LP   Y+ L    R A  + PY  P   +      CY       +  P ++  F
Sbjct: 361 GTVVTRLPPTAYSALRSAFRAA--MAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAF 418

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GGA + L  TS  +     G   FA    D    I GN  Q    +   FD   V F P
Sbjct: 419 GGGAAMDL-GTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEV--RFDGSTVGFMP 472

Query: 371 TDC 373
             C
Sbjct: 473 ASC 475


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 172/365 (47%), Gaps = 22/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CYKQ + +++PA S
Sbjct: 173 SGRALGTGNYVVTIGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARS 230

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC +  C  L T  CS    C Y+  Y D S + G  A + +T  +S +     
Sbjct: 231 STYANVSCAAPACSDLYTRGCSGGH-CLYSVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 288

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      SS T  + 
Sbjct: 289 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGYLD 343

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG GS  + G   +T +++    T+Y+V + GI VG      +L+    S    S     
Sbjct: 344 FGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFSTAGTI 396

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
           +D+G   T LP   Y+ L     +A+    Y+     S L  CY    M+ +A P ++  
Sbjct: 397 VDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLL 456

Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA + +  +       +  V   FA    D DVGI GN       + YD   + V F
Sbjct: 457 FQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGF 516

Query: 369 KPTDC 373
            P  C
Sbjct: 517 SPGAC 521


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 168/360 (46%), Gaps = 13/360 (3%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           ++  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC +CY Q   +++P  S +Y 
Sbjct: 111 LAQGSGEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYA 169

Query: 77  ELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + C +  C  LD+  CS++ ++C Y   Y D S T G  +TE +TF    N    V  G
Sbjct: 170 GIPCGAPLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF--RRNRVTRVALG 227

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CGH+N G+F      L      RLS   Q   +   +KFSYCLV   + S+  S + FG+
Sbjct: 228 CGHDNEGLFTGAAGLLGLGR-GRLSFPVQTGRRFN-HKFSYCLVD-RSASAKPSSVIFGD 284

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
            S VS     +  + + +  T+Y++ L GISVG           +    A   G + ID+
Sbjct: 285 -SAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAA-GNGGVIIDS 342

Query: 256 GAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGG 313
           G   T L +  Y  L +  R  A  L    +  L    C+    +  +  P +  HF  G
Sbjct: 343 GTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL-FDTCFDLSGLTEVKVPTVVLHFR-G 400

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A V L  T+  IP    G FCFA       + I GN  Q    I YD     V F P  C
Sbjct: 401 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 167/359 (46%), Gaps = 25/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP       +VDTGS L W+QC PCV  C++QV P+++P +SS+Y  + C
Sbjct: 132 GNYVTQLGLGTPST-SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRC 190

Query: 81  QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + QC  L        +CS+  +C Y   Y DSS + G L+T+ ++FG++   + +  +G
Sbjct: 191 SASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR--YPSFYYG 248

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG + FSYCL P    +   S   +  
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FSYCL-PTAASTGYLSIGPYNT 305

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           G   S   + S+SL    D + YF+TL G+SVG         P   S    S     ID+
Sbjct: 306 GHYYSYTPMASSSL----DASLYFITLSGMSVGG-------SPLAVSPSEYSSLPTIIDS 354

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   +  L + V  A+              C++  +     P +   F GGA 
Sbjct: 355 GTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGAS 414

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  I    +   C A  P D    I GN  Q    + YD     + F    C+
Sbjct: 415 MKLTTRNVLIDVD-DSTTCLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 178/379 (46%), Gaps = 29/379 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPC--VQCYKQVKPIYNP 69
           V + V  A  +Y+ ++ IG PP      ++DTGS+L+W QC   C    C KQ  P YN 
Sbjct: 73  VSAPVHLATRQYIAEYLIGDPPQ-RAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNL 131

Query: 70  ASSSSYKELSC--QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           + SS++  + C   ++ C       C     C +   Y   S+  G L TE  TF +   
Sbjct: 132 SRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYGAGSVF-GSLGTEAFTFQSGAA 190

Query: 128 FFDNVVFGC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
               + FGC        G  N    GL+GLGR RLSL    +SQ GA KFSYCL P+  +
Sbjct: 191 ---KLGFGCVSLTRITKGALN-GASGLIGLGRGRLSL----VSQTGATKFSYCLTPYLRN 242

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSL---VSKED---KTYYFVTLEGISVG--NLSNSSKL 236
              +S ++ G  + +SGGG   TS+    S ED    T+Y++ L GISVG   L   S  
Sbjct: 243 HGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAA 302

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR-LGSQLCYK 295
                 +     G + IDTG+P T L +  Y+ L ++V   +  +  Q P   G  LC  
Sbjct: 303 FELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCVA 362

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
              +  + P+L  HF GGA +  +   ++  P  +   C  ++   G   + GNF Q D+
Sbjct: 363 RQDVDKVVPVLVFHFGGGADMA-VSAGSYWGPVDKSTACMLIEE-GGYETVIGNFQQQDV 420

Query: 356 FIGYDFDSQMVSFKPTDCT 374
            + YD     +SF+  DC+
Sbjct: 421 HLLYDIGKGELSFQTADCS 439


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 28/365 (7%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
            +G Y++   +GTP   D+  I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY  +
Sbjct: 128 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 186

Query: 79  SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           SC S  C  L +      SCS+   C Y   Y D S + G LA E+ T  NS + FD V 
Sbjct: 187 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNS-DVFDGVY 244

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
           FGCG NN G+F     GL+GLGR +LS  SQ  +    NK FSYCL    + +S T  + 
Sbjct: 245 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 298

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
           FG+            S ++ +  ++Y + +  I+VG      KL IP    S   S    
Sbjct: 299 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 349

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            ID+G   T LP   Y  L    +  +   P          C+       +  P +   F
Sbjct: 350 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 409

Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
            GGA V L     F    +  V   FA    D +  IFGN  Q  L + YD     V F 
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 469

Query: 370 PTDCT 374
           P  C+
Sbjct: 470 PNGCS 474


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 131/363 (36%), Positives = 168/363 (46%), Gaps = 40/363 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
           +YV+  S+GTP +      VDTGSD+ WVQC PC    CY Q  P+++P  SSSY  + C
Sbjct: 130 QYVVTVSLGTPAVAQTL-EVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 188

Query: 81  QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            +  C  L   S  CS  Q C Y   Y D S T GV +++ +T   SN      +FGCGH
Sbjct: 189 AAASCSQLALYSNGCSGGQ-CGYVVSYGDGSTTTGVYSSDTLTLTGSNA-LKGFLFGCGH 246

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+F   + GL+GLGR   SL SQ  S  G   FSYCL P  T +S+    Y   G  
Sbjct: 247 AQQGLFAGVD-GLLGLGRQGQSLVSQASSTYG-GVFSYCLPP--TQNSVG---YISLGGP 299

Query: 199 VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDT 255
            S  G  +T L++   D TYY V L GISVG   LS  + +     +SGA+      +DT
Sbjct: 300 SSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF----ASGAV------VDT 349

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHF 310
           G   T LP   Y+ L    R A  + PY  P   +      CY       +  P ++  F
Sbjct: 350 GTVVTRLPPTAYSALRSAFRAA--MAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAF 407

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GGA + L  TS  +     G   FA    D    I GN  Q    +   FD   V F P
Sbjct: 408 GGGAAMDL-GTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEV--RFDGSTVGFMP 461

Query: 371 TDC 373
             C
Sbjct: 462 ASC 464


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 179/395 (45%), Gaps = 48/395 (12%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNP 69
           V + V +   EY+M   +GTPP+  +  I DTGSDL+WV+C           P    + P
Sbjct: 99  VVAEVVSRQFEYLMAIEVGTPPV-RVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVP 157

Query: 70  ASSSSYKELSCQSEQCHLLDTV-SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           ++SS+Y  + C ++ C  L +  SCS    C Y Y Y D S   G L+TE  TF    + 
Sbjct: 158 SASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADS 217

Query: 129 --------------------FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--L 166
                                  + FGC    TG F  + +  +G     +SLASQ+   
Sbjct: 218 SKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGG--GPVSLASQLGAT 275

Query: 167 SQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGIS 226
           + LG  KFSYCL P+  +++ +S + FG+ + VS  G  ST L++ E +TYY + L+ I+
Sbjct: 276 TSLG-RKFSYCLAPY-ANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSIN 333

Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
           V      +            ++ ++ +D+G   T L       L + +   IKL   + P
Sbjct: 334 VAGTKRPT----------TAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESP 383

Query: 287 RLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG 342
                LCY    + G      P +T    GG +V L   +TF+    EGV C A+     
Sbjct: 384 EKILDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQ-EGVLCLALVATSE 442

Query: 343 --DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
              V I GN AQ +L +GYD +   V+F   DC K
Sbjct: 443 RQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 170/367 (46%), Gaps = 31/367 (8%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           ++T    ++++  +G PP    Y I D  +D  W+QC PC++CY Q   I++P+ SSSY 
Sbjct: 180 ITTGTSNFLVQIGVGGPPQ-KFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYT 238

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            LSC+++ C+LL   SCS    C Y   Y D + T+GVL  E ++F  S+ + D V  GC
Sbjct: 239 LLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSF-ESSGWVDRVSLGC 297

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
            + N G F  ++ G  GLGR  LS  S+I     A+  SYCLV    D   +S + F   
Sbjct: 298 SNKNQGPFVGSD-GTFGLGRGSLSFPSRI----NASSMSYCLVE-SKDGYSSSTLEF--N 349

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLI-PYYNSSGAISKGNM 251
           S    G V +  L + + +  Y+V L+GI VG    ++ NS+  I PY N       G M
Sbjct: 350 SPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGN-------GGM 402

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL----CYKTPSMAGIA-PIL 306
            + + +  T+L  D YN     VR+A         RL + L    CY   S   +  PIL
Sbjct: 403 IVSSSSLITMLENDTYNV----VRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPIL 458

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
               + G    L   S        G FCFA  P  G   I G   Q    + +D  +  V
Sbjct: 459 EFEVNDGKSWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518

Query: 367 SFKPTDC 373
                 C
Sbjct: 519 YLHTLCC 525


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 172/372 (46%), Gaps = 27/372 (7%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPI 66
           P+    S  + + G YV+   +GTP     Y +V DTGSD  WVQC PCV +CYKQ  P+
Sbjct: 148 PSLPATSGRAVSTGNYVVTVGLGTP--ASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPL 205

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           ++PA SS+Y  +SC    C  LDT  C+    C Y   Y D S T G  A + +T   ++
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTI--AH 262

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           +      FGCG  N G+F +   GL+GLGR + SL  Q  ++ G   F+YCL    T   
Sbjct: 263 DAIKGFRFGCGEKNNGLFGKTA-GLMGLGRGKTSLTVQAYNKYG-GAFAYCLPALTTG-- 318

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            T  + FG GS  +G     T +++ + +T+Y+V + GI VG      + +P   S    
Sbjct: 319 -TGYLDFGPGS--AGNNARLTPMLTDKGQTFYYVGMTGIRVGG-----QQVPVAES--VF 368

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
           S     +D+G   T LP   Y  L       +    Y+     S L  CY    ++ +  
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVEL 428

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDF 361
           P ++  F GGA +  +  S  +    E   C  FA    D  V I GN  Q    + YD 
Sbjct: 429 PTVSLVFQGGACLD-VDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDL 487

Query: 362 DSQMVSFKPTDC 373
             + V F P  C
Sbjct: 488 GKKTVGFAPGSC 499


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 25/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP       +VDTGS L W+QC PCV  C++QV P+++P +SS+Y  + C
Sbjct: 132 GNYVTQLGLGTPST-SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRC 190

Query: 81  QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + QC  L        +CS+  +C Y   Y DSS + G L+T+ ++FG+++  + +  +G
Sbjct: 191 SASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS--YPSFYYG 248

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F  +  GL+GL R +LSL  Q+   LG + FSYCL P    +   S   +  
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FSYCL-PTAASTGYLSIGPYNT 305

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           G   S   + S+SL    D + YF+TL G+SVG         P   S    S     ID+
Sbjct: 306 GHYYSYTPMASSSL----DASLYFITLSGMSVGG-------SPLAVSPSEYSSLPTIIDS 354

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   +  L + V  A+              C++  +     P +   F GGA 
Sbjct: 355 GTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGAS 414

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  I    +   C A  P D    I GN  Q    + YD     + F    C+
Sbjct: 415 MKLTTRNVLIDVD-DSTTCLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 118/347 (34%), Positives = 162/347 (46%), Gaps = 37/347 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ 97
           ++DTGSD+ W+QCLPC     CY+Q+ PI++P  SSSY  +SC SEQC LLD   C+   
Sbjct: 13  VLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVNS 72

Query: 98  LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRT 157
            C Y   Y D S T G LATE +TF +SN+   N+  GCGH+N G+F   +         
Sbjct: 73  -CIYKVEYGDGSFTIGELATETLTFVHSNS-IPNISIGCGHDNEGLFVGADG-----LIG 125

Query: 158 RLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY 217
               A  I SQL A+ FSYCLV    DS   S + F            S SL+S   K  
Sbjct: 126 LGGGAISISSQLKASSFSYCLV--DIDSPSFSTLDFNTDPP-------SDSLISPLVKND 176

Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEE-- 272
            F +   + V  +S   K +P  +S   I +   G + +D+G   T LP D Y  L E  
Sbjct: 177 RFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAF 236

Query: 273 -----QVRNAIKLTPYQDP-RLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
                 +  A +++P+     L SQ   + P++A I P       G   + L   +  I 
Sbjct: 237 LGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILP-------GENSLQLPAKNCLIQ 289

Query: 327 PPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
               G FC A       + I GNF Q  + + YD  + +V F    C
Sbjct: 290 VDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 28/365 (7%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
            +G Y++   +GTP   D+  I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY  +
Sbjct: 100 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 158

Query: 79  SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           SC S  C  L +      SCS+   C Y   Y D S + G LA E+ T  NS + FD V 
Sbjct: 159 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNS-DVFDGVY 216

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
           FGCG NN G+F     GL+GLGR +LS  SQ  +    NK FSYCL    + +S T  + 
Sbjct: 217 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 270

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
           FG+            S ++ +  ++Y + +  I+VG      KL IP    S   S    
Sbjct: 271 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 321

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            ID+G   T LP   Y  L    +  +   P          C+       +  P +   F
Sbjct: 322 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 381

Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
            GGA V L     F    +  V   FA    D +  IFGN  Q  L + YD     V F 
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 441

Query: 370 PTDCT 374
           P  C+
Sbjct: 442 PNGCS 446


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 110/385 (28%), Positives = 178/385 (46%), Gaps = 34/385 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S V   +GEY     +G PP   +  ++DTGSDL+W+QCLPC +CY+QV P+Y+P +S
Sbjct: 81  VMSGVPFDSGEYFAVIGVGDPPTHALV-VIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNS 139

Query: 73  SSYKELSCQSEQCH-LLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
            +++ + C S QC  +L    C ++   C Y   Y D S + G LAT+ +   +      
Sbjct: 140 KTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVH- 198

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           NV  GCGH+N G+   +  GL+G GR +LS  +Q+    G + FSYCL    + +  +S 
Sbjct: 199 NVTLGCGHDNEGLL-ASAAGLLGAGRGQLSFPTQLAPAYG-HVFSYCLGDRMSRARNSSS 256

Query: 191 -MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
            + FG   E+            +    YY V + G SVG      ++  + N+S A++  
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYY-VDMVGFSVGG----ERVAGFSNASLALNPA 311

Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKT---- 296
             +G + +D+G   +   +D Y  + +   +       +  RL ++      CY      
Sbjct: 312 TGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMR--RLRNKFSVFDTCYDVHGNG 369

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFA 351
           P      P +  HF   A + L   +  I  PV G      FC  +Q  D  + + GN  
Sbjct: 370 PGTGVRVPSIVLHFAAAADMALPQANYLI--PVVGGDRRTYFCLGLQAADDGLNVLGNVQ 427

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTKQ 376
           Q    + +D +   + F P  C+ +
Sbjct: 428 QQGFGVVFDVERGRIGFTPNGCSGE 452


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 172/370 (46%), Gaps = 28/370 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNP 69
           V S  S   GEY  +  +G P +   + + DTGSD+ W+QC PC     CYKQ+ PI++P
Sbjct: 173 VTSGASQGAGEYFARIGVGQP-VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
            SSSSY  LSC SEQCHLLD  +C +   C Y   Y D S T G LATE  +F +SN+  
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEAACDANS-CIYEVEYGDGSFTVGELATETFSFRHSNS-I 289

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            N+  GCGH+N G+F                 A  + SQL A  FSYCLV   ++SS T 
Sbjct: 290 PNLPIGCGHDNEGLFVGAAG-----LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTL 344

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
                N  + S      TS + K D+  T+ +V + G+SVG      K +P  +SS  I 
Sbjct: 345 DF---NADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGG-----KPLPISSSSFEID 393

Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
           +   G + +D+G   T +P D Y+ L +      K  P          CY   S + +  
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P +     G   + L   +        G FC A  P    + I GN  Q  + + YD  +
Sbjct: 454 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513

Query: 364 QMVSFKPTDC 373
            +V F    C
Sbjct: 514 SLVGFSTDKC 523


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 169/360 (46%), Gaps = 25/360 (6%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKE 77
             G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PASSS+Y  
Sbjct: 179 GTGNYVVTVGLGTPA--SRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYAN 236

Query: 78  LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           +SC +  C  LD   CS    C Y   Y D S + G  A + +T  +S +      FGCG
Sbjct: 237 VSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCG 294

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + FG GS
Sbjct: 295 ERNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLP---ARSTGTGYLDFGAGS 349

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
             +     +T +++    T+Y+V + GI VG      +L+P   S    +     +D+G 
Sbjct: 350 PPA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGT 399

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGA 314
             T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  F GGA
Sbjct: 400 VITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGA 459

Query: 315 KVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + +  +          V   FA     GDVGI GN       + YD   ++V F P  C
Sbjct: 460 ALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 125/371 (33%), Positives = 181/371 (48%), Gaps = 33/371 (8%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY+    +GTP  +    IVDTGSDL WVQC PC  CY Q   ++ P +S+S+ +L+C 
Sbjct: 1   GEYLATVRLGTPERV-FSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACG 59

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVFGCGH 138
           +E C+ L    C +Q  C Y Y Y D SL+ G    + IT    N       N  FGCGH
Sbjct: 60  TELCNGLPYPMC-NQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
           +N G F   + G++GLG+  LS  SQ L  +   KFSYCLV +    + TS + FG+ + 
Sbjct: 119 DNEGSFAGAD-GILGLGQGPLSFPSQ-LKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAV 176

Query: 199 VSGGGVVSTSLVSKED-KTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAISKGNMFI 253
            +  GV   SL++     TYY+V L GISVG    N+S+++  I     +G I       
Sbjct: 177 PTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTI------F 230

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIA-------PI 305
           D+G   T L  + +  +   +  +    P + D   G  LC     + G A       P 
Sbjct: 231 DSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLC-----LGGFAEGQLPTVPS 285

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
           +T HF+GG  + L  ++ FI       +CF+M     DV I G+  Q +  + YD   + 
Sbjct: 286 MTFHFEGG-DMELPPSNYFIFLESSQSYCFSMVS-SPDVTIIGSIQQQNFQVYYDTVGRK 343

Query: 366 VSFKPTDCTKQ 376
           + F P  C  +
Sbjct: 344 IGFVPKSCVGR 354


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 169/363 (46%), Gaps = 19/363 (5%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           ++  +GEY  +  +GTP    +Y ++DTGSD++W+QC PC +CY Q  P+++P  S +Y 
Sbjct: 122 LAQGSGEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYA 180

Query: 77  ELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + C +  C  LD+  C+++ ++C Y   Y D S T G  +TE +TF  +      V  G
Sbjct: 181 GIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR--VTRVALG 238

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CGH+N G+F      L      RLS   Q   +    KFSYCLV   + S+  S + FG+
Sbjct: 239 CGHDNEGLFIGAAGLLGLGR-GRLSFPVQTGRRFN-QKFSYCLVD-RSASAKPSSVVFGD 295

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
            S VS     +  + + +  T+Y++ L GISVG  S    L        A   G + ID+
Sbjct: 296 -SAVSRTARFTPLIKNPKLDTFYYLELLGISVGG-SPVRGLSASLFRLDAAGNGGVIIDS 353

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILTAHF 310
           G   T L +  Y  L    R+A ++      R         C+    +  +  P +  HF
Sbjct: 354 GTSVTRLTRPAYIAL----RDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHF 409

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA V L  T+  IP    G FCFA       + I GN  Q    + +D     V F P
Sbjct: 410 R-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468

Query: 371 TDC 373
             C
Sbjct: 469 RGC 471


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 119/359 (33%), Positives = 169/359 (47%), Gaps = 25/359 (6%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKEL 78
            G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PASSS+Y  +
Sbjct: 176 TGNYVVTVGLGTPA--SRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANV 233

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
           SC +  C  LD   CS    C Y   Y D S + G  A + +T  +S +      FGCG 
Sbjct: 234 SCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCGE 291

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
            N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + FG GS 
Sbjct: 292 RNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLP---ARSTGTGYLDFGAGSP 346

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
            +     +T +++    T+Y+V + GI VG      +L+P   S    +     +D+G  
Sbjct: 347 PA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGTV 396

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
            T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  F GGA 
Sbjct: 397 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAA 456

Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + +  +          V   FA     GDVGI GN       + YD   ++V F P  C
Sbjct: 457 LDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 178/372 (47%), Gaps = 37/372 (9%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           ST    +++ FS+G P    +  I+DTGS+++WV+C PC +C +Q  P+ +P+ SS+Y  
Sbjct: 93  STYEPLFLVNFSMGQPATPQL-AIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYAS 151

Query: 78  LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVF 134
           L C +  CH   +  C+    C Y   YA    + GVLATE++ F +S+   N   +VVF
Sbjct: 152 LPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVF 211

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GC H N    +    G+ GLG+   S  +++      +KFSYCL          +++ FG
Sbjct: 212 GCSHENGDYKDRRFTGVFGLGKGITSFVTRM-----GSKFSYCLGNIADPHYGYNQLVFG 266

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN---M 251
             +   G    ST L  K    +Y+VTLEGISVG            +S+    KGN    
Sbjct: 267 EKANFEG---YSTPL--KVVNGHYYVTLEGISVGEKRLD------IDSTAFSMKGNEKSA 315

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYK-TPSMAGIA-PILT 307
            ID+G   T L +  +  L+ +VR  +   L P+     GS  CYK T S   I  P++T
Sbjct: 316 LIDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWR---GSFACYKGTVSQDLIGFPVVT 372

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG------DVGIFGNFAQSDLFIGYDF 361
            HF GGA + L   S F     + + C A++             + G  AQ    + YD 
Sbjct: 373 FHFSGGADLDLDTESMFYQATPD-ILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDL 431

Query: 362 DSQMVSFKPTDC 373
           +S  + F+  DC
Sbjct: 432 NSNKLFFQRIDC 443


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 23/375 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  K  +GTP +     ++DTGSD++W+QC PC +CY Q   +++P +S
Sbjct: 136 VVSGLAQGSGEYFTKIGVGTP-VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194

Query: 73  SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            SY  + C +  C  LD+  C   ++ C Y   Y D S+T G  ATE +TF  S      
Sbjct: 195 HSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-SGARVPR 253

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSI 187
           V  GCGH+N G+F      L+GLGR  LS  SQI  + G   FSYCLV       + +S 
Sbjct: 254 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFG-RSFSYCLVDRTSSSASATSR 311

Query: 188 TSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           +S + FG+G+  + G  V+       +D         G      +   +         + 
Sbjct: 312 SSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPST 371

Query: 247 SKGNMFIDTGAP-PTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSM 299
            +G + +D+G P P             + R A   ++L+P      G  L   CY    +
Sbjct: 372 GRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPG-----GFSLFDTCYDLSGL 426

Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
             +  P ++ HF GGA+  L   +  IP    G FCFA    DG V I GN  Q    + 
Sbjct: 427 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVV 486

Query: 359 YDFDSQMVSFKPTDC 373
           +D D Q + F P  C
Sbjct: 487 FDGDGQRLGFVPKGC 501


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 125/380 (32%), Positives = 184/380 (48%), Gaps = 24/380 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY M   IGTPP      I+DTGSDL W+QC+PC+ C++Q  P Y+P  S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPK-HYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKES 239

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFG---- 123
           SS++ ++C   +C L+ +      C  + Q C Y Y Y DSS T G  A E  T      
Sbjct: 240 SSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTP 299

Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
              +     +NV+FGCGH N G+F+    GL+GLGR  LS ASQ+ S  G + FSYCLV 
Sbjct: 300 NGKSEQKHVENVMFGCGHWNRGLFH-GAAGLLGLGRGPLSFASQLQSIYG-HSFSYCLVD 357

Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
            ++D+S++SK+ FG   E +S   +  TS V  E+    T+Y+V ++ I V G +    +
Sbjct: 358 RNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPE 417

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
              + +  G    G   ID+G   T   +  Y  ++E     IK     +     + CY 
Sbjct: 418 ETWHLSKEGG---GGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYN 474

Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
              +  +  P     F  GA       + FI    + V    +      + I GN+ Q +
Sbjct: 475 VSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQN 534

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
             I YD     + + P  CT
Sbjct: 535 FHILYDMKKSRLGYAPMKCT 554


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 166/350 (47%), Gaps = 33/350 (9%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQC PC  CY Q  P+Y+P+ SSSYK + C S  C  L   + +S     
Sbjct: 152 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGG 211

Query: 96  -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
                +  C Y   Y D S T+G LA+E I  G++    +N+VFGCG NN G+F     G
Sbjct: 212 FNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTK--LENLVFGCGRNNKGLFG-GASG 268

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
           L+GLGR+ +SL SQ L       FSYCL      +S T  + FGN   V  +   V  T 
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGASGT--LSFGNDFSVYKNSTSVFYTP 325

Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           LV   + +++Y + L G S+G +   +          +  +G + ID+G   T LP   Y
Sbjct: 326 LVQNPQLRSFYILNLTGASIGGVELKTL---------SFGRG-ILIDSGTVITRLPPSIY 375

Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
             ++ +        P          C+   S   I+ P +   F+G A++ +  T  F  
Sbjct: 376 KAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYF 435

Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             P   + C A+  +  + +VGI GN+ Q +  + YD   + +     +C
Sbjct: 436 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 128/387 (33%), Positives = 192/387 (49%), Gaps = 37/387 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY M   IGTPP      I+DTGSDL W+QC+PC  C+ Q  P Y+P  S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPR-HFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKES 239

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
           SS+K + C   +CHL+ +      C ++ Q C Y Y Y DSS T G  A E  T      
Sbjct: 240 SSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSP 299

Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            G S     +NV+FGCGH N G+F+     L+GLGR  LS +SQ+ S  G + FSYCLV 
Sbjct: 300 AGKSEFKRVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 357

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
            ++D++++SK+ FG   ++     V+ TSLV+ ++    T+Y+V ++ I V G +    +
Sbjct: 358 RNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPE 417

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRN--AIKLTPYQDPRLG 289
              + +  GA   G   +D+G   +   +  Y  +++    +V+    IK  P  DP   
Sbjct: 418 ETWHLSPEGA---GGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDP--- 471

Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
              CY    +  +  P     F+ GA       + FI    E + C A+       + I 
Sbjct: 472 ---CYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSII 528

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           GN+ Q +  I YD     + + P  C 
Sbjct: 529 GNYQQQNFHILYDTKKSRLGYAPMKCA 555


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 165/361 (45%), Gaps = 26/361 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQS 82
           YV+   +GTP   D+  + DTGSDL W QC PC   CYKQ   I++P+ SSSY  ++C S
Sbjct: 46  YVVVVGLGTPKR-DLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTS 104

Query: 83  EQCHLLDT------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
             C  L +       S S+   C Y   Y D+S + G L+ ER+T   + +  D+ +FGC
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTI-TATDIVDDFLFGC 163

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGN 195
           G +N G+FN    GL+GLGR  +S+  Q  S    NK FSYCL      SS    + FG 
Sbjct: 164 GQDNEGLFN-GSAGLMGLGRHPISIVQQTSSNY--NKIFSYCL---PATSSSLGHLTFG- 216

Query: 196 GSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
            S  +   ++ T L +   D ++Y + +  ISVG        +P  +SS   S G   ID
Sbjct: 217 ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTK-----LPAVSSS-TFSAGGSIID 270

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGG 313
           +G   T L    Y  L    R  ++  P  +       CY       I+ P +   F GG
Sbjct: 271 SGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGG 330

Query: 314 AKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
             V L H         + V   FA    D D+ +FGN  Q  L + YD     + F    
Sbjct: 331 VTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAG 390

Query: 373 C 373
           C
Sbjct: 391 C 391


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 175/373 (46%), Gaps = 43/373 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKELSCQ 81
           + + FS+G PP+   + I+DTGS L+W+QC PC  C     + P++NPA SS++ E SC 
Sbjct: 68  FFVNFSVGQPPVPQ-FTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCD 126

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGH 138
              C       CSS + C Y   Y   + +KGVLA ER+TF   N        + FGCGH
Sbjct: 127 DRFCRYAPNGHCSSNK-CVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGH 185

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
            N         G++GLG    SLA Q+      +KFSYC+      +   +++  G  ++
Sbjct: 186 ENGEQLESEFTGILGLGAKPTSLAVQL-----GSKFSYCIGDLANKNYGYNQLVLGEDAD 240

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           + G     T +  + +   Y++ LEGISVG+   + + + +       S+  + +DTG  
Sbjct: 241 ILGD---PTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRG---SRTGVILDTGTL 294

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRL-----GSQLCYK---TPSMAGIAPILTAHF 310
            T L    Y  L  ++++ +      DP+L        LCY       + G  P++T HF
Sbjct: 295 YTWLADIAYRELYNEIKSIL------DPKLERFWFRDFLCYHGRVNEELIGF-PVVTFHF 347

Query: 311 DGGAKVPLIHTSTFIP----PPVEGVFCFAMQPIDGDVGIFGNF------AQSDLFIGYD 360
            GGA++ +  TS F P         VFC +++P     G + +F      AQ    I YD
Sbjct: 348 AGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYD 407

Query: 361 FDSQMVSFKPTDC 373
              + +  +  DC
Sbjct: 408 LKERNIYLQRIDC 420


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 168/365 (46%), Gaps = 28/365 (7%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
            +G Y++   +GTP   D+  I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY  +
Sbjct: 129 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 187

Query: 79  SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           SC S  C  L +      SCS+   C Y   Y D S + G LA ++ T   S++ FD V 
Sbjct: 188 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKDKFTL-TSSDVFDGVY 245

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
           FGCG NN G+F     GL+GLGR +LS  SQ  +    NK FSYCL    + +S T  + 
Sbjct: 246 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 299

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
           FG+            S ++ +  ++Y + +  I+VG      KL IP    S   S    
Sbjct: 300 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 350

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            ID+G   T LP   Y  L    +  +   P          C+       +  P +   F
Sbjct: 351 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 410

Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
            GGA V L     F    +  V   FA    D +  IFGN  Q  L + YD     V F 
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 470

Query: 370 PTDCT 374
           P  C+
Sbjct: 471 PNGCS 475


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/369 (31%), Positives = 167/369 (45%), Gaps = 38/369 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S     +GEY ++  IG+P +   Y ++D+GSD++W+QC PC QCY Q  PI+NPA+S
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQ-YMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATS 176

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  ++C S  C+ LD      +  C Y   Y D S TKG LA E IT G +     + 
Sbjct: 177 ASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT--VIQDT 234

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F     GL+GLG   +S   Q+ +Q G   F YCLV             
Sbjct: 235 AIGCGHWNEGMF-VGAAGLLGLGGGPMSFVGQLGAQTGG-AFGYCLV------------- 279

Query: 193 FGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSN--SSKLIPYYNSSGAISKG 249
               S     G +   L+      ++Y+V+L G++VG +    S ++    +    I  G
Sbjct: 280 ----SRAMPVGAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTD----IGTG 331

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-AP 304
            + +DTG   T LP   YN      R+A        PR         CY       +  P
Sbjct: 332 GVVMDTGTAITRLPTVAYNAF----RDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVP 387

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            ++ +F GG  +     +  IP    G FCFA  P    + I GN  Q  + +  D  + 
Sbjct: 388 TVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNG 447

Query: 365 MVSFKPTDC 373
            V F P  C
Sbjct: 448 FVGFGPNVC 456


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 167/363 (46%), Gaps = 42/363 (11%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
           +V  +   Y++  +IGTPPL  +  ++DTGSDL+W QC  PC +C+ Q  P+Y PA S++
Sbjct: 84  SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142

Query: 75  YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           Y  +SC+S  C  L +    CS     C Y + Y D + T GVLATE  T G S+     
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG  N G   +N  GLVG+GR  LSL    +SQLG  +      P  +  +  +  
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTR------PRRSCRARAAAR 250

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
             G  +  S                     LEGI+VG+      + P       +  G +
Sbjct: 251 GGGAPTTTS--------------------PLEGITVGD--TLLPIDPAVFRLTPMGDGGV 288

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            ID+G   T L +  +  L   + + ++L       LG  LC+   S   +  P L  HF
Sbjct: 289 IIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHF 348

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           DG A + L   S  +     GV C  M    G + + G+  Q +  I YD +  ++SF+P
Sbjct: 349 DG-ADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFEP 406

Query: 371 TDC 373
             C
Sbjct: 407 AKC 409


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 120/381 (31%), Positives = 186/381 (48%), Gaps = 28/381 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY+++  +GTPP      I+DTGSDL W+QC PC+ C+ Q  P+++P +S
Sbjct: 139 VESGVAVGSGEYLVEVYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMAS 197

Query: 73  SSYKELSCQSEQCHLLD------TVSCSSQQLCNYTYGYADSSLTKGVLATERITF---G 123
           +SY+ ++C   +C L+       T   S    C Y Y Y D S T G LA E  T     
Sbjct: 198 TSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257

Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
           +S+   D VV GCGH N G+F+     L+GLGR  LS ASQ+ +  G + FSYCLV    
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYG-HAFSYCLV--DH 313

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLV--SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
            S++ SK+ FG+ + +     ++ +    S  + T+Y+V L+GI VG      +++   +
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGG-----EMLDIPS 368

Query: 242 SSGAISK----GNMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKT 296
           ++  +SK    G   ID+G   +  P+  Y  + +  V    K  P          CY  
Sbjct: 369 NTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNV 428

Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIFGNFAQSD 354
             +  +  P  +  F  GA       + FI    EG+ C A +      + I GN+ Q +
Sbjct: 429 SGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQN 488

Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
             + YD     + F P  C +
Sbjct: 489 FHVLYDLHHNRLGFAPRRCAE 509


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 130/366 (35%), Positives = 164/366 (44%), Gaps = 33/366 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSS 73
           N+ T N  YV+  S+GTP +      VDTGSDL WVQC PC    CY Q  P+++PA SS
Sbjct: 134 NIGTLN--YVVTVSLGTPGVAQTL-EVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSS 190

Query: 74  SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SY  + C    C  L     SCS+ Q C Y   Y D S T GV +++ +T  + N+    
Sbjct: 191 SYAAVPCGGPVCGGLGIYASSCSAAQ-CGYVVSYGDGSKTTGVYSSDTLTL-SPNDAVRG 248

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGCGH  +G F  N+ GL+GLGR   SL  Q     G   FSYCL    T  S T  +
Sbjct: 249 FFFGCGHAQSG-FTGND-GLLGLGREEASLVEQTAGTYG-GVFSYCL---PTRPSTTGYL 302

Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
             G  S  +  G  +T L+S  +  TYY V L GISVG    S   +P      ++  G 
Sbjct: 303 TLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLS---VP-----SSVFAGG 354

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGS-QLCYKTPSMAGIA-PILT 307
             +DTG   T LP   Y  L    R+ +    Y   P  G    CY       +  P + 
Sbjct: 355 TVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVA 414

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F GGA V L            G   FA    DG + I GN  Q    +    D   V 
Sbjct: 415 LTFSGGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVG 468

Query: 368 FKPTDC 373
           FKP+ C
Sbjct: 469 FKPSSC 474


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/372 (31%), Positives = 169/372 (45%), Gaps = 35/372 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++ F IGTP    +   VDTGSD++W QC PC  C+ Q  P ++ ++S +   + C  
Sbjct: 91  EYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTD 150

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGCGHN 139
             C  L   +C     C Y   Y D+S+T G LA +  TF   G       ++VFGCG  
Sbjct: 151 PICRALRPHACFLGG-CTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQY 209

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           NTG F+ NE G+ G GR  LSL      QLG + FSYC   F T     S   F  G+  
Sbjct: 210 NTGNFHSNETGIAGFGRGPLSLP----RQLGVSSFSYC---FTTIFESKSTPVFLGGAPA 262

Query: 200 SG------GGVVSTSLVSKEDKTYYFVTLEGISVGN----LSNSSKLIPYYNSSGAISKG 249
            G      G ++ST  +    + YY+++L+GI+VG     +  S+ ++    S G I   
Sbjct: 263 DGLRAHATGPILSTPFLPNHPE-YYYLSLKGITVGKTRLAVPESAFVVKADGSGGTI--- 318

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDPRLGSQLCYKTPSMAGIA---- 303
              ID+G   T  P+  +  L E     + L  T Y D    +  C+ T S+   +    
Sbjct: 319 ---IDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPV 375

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P +T H + GA   L   +     P     C  +   D D  + GNF Q ++ I +D   
Sbjct: 376 PKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAG 434

Query: 364 QMVSFKPTDCTK 375
             +  +P  C K
Sbjct: 435 NKLVIEPAQCDK 446


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 176/374 (47%), Gaps = 28/374 (7%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPI 66
           P+    S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CYKQ + +
Sbjct: 146 PSLPASSGSALGTGNYVVTIGLGTP--AGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKL 203

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           ++PA SS+Y  +SC +  C  L    CS    C Y   Y D S + G  A + +T  +S 
Sbjct: 204 FDPARSSTYANISCAAPACSDLYIKGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSY 261

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           +      FGCG  N G++ E   GL+GLGR + SL  Q   + G   F++C   F   SS
Sbjct: 262 DAIKGFRFGCGERNEGLYGE-AAGLLGLGRGKTSLPVQAYDKYG-GVFAHC---FPARSS 316

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSG 244
            T  + FG GS  +    ++T ++     T+Y+V L GI VG    S   IP   + +SG
Sbjct: 317 GTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLS---IPQSVFTTSG 373

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI 302
            I      +D+G   T LP   Y+ L     +A+    Y+     S L  CY    M+ +
Sbjct: 374 TI------VDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEV 427

Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGY 359
           A P ++  F GGA +  +H S  I        C  FA    D DVGI GN       + Y
Sbjct: 428 AIPTVSLLFQGGASLD-VHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVY 486

Query: 360 DFDSQMVSFKPTDC 373
           D   ++V F P  C
Sbjct: 487 DIGKKVVGFCPGAC 500


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 117/365 (32%), Positives = 172/365 (47%), Gaps = 22/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PA S
Sbjct: 171 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARS 228

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC +  C  L+   CS    C Y   Y D S + G  A + +T  +S +     
Sbjct: 229 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 286

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + 
Sbjct: 287 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 341

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG GS  +    ++T ++++   T+Y+V + GI VG      +L+    S    +     
Sbjct: 342 FGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 394

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
           +D+G   T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  
Sbjct: 395 VDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 454

Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA++ +  +          V   FA     GDVGI GN       + YD   ++V F
Sbjct: 455 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514

Query: 369 KPTDC 373
            P  C
Sbjct: 515 YPGAC 519


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 173/352 (49%), Gaps = 28/352 (7%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSCQSEQCH-- 86
           +GTP    +  +VDTGS L W+QC PC V C++Q  P++NP SSS+Y  + C ++QC   
Sbjct: 3   LGTPATQYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDL 61

Query: 87  ---LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGV 143
               L+  +CSS  +C Y   Y DSS + G L+ + ++FG+++    N  +GCG +N G+
Sbjct: 62  PSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--LPNFYYGCGQDNEGL 119

Query: 144 FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
           F  +  GL+GL R +LSL  Q+   LG + F+YCL    + SS    +   N  + S   
Sbjct: 120 FGRSA-GLIGLARNKLSLLYQLAPSLGYS-FTYCLP--SSSSSGYLSLGSYNPGQYSYTP 175

Query: 204 VVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
           +VS+SL    D + YF+ L G++V GN        P   SS A S     ID+G   T L
Sbjct: 176 MVSSSL----DDSLYFIKLSGMTVAGN--------PLSVSSSAYSSLPTIIDSGTVITRL 223

Query: 263 PKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTS 322
           P   Y+ L + V  A+K T           C+K  +    AP +T  F GGA + L   +
Sbjct: 224 PTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQN 283

Query: 323 TFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             +    +   C A  P      I GN  Q    + YD  S  + F    C+
Sbjct: 284 LLVDVD-DSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 126/386 (32%), Positives = 189/386 (48%), Gaps = 37/386 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY +   +GTPP      I+DTGSDL W+QC+PC +C++Q  P Y+P  S
Sbjct: 170 LESGVSLGSGEYFIDVFVGTPPK-HFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQS 228

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSN- 126
           SSY+ + C   +CHL+ +      C ++ Q C Y Y Y DSS T G  A E  T   +  
Sbjct: 229 SSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMS 288

Query: 127 ------NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
                    +NV+FGCGH N G+F+     L+GLGR  LS +SQ+ S  G + FSYCLV 
Sbjct: 289 SGKPELRRVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 346

Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISVG----NLSN 232
            ++D++++SK+ FG   + +S   +  T+LV+ ++    T+Y+V ++ I VG    N+  
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406

Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
               I    S G I      ID+G   +   +  Y  ++E     +K  P        + 
Sbjct: 407 EKWQIATDGSGGTI------IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP 460

Query: 293 CYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
           CY   ++ G+     P     F  GA       + FI      V C A+       + I 
Sbjct: 461 CY---NVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSII 517

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           GN+ Q +  I YD     + F PT C
Sbjct: 518 GNYQQQNFHILYDTKKSRLGFAPTKC 543


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 182/380 (47%), Gaps = 31/380 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
           V + V  A  +Y+ ++ +G PP      ++DTGS L+W QC  C++  C +Q  P +N +
Sbjct: 75  VSAPVHWATRQYIAEYMVGDPPQ-RAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNAS 133

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           SS S+  + CQ + C       C+    C +   Y    +  G L T+  TF +      
Sbjct: 134 SSGSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQSGGA--- 189

Query: 131 NVVFGCGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            + FGC         +      GL+GLGR RLSLASQ     GA +FSYCL P+  ++  
Sbjct: 190 TLAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQT----GAKRFSYCLTPYFHNNGA 245

Query: 188 TSKMYFGNGSEVSGGG--VVSTSLV-SKED---KTYYFVTLEGISVG--NLSNSSKLIPY 239
           +S ++ G  + +SGGG  V+S + V S +D    T+Y++ L GI+VG   L+  S     
Sbjct: 246 SSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDL 305

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLTPYQDPRLGSQLCYK 295
                   +G + ID+G+P T L +D Y      L  Q+  ++   P +D   G  LC  
Sbjct: 306 QEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDG-GMALCVA 364

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSD 354
              +  + P L  HF GGA + L     +  P  +   C A+  + G +  I GNF Q +
Sbjct: 365 RGDLDRVVPTLVLHFSGGADMAL-PPENYWAPLEKSTACMAI--VRGYLQSIIGNFQQQN 421

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
           + I +D     +SF+  DC+
Sbjct: 422 MHILFDVGGGRLSFQNADCS 441


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 172/370 (46%), Gaps = 24/370 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
           + S  S   G Y+ +  +GTP    +  +VD+GS L W+QC PC V C+ Q  P+Y+P +
Sbjct: 97  LASGASVGVGNYITRLGLGTPTTTYVM-VVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRA 155

Query: 72  SSSYKELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           SS+Y  + C + QC  L        SCS   +C Y   Y D S + G L+ + ++  +S 
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           + F    +GCG +N G+F     GL+GL R +LSL SQ+   +G N F+YCL P    +S
Sbjct: 216 S-FPGFYYGCGQDNVGLFGR-AAGLIGLARNKLSLLSQLAPSVG-NSFAYCL-PTSAAAS 271

Query: 187 ITSKMYFGNGSEVSGGGVVS-TSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
               + FG+ S+    G  S TS+VS   D + YFV+L G+SV      S L    +  G
Sbjct: 272 -AGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAG----SPLAVPSSEYG 326

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
           ++      ID+G   T LP   Y  L + V  A+         +  Q C+K        P
Sbjct: 327 SLPT---IIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-LQTCFKGQVAKLPVP 382

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +   F GGA + L   +  +    E   C A  P D    I GN  Q    + YD    
Sbjct: 383 AVNMAFAGGATLRLTPGNVLV-DVNETTTCLAFAPTD-STAIIGNTQQQTFSVVYDVKGS 440

Query: 365 MVSFKPTDCT 374
            + F    C+
Sbjct: 441 RIGFAAGGCS 450


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 162/366 (44%), Gaps = 34/366 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSS 73
           ++ T+N  YV+  S+GTP +      VDTGSDL WVQC PC    CY+Q  P+++PA SS
Sbjct: 131 DIGTSN--YVVTASLGTPGMAQTL-EVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSS 187

Query: 74  SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SY  + C    C  L     +CS+ Q C Y   Y D S T GV +++ +T   +N     
Sbjct: 188 SYAAVPCGRSACAGLGIYASACSAAQ-CGYVVSYGDGSNTTGVYSSDTLTLA-ANATVQG 245

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
            +FGCGH  +G       GL+G GR + SL  Q     G   FSYCL    T SS T  +
Sbjct: 246 FLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYG-GVFSYCL---PTKSSTTGYL 301

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
             G  S V+ G   +  L S    TYY V L GISVG         P    + A + G +
Sbjct: 302 TLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQ-------PLSVPASAFAAGTV 354

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI----LT 307
            +DTG   T LP   Y  L    R+ +   P   P      CY   S AG   +    + 
Sbjct: 355 -VDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCY---SFAGYGTVNLTSVA 410

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F  GA + L            G   FA    DG + I GN  Q    +    D   V 
Sbjct: 411 LTFSSGATMTLGADGIM----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVG 464

Query: 368 FKPTDC 373
           F+P+ C
Sbjct: 465 FRPSSC 470


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 180/392 (45%), Gaps = 45/392 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
           V S  S+ +G+Y +   IGTPP   +  + DTGSDL+WV+C PC  C ++     +    
Sbjct: 75  VISGASSGSGQYFVSLRIGTPPQTLLL-VADTGSDLIWVKCSPCRNCSHRSPGSAFFARH 133

Query: 72  SSSYKELSCQSEQCHLLDTVS---CSSQQL---CNYTYGYADSSLTKGVLATERITFGNS 125
           S++Y  + C S QC L+       C+  +L   C Y Y YADSS T G  + E +T   S
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTS 193

Query: 126 N---NFFDNVVFGCGHNN-----TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
                  + + FGCG        TG   E   G++GLGR  +S +SQ+  + G+ KFSYC
Sbjct: 194 TGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGS-KFSYC 252

Query: 178 LVPFHTDSSITSKMYFGNGSE--VSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNS 233
           L+ +      TS +  G      VS  G++S +  L++    T+Y++ ++G+ V    N 
Sbjct: 253 LMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYV----NG 308

Query: 234 SKLI--PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
            KL   P   S   +  G   ID+G   T + +  Y  + +  +  +KL    +P  G  
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFD 368

Query: 292 LCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI--D 341
           LC     +   A P ++ +  GG        S F PPP        + + C A+QP+  D
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGG--------SVFSPPPRNYFIETGDQIKCLAVQPVSQD 420

Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G   + GN  Q    + +D D   + F    C
Sbjct: 421 GGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 159/365 (43%), Gaps = 18/365 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY ++  +G+PP  D Y ++D+GSD++WVQC PC  CYKQ  P+++PA S
Sbjct: 121 VVSGMDQGSGEYFVRIGVGSPPR-DQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 179

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            SY  +SC S  C  ++   C S   C Y   Y D S TKG LA E +TF  +     NV
Sbjct: 180 GSYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTF--AKTVVRNV 236

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q+  Q G   F YCLV   TDS  T  + 
Sbjct: 237 AMGCGHRNRGMFIGAAGLLGIGGGS-MSFVGQLSGQTGG-AFGYCLVSRGTDS--TGSLV 292

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           FG  +   G   V      +    YY           L      IP  +    +++   G
Sbjct: 293 FGREALPVGASWVPLVRNPRAPSFYYVGLKG------LGVGGVRIPLPDGVFDLTETGDG 346

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
            + +DTG   T LP   Y    +  ++     P          CY       +  P ++ 
Sbjct: 347 GVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 406

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +F  G  + L   +  +P    G +CFA       + I GN  Q  + + +D  +  V F
Sbjct: 407 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 466

Query: 369 KPTDC 373
            P  C
Sbjct: 467 GPNVC 471


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 152/313 (48%), Gaps = 36/313 (11%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V ++ +  A GEY++K  IGTPP       +DT SDL+W QC PC  CY QV P++NP  
Sbjct: 77  VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135

Query: 72  SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           SS+Y  L C S+ C  LD   C     + C YTY Y+ ++ T+G LA +++  G   + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193

Query: 130 DNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             V FGC  ++TG     +  G+VGLGR  LSL    +SQL   +F+YCL P    S I 
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNS------------ 233
            K+  G  ++ +       ++  + D    +YY++ L+G+ +G+ + S            
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATAT 307

Query: 234 ----SKLIPYYNSSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
               +       ++ A++ G+     M ID  +  T L    Y+ L   +   I+L    
Sbjct: 308 ATATAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367

Query: 285 DPRLGSQLCYKTP 297
              LG  LC+  P
Sbjct: 368 GSSLGLDLCFILP 380


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 168/359 (46%), Gaps = 22/359 (6%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
            G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++PA SS+   +
Sbjct: 183 TGNYVVTIGLGTP--AGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANI 240

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
           SC +  C  L T  CS    C Y   Y D S + G  A + +T  +S +      FGCG 
Sbjct: 241 SCAAPACSDLYTKGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAIKGFRFGCGE 298

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
            N G+F E   GL+GLGR + SL  Q   + G   F++C   F   SS T  + FG GS 
Sbjct: 299 RNEGLFGE-AAGLLGLGRGKTSLPVQAYDKYG-GVFAHC---FPARSSGTGYLDFGPGSS 353

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
            +    ++T ++     T+Y+V L GI VG      KL+    S    +     +D+G  
Sbjct: 354 PAVSTKLTTPMLVDNGLTFYYVGLTGIRVGG-----KLLSIPPS--VFTTAGTIVDSGTV 406

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
            T LP   Y+ L     +AI    Y+     S L  CY    M+ +A P ++  F GGA 
Sbjct: 407 ITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGAS 466

Query: 316 VPLIHTSTFIPPPV-EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + +  +       V +    FA    D DVGI GN       + YD   ++V F P  C
Sbjct: 467 LDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 194/393 (49%), Gaps = 46/393 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  + EY+M   +GTPP      I+DTGSDL W+QC PC+ C++Q  P+++PA+S
Sbjct: 135 VESGVAVGSAEYLMDVYVGTPPRR-FQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 193

Query: 73  SSYKELSCQSEQC-HLLDTVSCSS-------QQLCNYTYGYADSSLTKGVLATERITFG- 123
           SSY+ L+C   +C H+    + +        +  C Y Y Y D S + G LA E  T   
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253

Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
               +++  D VVFGCGH N G+F+     L+GLGR  LS ASQ+ +  G + FSYCLV 
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGHTFSYCLVD 312

Query: 181 FHTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVG----NLS 231
             +D  + SK+ FG    ++          + +  S    T+Y+V L G+ VG    N+S
Sbjct: 313 HGSD--VASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNIS 370

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQD-PRL 288
           + +     +++S   S G + ID+G   +   +  Y  +      R +    P  D P L
Sbjct: 371 SDT-----WDASEGGSGGTI-IDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVL 424

Query: 289 GSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDG 342
               CY   +++G+     P L+  F  GA       + FI    +G+ C A+   P  G
Sbjct: 425 SP--CY---NVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 479

Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            + I GNF Q +  + YD  +  + F P  C +
Sbjct: 480 -MSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAE 511


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 128/382 (33%), Positives = 193/382 (50%), Gaps = 29/382 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S V+  +GEY M   IGTPP      I+DTGSDL W+QC+PC  C++Q  P Y+P  S
Sbjct: 79  LESGVTLGSGEYFMDVFIGTPPK-HYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKES 137

Query: 73  SSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
           SS++ + C   +CHL+ +    + C ++ Q C Y Y Y DSS T G  ATE  T      
Sbjct: 138 SSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSP 197

Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            G S     +NV+FGCGH N G+F+    GL+GLGR  LS +SQ+ S  G + FSYCLV 
Sbjct: 198 TGKSEFKRVENVMFGCGHWNRGLFH-GASGLLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 255

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
            ++D++++SK+ FG   ++     ++ T+LV  ++    T+Y+V ++ I V G + N  +
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQD-PRLGSQLC 293
                 S G    G   +D+G   +   +  Y  +++     +K  P  QD P L    C
Sbjct: 316 STWNMTSDGV---GGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP--C 370

Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFA 351
           Y    +  I  P     F  GA       + FI    E V C A+       + I GN+ 
Sbjct: 371 YNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQ 430

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
           Q +  + YD     + + P +C
Sbjct: 431 QQNFHVLYDTKKSRLGYAPMNC 452


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 128/386 (33%), Positives = 199/386 (51%), Gaps = 35/386 (9%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S ++  +GEY M   +G+PP      I+DTGSDL W+QCLPC  C++Q    Y+P +
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKA 216

Query: 72  SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
           S+SYK ++C  ++C+L+ +    + C S  Q C Y Y Y DSS T G  A E  T     
Sbjct: 217 SASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276

Query: 123 -GNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            G S+  +  +N++FGCGH N G+F+     L+GLGR  LS +SQ+ S  G + FSYCLV
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 334

Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSS 234
             ++D++++SK+ FG   + +S   +  TS V+ ++    T+Y+V ++ I V G + N  
Sbjct: 335 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIP 394

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLG 289
           +     +S GA   G   ID+G   +   +  Y    N++ E+ +   K   Y+D P L 
Sbjct: 395 EETWNISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILD 449

Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
              C+    +  +  P L   F  GA       ++FI    E + C AM         I 
Sbjct: 450 P--CFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAMLGTPKSAFSII 506

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           GN+ Q +  I YD     + + PT C
Sbjct: 507 GNYQQQNFHILYDTKRSRLGYAPTKC 532


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/356 (32%), Positives = 172/356 (48%), Gaps = 29/356 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           +Y++   IGTP   ++  I DTGS L+W QC PC  CY +V P+++P  S+S+K L C S
Sbjct: 131 DYIVNVGIGTPKK-EMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSS 188

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
           + C  +    CSS + C Y   Y D+S + G LATE I+F +    F N++ GC    +G
Sbjct: 189 KLCQSIRQ-GCSSPK-CTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSG 246

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
             +  E G++GL R+ +SLASQ  + +    FSYC +P    S+         G    GG
Sbjct: 247 E-SLGESGIMGLNRSPISLASQT-ANIYDKLFSYC-IPSTPGST---------GHLTFGG 294

Query: 203 GVVSTSLVSKEDKTY----YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
            V +    S   KT     Y + + GISVG      KL+       +  K    ID+GA 
Sbjct: 295 KVPNDVRFSPVSKTAPSSDYDIKMTGISVGG----RKLL----IDASAFKIASTIDSGAV 346

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
            T LP   Y+ L    R  +K  P  D       CY   + + +A P ++  F+GG ++ 
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMD 406

Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +  +      P   V+C A   +D +V IFGNF Q    + +D   + + F P  C
Sbjct: 407 IDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 119/387 (30%), Positives = 188/387 (48%), Gaps = 39/387 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY++   +GTPP      I+DTGSDL W+QC PC+ C++Q  P+++PA+S
Sbjct: 138 VESGVAVGSGEYLIDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 196

Query: 73  SSYKELSCQSEQCHLL----DTVSCS--SQQLCNYTYGYADSSLTKGVLATERITFG--- 123
           SSY+ ++C  ++C L+       +C   ++  C Y Y Y D S T G LA E  T     
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256

Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             ++   D VVFGCGH N G+F+     L       LS ASQ+ +  G + FSYCLV   
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGR-GPLSFASQLRAVYG-HTFSYCLVEHG 314

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSL---VSKEDKTYYFVTLEGISVG----NLSNSSK 235
           +D+   SK+ FG    V     +  +     S    T+Y+V L+G+ VG    N+S+ + 
Sbjct: 315 SDAG--SKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTW 372

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-KLTPYQDPRLGSQLCY 294
            +      G    G   ID+G   +   +  Y  + +   + + +L P          CY
Sbjct: 373 DV------GKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCY 426

Query: 295 KTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFG 348
              +++G+     P L+  F  GA       + F+    +G+ C A++  P  G + I G
Sbjct: 427 ---NVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTG-MSIIG 482

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           NF Q +  + YD  +  + F P  C +
Sbjct: 483 NFQQQNFHVVYDLQNNRLGFAPRRCAE 509


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 170/377 (45%), Gaps = 43/377 (11%)

Query: 24  YVMKFSIGTP---PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           YV   S+G     P  ++  IVDTGSDL WVQC PC  CY Q  P+++PA S++Y  + C
Sbjct: 144 YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 203

Query: 81  QSEQCHLLDTV--------SCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
            +  C   D++        SC S     + C Y   Y D S ++GVLAT+ +  G ++  
Sbjct: 204 NASACA--DSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 259

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
               VFGCG +N G+F     GL+GLGRT LSL SQ  S+ G   FSYCL P  T    +
Sbjct: 260 LGGFVFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTASRYG-GVFSYCL-PAATSGDAS 316

Query: 189 SKMYFGNGSEVSGG-----GVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNS 242
             +  G G + +        V  T +++   +  +YF+ + G +VG  + +++       
Sbjct: 317 GSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ------- 369

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMA 300
              +   N+ ID+G   T L    Y  +  +         Y      S L  CY      
Sbjct: 370 --GLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHD 427

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLF 356
            +  P+LT   +GGA V +           +G   C AM  +  + +  I GN+ Q +  
Sbjct: 428 EVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKR 487

Query: 357 IGYDFDSQMVSFKPTDC 373
           + YD     + F   DC
Sbjct: 488 VVYDTLGSRLGFADEDC 504


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 173/374 (46%), Gaps = 29/374 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S V      Y++   IG     ++  IVDTGSDL WVQC PC  CY Q  P++NP+ S
Sbjct: 56  LSSGVRLQTLNYIVTVEIGGR---NMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGS 112

Query: 73  SSYKELSCQSEQCHLLD------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
            SY+ + C S  C  L        V  S+   CNY   Y D S T+G L  E++  G ++
Sbjct: 113 PSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTH 172

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
               N +FGCG NN G+F     GL+GLG++ LSL SQ  S +    FSYCL     D+S
Sbjct: 173 --VSNFIFGCGRNNKGLFG-GASGLMGLGKSDLSLVSQT-SAIFEGVFSYCLPTTAADAS 228

Query: 187 ITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
             S +  GN S       +S +  + + +  T+YF+ L GIS+G ++  +   P Y  SG
Sbjct: 229 -GSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQA---PNYRQSG 284

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
                 + ID+G   T LP   Y  L+ +        P   P      C+       +  
Sbjct: 285 ------ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDI 338

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYD 360
           P +   F+G A++ +  T  F     +    C A+  +  D ++ I GN+ Q +  + Y+
Sbjct: 339 PTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYN 398

Query: 361 FDSQMVSFKPTDCT 374
                + F    C+
Sbjct: 399 TKESKLGFAAEACS 412


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 167/368 (45%), Gaps = 28/368 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S +    G YVM  S+GTP       I DTGSDL+WVQ  PC  C      I++P  S
Sbjct: 44  VESPLHPDGGGYVMDISVGTPGK-RFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQS 100

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
           S+++E+ C S+ C  L          C+Y+Y Y  S  T+G  A + I+ G +++    F
Sbjct: 101 STFREMDCSSQLCAELPGSCEPGSSTCSYSYEYG-SGETEGEFARDTISLGTTSDGSQKF 159

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            +   GCG  N+G    +  GLVGLG+  +SL SQ+ + +  +KFSYCLV  ++ S  +S
Sbjct: 160 PSFAVGCGMVNSGFDGVD--GLVGLGQGPVSLTSQLSAAI-DSKFSYCLVDINSQSE-SS 215

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
            + FG  + + G G+ ST +    D   TYY +T+ GI+V   +               S
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-------------S 262

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
            G   ID+G   T +P   Y R+  ++ + + L       +G  LCY   S      P L
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQM 365
           T    G    P       +        C AM    G  V I GN  Q    I YD  S  
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSE 382

Query: 366 VSFKPTDC 373
           +SF    C
Sbjct: 383 LSFVQAKC 390


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/365 (30%), Positives = 159/365 (43%), Gaps = 18/365 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S +   +GEY ++  +G+PP  D Y ++D+GSD++WVQC PC  CYKQ  P+++PA S
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPR-DQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 178

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            SY  +SC S  C  ++   C S   C Y   Y D S TKG LA E +TF  +     NV
Sbjct: 179 GSYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTF--AKTVVRNV 235

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L   G + +S   Q+  Q G   F YCLV   TDS  T  + 
Sbjct: 236 AMGCGHRNRGMFIGAAGLLGIGGGS-MSFVGQLSGQTGG-AFGYCLVSRGTDS--TGSLV 291

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           FG  +   G   V      +    YY           L      IP  +    +++   G
Sbjct: 292 FGREALPVGASWVPLVRNPRAPSFYYVGLKG------LGVGGVRIPLPDGVFDLTETGDG 345

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
            + +DTG   T LP   Y    +  ++     P          CY       +  P ++ 
Sbjct: 346 GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 405

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +F  G  + L   +  +P    G +CFA       + I GN  Q  + + +D  +  V F
Sbjct: 406 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 465

Query: 369 KPTDC 373
            P  C
Sbjct: 466 GPNVC 470


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/301 (35%), Positives = 141/301 (46%), Gaps = 20/301 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P ++P++SS+    SC S
Sbjct: 81  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139

Query: 83  EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C  L   SC S      Q C YTY Y D S+T G L  ++ TF  +      V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             N GVF  NE G+ G GR  LSL     SQL    FS+C    +     T  +      
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255

Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
             SG G V ST L+    + T+Y+++L+GI+VG     S  +P   S  A+  G     I
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 310

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
           D+G   T LP   Y  + +     +KL            C   P  A    P L  HF+G
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 313 G 313
            
Sbjct: 371 A 371


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 115/380 (30%), Positives = 176/380 (46%), Gaps = 37/380 (9%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +G+Y+ K ++GTP +  +  + DT SDL W+QC PC +CY Q  P+++P  S+SY E++ 
Sbjct: 138 SGDYIAKIAVGTPAVEALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 196

Query: 81  QSEQCHLLDTVSC--SSQQLCNYTYGYAD------SSLTKGVLATERITF-GNSNNFFDN 131
            +  C  L       + +  C YT  Y D      +S + G L  E +TF G     + +
Sbjct: 197 DAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLS 256

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD-SSITS 189
           +  GCGH+N G+F     G++GL R ++S+  QI + LG N  FSYCLV F +   S +S
Sbjct: 257 I--GCGHDNKGLFGAPAAGILGLSRGQISIPHQI-AFLGYNASFSYCLVDFISGPGSPSS 313

Query: 190 KMYFGNGS-EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNS 242
            + FG G+ + S     + +++++   T+Y+V L G+SVG +          +L PY   
Sbjct: 314 TLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGH 373

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYN---RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            G I      +D+G   T L +  Y             +       P      CY     
Sbjct: 374 GGVI------LDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGR 427

Query: 300 AGI-----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
           AG+      P ++ HF GG ++ L   +  I     G  CFA     D  V + GN  Q 
Sbjct: 428 AGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQ 487

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
              + YD   Q V F P  C
Sbjct: 488 GFRVVYDIGGQRVGFAPNSC 507


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/387 (30%), Positives = 176/387 (45%), Gaps = 38/387 (9%)

Query: 15  SNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLP---CVQCYKQVKPIY 67
           S  S  +G+Y ++  +GTP    PL     IVDTGSDL W+QC P            P Y
Sbjct: 50  SGSSIGSGQYFVELRVGTPAKKFPL-----IVDTGSDLTWIQCNPPNTTANSSSPPAPWY 104

Query: 68  NPASSSSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           + +SSSSY+E+ C  ++C  L      + S +S   C+YTYGY+D S T G+LA E I+ 
Sbjct: 105 DKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISM 164

Query: 123 ----------GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
                     GN         NV  GC   + G       G++GLG+  +SLA+Q     
Sbjct: 165 KSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTA 224

Query: 170 GANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG 228
               FSYCLV +   S+ +S +  G         +  T +V     +++Y+V + G++V 
Sbjct: 225 LGGIFSYCLVDYLRGSNASSFLVMG---RTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVD 281

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
                      +   G  +KG +F D+G   + L +  Y+++   +  +I L   Q+   
Sbjct: 282 GKPVDGIASSDWGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE 340

Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--I 346
           G +LCY    M    P L   F GGA + L   + ++    E V C A+Q +    G  I
Sbjct: 341 GFELCYNVTRMEKGMPKLGVEFQGGAVMELPW-NNYMVLVAENVQCVALQKVTTTNGSNI 399

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            GN  Q D  I YD     + FK + C
Sbjct: 400 LGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 173/374 (46%), Gaps = 36/374 (9%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
            + V   +    G Y M+FSIGTPP   +  + DTGSDL+W +C             Y+P
Sbjct: 86  TDTVPLRMDGGGGAYDMEFSIGTPPQ-KLTALADTGSDLIWTKCDAGGGAAWGGSSSYHP 144

Query: 70  ASSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYA---DSSLTKGVLATERITF 122
            +SS++  L C    C  L + S     +    C+Y Y Y    D   T+G L +E  T 
Sbjct: 145 NASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL 204

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
           G   +    V FGC     G + E   GLVGLGR  LSL    +SQL A  F YCL    
Sbjct: 205 GG--DAVPGVGFGCTTALEGDYGEGA-GLVGLGRGPLSL----VSQLDAGTFMYCLT--- 254

Query: 183 TDSSITSKMYFGNGSEV--SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
            D+S  S + FG  + +  +G GV ST L++    T+Y V L  I++G+ + +       
Sbjct: 255 ADASKASPLLFGALATMTGAGAGVQSTGLLAS--TTFYAVNLRSITIGSATTAGV----- 307

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSM 299
                   G +  D+G   T L +  Y   +   +     LTP +  R G + CY+ P  
Sbjct: 308 -----GGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEG-RYGFEACYEKPDS 361

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           A + P +  HFDGGA + L   + ++    +GV C+ +Q     + I GN  Q +  + +
Sbjct: 362 ARLIPAMVLHFDGGADMAL-PVANYVVEVDDGVVCWVVQR-SPSLSIIGNIMQMNYLVLH 419

Query: 360 DFDSQMVSFKPTDC 373
           D    ++SF+P +C
Sbjct: 420 DVRKSVLSFQPANC 433


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 31/367 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTP +     ++DTGSDL WVQC PC   +CY Q  P+++P+SSSSY  + C
Sbjct: 90  EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 148

Query: 81  QSEQCHLLDT---------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+ C  L           VS  +  LC Y   Y + + T GV +TE +T         +
Sbjct: 149 DSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVAD 207

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGCG +  G + + + GL+GLG    SL SQ  SQ G   FSYCL P    +   +  
Sbjct: 208 FGFGCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGFLTLG 265

Query: 192 YFGNGSEVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
              N S  +    +S + + +     T+Y VTL GISVG         P      A S G
Sbjct: 266 APPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGG-------APLAIPPSAFSSG 318

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
            M ID+G   T LP   Y  L    R+A+       P  G  L  CY     A +  P +
Sbjct: 319 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTI 377

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           +  F GGA + L   +  +   V+G   FA    D  +GI GN  Q    + YD     V
Sbjct: 378 SLTFSGGATIDLAAPAGVL---VDGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTV 434

Query: 367 SFKPTDC 373
            F+   C
Sbjct: 435 GFRAGAC 441


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 176/359 (49%), Gaps = 25/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PC V C++Q  P++NP SSSSY  +SC
Sbjct: 119 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSC 177

Query: 81  QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            + QC  L T      +CS+  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 178 SAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 235

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F ++  GL+GL R +LSL  Q+   +G + FSYCL P  + SS    +   N
Sbjct: 236 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCL-PTSSSSSGYLSIGSYN 292

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   +  +SL    D + YF+ + GI+V          P   S+ A S     ID+
Sbjct: 293 PGQYSYTPMAKSSL----DDSLYFIKMTGITVAGK-------PLSVSASAYSSLPTIIDS 341

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP D Y+ L + V  A+K TP          C++  +     P ++  F GGA 
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFAGGAA 401

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L  T+  +        C A  P      I GN  Q    + YD  +  + F    C+
Sbjct: 402 LKLKATNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 125/369 (33%), Positives = 175/369 (47%), Gaps = 33/369 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
            +S ++  +G Y++   IGTP   D+  + DTGSDL W QC PC+  CY Q +P +NP+S
Sbjct: 121 AKSGITLGSGNYIVTIGIGTPKH-DLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSS 179

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS+Y+ +SC S  C   D  SCS+   C Y+ GY D S T+G LA E+ T  NS +  ++
Sbjct: 180 SSTYQNVSCSSPMCE--DAESCSASN-CVYSIGYGDKSFTQGFLAKEKFTLTNS-DVLED 235

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG NN G+F+     L          A    +    N FSYCL  F ++S  T  +
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTY--NNIFSYCLPSFTSNS--TGHL 291

Query: 192 YFGNG--SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISK 248
            FG+   SE     V  T + S      Y + + GISVG+      + P  +++ GAI  
Sbjct: 292 TFGSAGISE----SVKFTPISSFPSAFNYGIDIIGISVGD--KELAITPNSFSTEGAI-- 343

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-P 304
               ID+G   T LP   Y  L    +   K++ Y+    G  L   CY    +  +  P
Sbjct: 344 ----IDSGTVFTRLPTKVYAELRSVFKE--KMSSYKSTS-GYGLFDTCYDFTGLDTVTYP 396

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +   F GG  V L  +   +P  +  V C A    D    IFGN  Q+ L + YD    
Sbjct: 397 TIAFSFAGGTVVELDGSGISLPIKISQV-CLAFAGNDDLPAIFGNVQQTTLDVVYDVAGG 455

Query: 365 MVSFKPTDC 373
            V F P  C
Sbjct: 456 RVGFAPNGC 464


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 126/376 (33%), Positives = 169/376 (44%), Gaps = 70/376 (18%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           Q+ +  + G Y M  SIGTPP+     + DTGS L+W QC PC +C  +  P + PASSS
Sbjct: 80  QTLLDNSAGAYNMNLSIGTPPV-TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSS 138

Query: 74  SYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           ++ +L C S  C  L +   +C++   C Y Y Y     T G LATE +  G ++  F  
Sbjct: 139 TFSKLPCASSLCQFLTSPYRTCNATG-CVYYYPYG-MGFTAGYLATETLHVGGAS--FPG 194

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGC   N GV N +  G+VGLGR+ LSL SQ+    G  +FSYCL   + D+   S +
Sbjct: 195 VTFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVARFSYCLRS-NADAG-DSPI 246

Query: 192 YFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
            FG+ ++V+GG V ST L+   +    +YY+V L GI+VG                    
Sbjct: 247 LFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVG-------------------- 286

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK----TPSMAGIAP 304
                      T LP    N           LT     R G  LC+             P
Sbjct: 287 ----------ATDLPMAMAN-----------LTTVNGTRFGFDLCFDATAAGGGGGVPVP 325

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPIDG--DVGIFGNFAQSDLFI 357
            L   F GGA+  +   S F    V+      V C  + P      + I GN  Q DL +
Sbjct: 326 TLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHV 385

Query: 358 GYDFDSQMVSFKPTDC 373
            YD D  M SF P DC
Sbjct: 386 LYDLDGGMFSFAPADC 401


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 166/368 (45%), Gaps = 28/368 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S +    G YVM  S+GTP       I DTGSDL+WVQ  PC  C      I++P  S
Sbjct: 44  VESPLHPDGGGYVMDISVGTPGK-RFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQS 100

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFF 129
           S+++E+ C S+ C  L          C+Y+Y Y  S  T+G  A + I+ G +   +  F
Sbjct: 101 STFREMDCSSQLCTELPGSCEPGSSACSYSYEYG-SGETEGEFARDTISLGTTSGGSQKF 159

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            +   GCG  N+G   +   GLVGLG+  +SL SQ+ + +  +KFSYCLV  ++ S  +S
Sbjct: 160 PSFAVGCGMVNSGF--DGVDGLVGLGQGPVSLTSQLSAAI-DSKFSYCLVDINSQSE-SS 215

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
            + FG  + + G G+ ST +    D   TYY +T+ GI+V   +               S
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-------------S 262

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
            G   ID+G   T +P   Y R+  ++ + + L       +G  LCY   S      P L
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQM 365
           T    G    P       +        C AM    G  V I GN  Q    I YD  S  
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSE 382

Query: 366 VSFKPTDC 373
           +SF    C
Sbjct: 383 LSFVQAKC 390


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 31/367 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTP +     ++DTGSDL WVQC PC   +CY Q  P+++P+SSSSY  + C
Sbjct: 170 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 228

Query: 81  QSEQCHLLDT---------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+ C  L           VS  +  LC Y   Y + + T GV +TE +T         +
Sbjct: 229 DSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVAD 287

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGCG +  G + + + GL+GLG    SL SQ  SQ G   FSYCL P    +   +  
Sbjct: 288 FGFGCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGFLTLG 345

Query: 192 YFGNGSEVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
              N S  +    +S + + +     T+Y VTL GISVG         P      A S G
Sbjct: 346 APPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGG-------APLAIPPSAFSSG 398

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
            M ID+G   T LP   Y  L    R+A+       P  G  L  CY     A +  P +
Sbjct: 399 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTI 457

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           +  F GGA + L   +  +   V+G   FA    D  +GI GN  Q    + YD     V
Sbjct: 458 SLTFSGGATIDLAAPAGVL---VDGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTV 514

Query: 367 SFKPTDC 373
            F+   C
Sbjct: 515 GFRAGAC 521


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 176/375 (46%), Gaps = 36/375 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           NV      +++  SIG+PP+  +  + DT SDL+W+QC PC+ CY Q  PI++P+ S ++
Sbjct: 77  NVPIIPQAFLVNISIGSPPVTQLLHM-DTASDLLWLQCRPCINCYAQSLPIFDPSRSYTH 135

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNNFFD 130
           +  SC++ Q  +      +  + C Y+  Y D + +KG+LA E + F      +S+    
Sbjct: 136 RNESCRTSQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALH 195

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           +VVFGCGH+N G       G++GLG    SL  +        KFSYC       S   + 
Sbjct: 196 DVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHRF-----GTKFSYCFGSLDDPSYPHNV 249

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAI 246
           +  G+     G  ++  +   +    +Y+VT+E ISV  +     ++P     +N +   
Sbjct: 250 LVLGD----DGANILGDTTPLEIYNGFYYVTIEAISVDGI-----ILPIDPWVFNRNHQT 300

Query: 247 SKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
             G   IDTG   T L ++ Y    N++E+               +    CY       +
Sbjct: 301 GLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDL 360

Query: 303 A----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
                PI+T HF  GA++ L   S F+      VFC A+ P  G++   G  AQ    IG
Sbjct: 361 VESGFPIVTFHFSDGAELSLDVKSVFMKLS-PNVFCLAVTP--GNMNSIGATAQQSYNIG 417

Query: 359 YDFDSQMVSFKPTDC 373
           YD +++ +SF+  DC
Sbjct: 418 YDLEAKKISFERIDC 432


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 178/359 (49%), Gaps = 36/359 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-KPIYNPASSSSYKELSCQS 82
           +++ FS+G PP+  +  I+DTGS L+W+QC PC  C +Q+  P+++P+ SS+Y  LSC++
Sbjct: 102 FLVNFSMGQPPVPQL-AIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKN 160

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVFGCGHN 139
             C    +  C S   C Y   Y +   + GV+ATE++ FG+S+   N  +NV+FGC H 
Sbjct: 161 IICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHR 220

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N    +    G+ GLG    S  + +++Q+G+ KFSYC+          +++    G  +
Sbjct: 221 NGNYKDRRFTGVFGLG----SGITSVVNQMGS-KFSYCIGNIADPDYSYNQLVLSEGVNM 275

Query: 200 SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-GNMFIDTGAP 258
            G    ST L   +   +Y V LEGISVG     ++L+   ++     K   + ID+G  
Sbjct: 276 EG---YSTPLDVVDG--HYQVILEGISVGE----TRLVIDPSAFKRTEKQRRVIIDSGTA 326

Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
           PT L ++ Y  LE +VRN +   LTP+      S LCYK      +   P +T HF  GA
Sbjct: 327 PTWLAENEYRALEREVRNLLDRFLTPFMRE---SFLCYKGKVGQDLVGFPAVTFHFAEGA 383

Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              L+  +      V G           D  + G  AQ    + YD +   + F+  DC
Sbjct: 384 D--LVVDTEMRQASVYG-------KDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 113/360 (31%), Positives = 176/360 (48%), Gaps = 39/360 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N EY+M   + TPP+  +  + DTGS L+W++C        ++   + PASSS Y  L C
Sbjct: 73  NFEYLMALDVSTPPV-RMLALADTGSSLVWLKC--------KLPAAHTPASSS-YARLPC 122

Query: 81  QSEQCHLL-DTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            +  C  L D  SC    S   +C Y Y +AD S T G +  +  TF    +F      G
Sbjct: 123 DAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRLDF------G 176

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFG 194
           C     G+   ++ GLVGL    +SL SQ+ ++   A+KFSYCLVP+ +  +++S + FG
Sbjct: 177 CATRTEGLSVPDD-GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSLNFG 235

Query: 195 NGSEVSGG-GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           + + VS   G  +T LV+  +K++Y + L+ I V     + K +P   ++       + +
Sbjct: 236 SHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKV-----AGKPVPLQTTT-----TKLIV 285

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIA-PILTA 308
           D+G   T LPK   + L   +  AIKL   + P     +CY    + P   G + P +T 
Sbjct: 286 DSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIPDVTL 345

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
              GG +V L   +TF+        C A+        I GN AQ +L +G+D + + VSF
Sbjct: 346 VLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLERRTVSF 405


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 114/362 (31%), Positives = 168/362 (46%), Gaps = 30/362 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           E+V+    GTP     Y ++ DTGSD+ W+QCLPC   CYKQ  PI++P  S++Y  + C
Sbjct: 134 EFVVTVGFGTP--AQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPC 191

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
              QC   D   CS+   C Y   Y D S + GVL+ E ++   S        FGCG  N
Sbjct: 192 GHPQCAAADGSKCSNGT-CLYKVEYGDGSSSAGVLSHETLSL-TSTRALPGFAFGCGQTN 249

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G F + + GL+GLGR +LSL+SQ  +  G   FSYCL    +D++    +  G  +  S
Sbjct: 250 LGDFGDVD-GLIGLGRGQLSLSSQAAASFGGT-FSYCL---PSDNTTHGYLTIGPTTPAS 304

Query: 201 GGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
              V  T++V K+D  ++YFV L  I +G       ++P   +    +    F+D+G   
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGY-----ILPVPPT--LFTDDGTFLDSGTIL 357

Query: 260 TLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
           T LP + Y  L ++ +  +   K  P  DP      CY     + I  P ++  F  G+ 
Sbjct: 358 TYLPPEAYTALRDRFKFTMTQYKPAPAYDPF---DTCYDFTGQSAIFIPAVSFKFSDGSV 414

Query: 316 VPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
             L      I P    P  G   F  +P      I GN  Q +  + YD  ++ + F   
Sbjct: 415 FDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASA 474

Query: 372 DC 373
            C
Sbjct: 475 SC 476


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/372 (31%), Positives = 173/372 (46%), Gaps = 33/372 (8%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           N+ T N  Y++   +G     ++  I+DTGSDL WVQC PC+ CY Q  P++NP++SSSY
Sbjct: 127 NLETLN--YIVTIGLGNQ---NMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSY 181

Query: 76  KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
             L C S  C  L     +T +C S     CN+T  Y D S T G L  E ++FG  +  
Sbjct: 182 NSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGIS-- 239

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N VFGCG NN G+F     G++GLGR+ LS+ SQ  +  G   FSYCL    TDS  +
Sbjct: 240 VSNFVFGCGRNNKGLFG-GVSGIMGLGRSNLSMISQTNTTFGG-VFSYCLPT--TDSGAS 295

Query: 189 SKMYFGNGSEVSGG--GVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +  GN S +      +  TS+VS  +   +Y + L GI VG ++             +
Sbjct: 296 GSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDT---------S 346

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
              G + ID+G   T L    YN L+ +        P          C+    +  ++ P
Sbjct: 347 FGNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIP 406

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFD 362
            L+ HF+    + +        P      C A+  +  + D+ I GN+ Q +  + YD  
Sbjct: 407 TLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAK 466

Query: 363 SQMVSFKPTDCT 374
              + F   DC+
Sbjct: 467 QSKIGFAREDCS 478


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 121/388 (31%), Positives = 189/388 (48%), Gaps = 40/388 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY++   +GTPP      I+DTGSDL W+QC PC+ C++Q  P+++PA+S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 199

Query: 73  SSYKELSCQSEQCHLL----DTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFG--- 123
            SY+ ++C   +C L+       +C       C Y Y Y D S T G LA E  T     
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             ++   D+VVFGCGH+N G+F+     L+GLGR  LS ASQ+ +  G + FSYCLV   
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYG-HAFSYCLV--D 315

Query: 183 TDSSITSKMYFGNGSEVSGGGVVS----TSLVSKEDKTYYFVTLEGISVG----NLSNSS 234
             SS+ SK+ FG+   + G   ++        +    T+Y+V L+G+ VG    N+S S+
Sbjct: 316 HGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPST 375

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPT--LLPKDFYNRLEEQ---VRNAIKLTPYQDPRLG 289
             +    S G I      +   A P   ++ + F  R+++    V +   L+P       
Sbjct: 376 WDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP------- 428

Query: 290 SQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIF 347
              CY    +  +  P  +  F  GA       + F+    +G+ C A +      + I 
Sbjct: 429 ---CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           GNF Q +  + YD  +  + F P  C +
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRCAE 513


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 175/387 (45%), Gaps = 38/387 (9%)

Query: 15  SNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLP---CVQCYKQVKPIY 67
           S  S  +G+Y ++  +GTP    PL     I+DTGSDL W+QC P            P Y
Sbjct: 18  SGSSIGSGQYFVELRVGTPAKKFPL-----IIDTGSDLTWIQCNPPNTTANSSSPPAPWY 72

Query: 68  NPASSSSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           + +SSSSY+E+ C  ++C  L      + S  S   C+YTYGY+D S T G+LA E I+ 
Sbjct: 73  DKSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISM 132

Query: 123 ----------GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
                     GN         NV  GC   + G       G++GLG+  +SLA+Q     
Sbjct: 133 KSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTA 192

Query: 170 GANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG 228
               FSYCLV +   S+ +S +  G         +  T +V     +++Y+V + G++V 
Sbjct: 193 LGGIFSYCLVDYLRGSNASSFLVMG---RTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVD 249

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
                      +   G  +KG +F D+G   + L +  Y+++   +  +I L   Q+   
Sbjct: 250 GKPVDGIASSDWGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE 308

Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--I 346
           G +LCY    M    P L   F GGA + L   + ++    E V C A+Q +    G  I
Sbjct: 309 GFELCYNVTRMEKGMPKLGVEFQGGAVMEL-PWNNYMVLVAENVQCVALQKVTTTNGSNI 367

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            GN  Q D  I YD     + FK + C
Sbjct: 368 LGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 121/388 (31%), Positives = 189/388 (48%), Gaps = 40/388 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY++   +GTPP      I+DTGSDL W+QC PC+ C++Q  P+++PA+S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATS 199

Query: 73  SSYKELSCQSEQCHLL----DTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFG--- 123
            SY+ ++C   +C L+       +C       C Y Y Y D S T G LA E  T     
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             ++   D+VVFGCGH+N G+F+     L+GLGR  LS ASQ+ +  G + FSYCLV   
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYG-HAFSYCLV--D 315

Query: 183 TDSSITSKMYFGNGSEVSGGGVVS----TSLVSKEDKTYYFVTLEGISVG----NLSNSS 234
             SS+ SK+ FG+   + G   ++        +    T+Y+V L+G+ VG    N+S S+
Sbjct: 316 HGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPST 375

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPT--LLPKDFYNRLEEQ---VRNAIKLTPYQDPRLG 289
             +    S G I      +   A P   ++ + F  R+++    V +   L+P       
Sbjct: 376 WDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP------- 428

Query: 290 SQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIF 347
              CY    +  +  P  +  F  GA       + F+    +G+ C A +      + I 
Sbjct: 429 ---CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           GNF Q +  + YD  +  + F P  C +
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRCAE 513


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 166/366 (45%), Gaps = 25/366 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
           + +AN  Y +   +GTP   D+  + DTGSDL W QC PC   CYKQ   I++P+ SSSY
Sbjct: 131 IGSAN--YFVVVGLGTPKR-DLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSY 187

Query: 76  KELSCQSEQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
             ++C S  C  L +        SS   C Y   Y D S + G L+ ER+T   + +  D
Sbjct: 188 INITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTI-TATDIVD 246

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           + +FGCG +N G+F+    GL+GLGR  +S   Q  S +    FSYCL    + SS    
Sbjct: 247 DFLFGCGQDNEGLFS-GSAGLIGLGRHPISFVQQT-SSIYNKIFSYCL---PSTSSSLGH 301

Query: 191 MYFGNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
           + FG  S  +   +  T L +   D T+Y + + GISVG        +P  +SS   S G
Sbjct: 302 LTFG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTK-----LPAVSSS-TFSAG 354

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTA 308
              ID+G   T L    Y  L    R  ++  P  +       CY       I+ P +  
Sbjct: 355 GSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDF 414

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GG  V L      I    + V   FA    D D+ IFGN  Q  L + YD +   + 
Sbjct: 415 EFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIG 474

Query: 368 FKPTDC 373
           F    C
Sbjct: 475 FGAAGC 480


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 123/393 (31%), Positives = 190/393 (48%), Gaps = 46/393 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY+M   +GTPP      I+DTGSDL W+QC PC+ C+ QV P+++PA+S
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAAS 198

Query: 73  SSYKELSCQSEQCHLL----DTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFG--- 123
           SSY+ ++C  ++C L+       +C    +  C Y Y Y D S T G LA E  T     
Sbjct: 199 SSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 258

Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             ++   D+VVFGCGH N G+F+     L       LS ASQ+ +  G + FSYCLV   
Sbjct: 259 PGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGR-GPLSFASQLRAVYG-HTFSYCLVDHG 316

Query: 183 TDSSITSKMYFGNGSEVSGGGV------VSTSLVSKEDKTYYFVTLEGISVG----NLSN 232
           +D  + SK+ FG    ++           + +  S    T+Y+V L+G+ VG    N+S+
Sbjct: 317 SD--VASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISS 374

Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRL 288
            +    +    G    G   ID+G   +   +  Y  + +    ++  +  L P   P L
Sbjct: 375 DT----WGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP-DFPVL 429

Query: 289 GSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDG 342
               CY   +++G+     P L+  F  GA       + FI    +G+ C A+   P  G
Sbjct: 430 SP--CY---NVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484

Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            + I GNF Q +  + YD  +  + F P  C +
Sbjct: 485 -MSIIGNFQQQNFHVVYDLKNNRLGFAPRRCAE 516


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 176/372 (47%), Gaps = 40/372 (10%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           NV      +++  SIG+PP+  +  + DT SDL+W+QCLPC+ CY Q  PI++P+ S ++
Sbjct: 77  NVPIIPQAFLVNISIGSPPITQLLHM-DTASDLLWIQCLPCINCYAQSLPIFDPSRSYTH 135

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNNFFD 130
           +  +C++ Q  +      ++ + C Y+  Y D + +KG+LA E + F      +S+    
Sbjct: 136 RNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALH 195

Query: 131 NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           +VVFGCGH+N G   E  +  G++GLG    SL  +        KFSYC       S   
Sbjct: 196 DVVFGCGHDNYG---EPLVGTGILGLGYGEFSLVHRF-----GKKFSYCFGSLDDPSYPH 247

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSG 244
           + +  G+     G  ++  +   +    +Y+VT+E ISV  +     ++P     +N + 
Sbjct: 248 NVLVLGD----DGANILGDTTPLEIHNGFYYVTIEAISVDGI-----ILPIDPRVFNRNH 298

Query: 245 AISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
               G   IDTG   T L ++ Y    NR+E+               +    CY      
Sbjct: 299 QTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFER 358

Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +     PI+T HF  GA++ L   S F+      VFC A+ P  G++   G  AQ    
Sbjct: 359 DLVESGFPIVTFHFSEGAELSLDVKSLFMKLS-PNVFCLAVTP--GNLNSIGATAQQSYN 415

Query: 357 IGYDFDSQMVSF 368
           IGYD ++  VSF
Sbjct: 416 IGYDLEAMEVSF 427


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/347 (31%), Positives = 163/347 (46%), Gaps = 26/347 (7%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN 100
           IVDT S+L WVQC PC  C+ Q +P+++P+SS SY  + C S  C  L   +  S Q C+
Sbjct: 127 IVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACD 186

Query: 101 -------YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
                  YT  Y D S ++GVLA +R++    +      VFGCG +N G F     GL+G
Sbjct: 187 DQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED--IQGFVFGCGTSNQGPFGGTS-GLMG 243

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLVS 211
           LGR++LSL SQ + Q G   FSYCL P  + SS    +  G+ + V  +   +V T++VS
Sbjct: 244 LGRSQLSLISQTMDQFG-GVFSYCLPPKESGSS--GSLVLGDDASVYRNSTPIVYTAMVS 300

Query: 212 KE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
                 +Y   L GI+VG     S   P +++ G    G   +D+G   T L    Y  +
Sbjct: 301 DPLQGPFYLANLTGITVGGEDVQS---PGFSAGGG---GKAIVDSGTIITSLVPSVYAAV 354

Query: 271 EEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST-FIPPP 328
             +  + +   P   P      C+    +  +  P L   FDGGA+V +      ++   
Sbjct: 355 RAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTG 414

Query: 329 VEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                C A+  +    D  I GN+ Q +L + +D     + F    C
Sbjct: 415 DASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETC 461


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/380 (31%), Positives = 175/380 (46%), Gaps = 28/380 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S   T +GEY+ K ++GTP +  +  + DTGSD+ W+QC PC +CY Q  P+++P  S
Sbjct: 123 VVSRAPTTSGEYMAKIAVGTPAVEALLAM-DTGSDITWLQCQPCRRCYPQSGPVFDPRHS 181

Query: 73  SSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYA-DSSLTKGVLATERITFGNSNNFF 129
           +SY+E+   +  C  L       + +  C Y  GY  D S T G    E +TF       
Sbjct: 182 TSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQ-V 240

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN--KFSYCLVPFHTDS-- 185
            ++  GCGH+N G+F     G++GLGR ++S  SQI + LG N   FSYCL  F   S  
Sbjct: 241 PHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQI-AALGYNVTSFSYCLADFFLSSPG 299

Query: 186 -SITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNL------SNSSKLI 237
            S++S +  G+G+         T  V   +  T+Y+V L G+SVG +       +  KL 
Sbjct: 300 RSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD 359

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCY 294
           PY        +G + +D+G   T L +  Y    +  R A   +       P      CY
Sbjct: 360 PY------TGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCY 413

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
                A   P ++ HF GG ++ L   +  IP    G  CFA     D  V I GN  Q 
Sbjct: 414 TMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQ 473

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
              + Y+     V F P  C
Sbjct: 474 GFRVVYNIGGGRVGFAPNSC 493


>gi|224143825|ref|XP_002325088.1| predicted protein [Populus trichocarpa]
 gi|222866522|gb|EEF03653.1| predicted protein [Populus trichocarpa]
          Length = 241

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/292 (39%), Positives = 153/292 (52%), Gaps = 58/292 (19%)

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           SSS+Y  ++  S +CHLLDTVS   + +         SS +KG    +R++         
Sbjct: 7   SSSTYTTINRHSNKCHLLDTVSLPRKTI-------TLSSSSKG----QRVSV-------P 48

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           ++VFGCGHNNTG FNE+EMG VG G    SL S+I S  G  KF++CLVPFH+  +I+SK
Sbjct: 49  DIVFGCGHNNTG-FNEHEMGSVGRGGRPSSLTSRIGSYSGNIKFTHCLVPFHSTLNISSK 107

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           MY G+GSE+ G GVVST LV K+++ YY+V LEGISV       K +  Y+SSG ISK  
Sbjct: 108 MYSGDGSEIIGKGVVSTPLVRKKNRAYYYVALEGISV-----RGKFLT-YSSSGTISK-- 159

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
           +  + GA               +V     ++P QD       C+         P+  A F
Sbjct: 160 VHFEGGA---------------RVPTTTFISPKQD-----VFCFAMTITDAFMPV--ACF 197

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
             G    L+    F P        +AM       GIFGNFAQS+  IG+D D
Sbjct: 198 ARGLGSFLL----FRPRT-----SYAMSESMLAGGIFGNFAQSNFRIGFDLD 240


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 128/386 (33%), Positives = 197/386 (51%), Gaps = 35/386 (9%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S ++  +GEY M   +G+PP      I+DTGSDL W+QCLPC  C++Q    Y+P +
Sbjct: 143 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKA 201

Query: 72  SSSYKELSCQSEQCHLLDTVS----CSS-QQLCNYTYGYADSSLTKGVLATERITF---- 122
           S+SYK ++C   +C+L+        C S  Q C Y Y Y DSS T G  A E  T     
Sbjct: 202 SASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 261

Query: 123 -GNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            G S+  +  +N++FGCGH N G+F+     L+GLGR  LS +SQ+ S  G + FSYCLV
Sbjct: 262 SGGSSELYNVENMMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 319

Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSS 234
             ++D++++SK+ FG   + +S   +  TS V++++    T+Y+V ++ I V G + N  
Sbjct: 320 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIP 379

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLG 289
           +     +S GA   G   ID+G   +   +  Y    N++ E+ +   K   Y+D P L 
Sbjct: 380 EETWNISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILD 434

Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
              C+    +  I  P L   F  GA       ++FI    E + C A+         I 
Sbjct: 435 P--CFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAILGTPKSAFSII 491

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           GN+ Q +  I YD     + + PT C
Sbjct: 492 GNYQQQNFHILYDTKRSRLGYAPTKC 517


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/371 (31%), Positives = 175/371 (47%), Gaps = 30/371 (8%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASS 72
           S +S  +  YVMKFSIG+P + D Y I D+GS L+W+QC    C  CY+Q  P++NP+ S
Sbjct: 92  SRMSYTDKAYVMKFSIGSPAV-DTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKS 150

Query: 73  SSYKELSCQSEQCHLL---DTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
            +Y +  C + +C +    +   C    Q+C Y   Y D S T+GV++T+  TF    + 
Sbjct: 151 VTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISG 210

Query: 129 FDN----VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
           F N    ++FGCG+NN+   +    GLVGL   + SL    + Q+  ++FSYC V   T+
Sbjct: 211 FGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASL----VGQMDVDQFSYC-VSIDTE 265

Query: 185 SSITSKM--YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSS--KLIPYY 240
            ++   M   FG  + +SG    ST LV   D  Y F  ++GI V           +  Y
Sbjct: 266 QNLKGSMEIRFGLAASISGH---STQLVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKY 322

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYKTPSM 299
              G   +G + +DTG   T L     + L + +   I + P +D    G +LCY +   
Sbjct: 323 TEGG---QGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSGFELCYFSDDF 379

Query: 300 AGIA-PILTAHF-DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
            G   P +   F D        +T     P      C AM   +G + I G     D+ I
Sbjct: 380 LGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNG-MSIIGMHQLRDIKI 438

Query: 358 GYDFDSQMVSF 368
           GYD    +VSF
Sbjct: 439 GYDLHHNIVSF 449


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 124/385 (32%), Positives = 175/385 (45%), Gaps = 42/385 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYN 68
           V + V  A  +Y+  + IG+PP      ++DTGSDL+W QC    LP   C KQ  P YN
Sbjct: 75  VSAQVHRATRQYIASYLIGSPPQ-RTEALIDTGSDLIWTQCATTCLP-KSCAKQGLPYYN 132

Query: 69  PASSSSYKELSCQSEQ--CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
            + SS++  + C  +   C       C     C +   Y    +  G L TE   F +  
Sbjct: 133 LSQSSTFVPVPCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVI-GSLGTESFAFESGT 191

Query: 127 NFFDNVVFGC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
               ++ FGC       +G  N+   GL+GLGR RLSL SQI    GA +FSYCL P+  
Sbjct: 192 T---SLAFGCVSLTRITSGALNDAS-GLIGLGRGRLSLVSQI----GATRFSYCLTPYFH 243

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYY 240
            S  +S ++ G  + + GGG     + S +D    T+Y++ LEGI+VG        +P  
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTR-----LPAV 298

Query: 241 NSS--------GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLG 289
           NS+             G + IDTG+P T L    Y  L+E+V   +    L P  +   G
Sbjct: 299 NSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDS-G 357

Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGN 349
            +LC        + P L  HF GGA + +   S +   PV+      M    G   I GN
Sbjct: 358 LELCVAREGFQKVVPALVFHFGGGADMAVPAASYW--APVDKAAACMMILEGGYDSIIGN 415

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
           F Q D+ + YD      SF+  DCT
Sbjct: 416 FQQQDMHLLYDLRRGRFSFQTADCT 440


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 123/373 (32%), Positives = 186/373 (49%), Gaps = 45/373 (12%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
            + +  G Y M FS+GTPP   +  + DTGSDL+W +C  C +C  +    Y P  SSS+
Sbjct: 73  QMDSGGGAYDMTFSMGTPPQ-TLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSF 131

Query: 76  KELSCQSEQCHLLDTVSCSS-------QQLCNYTYGYADSS----LTKGVLATERITFGN 124
            +L C S  C  L++ S ++         +C+Y Y Y  SS     T+G + +E  T G 
Sbjct: 132 SKLPCSSALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLG- 190

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
            ++    + FGC    +     +  GLVGLGR +LSL  Q+  ++GA  FSYCL    +D
Sbjct: 191 -SDAVQGIGFGC-TTMSEGGYGSGSGLVGLGRGKLSLVRQL--KVGA--FSYCLT---SD 241

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            S +S + FG G+ ++G GV ST LV+ +  T+Y V L+ IS+G             + G
Sbjct: 242 PSTSSPLLFGAGA-LTGPGVQSTPLVNLKTSTFYTVNLDSISIGAA----------KTPG 290

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
               G +F D+G   T L +  Y   E     Q  N  ++ P  D   G ++C++T S  
Sbjct: 291 TGRHGIIF-DSGTTLTFLAEPAYTLAEAGLLSQTTNLTRV-PGTD---GYEVCFQT-SGG 344

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
            + P +  HFDGG     + T  +     + V C+ +Q    ++ I GN  Q D  I YD
Sbjct: 345 AVFPSMVLHFDGGDMA--LKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYD 402

Query: 361 FDSQMVSFKPTDC 373
            D  ++SF+PT+C
Sbjct: 403 LDKSVLSFQPTNC 415


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 175/374 (46%), Gaps = 28/374 (7%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSS 74
           V  A  +YV ++ IG PP      ++DTGSDL+W QC  C++  C +Q  P YN ++SS+
Sbjct: 83  VRWATLQYVAEYLIGDPPQ-RAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASST 141

Query: 75  YKELSCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +  + C +  C   D +   C     C+   GY  + +  G L TE   F +       +
Sbjct: 142 FAPVPCAARICAANDDIIHFCDLAAGCSVIAGYG-AGVVAGTLGTEAFAFQSGTA---EL 197

Query: 133 VFGCGHNNTGVFN--ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            FGC      V        GL+GLGR RLSL    +SQ GA KFSYCL P+  ++  T  
Sbjct: 198 AFGCVTFTRIVQGALHGASGLIGLGRGRLSL----VSQTGATKFSYCLTPYFHNNGATGH 253

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
           ++ G  + + G G V T+   K  K   +Y++ L G++VG   L   + +      +  +
Sbjct: 254 LFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGL 313

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
             G + ID+G+P T L  D Y+ L  ++    N   + P  D   G+ LC     +  + 
Sbjct: 314 FSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGA-LCVARRDVGRVV 372

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYD 360
           P +  HF GGA + +   S +   PV+           G      + GN+ Q ++ + YD
Sbjct: 373 PAVVFHFRGGADMAVPAESYW--APVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYD 430

Query: 361 FDSQMVSFKPTDCT 374
             +   SF+P DC+
Sbjct: 431 LANGDFSFQPADCS 444


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 173/396 (43%), Gaps = 52/396 (13%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
           V S  ++ +G+Y +   IG PP   +  I DTGSDL+WV+C  C  C +     ++ P  
Sbjct: 73  VVSGAASGSGQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 131

Query: 72  SSSYKELSCQSEQCHLLDTVS----CSSQQL---CNYTYGYADSSLTKGVLATERITFGN 124
           SS++    C    C L+        C+  ++   C+Y YGYAD SLT G+ A E  +   
Sbjct: 132 SSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKT 191

Query: 125 SNN---FFDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
           S+       +V FGCG   +G       FN    G++GLGR  +S ASQ+  + G NKFS
Sbjct: 192 SSGKEARLKSVAFGCGFRISGQSVSGTSFN-GANGVMGLGRGPISFASQLGRRFG-NKFS 249

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
           YCL+ +      TS +  GNG +       +  L +    T+Y+V L+ + V    N +K
Sbjct: 250 YCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAK 305

Query: 236 L-----IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
           L     I   + SG    G   +D+G     L +  Y  +   VR  +KL        G 
Sbjct: 306 LRIDPSIWEIDDSG---NGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGF 362

Query: 291 QLCYKTPSMA---GIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI 340
            LC     +     I P L   F GGA         F+PPP        E + C A+Q +
Sbjct: 363 DLCVNVSGVTKPEKILPRLKFEFSGGA--------VFVPPPRNYFIETEEQIQCLAIQSV 414

Query: 341 DGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           D  VG  + GN  Q      +D D   + F    C 
Sbjct: 415 DPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 117/370 (31%), Positives = 176/370 (47%), Gaps = 32/370 (8%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           A+  + +   +GTPP      I+D GSDL+W QC       KQ++P+++ A SSS+  L 
Sbjct: 103 AHQGHSLTVGVGTPPQPSKV-ILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLP 161

Query: 80  CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           C S+ C        +C+ ++ C Y   Y   + T GVLATE  TFG  +    N+ FGCG
Sbjct: 162 CDSKLCEAGTFTNKTCTDRK-CAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCG 219

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
               G   E   G++GL    LS+    L QL   KFSYCL PF      TS + FG  +
Sbjct: 220 KLANGTIAEAS-GILGLSPGPLSM----LKQLAITKFSYCLTPFADRK--TSPVMFGAMA 272

Query: 198 EV----SGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
           ++    + G V +  L+    +  YY+V + G+SVG     SK +     + AI     G
Sbjct: 273 DLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVG-----SKRLDVPQETLAIKPDGTG 327

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP---SMAGI-API 305
              +D+      L +  +  L++ V   IKL           +C++ P   SM G+  P 
Sbjct: 328 GTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPP 387

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDS 363
           L  HFDG A++ L   + F   P  G+ C A+   P +G   + GN  Q ++ + YD  +
Sbjct: 388 LVLHFDGDAEMSLPRDNYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGN 446

Query: 364 QMVSFKPTDC 373
           +  S+ PT C
Sbjct: 447 RKFSYAPTKC 456


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 177/373 (47%), Gaps = 33/373 (8%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           N+ T N  Y++   +G+    ++  I+DTGSDL WVQC PC+ CY Q  PI+ P++SSSY
Sbjct: 59  NLQTLN--YIVTMGLGSK---NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113

Query: 76  KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           + +SC S  C  L     +T +C S     CNY   Y D S T G L  E ++FG  +  
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVS-- 171

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             + VFGCG NN G+F     GL+GLGR+ LSL SQ  +  G   FSYCL    T++  +
Sbjct: 172 VSDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPT--TEAGSS 227

Query: 189 SKMYFGNGSEV--SGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +  GN S V  +   +  T ++S  +   +Y + L GI VG ++  + L        +
Sbjct: 228 GSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPL--------S 279

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
              G + ID+G   T LP   Y  L+ +        P          C+       ++ P
Sbjct: 280 FGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIP 339

Query: 305 ILTAHFDGGAKVPLIHTSTF-IPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDF 361
            ++  F+G A++ +  T TF +        C A+  +    D  I GN+ Q +  + YD 
Sbjct: 340 TISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDT 399

Query: 362 DSQMVSFKPTDCT 374
               V F    C+
Sbjct: 400 KQSKVGFAEEPCS 412


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 117/369 (31%), Positives = 174/369 (47%), Gaps = 19/369 (5%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S  + ++  Y++K   GTPP    Y ++DTGS++ W+ C PC  C  + +P + P+ S
Sbjct: 113 LASGQAISSSNYIIKLGFGTPPQ-SFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPSKS 170

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           S+Y  L+C S+QC LL   + S   + C+ T  Y D S    +L++E ++ G+     +N
Sbjct: 171 STYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQ--VEN 228

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
            VFGC +   G+       LVG GR  LS  SQ  + L  + FSYCL P    S+ T  +
Sbjct: 229 FVFGCSNAARGLIQRTP-SLVGFGRNPLSFVSQT-ATLYDSTFSYCL-PSLFSSAFTGSL 285

Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
             G    +S  G+  T L+S     ++Y+V L GISVG    S   IP    S   S G 
Sbjct: 286 LLGK-EALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVS---IPAGTLSLDESTGR 341

Query: 251 -MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
              ID+G   T L +  YN + +  R+ +       P      CY  PS     P++T H
Sbjct: 342 GTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDVEFPLITLH 401

Query: 310 FDGGAKVPLIHTSTFIPPPVEG-VFC--FAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQ 364
           FD    + L   +   P   +G V C  F + P  GD  +  FGN+ Q  L I +D    
Sbjct: 402 FDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAES 461

Query: 365 MVSFKPTDC 373
            +     +C
Sbjct: 462 RLGIASENC 470


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 124/387 (32%), Positives = 190/387 (49%), Gaps = 38/387 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY M   +GTPP      I+DTGSDL W+QC+PC+ C++Q  P Y+P  S
Sbjct: 186 LESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDS 244

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
           SS++ +SC   +C L+        C ++ Q C Y Y Y D S T G  A E  T      
Sbjct: 245 SSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTP 304

Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            G S     +NV+FGCGH N G+F+     L+GLG+  LS ASQ+ S  G + FSYCLV 
Sbjct: 305 NGTSELKHVENVMFGCGHWNRGLFHGAAG-LLGLGKGPLSFASQMQSLYGQS-FSYCLVD 362

Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKED---KTYYFVTLEGISVGN-LSNSSK 235
            ++++S++SK+ FG   E +S   +  TS    +D    T+Y+V ++ + V + +    +
Sbjct: 363 RNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPE 422

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
              + +S GA   G   ID+G   T   +  Y  ++E     IK     +     + CY 
Sbjct: 423 ETWHLSSEGA---GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCY- 478

Query: 296 TPSMAGIAPILTAHF------DGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIF 347
             +++GI  +    F      +     P+ +   +I P    V C A+   P    + I 
Sbjct: 479 --NVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPE---VVCLAILGNPRSA-LSII 532

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           GN+ Q +  I YD     + + P  C 
Sbjct: 533 GNYQQQNFHILYDMKKSRLGYAPMKCA 559


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 163/355 (45%), Gaps = 34/355 (9%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD---------TV 91
           IVDT S+L WVQC PC  C+ Q  P+++P+SS SY  + C S  C  L            
Sbjct: 167 IVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226

Query: 92  SCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNEN 147
           +C  Q      C+YT  Y D S ++GVLA +R++   +    D  VFGCG +N G     
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL--AGEVIDGFVFGCGTSNQGPPFGG 284

Query: 148 EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVV 205
             GL+GLGR++LSL SQ + Q G   FSYCL    +DSS    +  G+ S V  +   +V
Sbjct: 285 TSGLMGLGRSQLSLVSQTMDQFGG-VFSYCLPLKESDSS--GSLVIGDDSSVYRNSTPIV 341

Query: 206 STSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
             S+VS      +YFV L GI+VG      + +     S     G   ID+G   T L  
Sbjct: 342 YASMVSDPLQGPFYFVNLTGITVGG-----QEVESSGFSSGGGGGKAIIDSGTVITSLVP 396

Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAHFDGGAKVPLIHTS 322
             YN ++ +  +     P Q P       C+    +  +  P L   FDGG +V +    
Sbjct: 397 SIYNAVKAEFLSQFAEYP-QAPGFSILDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGG 455

Query: 323 T--FIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              F+      V C AM P+    +  I GN+ Q +L + +D     V F    C
Sbjct: 456 VLYFVSSDSSQV-CLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 125/384 (32%), Positives = 187/384 (48%), Gaps = 32/384 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY M   +GTPP      I+DTGSDL W+QC+PC+ C++Q  P Y+P  S
Sbjct: 184 LESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDS 242

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
           SS++ +SC   +C L+ +      C ++ Q C Y Y Y D S T G  A E  T      
Sbjct: 243 SSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTP 302

Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            G S     +NV+FGCGH N G+F+     L+GLG+  LS ASQ+ S  G + FSYCLV 
Sbjct: 303 NGKSELKHVENVMFGCGHWNRGLFHGAAG-LLGLGKGPLSFASQMQSLYGQS-FSYCLVD 360

Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKED---KTYYFVTLEGISVGN-LSNSSK 235
            ++++S++SK+ FG   E +S   +  TS    +D    T+Y+V +  + V + +    +
Sbjct: 361 RNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPE 420

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
              + +S GA   G   ID+G   T   +  Y  ++E     IK     +     + CY 
Sbjct: 421 ETWHLSSEGA---GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYN 477

Query: 296 TPSMAGIA-PILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNF 350
              +  +  P     F  GA    P+ +    I P    V C A+   P    + I GN+
Sbjct: 478 VSGIEKMELPDFGILFADGAVWNFPVENYFIQIDP---DVVCLAILGNPRSA-LSIIGNY 533

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q +  I YD     + + P  C 
Sbjct: 534 QQQNFHILYDMKKSRLGYAPMKCA 557


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 121/363 (33%), Positives = 179/363 (49%), Gaps = 34/363 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
            + +  G Y M FSIGTPP  ++  + DTGSDL+W +C  C +C  Q  P Y P  SSS+
Sbjct: 74  QLDSGGGAYDMTFSIGTPPQ-ELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSF 132

Query: 76  KELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSS----LTKGVLATERITFGNSNNFFD 130
            +L C    C  L +  CS+    C+Y Y Y  +S     T+G L +E  T G  ++   
Sbjct: 133 SKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLG--SDAVP 190

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            + FGC    +     +  GLVGLGR  LSL    +SQL    FSYCL    +D++ TS 
Sbjct: 191 GIGFGC-TTMSEGGYGSGSGLVGLGRGPLSL----VSQLNVGAFSYCLT---SDAAKTSP 242

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG+G+ ++G GV ST L+ +    YY V LE IS+G  +          ++G  S G 
Sbjct: 243 LLFGSGA-LTGAGVQSTPLL-RTSTYYYTVNLESISIGAAT----------TAGTGSSGI 290

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
           +F D+G     L +  Y   +E V +          R G ++C++T     + P +  HF
Sbjct: 291 IF-DSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTS--GAVFPSMVLHF 347

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           DGG     + T  +     + V C+ +Q     + I GN  Q +  I YD +  M+SF+P
Sbjct: 348 DGGDMD--LPTENYFGAVDDSVSCWIVQK-SPSLSIVGNIMQMNYHIRYDVEKSMLSFQP 404

Query: 371 TDC 373
            +C
Sbjct: 405 ANC 407


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 115/381 (30%), Positives = 171/381 (44%), Gaps = 58/381 (15%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
           +S  +G Y +K  +G+PP      I+DTGS L W+QC PCV  C+ QV P++ P++S++Y
Sbjct: 113 LSIGSGNYYLKLGLGSPPKYYTM-ILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTY 171

Query: 76  KELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           + L C S +C LL   +     C++  +C YT  Y D+S + G L+ + +T   S     
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQT-LP 230

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           +  +GCG +N G+F +   G+VGL R +LS+ +Q+  + G   FSYCL P  T       
Sbjct: 231 SFTYGCGQDNEGLFGK-AAGIVGLARDKLSMLAQLSPKYG-YAFSYCL-PTSTS------ 281

Query: 191 MYFGNGSEVSGGGVVSTSLVS------------KEDKTYYFVTLEGISVGNLSNSSKLIP 238
                    SGGG +S   +S             ++ + YF+ L  I+V           
Sbjct: 282 ---------SGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAG--------- 323

Query: 239 YYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCY 294
                G  + G      ID+G   T LP   Y  L E     +     Q P       C+
Sbjct: 324 --RPVGVAAAGYQVPTIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCF 381

Query: 295 K--TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           K    SM+G AP +   F GGA + L   +  I    +G+ C A       + I GN  Q
Sbjct: 382 KGSLKSMSG-APEIRMIFQGGADLSLRAPNILIEAD-KGIACLAFAS-SNQIAIIGNHQQ 438

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
               I YD  +  + F P  C
Sbjct: 439 QTYNIAYDVSASKIGFAPGGC 459


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 172/396 (43%), Gaps = 52/396 (13%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
           V S  S+ +G+Y +   IG PP   +  I DTGSDL+WV+C  C  C +     ++ P  
Sbjct: 72  VVSGASSGSGQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 130

Query: 72  SSSYKELSCQSEQCHLL----DTVSCSSQQL---CNYTYGYADSSLTKGVLATERITFGN 124
           SS++    C    C L+        C+  ++   C Y YGYAD SLT G+ A E  +   
Sbjct: 131 SSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKT 190

Query: 125 SNNF---FDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
           S+       +V FGCG   +G       FN    G++GLGR  +S ASQ+  + G NKFS
Sbjct: 191 SSGKEAKLKSVAFGCGFRISGQSVSGTSFN-GANGVMGLGRGPISFASQLGRRFG-NKFS 248

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
           YCL+ +      TS +  G+G +       +  L +    T+Y+V L+ + V    N +K
Sbjct: 249 YCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAK 304

Query: 236 L-----IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
           L     I   + SG    G   +D+G     L    Y  +   V+  IKL    +   G 
Sbjct: 305 LRIDPSIWEIDDSG---NGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGF 361

Query: 291 QLCYKTPSMA---GIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI 340
            LC     +     I P L   F GGA         F+PPP        E + C A+Q +
Sbjct: 362 DLCVNVSGVTKPEKILPRLKFEFSGGA--------VFVPPPRNYFIETEEQIQCLAIQSV 413

Query: 341 DGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           D  VG  + GN  Q      +D D   + F    C 
Sbjct: 414 DPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 178/384 (46%), Gaps = 35/384 (9%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSS 74
           V  A  +Y+ ++ IG PP      I+DTGS+L+W QC  C    C+ Q    Y+P+ S +
Sbjct: 64  VHWAESQYIAEYLIGDPPQ-QAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRT 122

Query: 75  YKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDNV 132
            + ++C    C L     C+   + C     Y  + +  GVL TE  TF   S N   ++
Sbjct: 123 ARPVACNDTACALGSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQPQSENV--SL 179

Query: 133 VFGC--GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            FGC      T    +   G++GLGR  LSL    +SQLG NKFSYCL P+ + S+ TS+
Sbjct: 180 AFGCIAATRLTPGSLDGASGIIGLGRGNLSL----VSQLGDNKFSYCLTPYFSQSTNTSR 235

Query: 191 MYFGNGSEVSGGGVVSTSL--VSKED----KTYYFVTLEGISVGN--LSNSSKLIPYYNS 242
           ++ G  + +S GG  +TS+  +   D     T+Y++ L GI+VG+  L+           
Sbjct: 236 LFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQV 295

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYKTP--S 298
           +  +  G + ID+G+P T L    Y  L +++   +   + P      G  LC       
Sbjct: 296 ATGLWAGTL-IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGD 354

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--------DVGIFGNF 350
           +  + P L  HF  G     +    +  P  +   C  +    G        +  I GN+
Sbjct: 355 VGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNY 414

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q D+ + YD +  M+SF+P DC+
Sbjct: 415 MQQDMHLLYDLEKGMLSFQPADCS 438


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 120/381 (31%), Positives = 177/381 (46%), Gaps = 43/381 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTPP  +   + DTGSDL WVQCLPC    CY Q +P+++P+ SS+Y ++ C
Sbjct: 121 EYVVTIGIGTPPR-NFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPC 179

Query: 81  QSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FDNVVFG 135
            + +CH+  +    C +   C Y+  Y D S T G LA E  T    +        VVFG
Sbjct: 180 SAPECHIGGVQQTRCGATS-CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFG 238

Query: 136 CGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQL--GANKFSYCLVPFHTDSSITSK 190
           C H    VFN+  M   GL+GLGR   S+ SQ    +  G   FSYCL P     S T  
Sbjct: 239 CSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPP---RGSSTGY 295

Query: 191 MYFGNGSEVSG---GGVVSTSLVS--KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           +  G G+         +  T L++   + ++ Y V L G+SV   + ++  IP    + A
Sbjct: 296 LTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSV---NGAAVDIP----ASA 348

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGI 302
            S G + ID+G   T +P   Y  L ++ R    + K+ P    +L    CY       +
Sbjct: 349 FSLGAV-IDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKL-LDTCYDVTGQDVV 406

Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEG-------VFCFAMQPID-GDVGIFGNFAQS 353
            AP +   F GGA++ +  +   +  P E        + C A  P +   + I GN  Q 
Sbjct: 407 TAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQR 466

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
              + +D D   + F P  C+
Sbjct: 467 AYNVVFDVDGGRIGFGPNGCS 487


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 175/383 (45%), Gaps = 38/383 (9%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           A  EY +   +GTP  +++  I+DTGSD+ W+QC+PC  C   ++P +NP  SSS+ +L 
Sbjct: 134 AGLEYYVPLQLGTP-AVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLP 192

Query: 80  CQSEQC-HLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD----- 130
           C S  C ++   V   CS S + C ++  Y D SL+ G+LA E I  GN+ NF D     
Sbjct: 193 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVK 251

Query: 131 --NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N+  GC   +         GL+G+ R  +S  SQ+ S+  A KFS+C        + +
Sbjct: 252 LSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRY-ARKFSHCFPDKIAHLNSS 310

Query: 189 SKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
             ++FG    +S       +V    V      YY+V L GISV    + S+L P  + + 
Sbjct: 311 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISV----DESRL-PLSHKNF 365

Query: 245 AISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-- 298
            I K    G   ID+G   T L K  +  +  +           D   G   CY   S  
Sbjct: 366 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 425

Query: 299 ---MAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVFCFAMQPIDGDV--GIFGNF 350
               + I P +T HF GG  V L   S  IP      +   C A Q + GD+   I GN+
Sbjct: 426 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNY 484

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q +L++ YD +   +   P  C
Sbjct: 485 QQQNLWVEYDLEKLRLGIAPAQC 507


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 167/382 (43%), Gaps = 32/382 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
            +  +S   G YV+   +GTP   D+  + DTGSDL WVQC PC    CY Q  P++ P+
Sbjct: 74  AERGISVGTGNYVVSVGLGTP-ARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPS 132

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSS---QQLCNYTYGYADSSLTKGVLATERITFG---- 123
           SSS++  + C   +C      SCSS      C Y   Y D S T G L  + +T G    
Sbjct: 133 SSSTFSAVRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPS 191

Query: 124 -----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
                N++N     VFGCG NNTG+F + + GL GLGR ++SL+SQ   + G   FSYCL
Sbjct: 192 TNASENNSNKLPGFVFGCGENNTGLFGKAD-GLFGLGRGKVSLSSQAAGKYG-EGFSYCL 249

Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
               + S+    +  G  +        +  L      ++Y+V L GI V   +      P
Sbjct: 250 P--SSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRP 307

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGS-QLCYKT 296
                 A+    + +D+G   T L    Y+ L     +A+    Y+  PRL     CY  
Sbjct: 308 ------ALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDF 361

Query: 297 PSMAGIA---PILTAHFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
            + A      P +   F GGA + +  +   ++    +    FA        GI GN  Q
Sbjct: 362 TAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQ 421

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
             + + YD   Q + F    C+
Sbjct: 422 RTVAVVYDVGRQKIGFAAKGCS 443


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 117/365 (32%), Positives = 172/365 (47%), Gaps = 22/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           S  +   G YV+   +GTP  +  Y +V DTGSD  WVQC PCV  CY+Q + +++PA S
Sbjct: 171 SGRALGTGNYVVTVGLGTP--VSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARS 228

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC +  C  L+   CS    C Y   Y D S + G  A + +T  +S +     
Sbjct: 229 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 286

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + 
Sbjct: 287 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 341

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG GS  +    ++T +++    T+Y+V + GI VG      +L+    S    +     
Sbjct: 342 FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 394

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
           +D+G   T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  
Sbjct: 395 VDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 454

Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA++ +  +          V   FA     GDVGI GN       + YD   ++V F
Sbjct: 455 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514

Query: 369 KPTDC 373
            P  C
Sbjct: 515 YPGAC 519


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 159/349 (45%), Gaps = 24/349 (6%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
           +G Y +   +GTP   D+  I DTGSDL W QC PC + CYKQ   I++P+ S+SY  ++
Sbjct: 143 SGNYFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNIT 201

Query: 80  CQSEQCHLLDTVS-----CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           C S  C  L T +     CS S + C Y   Y DSS + G  + ER+T   + +  DN +
Sbjct: 202 CTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTV-TATDVVDNFL 260

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG NN G+F     GL+GLGR  +S   Q  ++     FSYCL    + SS T  + F
Sbjct: 261 FGCGQNNQGLFG-GSAGLIGLGRHPISFVQQTAAKY-RKIFSYCL---PSTSSSTGHLSF 315

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G  +          S +S+   ++Y + +  I+VG +      +P   SS   S G   I
Sbjct: 316 GPAATGRYLKYTPFSTISR-GSSFYGLDITAIAVGGVK-----LPV--SSSTFSTGGAII 367

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
           D+G   T LP   Y  L    R  +   P          CY        + P +   F G
Sbjct: 368 DSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEFSFAG 427

Query: 313 GAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
           G  V L      F+    +    FA    D DV I+GN  Q  + + YD
Sbjct: 428 GVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYD 476


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 166/370 (44%), Gaps = 34/370 (9%)

Query: 24  YVMKFSIGT----PPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           YV   ++G      P  ++  IVDTGSDL WVQC PC  CY Q  P+++PA S++Y  + 
Sbjct: 185 YVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVR 244

Query: 80  CQSEQCHL-LDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           C +  C   L        SC    + C Y   Y D S ++GVLAT+ +  G ++   D  
Sbjct: 245 CNASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS--LDGF 302

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           VFGCG +N G+F     GL+GLGRT LSL SQ   + G   FSYCL P  T    +  + 
Sbjct: 303 VFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTALRYG-GVFSYCL-PATTSGDASGSLS 359

Query: 193 FGN--GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
            G    S  +   V  T +++   +  +YF+ + G +VG  + +++          +   
Sbjct: 360 LGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ---------GLGAS 410

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
           N+ ID+G   T L    Y  +  +         Y      S L  CY       +  P+L
Sbjct: 411 NVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLL 470

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDS 363
           T   +GGA+V +           +G   C AM  +  +    I GN+ Q +  + YD   
Sbjct: 471 TLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVG 530

Query: 364 QMVSFKPTDC 373
             + F   DC
Sbjct: 531 SRLGFADEDC 540


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 170/363 (46%), Gaps = 32/363 (8%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP-CVQCYKQVKPIYNPASSSSYKELSCQS 82
           YV+  +IGTPP   +  I+D G +L+W QC   C +C+KQ  P+++  +SS+++   C +
Sbjct: 51  YVVNLTIGTPPQ-PVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGA 109

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
             C  + T SC+        Y  + S   T G + T+ +  G +      + FGC   + 
Sbjct: 110 AVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT--ARLAFGCAVASE 167

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G VGLGRT LSLA+Q    + A  FSYCL P   D+  +S ++ G  ++++G
Sbjct: 168 MDTMWGSSGSVGLGRTNLSLAAQ----MNATAFSYCLAP--PDTGKSSALFLGASAKLAG 221

Query: 202 GG--VVSTSLVSKEDKTY------YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            G    +T  V      +      Y + LE I  GN   ++  +P        S   + +
Sbjct: 222 AGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGN---ATIAMPQ-------SGNTIMV 271

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
            T  P T L    Y  L + V +A+   P   P     LC+   S +G AP L   F GG
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGG 331

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           A++  +  S+++        C A+   P  G V I G+  Q ++ + +D D + +SF+P 
Sbjct: 332 AEM-TVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPA 390

Query: 372 DCT 374
           DC+
Sbjct: 391 DCS 393


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 173/365 (47%), Gaps = 36/365 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      ++D   +L+W QC  C +C++Q  P+++P +S++Y+   C + 
Sbjct: 51  YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 84  QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
            C  +  D+ +CS   +C Y     ++  T G + T+    G +     ++ FGC   + 
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G+VGLGRT  SL    ++Q G   FSYCL P   D+   S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGKNSALFLGSSAKLAG 218

Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           GG   ST  V+      +   YY V LEG+  G+      +IP   S   +      +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
            +P + L    Y  +++ V  A+   P   P     LC+     +G AP L   F GGA 
Sbjct: 269 FSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +  +  S ++     G  C AM          ++ + G+  Q ++   +D D + +SF+P
Sbjct: 329 M-TVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387

Query: 371 TDCTK 375
            DCTK
Sbjct: 388 ADCTK 392


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 28/363 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           +++  SIG+PP+  +  +VDTGS L+WVQCLPC+ C++Q    ++P  S S+K L C   
Sbjct: 104 FLVNLSIGSPPVTQLV-VVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFP 162

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
             + ++   C+      Y   Y     ++G+LA E + F   +       N+ FGCGH N
Sbjct: 163 GYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMN 222

Query: 141 TGVFNENEM-GLVGLGR-TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
               N++   G+ GLG    +++A+Q+      NKFSYC+   +      + +  G GS 
Sbjct: 223 IKTNNDDAYNGVFGLGAYPHITMATQL-----GNKFSYCIGDINNPLYTHNHLVLGQGSY 277

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           + G    ST L  +    +Y+VTL+ ISVG  S + K+ P      +   G + ID+G  
Sbjct: 278 IEGD---STPL--QIHFGHYYVTLQSISVG--SKTLKIDPNAFKISSDGSGGVLIDSGMT 330

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYK---TPSMAGIAPILTAHFDGG 313
            T L    +  L +++ + +K    + P  R    LC+K   +  + G  P +T HF GG
Sbjct: 331 YTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGF-PAVTFHFAGG 389

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           A + L   S F     +  FC A+ P + +   + + G  AQ +  +G+D +   V F+ 
Sbjct: 390 ADLVLESGSLFRQHGGD-RFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRR 448

Query: 371 TDC 373
            DC
Sbjct: 449 IDC 451


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 34/367 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTP +  I  ++DTGSDL WVQC PC   +CY Q  P+++P+SSSSY  + C
Sbjct: 117 EYVVTLGIGTPAVQQIV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 175

Query: 81  QSEQCHLLDTVS----CSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S+ C  L   +    C+S    LC Y   Y + + T GV +TE +T         +  F
Sbjct: 176 DSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVADFGF 234

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCG +  G + + + GL+GLG    SL SQ  SQ G   FSYCL P    +     +  G
Sbjct: 235 GCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGF---LALG 289

Query: 195 NGSEVSGGGVVSTSLVSKEDK-----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
             +  S     +  L +   +     T+Y VTL GISVG         P      A S G
Sbjct: 290 APNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGG-------APLAVPPSAFSSG 342

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
            M ID+G   T LP   Y  L    R+A+       P  G+ L  CY       +  P +
Sbjct: 343 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTI 401

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
              F GGA + L   +  +   V+G   FA    D  +GI GN  Q    + YD     V
Sbjct: 402 ALTFSGGATIDLATPAGVL---VDGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTV 458

Query: 367 SFKPTDC 373
            F+   C
Sbjct: 459 GFRAGAC 465


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 162/370 (43%), Gaps = 19/370 (5%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           VS  +GEY+++  IG+PPL + + + DTGSD++WVQC PC  CY Q  P+++PA+S+S+ 
Sbjct: 116 VSHGSGEYLVRVGIGSPPL-EQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFS 174

Query: 77  ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            + C S  C        +        C Y   Y D S T GVLA E +T  +       V
Sbjct: 175 PVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL-DGGTEVQGV 233

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F E   GL+GLG   +SL  Q L       FSYCL  +++     S   
Sbjct: 234 AMGCGHENRGLFAE-AAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLAGYYSGEGSGSGSL 291

Query: 193 FGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
                + +  G V   LV   D  ++Y+V + G+ V       +L       G    G +
Sbjct: 292 VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAG--ERLQLQDGLFDLGDDGGGGV 349

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAH 309
            +DTG   T LP + Y  L      A +    + P +     CY     A +  P +  +
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVALY 409

Query: 310 FDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           F G       A + L   +  +P    G +C A   +     I GN  Q  + I  D  S
Sbjct: 410 FGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSAS 469

Query: 364 QMVSFKPTDC 373
             V F P  C
Sbjct: 470 GYVGFGPATC 479


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/362 (32%), Positives = 171/362 (47%), Gaps = 33/362 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+    GTP +  +  ++DTGSD+ WVQC PC   +CY Q  P+++P+ SS+Y  ++C
Sbjct: 130 EYVVTLGFGTPSVPQVL-LMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIAC 188

Query: 81  QSEQCHLLDTV---SCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            ++ C  L       C+S    C Y+  YAD S ++GV + E +T        D   FGC
Sbjct: 189 NTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLAPGITVED-FHFGC 247

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G +  G  ++ + GL+GLG   +SL  Q  S  G   FSYCL   ++++     +  G+ 
Sbjct: 248 GRDQRGPSDKYD-GLLGLGGAPVSLVVQTSSVYGG-AFSYCLPALNSEAGF---LVLGSP 302

Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
              +    V T +       T+Y VT+ GISVG         P +    A  +G M ID+
Sbjct: 303 PSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGK-------PLHIPQSAF-RGGMIIDS 354

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGA 314
           G   T LP+  YN LE  +R A+K  P   P      CY     + I  P +   F GGA
Sbjct: 355 GTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFTGYSNITVPRVAFTFSGGA 413

Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQ---PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            + L      +P  +    C A Q   P DG +GI GN  Q  L + YD     V F+  
Sbjct: 414 TIDLD-----VPNGILVNDCLAFQESGPDDG-LGIIGNVNQRTLEVLYDAGRGNVGFRAG 467

Query: 372 DC 373
            C
Sbjct: 468 AC 469


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 154/356 (43%), Gaps = 24/356 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+   IG+P +     + DTGSD+ WVQC PC QC+ +V  +++P++SS+Y   SC S
Sbjct: 130 EYVITVGIGSPAVTQTMSM-DTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSS 188

Query: 83  EQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
             C  L        CSS Q C Y   Y D S T G  +++ +T G  +N      FGC  
Sbjct: 189 AACVQLSQSQQGNGCSSSQ-CQYIVSYVDGSSTTGTYSSDTLTLG--SNAIKGFQFGCSQ 245

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
           + +G F++   GL+GLG    SL SQ     G   FSYCL P     +  S  +   G+ 
Sbjct: 246 SESGGFSDQTDGLMGLGGDAQSLVSQTAGTFG-KAFSYCLPP-----TPGSSGFLTLGAA 299

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
              G V +  L S +  TYY V LE I VG            N   ++      +D+G  
Sbjct: 300 SRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQ--------LNIPTSVFSAGSVMDSGTV 351

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
            T LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V 
Sbjct: 352 ITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVN 411

Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           L      +         FA    D  +G  GN  Q    + YD     V F+   C
Sbjct: 412 LDFNGIMLELD-NWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/362 (29%), Positives = 167/362 (46%), Gaps = 26/362 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y++   +G+    ++  IVDTGSDL WVQC PC  CY Q  P++ P++S SY+ + C S 
Sbjct: 122 YIVTMGLGSQ---NMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNST 178

Query: 84  QCHLLDTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            C  L+  +C    S+   C+Y   Y D S T G L  E++ FG  +    N VFGCG N
Sbjct: 179 TCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGIS--VSNFVFGCGRN 236

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N G+F     GL+GLGR+ LS+ SQ  +  G   FSYCL P    +  +  +  GN S V
Sbjct: 237 NKGLFG-GASGLMGLGRSELSMISQTNATFGG-VFSYCL-PSTDQAGASGSLVMGNQSGV 293

Query: 200 SGGG---VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
                    +  L + +   +Y + L GI VG +S        +  + +   G + +D+G
Sbjct: 294 FKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVS-------LHVQASSFGNGGVILDSG 346

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
              + L    Y  L+ +        P          C+       +  P ++ +F+G A+
Sbjct: 347 TVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAE 406

Query: 316 VPLIHTSTF-IPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           + +  T  F +        C A+  +  + ++GI GN+ Q +  + YD     V F    
Sbjct: 407 LNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEP 466

Query: 373 CT 374
           CT
Sbjct: 467 CT 468


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 123/369 (33%), Positives = 173/369 (46%), Gaps = 33/369 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
            +S ++  +G Y++   IGTP   D+  + DTGSDL W QC PC+  CY Q +P +NP+S
Sbjct: 121 AKSGITLGSGNYIVTIGIGTPKH-DLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSS 179

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SS+Y+ +SC S  C   D  SCS+   C Y+  Y D S T+G LA E+ T  NS +  ++
Sbjct: 180 SSTYQNVSCSSPMCE--DAESCSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNS-DVLED 235

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGCG NN G+F+     L          A    +    N FSYCL  F ++S  T  +
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTY--NNIFSYCLPSFTSNS--TGHL 291

Query: 192 YFGNG--SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISK 248
            FG+   SE     V  T + S      Y + + GISVG+      + P  +++ GAI  
Sbjct: 292 TFGSAGISE----SVKFTPISSFPSAFNYGIDIIGISVGD--KELAITPNSFSTEGAI-- 343

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-P 304
               ID+G   T LP   Y  L    +   K++ Y+    G  L   CY    +  +  P
Sbjct: 344 ----IDSGTVFTRLPTKVYAELRSVFKE--KMSSYKSTS-GYGLFDTCYDFTGLDTVTYP 396

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +   F G   V L  +   +P  +  V C A    D    IFGN  Q+ L + YD    
Sbjct: 397 TIAFSFAGSTVVELDGSGISLPIKISQV-CLAFAGNDDLPAIFGNVQQTTLDVVYDVAGG 455

Query: 365 MVSFKPTDC 373
            V F P  C
Sbjct: 456 RVGFAPNGC 464


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 36/365 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      ++D   +L+W QC  C +C++Q  P+++P +S++Y+   C + 
Sbjct: 51  YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 84  QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
            C  +  D+ +CS   +C Y     ++  T G + T+    G +     ++ FGC   + 
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G+VGLGRT  SL    ++Q G   FSYCL P   D+   S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGRNSALFLGSSAKLAG 218

Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           GG   ST  V+      +   YY V LEG+  G+      +IP   S   +      +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
            +P + L    Y  +++ V  A+   P   P     LC+     +G AP L   F GGA 
Sbjct: 269 FSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +  +  + ++     G  C AM          ++ + G+  Q ++   +D D + +SF+P
Sbjct: 329 M-TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387

Query: 371 TDCTK 375
            DCTK
Sbjct: 388 ADCTK 392


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 126/378 (33%), Positives = 171/378 (45%), Gaps = 38/378 (10%)

Query: 11  NVVQSNVSTAN--GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIY 67
           N +++ V T +  G Y +   +GTP   D   + DTGSDL W QC PC   C+ Q    +
Sbjct: 117 NEMKTRVPTTHFGGGYAVTVGLGTPKK-DFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKF 175

Query: 68  NPASSSSYKELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
           +P  S+SYK LSC SE C  +   S   CSS   C Y   Y  +  T G LATE +T   
Sbjct: 176 DPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYG-TGYTVGFLATETLTITP 234

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
           S + F+N V GCG  N G F+    GL+GLGR+ ++L SQ  S    N FSYCL      
Sbjct: 235 S-DVFENFVIGCGERNGGRFS-GTAGLLGLGRSPVALPSQTSSTY-KNLFSYCL---PAS 288

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YY 240
           SS T  + FG G   +      TS + +     Y + + GISVG      + +P     +
Sbjct: 289 SSSTGHLSFGGGVSQAAKFTPITSKIPE----LYGLDVSGISVGG-----RKLPIDPSVF 339

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
            ++G I      ID+G   T LP   ++ L    +  +          G Q CY     A
Sbjct: 340 RTAGTI------IDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHA 393

Query: 301 G---IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDL 355
                 P ++  F+GG +V +  +  FI        C A +    D DV IFGN  Q   
Sbjct: 394 NDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTY 453

Query: 356 FIGYDFDSQMVSFKPTDC 373
            + YD    MV F P  C
Sbjct: 454 EVVYDVAKGMVGFAPGGC 471


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 118/209 (56%), Gaps = 14/209 (6%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
           +G P  L +YGI DTGS+L+W+QCLPC  CY Q  PI++PA S +Y+ +S  S  C+ + 
Sbjct: 63  LGVPSTL-VYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121

Query: 90  TVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV---VFGCGHNNTGVFN 145
            +SC    + C Y + Y D + TKG L+T+   F +       V    FGC H+      
Sbjct: 122 RISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLK 181

Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
            ++ G+VGL R   SL    +SQL   KFSYC+V    D    S+MYFG+ + + GG   
Sbjct: 182 GHQAGVVGLNRHPNSL----VSQLKVKKFSYCMV-IPDDHGSGSRMYFGSRAVILGG--- 233

Query: 206 STSLVSKEDKTYYFVTLEGISVGNLSNSS 234
            T L+ K D ++YFVTL+GISVG     S
Sbjct: 234 KTPLL-KGDYSHYFVTLKGISVGEEKGRS 261



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 5/112 (4%)

Query: 57  VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYAD-SSLTKGV 114
            QC+ Q  PI++P+ SS+Y  +   +  C+     +C   ++ C Y   Y   S+ T+G 
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391

Query: 115 LATERITF-GNSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
           ++ +   F  N  N  D  ++VFGC    TG F   E+G+VGL +  LSL S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 177/373 (47%), Gaps = 36/373 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
           + +++    G YV+   +GTP   D     DTGSDL W QC PC+  C+ Q +P ++P +
Sbjct: 129 IPASIVPTGGAYVVTVGLGTPKK-DFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTT 187

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           S+SYK +SC SE C L+   +  +Q      C Y   Y  S  T G LATE +    S++
Sbjct: 188 STSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYG-SGYTIGFLATETLAIA-SSD 245

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F N +FGC   + G FN    GL+GLGR+ ++L SQ  ++   N FSYCL      +S 
Sbjct: 246 VFKNFLFGCSEESRGTFN-GTTGLLGLGRSPIALPSQTTNKY-KNLFSYCL-----PASP 298

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           +S  +   G EVS      ++ +S + K  Y +   GISV       + +P    +G+IS
Sbjct: 299 SSTGHLSFGVEVSQAA--KSTPISPKLKQLYGLNTVGISV-----RGRELPI---NGSIS 348

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IAP 304
           +    ID+G   T LP   Y+ L    R  +      +     Q CY   ++       P
Sbjct: 349 R--TIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIP 406

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
            ++  F+GG +V +  +   I  PV G+      FA    D D  IFGN+ Q    + YD
Sbjct: 407 GISIFFEGGVEVEIDVSGIMI--PVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYD 464

Query: 361 FDSQMVSFKPTDC 373
               MV F P  C
Sbjct: 465 VAKGMVGFAPKGC 477


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 119/362 (32%), Positives = 153/362 (42%), Gaps = 38/362 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           +YV+  S+GTP +      VDTGSD+ WVQC PC    C  Q   +++PA SS+Y  + C
Sbjct: 142 QYVVTVSLGTPGVSQTV-EVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200

Query: 81  QSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            ++ C  L      CS  Q C Y   Y D S T GV  ++ +     N      +FGCGH
Sbjct: 201 GADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLALAPGNT-VGTFLFGCGH 258

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+F   + GL+ LGR  +SL SQ     G   FSYCL      S  ++  Y   G  
Sbjct: 259 AQAGMFAGID-GLLALGRQSMSLKSQAAGAYG-GVFSYCL-----PSKQSAAGYLTLGGP 311

Query: 199 VSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
            S  G  +T L++     T+Y V L GISVG    +     +         G   +DTG 
Sbjct: 312 TSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF--------AGGTVVDTGT 363

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA--PILTAHFD 311
             T LP   Y  L    R AI   PY  P   +      CY   S  G+   P +   F 
Sbjct: 364 VITRLPPTAYAALRSAFRGAIA--PYGYPSAPANGILDTCYDF-SRYGVVTLPTVALTFS 420

Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           GGA + L            G   FA    DGD  I GN  Q    +   FD   V F P 
Sbjct: 421 GGATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAV--RFDGSTVGFMPG 474

Query: 372 DC 373
            C
Sbjct: 475 AC 476


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 171/363 (47%), Gaps = 32/363 (8%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP-CVQCYKQVKPIYNPASSSSYKELSCQS 82
           YV+  +IGTPP   +  I+D G +L+W QC   C +C+KQ  P+++  +SS+++   C +
Sbjct: 51  YVVNLTIGTPPQ-PVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGA 109

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
             C  + T SC+        Y  + S   T G + T+ +  G +      + FGC   + 
Sbjct: 110 AVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT--ARLAFGCAVASE 167

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G VGLGRT LSLA+Q    + A  FSYCL P   D+  +S ++ G  ++++G
Sbjct: 168 MDTMWGSSGSVGLGRTNLSLAAQ----MNATAFSYCLAP--PDTGKSSALFLGASAKLAG 221

Query: 202 GG--------VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            G        V +++  +      Y + LE I  GN   ++  +P        S   + +
Sbjct: 222 AGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGN---ATIAMPQ-------SGNTITV 271

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
            T  P T L    Y  L + V +A+   P   P     LC+   S +G AP L   F GG
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGG 331

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           A++  +  S+++        C A+   P  G V I G+  Q ++ + +D D + +SF+P 
Sbjct: 332 AEM-TVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPA 390

Query: 372 DCT 374
           DC+
Sbjct: 391 DCS 393


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 115/349 (32%), Positives = 160/349 (45%), Gaps = 25/349 (7%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
           +G Y +   +GTP   D+  I DTGSDL W QC PC + CYKQ   I++P+ S+SY  ++
Sbjct: 142 SGNYFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNIT 200

Query: 80  CQSEQCHLLDTVS-----CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           C S  C  L T +     CS S + C Y   Y DSS + G  + ER++   + +  DN +
Sbjct: 201 CTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSV-TATDIVDNFL 259

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG NN G+F     GL+GLGR  +S   Q  + +    FSYCL      SS T ++ F
Sbjct: 260 FGCGQNNQGLFG-GSAGLIGLGRHPISFVQQT-AAVYRKIFSYCL---PATSSSTGRLSF 314

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G  +  S       S +S+   ++Y + + GISVG        +P   SS   S G   I
Sbjct: 315 GT-TTTSYVKYTPFSTISR-GSSFYGLDITGISVGGAK-----LPV--SSSTFSTGGAII 365

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
           D+G   T LP   Y  L    R  +   P          CY        + P +   F G
Sbjct: 366 DSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAG 425

Query: 313 GAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
           G  V L      ++    +    FA    D DV I+GN  Q  + + YD
Sbjct: 426 GVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 166/366 (45%), Gaps = 27/366 (7%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSS 73
            V+   G YV+   +GTP   + + +V DTGSD  WVQC PCV  CY+Q +P+++P  S+
Sbjct: 88  GVALGTGNYVVPVRLGTP--AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSA 145

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           +Y  +SC S  C  L    CS    C Y   Y D S T G  A + +T   + +   N  
Sbjct: 146 TYANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTL--AYDTIKNFR 202

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F     GL+GLGR + SL  Q   + G   F+YCL      S+ T  +  
Sbjct: 203 FGCGEKNRGLFGR-AAGLLGLGRGKTSLPVQAYDKYG-GVFAYCL---PATSAGTGFLDL 257

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G G+  +   +  T ++     T+Y+V + GI VG       ++P   S    S     +
Sbjct: 258 GPGAPAANARL--TPMLVDRGPTFYYVGMTGIKVGG-----HVLPIPGS--VFSTAGTLV 308

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG--IA-PILTA 308
           D+G   T LP   Y  L      A++   Y      S L  CY      G  IA P ++ 
Sbjct: 309 DSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSL 368

Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GGA + +  +   ++    +    FA    D DV I GN  Q    + YD   ++V 
Sbjct: 369 VFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVG 428

Query: 368 FKPTDC 373
           F P  C
Sbjct: 429 FAPGAC 434


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 175/372 (47%), Gaps = 33/372 (8%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           N+ T N  Y++   +G+    ++  I+DTGSDL WVQC PC+ CY Q  PI+ P++SSSY
Sbjct: 59  NLQTLN--YIVTMGLGST---NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113

Query: 76  KELSCQSEQCHLL-----DTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           + +SC S  C  L     +T +C S    CNY   Y D S T G L  E+++FG  +   
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVS--V 171

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            + VFGCG NN G+F     GL+GLGR+ LSL SQ  +  G   FSYCL    T+S  + 
Sbjct: 172 SDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPT--TESGASG 227

Query: 190 KMYFGNGSEVSGGGVVST---SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
            +  GN S V       T    L + +   +Y + L GI V  ++     +P + + G +
Sbjct: 228 SLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQ---VPSFGNGGVL 284

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PI 305
                 ID+G   T LP   Y  L+          P          C+       ++ P 
Sbjct: 285 ------IDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPT 338

Query: 306 LTAHFDGGAKVPLIHTSTF-IPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
           ++ HF+G A++ +  T TF +        C A+  +    D  I GN+ Q +  + YD  
Sbjct: 339 ISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTK 398

Query: 363 SQMVSFKPTDCT 374
              V F    C+
Sbjct: 399 QSKVGFAEESCS 410


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 174/377 (46%), Gaps = 31/377 (8%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  +  +G+Y + FS+GTP     + IVDTGSDL +VQC PC  CY+Q  P+Y P++SS+
Sbjct: 25  SGTTLGSGQYFVDFSLGTPEQ-KFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSST 83

Query: 75  YKELSCQSEQCHLLDT---VSCSS-------QQLCNYTYGYADSSLTKGVLATERITFGN 124
           +  + C S +C L+       CSS       Q  C+Y Y Y D+S T GV A E  T G 
Sbjct: 84  FTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGG 143

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                ++V FGCG+ N G F  +  G++GLG+  LS  SQ       NKF+YCL  + + 
Sbjct: 144 IR--VNHVAFGCGNRNQGSF-VSAGGVLGLGQGALSFTSQAGYAF-ENKFAYCLTSYLSP 199

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
           +S+ S + FG+    +   +  T LVS   + + Y+V +  I  G     + LIP  +S+
Sbjct: 200 TSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFG---GETLLIP--DSA 254

Query: 244 GAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM- 299
             I     G    D+G   T      Y R+      ++          G  LC     + 
Sbjct: 255 WKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGID 314

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIP--PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLF 356
             I P  T  FD GA       + FI   P ++   C AM     D   + GN  Q +  
Sbjct: 315 HPIYPSFTIEFDQGATYRPNQGNYFIEVSPNID---CLAMLESSSDGFNVIGNIIQQNYL 371

Query: 357 IGYDFDSQMVSFKPTDC 373
           + YD +   + F   +C
Sbjct: 372 VQYDREEHRIGFAHANC 388


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 118/383 (30%), Positives = 164/383 (42%), Gaps = 25/383 (6%)

Query: 4   ATYFYPNNVVQSNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLPCVQC 59
           AT   P N      +  +GEY+ K ++GTP       +     D GSD+ W+QC+PC +C
Sbjct: 105 ATPADPENGTVVTGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRC 164

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL--CNYTYGYADSSLTKGVLAT 117
           Y Q  P+YN   SSS  ++ C +  C  L +     Q L  C Y   Y D S + G    
Sbjct: 165 YHQPGPVYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGV 224

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           E +TF         V  GCG +N G+F     G++GLGR  LS  SQI  + G   FSYC
Sbjct: 225 ETLTFPPGVR-VPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYG-RSFSYC 282

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS----LVSKEDKTYYFVTLEGISVGNLSNS 233
           L    T    +S + FG+G+  +       S    L +    T+Y+V L GISVG +   
Sbjct: 283 LAGQGTGGR-SSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVR 341

Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQL 292
                      +   G + +D+G   T L    Y    +  R  A+K   +  P  G   
Sbjct: 342 GVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSP--GGPF 399

Query: 293 -----CYKT--PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-EGVFCFAMQPI-DGD 343
                CY +    +    P ++ HF GG +V L   +  IP    +G  CFA     D  
Sbjct: 400 AFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG 459

Query: 344 VGIFGNFAQSDLFIGYDFDSQMV 366
           V I GN       + YD D Q V
Sbjct: 460 VSIIGNIQLQGFRVVYDVDGQRV 482


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 166/366 (45%), Gaps = 27/366 (7%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSS 73
            V+   G YV+   +GTP   + + +V DTGSD  WVQC PCV  CY+Q +P+++P  S+
Sbjct: 153 GVALGTGNYVVPVRLGTP--AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSA 210

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
           +Y  +SC S  C  L    CS    C Y   Y D S T G  A + +T   + +   N  
Sbjct: 211 TYANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTL--AYDTIKNFR 267

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F     GL+GLGR + SL  Q   + G   F+YCL      S+ T  +  
Sbjct: 268 FGCGEKNRGLFGR-AAGLLGLGRGKTSLPVQAYDKYG-GVFAYCL---PATSAGTGFLDL 322

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G G+  +   +  T ++     T+Y+V + GI VG       ++P   S    S     +
Sbjct: 323 GPGAPAANARL--TPMLVDRGPTFYYVGMTGIKVGG-----HVLPIPGS--VFSTAGTLV 373

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG--IA-PILTA 308
           D+G   T LP   Y  L      A++   Y      S L  CY      G  IA P ++ 
Sbjct: 374 DSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSL 433

Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GGA + +  +   ++    +    FA    D DV I GN  Q    + YD   ++V 
Sbjct: 434 VFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVG 493

Query: 368 FKPTDC 373
           F P  C
Sbjct: 494 FAPGAC 499


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQC PC  CY Q  P+Y+P+ SSSYK + C S  C  L   + +S     
Sbjct: 149 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208

Query: 96  -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
                +  C Y   Y D S T+G LA+E I  G++    +N VFGCG NN G+F  +   
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 265

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
           L+GLGR+ +SL SQ L       FSYCL      +S    + FGN S V  +   V  T 
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 322

Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           LV   + +++Y + L G S+G +   S          +  +G + ID+G   T LP   Y
Sbjct: 323 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 372

Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
             ++ +        P          C+   S   I+ PI+   F G A++ +  T  F  
Sbjct: 373 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 432

Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             P   + C A+  +  + +VGI GN+ Q +  + YD   + +     +C
Sbjct: 433 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQC PC  CY Q  P+Y+P+ SSSYK + C S  C  L   + +S     
Sbjct: 101 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 160

Query: 96  -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
                +  C Y   Y D S T+G LA+E I  G++    +N VFGCG NN G+F  +   
Sbjct: 161 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 217

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
           L+GLGR+ +SL SQ L       FSYCL      +S    + FGN S V  +   V  T 
Sbjct: 218 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 274

Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           LV   + +++Y + L G S+G +   S          +  +G + ID+G   T LP   Y
Sbjct: 275 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 324

Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
             ++ +        P          C+   S   I+ PI+   F G A++ +  T  F  
Sbjct: 325 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 384

Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             P   + C A+  +  + +VGI GN+ Q +  + YD   + +     +C
Sbjct: 385 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQC PC  CY Q  P+Y+P+ SSSYK + C S  C  L   + +S     
Sbjct: 149 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208

Query: 96  -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
                +  C Y   Y D S T+G LA+E I  G++    +N VFGCG NN G+F  +   
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 265

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
           L+GLGR+ +SL SQ L       FSYCL      +S    + FGN S V  +   V  T 
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 322

Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           LV   + +++Y + L G S+G +   S          +  +G + ID+G   T LP   Y
Sbjct: 323 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 372

Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
             ++ +        P          C+   S   I+ PI+   F G A++ +  T  F  
Sbjct: 373 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 432

Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             P   + C A+  +  + +VGI GN+ Q +  + YD   + +     +C
Sbjct: 433 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 118/383 (30%), Positives = 174/383 (45%), Gaps = 38/383 (9%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           A  EY +   +GTP  +++  I+DTGSD+ W+QC+PC  C   ++P +NP  SSS+ +L 
Sbjct: 135 AGLEYYVPLQVGTP-AVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLP 193

Query: 80  CQSEQC-HLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD----- 130
           C S  C ++   V   CS S + C ++  Y D SL+ G+LA E I  GN+ NF D     
Sbjct: 194 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVK 252

Query: 131 --NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             N+  GC   +         GL+G+ R  +S  SQ+ S+  A KFS+C        + +
Sbjct: 253 LSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRY-ARKFSHCFPDKIAHLNSS 311

Query: 189 SKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
             ++FG    +S       +V    V      YY+V L GISV    + S+L P  + + 
Sbjct: 312 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISV----DESRL-PLSHKNF 366

Query: 245 AISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-- 298
            I K    G   ID+G   T L K  +  +  +           D   G   CY   S  
Sbjct: 367 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 426

Query: 299 ---MAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVFCFAMQPIDGDV--GIFGNF 350
               + I P +T HF GG  V L   S  IP      +   C A   + GD+   I GN+
Sbjct: 427 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNY 485

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q +L++ YD +   +   P  C
Sbjct: 486 QQQNLWVEYDLEKLRLGIAPAQC 508


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 172/365 (47%), Gaps = 36/365 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      ++D   +L+W QC  C +C++Q  P+++P +S++Y+   C + 
Sbjct: 51  YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109

Query: 84  QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
            C  +  D  +CS   +C Y     ++  T G + T+    G +     ++ FGC   + 
Sbjct: 110 LCESIPSDVRNCSG-NVCAYE-ASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G+VGLGRT  SL    ++Q G   FSYCL P   D+   S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGKNSALFLGSSAKLAG 218

Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           GG   ST  V+      +   YY V LEG+  G+      +IP   S   +      +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
            +P + L    Y  +++ V  A+   P   P     LC+     +G AP L   F GGA 
Sbjct: 269 FSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +  +  + ++     G  C AM          ++ + G+  Q ++   +D D + +SF+P
Sbjct: 329 M-TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387

Query: 371 TDCTK 375
            DCTK
Sbjct: 388 ADCTK 392


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 165/363 (45%), Gaps = 52/363 (14%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-- 98
           IVDT S+L WVQC PC  C+ Q  P+++P+SS SY  + C S  C  L       QQL  
Sbjct: 157 IVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQ------QQLAT 210

Query: 99  ----------------CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
                           C+Y   Y D S ++GVLA +R++   +    D  VFGCG +N G
Sbjct: 211 GAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL--AGEVIDGFVFGCGTSNQG 268

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--S 200
                  GL+GLGR++LSL SQ + Q G   FSYCL P   +S  +  +  G+      +
Sbjct: 269 PPFGGTSGLMGLGRSQLSLVSQTVDQFG-GVFSYCL-PLSRESDASGSLVLGDDPSAYRN 326

Query: 201 GGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
              VV TS+VS  D      +Y V L GI+VG             S+G  ++    +D+G
Sbjct: 327 STPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQE--------VESTGFSARA--IVDSG 376

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAHFDGGA 314
              T L    YN +  +  + +   P Q P       C+    +  +  P LT  FDGGA
Sbjct: 377 TVITSLVPSVYNAVRAEFMSQLAEYP-QAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGA 435

Query: 315 KVPLIHTST--FIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +V +       F+      V C A+  +  + +  I GN+ Q +L + +D  +  V F  
Sbjct: 436 EVEVDSGGVLYFVSSDSSQV-CLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQ 494

Query: 371 TDC 373
             C
Sbjct: 495 ETC 497


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 38/375 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL----PCVQCYKQVKPIYNPASSSSYKELS 79
           + +   IGTPP      IVDTGSDL+W QC       V       P+Y+P  SS++  L 
Sbjct: 91  HSLTVGIGTPPQPRKL-IVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLP 149

Query: 80  CQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           C    C        +C+S+  C Y   Y  S+   GVLA+E  TFG        + FGCG
Sbjct: 150 CSDRLCQEGQFSFKNCTSKNRCVYEDVYG-SAAAVGVLASETFTFGARRAVSLRLGFGCG 208

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             + G       G++GL    LSL    ++QL   +FSYCL PF      TS + FG  +
Sbjct: 209 ALSAGSLI-GATGILGLSPESLSL----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMA 261

Query: 198 EVSGGG----VVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
           ++S       + +T++VS   KT YY+V L GIS+G+     K +    +S A+     G
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGG 316

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------ 303
              +D+G+    L +  +  ++E V + ++L          +LC+  P     A      
Sbjct: 317 GTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQ 376

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYD 360
            P L  HFDGGA + L   + F   P  G+ C A+ +  DG  V I GN  Q ++ + +D
Sbjct: 377 VPPLVLHFDGGAAMVLPRDNYF-QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435

Query: 361 FDSQMVSFKPTDCTK 375
                 SF PT C +
Sbjct: 436 VQHHKFSFAPTQCDQ 450


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 151/360 (41%), Gaps = 34/360 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           +YV+  S+GTP +      VDTGSD+ WVQC PC    C  Q   +++PA SS+Y  + C
Sbjct: 142 QYVVTVSLGTPGVSQTV-EVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200

Query: 81  QSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            ++ C  L      CS  Q C Y   Y D S T GV  ++ +     N      +FGCGH
Sbjct: 201 GADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLALAPGNT-VGTFLFGCGH 258

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+F   + GL+ LGR  +SL SQ     G   FSYCL      S  ++  Y   G  
Sbjct: 259 AQAGMFAGID-GLLALGRQSMSLKSQAAGAYG-GVFSYCL-----PSKQSAAGYLTLGGP 311

Query: 199 VSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
            S  G  +T L++     T+Y V L GISVG    +     +         G   +DTG 
Sbjct: 312 SSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF--------AGGTVVDTGT 363

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--PILTAHFDGG 313
             T LP   Y  L    R AI    Y        L  CY   S  G+   P +   F GG
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDF-SRYGVVTLPTVALTFSGG 422

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A + L            G   FA    DGD  I GN  Q    +   FD   V F P  C
Sbjct: 423 ATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAV--RFDGSTVGFMPGAC 476


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/403 (31%), Positives = 191/403 (47%), Gaps = 58/403 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V+S V+  +GEY+M   +GTPP      I+DTGSDL W+QC PC+ C++Q  P+++PA+S
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 198

Query: 73  SSYKELSCQSEQC-HL----------LDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
           SSY+ ++C   +C H+            T     +  C Y Y Y D S T G LA E  T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258

Query: 122 FG----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
                  ++   D VVFGCGH N G+F+     L+GLGR  LS ASQ+ +  G + FSYC
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYG-HTFSYC 316

Query: 178 LVPFHTDSSITSKMYFGNGSEVSG---------GGVVSTSLVSKEDKTYYFVTLEGISVG 228
           LV   +D  + SK+ FG   +                  S  S    T+Y+V L+G+ VG
Sbjct: 317 LVDHGSD--VGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVG 374

Query: 229 ----NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKL 280
               N+S+ +  +      G    G   ID+G   +   +  Y  +     +++  +  L
Sbjct: 375 GELLNISSDTWDV------GKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPL 428

Query: 281 TPYQDPRLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFC 334
            P + P L    CY   +++G+     P L+  F  GA       + FI    +G  + C
Sbjct: 429 VP-EFPVLSP--CY---NVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMC 482

Query: 335 FAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            A+   P  G + I GNF Q +  + YD  +  + F P  C +
Sbjct: 483 LAVLGTPRTG-MSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 161/359 (44%), Gaps = 18/359 (5%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKEL 78
            + +Y +   +GTP   D+  I DTGS L W QC PC   CYKQ  PI++P+ SSSY  +
Sbjct: 136 GSADYYVVVGLGTPKR-DLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNI 194

Query: 79  SCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            C S  C    +  CSS     C Y   Y D+S+++G L+ ER+T   + +   + +FGC
Sbjct: 195 KCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTI-TATDIVHDFLFGC 253

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G +N G+F     GL+GL R  +S   Q  S +    FSYCL    T SS+    +  + 
Sbjct: 254 GQDNEGLF-RGTAGLMGLSRHPISFVQQT-SSIYNKIFSYCLPS--TPSSLGHLTFGASA 309

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           +  +       S +S E+ ++Y + + GISVG        +P  +SS   S G   ID+G
Sbjct: 310 ATNANLKYTPFSTISGEN-SFYGLDIVGISVGGTK-----LPAVSSS-TFSAGGSIIDSG 362

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK 315
              T LP   Y  L    R  +   P          CY       I+ P +   F GG K
Sbjct: 363 TVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVK 422

Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           V L           + +   FA      D+ IFGN  Q  L + YD +   + F    C
Sbjct: 423 VELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
           VQS  S   G+YV+   +GTP   +   I DTGSD+ W QC PCV+ CYKQ +P  NP++
Sbjct: 108 VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 166

Query: 72  SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S+SYK +SC S  C L+      + SCSS   C Y   Y D S + G  ATE +T  +S+
Sbjct: 167 STSYKNISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTL-SSS 224

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           N F N +FGCG  N         GL+GLGRT+L+L SQ  ++     FSYCL      +S
Sbjct: 225 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 277

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            +SK Y   G +VS   V  T L +  D T +Y + + G+SVG    S           A
Sbjct: 278 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSI-------DESA 329

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
            S G + ID+G   T L    Y+ L    +N +   P          CY       +  P
Sbjct: 330 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 388

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
            +   F GG ++ +  +   I  PV G+      FA    D D  IFGN  Q    + YD
Sbjct: 389 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 446

Query: 361 FDSQMVSFKPTDCT 374
                V F P  C+
Sbjct: 447 GAKGRVGFAPGGCS 460


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
           VQS  S   G+YV+   +GTP   +   I DTGSD+ W QC PCV+ CYKQ +P  NP++
Sbjct: 60  VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 118

Query: 72  SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S+SYK +SC S  C L+      + SCSS   C Y   Y D S + G  ATE +T  +S+
Sbjct: 119 STSYKNISCSSALCKLVASGKKFSQSCSS-STCLYQVQYGDGSYSIGFFATETLTL-SSS 176

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           N F N +FGCG  N         GL+GLGRT+L+L SQ  ++     FSYCL      +S
Sbjct: 177 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 229

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            +SK Y   G +VS   V  T L +  D T +Y + + G+SVG    S           A
Sbjct: 230 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSI-------DESA 281

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
            S G + ID+G   T L    Y+ L    +N +   P          CY       +  P
Sbjct: 282 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 340

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
            +   F GG ++ +  +   I  PV G+      FA    D D  IFGN  Q    + YD
Sbjct: 341 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 398

Query: 361 FDSQMVSFKPTDCT 374
                V F P  C+
Sbjct: 399 GAKGRVGFAPGGCS 412


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 162/364 (44%), Gaps = 30/364 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTP +     ++DTGSDL WVQC PC    CY Q  P+Y+P +SS+Y  + C
Sbjct: 126 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPC 184

Query: 81  QSEQCHLL-------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
            S+ C  L          + S   LC Y   Y +   T GV +TE +T     +  D   
Sbjct: 185 DSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKD-FG 243

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG    G F+  +  L   G    SL SQ     G   FSYCL P ++ +   +    
Sbjct: 244 FGCGLVQQGTFDLFDGLLGLGGAPE-SLVSQTAETYGG-AFSYCLPPGNSTTGFLALGAP 301

Query: 194 GNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
            N ++ +  G + T L S  E  T+Y V L G+SVG       + P       +  G M 
Sbjct: 302 TNNNDTA--GFLFTPLHSLPEQATFYLVNLTGVSVGG--KPLDIPP------TVLSGGMI 351

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAH 309
           ID+G   T LP   Y+ L    R A+   P   P     L  CY    +A +  P +   
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           FDGGA + L   S  +   ++    FA    DGDVGI GN  Q    + YD     V F+
Sbjct: 412 FDGGATIDLDVPSGVL---IQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHVGFR 468

Query: 370 PTDC 373
           P  C
Sbjct: 469 PGAC 472


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 118/384 (30%), Positives = 174/384 (45%), Gaps = 39/384 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
            +  +S   G YV+   +GTP   D+  + DTGSDL WVQC PC    CYKQ  P++ P+
Sbjct: 143 AERGISVGTGNYVVSVGLGTP-ARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPS 201

Query: 71  SSSSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFG------ 123
            SS++  + C + +C    +   S     C Y   Y D S T+G L  + +T G      
Sbjct: 202 DSSTFSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPAN 261

Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
               ++N     VFGCG NNTG+F + + GL GLGR ++SL+SQ   + G   FSYCL  
Sbjct: 262 ASAENDNKLPGFVFGCGENNTGLFGQAD-GLFGLGRGKVSLSSQAAGKFG-EGFSYCL-- 317

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIP 238
               SS ++  Y   G+ V        T ++++    ++Y+V L GI V     + + I 
Sbjct: 318 --PSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRV-----AGRAIR 370

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGS-QLCYKT 296
             +   A+    + +D+G   T L    Y  L     +A+    Y+  PRL     CY  
Sbjct: 371 VSSPRVALP---LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDF 427

Query: 297 PSMAGIA---PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNF 350
            + A      P +   F GGA + +  +       V    C A  P +GD    GI GN 
Sbjct: 428 TAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-CLAFAP-NGDGRSAGILGNT 485

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q  L + YD   Q + F    C+
Sbjct: 486 QQRTLAVVYDVARQKIGFAAKGCS 509


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 179/376 (47%), Gaps = 41/376 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           +++  SIG+PP+  +  +VDTGS L+WVQCLPC+ C++Q    ++P  S S+K L C   
Sbjct: 104 FLVNLSIGSPPVTQLV-VVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFP 162

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG--NSNNFFD----------- 130
             + ++   C+      Y   Y     ++G+LA E + F   +    F            
Sbjct: 163 GYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKI 222

Query: 131 ---NVVFGCGHNNTGVFNENEM-GLVGLGRT-RLSLASQILSQLGANKFSYCLVPFHTDS 185
              N+ FGCGH N    N++   G+ GLG    +++A+Q+      NKFSYC+   +   
Sbjct: 223 KKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-----GNKFSYCIGDINNPL 277

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
              + +  G GS + G    ST L  +    +Y+VTL+ ISVG  S + K+ P      +
Sbjct: 278 YTHNHLVLGQGSYIEGD---STPL--QIHFGHYYVTLQSISVG--SKTLKIDPNAFKISS 330

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYK---TPSMA 300
              G + ID+G   T L    +  L +++ + +K    + P  R    LC+K   +  + 
Sbjct: 331 DGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLV 390

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFI 357
           G  P +T HF GGA + L   S F     +  FC A+ P + +   + + G  AQ +  +
Sbjct: 391 GF-PAVTFHFAGGADLVLESGSLFRQHGGDR-FCLAILPSNSELLNLSVIGILAQQNYNV 448

Query: 358 GYDFDSQMVSFKPTDC 373
           G+D +   V F+  DC
Sbjct: 449 GFDLEQMKVFFRRIDC 464


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 170/365 (46%), Gaps = 22/365 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           S  +   G YV+   +GTP     Y +V DTGSD  WVQC PCV  CY+Q + +++P  S
Sbjct: 169 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRS 226

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC +  C  L+   CS    C Y   Y D S + G  A + +T  +S +     
Sbjct: 227 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 284

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G+F E   GL+GLGR + SL  Q   + G   F++CL      S+ T  + 
Sbjct: 285 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 339

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           FG GS  +    ++T +++    T+Y++ + GI VG      +L+    S    +     
Sbjct: 340 FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGG-----QLLSIPQS--VFATAGTI 392

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
           +D+G   T LP   Y+ L      A+    Y+     S L  CY    M+ +A P ++  
Sbjct: 393 VDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 452

Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA++ +  +          V   FA     GDVGI GN       + YD   ++V F
Sbjct: 453 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512

Query: 369 KPTDC 373
            P  C
Sbjct: 513 YPGVC 517


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 179/385 (46%), Gaps = 28/385 (7%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S VS  +GEY +   IG+PP      I+DTGSDL W+QC+PC  C++Q  P Y+P  
Sbjct: 184 TLESGVSLGSGEYFIDVFIGSPPK-HFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKD 242

Query: 72  SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
           S S++ ++C   +C L+ +      C  + Q C Y Y Y DSS T G  A E  T     
Sbjct: 243 SISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS 302

Query: 123 ---GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
              G S     +NV+FGCGH N G+F+     L       LS +SQ+ S  G + FSYCL
Sbjct: 303 STTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR-GPLSFSSQLQSLYG-HSFSYCL 360

Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISVGNLSNSS 234
           V   +D+S++SK+ FG   ++     ++ TSL++ ++    T+Y++ ++ I VG      
Sbjct: 361 VDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVG----GE 416

Query: 235 KL-IPYYNSS-GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
           KL IP  N +  A   G   ID+G   +      Y  ++E     +K     +       
Sbjct: 417 KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHP 476

Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNF 350
           CY       +  P     F  GA       + FI      + C AM       + I GN+
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNY 536

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
            Q +  I YD  +  + + P  C +
Sbjct: 537 QQQNFHILYDTKNSRLGYAPMRCAE 561


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 118/384 (30%), Positives = 179/384 (46%), Gaps = 28/384 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S VS  +GEY +   IG+PP      I+DTGSDL W+QC+PC  C++Q  P Y+P  S
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPK-HFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDS 243

Query: 73  SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
            S++ ++C   +C L+ +      C  + Q C Y Y Y DSS T G  A E  T      
Sbjct: 244 ISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS 303

Query: 123 --GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
             G S     +NV+FGCGH N G+F+     L       LS +SQ+ S  G + FSYCLV
Sbjct: 304 TTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR-GPLSFSSQLQSLYG-HSFSYCLV 361

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISVGNLSNSSK 235
              +D+S++SK+ FG   ++     ++ TSL++ ++    T+Y++ ++ I VG      K
Sbjct: 362 DRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGG----EK 417

Query: 236 L-IPYYNSS-GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLC 293
           L IP  N +  A   G   ID+G   +      Y  ++E     +K     +       C
Sbjct: 418 LQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPC 477

Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFA 351
           Y       +  P     F  GA       + FI      + C AM       + I GN+ 
Sbjct: 478 YNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQ 537

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
           Q +  I YD  +  + + P  C +
Sbjct: 538 QQNFHILYDTKNSRLGYAPMRCAE 561


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/384 (29%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
           +Y+ ++ IG PP      I+DTGS+L+W QC  C   C++Q  P Y+P+ S + + + C 
Sbjct: 70  QYIAEYLIGDPPQ-RAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCN 128

Query: 82  SEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC---G 137
              C L     C S  + C    GY   ++  G LATE +TF +      ++VFGC    
Sbjct: 129 DAACALGSETQCLSDNKTCAVVTGYGAGNI-AGTLATENLTFQSETV---SLVFGCIVVT 184

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
             + G  N    G++GLGR +LSL SQ    LG  +FSYCL P+  D+   S M  G  +
Sbjct: 185 KLSPGSLN-GASGIIGLGRGKLSLPSQ----LGDTRFSYCLTPYFEDTIEPSHMVVGASA 239

Query: 198 EVSGGGVVSTSLVS-------KED--KTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
            +  G   ST + +        +D   T+Y++ L GI+ G   L+  S        +  +
Sbjct: 240 GLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGM 299

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA 303
             G  FID+GAP T L    Y  L  ++   +     Q P  G+    LC        + 
Sbjct: 300 WTGT-FIDSGAPLTSLVDVAYQALRAELARQLGAALVQ-PLAGTTGFDLCVALKDAERLV 357

Query: 304 PILTAHFDGGAKVPLIHTSTFIPP-----PVE-GVFCFAM-QPID------GDVGIFGNF 350
           P L  HF GG+      T   +PP     PV+    C  +   +D       +  + GN+
Sbjct: 358 PPLVLHFGGGSGT---GTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNY 414

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q ++ + YD    ++SF+P DC+
Sbjct: 415 MQQNMHVLYDLAGGVLSFQPADCS 438


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
           VQS  S   G+YV+   +GTP   +   I DTGSD+ W QC PCV+ CYKQ +P  NP++
Sbjct: 120 VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 178

Query: 72  SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S+SYK +SC S  C L+      + SCSS   C Y   Y D S + G  ATE +T  +S+
Sbjct: 179 STSYKNISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTL-SSS 236

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           N F N +FGCG  N         GL+GLGRT+L+L SQ  ++     FSYCL      +S
Sbjct: 237 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 289

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            +SK Y   G +VS   V  T L +  D T +Y + + G+SVG    S           A
Sbjct: 290 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSI-------DESA 341

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
            S G + ID+G   T L    Y+ L    +N +   P          CY       +  P
Sbjct: 342 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 400

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
            +   F GG ++ +  +   I  PV G+      FA    D D  IFGN  Q    + YD
Sbjct: 401 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 458

Query: 361 FDSQMVSFKPTDCT 374
                V F P  C+
Sbjct: 459 GAKGRVGFAPGGCS 472


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 167/367 (45%), Gaps = 30/367 (8%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSY 75
            S A G YV +  +GTP    +  +VDTGS L W+QC PC V C++Q  P+++P +S +Y
Sbjct: 124 ASVAVGNYVTRLGLGTPATSYVM-VVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTY 182

Query: 76  KELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
             + C S +C  L        +CS   +C Y   Y DSS + G L+ + ++FG+ +  F 
Sbjct: 183 AAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGS--FP 240

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
              +GCG +N G+F  +  GL+GL + +LSL  Q+   LG   FSYCL      +S  + 
Sbjct: 241 GFYYGCGQDNEGLFGRSA-GLIGLAKNKLSLLYQLAPSLG-YAFSYCL-----PTSSAAA 293

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSGAISK 248
            Y   GS   G    +    S  D + YFVTL GISV   + +   +P   Y S   I  
Sbjct: 294 GYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISV---AGAPLAVPPSEYRSLPTI-- 348

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILT 307
               ID+G   T LP + Y  L   V  A+     + P       C++  +     P + 
Sbjct: 349 ----IDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVD 404

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F GGA + L   +  I    +   C A  P  G   I GN  Q    + YD     + 
Sbjct: 405 MAFAGGATLALSPGNVLIDVD-DSTTCLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIG 462

Query: 368 FKPTDCT 374
           F    C+
Sbjct: 463 FAAGGCS 469


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 179/379 (47%), Gaps = 61/379 (16%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP      I+D   +L+W QC  C +C+KQ  P++ P +SS+++   C ++ 
Sbjct: 68  VANFTIGTPPQ-PASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDA 126

Query: 85  CHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGC----G 137
           C  + T +CSS  +C Y  G  +S L   T G++AT+    G +     ++ FGC    G
Sbjct: 127 CKSIPTSNCSS-NMCTY-EGTINSKLGGHTLGIVATDTFAIGTATA---SLGFGCVVASG 181

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            +  G       GL+GLGR      S ++SQ+   KFSYCL P   DS   S++  G+ +
Sbjct: 182 IDTMG----GPSGLIGLGRA----PSSLVSQMNITKFSYCLTPH--DSGKNSRLLLGSSA 231

Query: 198 EVSGGGVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           +++GGG  +T+   K     +   YY + L+GI  G+ + +  L P  N+        + 
Sbjct: 232 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIA--LPPSGNT--------VL 281

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
           + T AP + L    Y  L+++V  A+   P   P     LC+    ++   AP L   F 
Sbjct: 282 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQ 341

Query: 312 GGAKVPLIHTSTFIPPPV--------EGVFCFAM--------QPIDGDVGIFGNFAQSDL 355
            GA       +  +PPP         +G  C A+          +D ++ I G+  Q + 
Sbjct: 342 QGA------AALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENT 395

Query: 356 FIGYDFDSQMVSFKPTDCT 374
               D + + +SF+P DC+
Sbjct: 396 HFLLDLEKKTLSFEPADCS 414


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 175/371 (47%), Gaps = 32/371 (8%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYG-IVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSS 74
           +S  +G Y +K  +GTPP    Y  I+DTGS L W+QC PC V C+ Q  P+Y+P+ S +
Sbjct: 118 LSIGSGNYYVKLGLGTPP--KYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKT 175

Query: 75  YKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           YK+LSC S +C  L      D +  +    C YT  Y D+S + G L+ + +T  +S   
Sbjct: 176 YKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT- 234

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
                +GCG +N G+F     G++GL R +LS+ +Q+ ++ G + FSYCL   ++ SS  
Sbjct: 235 LPQFTYGCGQDNQGLFGR-AAGIIGLARDKLSMLAQLSTKYG-HAFSYCLPTANSGSSGG 292

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             +  G+ S  S     +  L   ++ + YF+ L  I+V             + + A+ +
Sbjct: 293 GFLSIGSISPTSYK--FTPMLTDSKNPSLYFLRLTAITVSGRP--------LDLAAAMYR 342

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--P 304
               ID+G   T LP   Y  L  Q    I  T Y      S L  C+K  S+  I+  P
Sbjct: 343 VPTLIDSGTVITRLPMSMYAAL-RQAFVKIMSTKYAKAPAYSILDTCFKG-SLKSISAVP 400

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
            +   F GGA + L   S  I    +G+ C A     G   + I GN  Q    I YD  
Sbjct: 401 EIKMIFQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 459

Query: 363 SQMVSFKPTDC 373
           +  + F P  C
Sbjct: 460 TSRIGFAPGSC 470


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 128/396 (32%), Positives = 199/396 (50%), Gaps = 53/396 (13%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S ++  +GEY M   +GTPP      I+DTGSDL W+QCLPC  C+ Q +  Y+P +
Sbjct: 150 TLESGMTLGSGEYFMDVLVGTPPK-HFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKT 208

Query: 72  SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
           S+S+K ++C   +C L+ +    V C S  Q C Y Y Y D S T G  A E  T     
Sbjct: 209 SASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTT 268

Query: 123 --GNSNNF-FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
             G S+ +  +N++FGCGH N G+F+     L+GLGR  LS +SQ+ S  G + FSYCLV
Sbjct: 269 TEGRSSEYKVENMMFGCGHWNRGLFSGASG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 326

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKED---KTYYFVTLEGISVGNLSNSSK 235
             ++D++++SK+ FG   ++     ++ TS V+ ++   +T+Y++ ++ I VG     + 
Sbjct: 327 DRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG---GEAL 383

Query: 236 LIPY--YNSS--GAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN---AIKLTPYQ 284
            IP   +N S  GA   G   ID+G   +   +  Y    N+  E+++      +  P  
Sbjct: 384 DIPEETWNISPDGA---GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVL 440

Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
           DP      C+   +++GI       P L   F  GA       ++FI    E + C A+ 
Sbjct: 441 DP------CF---NVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLS-EDLVCLAIL 490

Query: 339 PI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                   I GN+ Q +  I YD     + F PT C
Sbjct: 491 GTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 155/374 (41%), Gaps = 55/374 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY     +GTPP   +  ++DTGSD++W+QC PC QCY Q   +++P  S
Sbjct: 131 VVSGLAQGSGEYFASVGVGTPPTPALL-VLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRS 189

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
            SY  + C +  C  LD             C Y   Y D S+T G LATE + F      
Sbjct: 190 RSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGAR- 248

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-- 186
              V  GCGH+N G+F      L      RLSL +Q   + G  +FSYC      D    
Sbjct: 249 VPRVAVGCGHDNEGLFVAAAGLLGLGR-GRLSLPTQTARRYG-RRFSYCFQGSDLDHRTI 306

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           I +      G+ V G G                             S +L P      + 
Sbjct: 307 IRTVHQHVGGARVRGVG---------------------------ERSLRLDP------ST 333

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
            +G + +D+G   T L +  Y  + E  R A   ++L P      G  L   CY      
Sbjct: 334 GRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPG-----GFSLFDTCYDLRGRR 388

Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            +  P ++ H  GGA+V L   +  IP    G FC A+   DG V I GN  Q    + +
Sbjct: 389 VVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVF 448

Query: 360 DFDSQMVSFKPTDC 373
           D D Q V+  P  C
Sbjct: 449 DGDRQRVALVPKSC 462


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 177/384 (46%), Gaps = 62/384 (16%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQ 81
           YV  F+IGTPP   + GIVD   +L+W QC  C    C+KQ  P+++P++S++Y+   C 
Sbjct: 62  YVANFTIGTPPQA-VSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGCGH 138
           S  C  + T +CS    C    GY   S+   T G+ +T+ I  GN+      + FGC  
Sbjct: 121 SPLCKSIPTRNCSGDGEC----GYEAPSMFGDTFGIASTDAIAIGNAEG---RLAFGCVV 173

Query: 139 NNTGVFN---ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
            + G  +   +   G VGLGRT  SL    + Q     FSYCL P        S ++ G 
Sbjct: 174 ASDGSIDGAMDGPSGFVGLGRTPWSL----VGQSNVTAFSYCLAPHGPGKK--SALFLGA 227

Query: 196 GSEVSGGGVVS--TSLVSKE--------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            ++++G G  +  T L+ +            YY V LEGI  G+++ ++       SSG 
Sbjct: 228 SAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAA------SSGG 281

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
            +   + ++T  P + LP   Y  LE+ V  A+      +P     LC++  +++G+ P 
Sbjct: 282 GAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGV-PD 340

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVE---------GVFCFA------MQPIDGDVGIFGNF 350
           L   F GGA        T   PP +         G  C +      +   D  V I G+ 
Sbjct: 341 LVFTFQGGA--------TLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSL 392

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q ++   +D + + +SF+P DC+
Sbjct: 393 LQENVHFLFDLEKETLSFEPADCS 416


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 167/369 (45%), Gaps = 33/369 (8%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQ 81
           YV   ++G     ++  IVDTGSDL WVQC PC    CY Q  P+++PA+S ++  + C 
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239

Query: 82  SEQC--HLLDTV----SCS-----SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
           S  C   L D      SC+     S+Q C Y   Y D S ++GVLA + +  G +    D
Sbjct: 240 SPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTK-LD 298

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
             VFGCG +N G+F     GL+GLGRT LSL SQ  ++ G   FSYCL P  T S  T  
Sbjct: 299 GFVFGCGLSNRGLFG-GTAGLMGLGRTDLSLVSQTAARFG-GVFSYCL-PATTTS--TGS 353

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
           +  G G   S   +  T +++   +  +YF+        N++ ++       ++     G
Sbjct: 354 LSLGPGPSSSFPNMAYTRMIADPTQPPFYFI--------NITGAAVGGGAALTAPGFGAG 405

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
           N+ +D+G   T L    Y  +  +     +  P          CY       +  P+LT 
Sbjct: 406 NVLVDSGTVITRLAPSVYKAVRAEFARRFEY-PAAPGFSILDACYDLTGRDEVNVPLLTL 464

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQM 365
             +GGA+V +           +G   C AM   P +    I GN+ Q +  + YD     
Sbjct: 465 TLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSR 524

Query: 366 VSFKPTDCT 374
           + F   DCT
Sbjct: 525 LGFADEDCT 533


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 156/367 (42%), Gaps = 28/367 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
           EYV+   +G+PP      ++DTGSD+ WV+C PC  QC  QV P+++P+ SS+Y   SC 
Sbjct: 139 EYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCS 198

Query: 82  SEQCHLL----DTVSCSSQQLCNYTYGYADSSL-TKGVLATERITFGNSNN--FFDNVVF 134
           S  C  L    +   CSS   C Y   Y D S+ T G  +++ +  G+++N        F
Sbjct: 199 SAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRF 258

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GC H  TG+       +   G  + SL SQ     G   FSYCL P  + S     +  G
Sbjct: 259 GCSHAETGITGLTAGLMGLGGGAQ-SLVSQTAGTFGTTAFSYCLPPTPSSSGF---LTLG 314

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
                S G V +  L S +   +Y V LE I VG    S   IP       +    M +D
Sbjct: 315 AAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLS---IPT-----TVFSAGMIMD 366

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA-PILTAHF 310
           +G   T LP   Y+ L    +  +K  P      G      C+     + ++ P +   F
Sbjct: 367 SGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVF 426

Query: 311 D--GGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMV 366
              GGA V L  +   +      +FC A      DG  GI GN  Q    + YD     V
Sbjct: 427 SGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAV 486

Query: 367 SFKPTDC 373
            FK   C
Sbjct: 487 GFKAGAC 493


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 171/357 (47%), Gaps = 26/357 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y +   IGTPP L    I DT SDL W QC       KQV+P+++PA SSS+  ++C S+
Sbjct: 91  YTVTIGIGTPPQLHTL-IADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSK 149

Query: 84  QCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNN 140
            C   +  T  CS++  C Y Y Y  S    GVLA E  T  ++N     +  FGCG   
Sbjct: 150 LCTEDNPGTKRCSNKT-CRYVYPYV-SVEAAGVLAYESFTLSDNNQHICMSFGFGCGALT 207

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G       G++G+    LS+    +SQL   KFSYCL P+ TD   +S ++FG  +++ 
Sbjct: 208 DGNL-LGASGILGMSPAILSM----VSQLAIPKFSYCLTPY-TDRK-SSPLFFGAWADL- 259

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
            G   +T  + K    YY+V L G+S+G     ++ +    ++ A+ +G   +D G    
Sbjct: 260 -GRYKTTGPIQKSLTFYYYVPLVGLSLG-----TRRLDVPAATFALKQGGTVVDLGCTVG 313

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKV 316
            L +  +  L+E V + + L          ++C+  PS   +     P L  +FDGGA +
Sbjct: 314 QLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADM 373

Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            L   + F   P  G+ C A+ P  G + I GN  Q +  + +D       F PT C
Sbjct: 374 VLPRDNYF-QEPTAGLMCLALVP-GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 169/385 (43%), Gaps = 46/385 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   IGTP    +   +DTGSDL+W QC  C  C+ Q  P++  + S ++  + C  
Sbjct: 93  EYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSD 151

Query: 83  EQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF-----GNSNNFFDNVV 133
             C     L    C+++ + C Y YGY D S+T G +A +  TF      ++     N+ 
Sbjct: 152 PLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIR 211

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  N G+F  N+ G+ G G   LSL     SQL   +FSYC      +S ++  +  
Sbjct: 212 FGCGMMNYGLFTPNQSGIAGFGTGPLSLP----SQLKVRRFSYCFTAME-ESRVSPVILG 266

Query: 194 GNGSEVSG---GGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           G    +     G + ST             + +YF++L G++VG        +P+  S+ 
Sbjct: 267 GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETR-----LPFNASTF 321

Query: 245 AIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL---TPYQDPRLGSQLCYKTPS 298
           A+     G  FID+G   T  P+  +  L E     + L     Y DP   + LC+  P+
Sbjct: 322 ALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDP--DNLLCFSVPA 379

Query: 299 --MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-----FCFAMQPIDGDVG-IFGNF 350
              A   P L  H + GA   L   +  +    +G       C  +       G I GNF
Sbjct: 380 KKKAPAVPKLILHLE-GADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNF 438

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
            Q ++ I YD +S  + F P  C K
Sbjct: 439 QQQNMHIVYDLESNKMVFAPARCDK 463


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 45/392 (11%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    Q+ +   +G+Y M F IGTP    + G  DTGSDL+W +C  C +C  +  P Y 
Sbjct: 77  PGESAQTPLKKGSGDYAMSFGIGTP-ATGLSGEADTGSDLIWTKCGACARCSPRGSPSYY 135

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKGVLAT 117
           P SSSS   ++C    C  L    CS+          C+Y Y Y ++      T+G+L T
Sbjct: 136 PTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMT 195

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           E  TFG+    F  + FGC   + G F     GLVGLGR +LSL    ++QL    F Y 
Sbjct: 196 ETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGYR 250

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN-- 229
           L    +D S  S + FG+ ++V+GG     +ST L++    +D  +Y+V L GISVG   
Sbjct: 251 L---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 307

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           +   S    +  S+GA   G +  D+G   T+LP   Y  + +++ + +    +Q P   
Sbjct: 308 VQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPPA 361

Query: 290 SQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPID 341
           +     +C+   S     P +  HFDGGA + L  T  ++P       E   C+++    
Sbjct: 362 ANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKSS 420

Query: 342 GDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
             + I GN  Q D  + +D   +++M+   PT
Sbjct: 421 QALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 452


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 45/392 (11%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P    Q+ +   +G+Y M F IGTP    + G  DTGSDL+W +C  C +C  +  P Y 
Sbjct: 77  PGESAQTPLKKGSGDYAMSFGIGTP-ATGLSGEADTGSDLIWTKCGACARCSPRGSPSYY 135

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKGVLAT 117
           P SSSS   ++C    C  L    CS+          C+Y Y Y ++      T+G+L T
Sbjct: 136 PTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMT 195

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           E  TFG+    F  + FGC   + G F     GLVGLGR +LSL    ++QL    F Y 
Sbjct: 196 ETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGYR 250

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN-- 229
           L    +D S  S + FG+ ++V+GG     +ST L++    +D  +Y+V L GISVG   
Sbjct: 251 L---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 307

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           +   S    +  S+GA   G +  D+G   T+LP   Y  + +++ + +    +Q P   
Sbjct: 308 VQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPPA 361

Query: 290 SQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPID 341
           +     +C+   S     P +  HFDGGA + L  T  ++P       E   C+++    
Sbjct: 362 ANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKSS 420

Query: 342 GDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
             + I GN  Q D  + +D   +++M+   PT
Sbjct: 421 QALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 452


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 171/366 (46%), Gaps = 32/366 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+   IGTP +     ++DTGSDL WVQC PC    CY Q  P+++P+ SS++  + C
Sbjct: 124 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPC 182

Query: 81  QSEQCHLLDTV----SCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
            S+ C  L        C++        C Y   Y + ++T+GV +TE +  G S+    +
Sbjct: 183 ASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALG-SSAVVKS 241

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGCG +  G +++ + GL+GLG    SL SQ  S  G   FSYCL P ++ +   + +
Sbjct: 242 FRFGCGSDQHGPYDKFD-GLLGLGGAPESLVSQTASVYG-GAFSYCLPPLNSGAGFLT-L 298

Query: 192 YFGNGSEVSGGGVVSTSL--VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
              N +  S  G V T +   S +  T+Y VTL GISVG  +     IP        +KG
Sbjct: 299 GAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALD---IP----PAVFAKG 351

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILT 307
           N+ +D+G   T +P   Y  L    R+A+   P   P   +   CY       +  P + 
Sbjct: 352 NI-VDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVA 410

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
             F GGA V L   S  +   VE    FA    DG  GI GN     + + YD     + 
Sbjct: 411 LTFVGGATVDLDVPSGVL---VEDCLAFA-DAGDGSFGIIGNVNTRTIEVLYDSGKGHLG 466

Query: 368 FKPTDC 373
           F+   C
Sbjct: 467 FRAGAC 472


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 117/390 (30%), Positives = 165/390 (42%), Gaps = 83/390 (21%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-----------LPCVQCYK 61
           V S V + + EY+M  ++G+PP   +  I DTGSDL+WV+C            P  Q   
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPR-SMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 62  QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
                ++P+ SS+Y  +SCQ++ C  L   +C     C Y Y Y D S T GVL+TE  T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 122 FGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---- 170
           F +              V FGC     G F  + +            A  +++QLG    
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGL------VGLGGGAVSLVTQLGGATS 254

Query: 171 -ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
              +FSYCLVP   ++S  S + FG  ++V+  G  ST L                 VGN
Sbjct: 255 LGRRFSYCLVPHSVNAS--SALNFGALADVTEPGAASTPL-----------------VGN 295

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
            + +S            +   + +D+G   T L       + +++   I L P Q P   
Sbjct: 296 KTVASA-----------ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 344

Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM------QP 339
            QLCY        A    P LT  F GGA V L   + F+    EG  C A+      QP
Sbjct: 345 LQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQP 403

Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
               V I GN AQ ++ +GYD D+  V  K
Sbjct: 404 ----VSILGNLAQQNIHVGYDLDAGTVGNK 429



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/165 (29%), Positives = 71/165 (43%), Gaps = 23/165 (13%)

Query: 227 VGNLSNSSKLIPYYNSSGAI--------SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
           +GNL+  +  + Y   +G +        +   + +D+G   T L       + +++   I
Sbjct: 407 LGNLAQQNIHVGYDLDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRI 466

Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC 334
            L P Q P    QLCY        A    P LT  F GGA V L   + F+    EG  C
Sbjct: 467 TLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLC 525

Query: 335 FAM------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            A+      QP    V I GN AQ ++ +GYD D+  V+F   DC
Sbjct: 526 LAIVATTEQQP----VSILGNLAQQNIHVGYDLDAGTVTFAVADC 566


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 110/363 (30%), Positives = 152/363 (41%), Gaps = 22/363 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASS 72
           S  S    EYV+  +IGTP +  +  I DTGSD+ WVQC PC    C  Q   +++PA S
Sbjct: 120 SGYSLGTTEYVITVTIGTPAVTQVMSI-DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMS 178

Query: 73  SSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           ++Y   SC S QC  L D  +   +  C Y   Y D S T G   ++ ++   S++   +
Sbjct: 179 ATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSL-TSSDAVKS 237

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGC H   G   E + GL+GLG    SL SQ  +  G   FSYCL P    SS    +
Sbjct: 238 FQFGCSHRAAGFVGELD-GLMGLGGDTESLVSQTAATYG-KAFSYCLPP--PSSSGGGFL 293

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
             G     S      T +V     T+Y V L+GI+V             N   ++  G  
Sbjct: 294 TLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGT--------MLNVPASVFSGAS 345

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            +D+G   T LP   Y  L    +  +K  P   P      C+       I  P +T  F
Sbjct: 346 VVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTF 405

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             GA + L  +         G   F     DGD GI GN  Q    + +D   + + F+ 
Sbjct: 406 SRGAAMDLDISGILY----AGCLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRS 461

Query: 371 TDC 373
             C
Sbjct: 462 GAC 464


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 62/132 (46%), Positives = 89/132 (67%), Gaps = 4/132 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           +Q+ ++  +GEY+M  SIGTPP+ D  G+ DTGSDLMW QCLPC++CYKQ +PI++P  S
Sbjct: 81  LQAPLTPGSGEYLMSVSIGTPPV-DYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKS 139

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C S+ C  +D   C +Q +C+Y+Y Y D + TKG L  E+IT G+S+      
Sbjct: 140 TSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV---KS 196

Query: 133 VFGCGHNNTGVF 144
           V GCGH + G F
Sbjct: 197 VIGCGHESGGGF 208


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 162/369 (43%), Gaps = 38/369 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           +YV+    GTP +  +  ++DTGSDL WVQC PC    CY Q  P+++P++SS+Y  + C
Sbjct: 121 QYVVTLGFGTPAVPQVL-LIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPC 179

Query: 81  QSEQCHLLD--------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
            SE C  LD        T S S   LC Y   Y +   T GV +TE +T    +    +N
Sbjct: 180 GSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNN 239

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGCG    GVF+  +  L   G    SL SQ     G   FSYCL      +  ++  
Sbjct: 240 FSFGCGLVQKGVFDLFDGLLGLGGAPE-SLVSQTTGTYGG-AFSYCL-----PAGNSTAG 292

Query: 192 YFGNGSEVSGG----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           +   G+  +GG    G   T L   E  T+Y V L GISVG            +    + 
Sbjct: 293 FLALGAPATGGNNTAGFQFTPLQVVE-TTFYLVKLTGISVGGKQ--------LDIEPTVF 343

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAP 304
            G M ID+G   T LP+  Y+ L    R+A+   P   P     L  CY  T +     P
Sbjct: 344 AGGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVP 403

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            +   F+GG  + L   S  +   ++G   F     DGD GI GN  Q    + YD    
Sbjct: 404 TVALTFEGGVTIDLDVPSGVL---LDGCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARG 460

Query: 365 MVSFKPTDC 373
            V F+   C
Sbjct: 461 HVGFRAGAC 469


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 169/370 (45%), Gaps = 23/370 (6%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYN 68
           N   +S +S   G YV+   +GTP       + DTGSD  WVQC PCV  CY+Q +P++ 
Sbjct: 151 NLPAKSGLSLNTGNYVVPIRLGTP-AARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFT 209

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           P  S++Y  +SC S  C  LDT  CS    C Y   Y D S T G  A + +T G   + 
Sbjct: 210 PTKSATYANISCTSSYCSDLDTRGCSGGH-CLYAVQYGDGSYTVGFYAQDTLTLG--YDT 266

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
             +  FGCG  N G+F +   GL+GLGR + S+  Q   +  +  F+YC+      SS T
Sbjct: 267 VKDFRFGCGEKNRGLFGK-AAGLMGLGRGKTSVPVQAYDKY-SGVFAYCI---PATSSGT 321

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             + FG G+  +    ++  LV     T+Y+V + GI VG    S   IP    +   S 
Sbjct: 322 GFLDFGPGAPAAANARLTPMLVD-NGPTFYYVGMTGIKVGGHLLS---IP----ATVFSD 373

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG-IA-P 304
               +D+G   T LP   Y  L       ++   Y+     S L  CY      G IA P
Sbjct: 374 AGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALP 433

Query: 305 ILTAHFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
            ++  F GGA + +  +   ++    +    FA    D D+ I GN  Q    + YD   
Sbjct: 434 AVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGK 493

Query: 364 QMVSFKPTDC 373
           ++V F P  C
Sbjct: 494 KVVGFAPGAC 503


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/380 (31%), Positives = 183/380 (48%), Gaps = 40/380 (10%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASS 72
           S +S  +  YVMKF+IG+PP+ + Y I DTGS+++W+QC    C  CYKQ  P++NP  S
Sbjct: 99  SRISIIDKVYVMKFNIGSPPV-ETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKS 157

Query: 73  SSYKELSCQSEQCH-----LLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S+Y    C   +C      L + + C SS Q+C Y   Y D S ++G ++T+ ITF    
Sbjct: 158 STYAIRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHI 217

Query: 127 NFFDN----VVFGCGHNNTGVFNEN-----EMGLVGLGRTRLSLASQILSQLGANKFSYC 177
             F N    + FGCG+NN+    ++       G+VGLG    SL    + QL   +FSYC
Sbjct: 218 AEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASL----VGQLTLGQFSYC 273

Query: 178 L-VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
           +  P     + T ++ FG  + +SG    ST+L +  +  Y F  ++GI V +     K 
Sbjct: 274 ISTPDVQKPNGTIEIRFGLAASISGH---STALANNLEGWYIFQNVDGIYVDD--TKVKG 328

Query: 237 IPYYN---SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ-- 291
            P +    + G I  G + +D+G   T L     + L  +++  I+L P       S   
Sbjct: 329 YPEWVFQFAEGGI--GGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYS 386

Query: 292 LCYKTPS-MAGIAPILTAHF--DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
           LCY   + +    P +   F  +  A  P    + +I    +  +C AM    G + I G
Sbjct: 387 LCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQ-YCLAMFGTSG-ISIIG 444

Query: 349 NFAQSDLFIGYDFDSQMVSF 368
            +   D+ IGYD    +VSF
Sbjct: 445 IYQHRDIKIGYDLKYNLVSF 464


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 178/385 (46%), Gaps = 57/385 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-------LPCVQCYKQVKPIYNPASSSSYK 76
           + +   IGTPP      IVDTGSDL+W QC              +Q +P+Y P  SSS+ 
Sbjct: 84  HSLTVGIGTPPQPRTL-IVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFA 142

Query: 77  ELSCQSEQCH--LLDTVSCSSQQLCNY--TYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            L C    C        +C+    C Y   YG A++    GVLA+E  TFG +      +
Sbjct: 143 YLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAEAG---GVLASETFTFGVNAKVSLPL 199

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  + G       GL+GL    +SL    +SQL   +FSYCL PF      TS + 
Sbjct: 200 GFGCGALSAGDL-VGASGLMGLSPGIMSL----VSQLSVPRFSYCLTPFAERK--TSPLL 252

Query: 193 FGNGSEV----SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
           FG  +++    + G V +TS++     +  YY+V L G+S+G    + +L     S G I
Sbjct: 253 FGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLG----TKRLDVPATSLGMI 308

Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL-------TPYQDPRLGSQLCYKT 296
                G   +D+G+  + L +  +  +++ V  A++L         Y D     +LC+  
Sbjct: 309 KPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDD----YELCFAL 364

Query: 297 PSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFG 348
           P+  G+A      P L  HFDGGA + L   + F   P  G+ C A+   P    V I G
Sbjct: 365 PT--GVAMEAVKTPPLVLHFDGGAAMTLPRDNYF-QEPRAGLMCLAVGTSPDGFGVSIIG 421

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
           N  Q ++ + +D  +Q  SF PT C
Sbjct: 422 NVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 162/358 (45%), Gaps = 23/358 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EY++    GTP +  +  ++DTGSD+ WVQC PC   +CY Q  P+++P+ SS+Y  ++C
Sbjct: 124 EYMVTLGFGTPSVPQVL-LMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIAC 182

Query: 81  QSEQCHLLD---TVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            ++ C+ L       C+S    C Y   Y D S T+GV + E ITF       D   FGC
Sbjct: 183 GADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPGITVKD-FHFGC 241

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           GH+  G  ++ + GL+GLG    SL  Q  S  G   FSYCL   ++++   +     + 
Sbjct: 242 GHDQRGPSDKFD-GLLGLGGAPESLVVQTASVYGG-AFSYCLPALNSEAGFLALGVRPSA 299

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           +  +   V +       D T Y V + GISVG        IP      +  +G M ID+G
Sbjct: 300 ATNTSAFVFTPMWHLPMDATSYMVNMTGISVG---GKPLDIPR-----SAFRGGMLIDSG 351

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
              T LP+  YN L   +R A    P          CY     + +  P +   F GGA 
Sbjct: 352 TIVTELPETAYNALNAALRKAFAAYPMVASE-DFDTCYNFTGYSNVTVPRVALTFSGGAT 410

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + L   +  +   V+    F     D  +GI GN  Q  L + YD     V F+   C
Sbjct: 411 IDLDVPNGIL---VKDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 113/361 (31%), Positives = 157/361 (43%), Gaps = 33/361 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV+  S+GTP +  +  I DTGSD+ WVQC PC    C  Q   +++PA S++Y   SC
Sbjct: 129 EYVITVSLGTPAVTQVMSI-DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSC 187

Query: 81  QSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            S QC  L  +   C +   C Y   Y D S T G   ++ +    S+    N  FGC H
Sbjct: 188 SSAQCAQLGGEGNGCLNSH-CQYIVKYVDHSNTTGTYGSDTLGLTTSDA-VKNFQFGCSH 245

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G   + + GL+GLG    SL SQ  +  G   FSYCL P  + SS    +  G    
Sbjct: 246 RANGFVGQLD-GLMGLGGDTESLVSQTAATYG-KAFSYCLPP--SSSSAGGFLTLG---- 297

Query: 199 VSGGGVVS-----TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            + GG  S     T LV     T+Y V L+ I+V      +KL    N   ++  G   +
Sbjct: 298 AAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAG----TKL----NVPASVFSGASVV 349

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDG 312
           D+G   T LP   Y  L    +  +K  P   P      C+    +  +  P++T  F  
Sbjct: 350 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSR 409

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA + L  +  F      G   F     DGD GI GN  Q    + +D     + F+P  
Sbjct: 410 GAVMDLDVSGIFY----AGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGA 465

Query: 373 C 373
           C
Sbjct: 466 C 466


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 162/350 (46%), Gaps = 37/350 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--------VS 92
           IVDT S+L WVQC PC  C+ Q  P+++PASS SY  L C S  C  L            
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200

Query: 93  CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
              Q  C+YT  Y D S ++GVLA ++++   +    D  VFGCG +N G F     GL+
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL--AGEVIDGFVFGCGTSNQGPFGGTS-GLM 257

Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLV 210
           GLGR++LSL SQ + Q G   FSYCL    ++SS    +  G+ + V  +   +V T++V
Sbjct: 258 GLGRSQLSLISQTMDQFGG-VFSYCLPLKESESS--GSLVLGDDTSVYRNSTPIVYTTMV 314

Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
           S   +  +YFV L GI++G     S            S G + +D+G   T L    YN 
Sbjct: 315 SDPVQGPFYFVNLTGITIGGQEVES------------SAGKVIVDSGTIITSLVPSVYNA 362

Query: 270 LEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTST--FI 325
           ++ +  +     P Q P       C+       +  P L   F+G  +V +  +    F+
Sbjct: 363 VKAEFLSQFAEYP-QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFV 421

Query: 326 PPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                 V C A+  +    +  I GN+ Q +L + +D     + F    C
Sbjct: 422 SSDSSQV-CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 176/370 (47%), Gaps = 41/370 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           ++M FSIG PP+  +  ++DTGS L WV C PC  C +Q  PI++P+ SS+Y  LSC   
Sbjct: 93  FLMNFSIGEPPIPQL-AVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS-- 149

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV---VFGCGH-- 138
           +C+  D V+      C Y+  Y  S  ++G+ A E++T    +     V   +FGCG   
Sbjct: 150 ECNKCDVVNGE----CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKF 205

Query: 139 --NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
             ++ G   +   G+ GLG  R SL    L   G  KFSYC+      +   +++  G+ 
Sbjct: 206 SISSNGYPYQGINGVFGLGSGRFSL----LPSFG-KKFSYCIGNLRNTNYKFNRLVLGDK 260

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMFIDT 255
           + + G    ST+L        Y+V LE IS+G       + P  +  S   +   + ID+
Sbjct: 261 ANMQGD---STTL--NVINGLYYVNLEAISIG--GRKLDIDPTLFERSITDNNSGVIIDS 313

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYK---TPSMAGIAPILTAH 309
           GA  T L K  +  L  +V N ++   +   QD      LCY    +  ++G  P++T H
Sbjct: 314 GADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGF-PLVTFH 372

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GD----VGIFGNFAQSDLFIGYDFDS 363
           F  GA + L  TS FI    E  FC AM P +  GD        G  AQ +  +GYD + 
Sbjct: 373 FAEGAVLDLDVTSMFI-QTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNR 431

Query: 364 QMVSFKPTDC 373
             V F+  DC
Sbjct: 432 MRVYFQRIDC 441


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 162/350 (46%), Gaps = 37/350 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--------VS 92
           IVDT S+L WVQC PC  C+ Q  P+++PASS SY  L C S  C  L            
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199

Query: 93  CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
              Q  C+YT  Y D S ++GVLA ++++   +    D  VFGCG +N G F     GL+
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL--AGEVIDGFVFGCGTSNQGPFGGTS-GLM 256

Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLV 210
           GLGR++LSL SQ + Q G   FSYCL    ++SS    +  G+ + V  +   +V T++V
Sbjct: 257 GLGRSQLSLISQTMDQFGG-VFSYCLPLKESESS--GSLVLGDDTSVYRNSTPIVYTTMV 313

Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
           S   +  +YFV L GI++G     S            S G + +D+G   T L    YN 
Sbjct: 314 SDPVQGPFYFVNLTGITIGGQEVES------------SAGKVIVDSGTIITSLVPSVYNA 361

Query: 270 LEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTST--FI 325
           ++ +  +     P Q P       C+       +  P L   F+G  +V +  +    F+
Sbjct: 362 VKAEFLSQFAEYP-QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFV 420

Query: 326 PPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                 V C A+  +    +  I GN+ Q +L + +D     + F    C
Sbjct: 421 SSDSSQV-CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 120/374 (32%), Positives = 171/374 (45%), Gaps = 35/374 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           ++ST N  YV    +GTP   ++   +DTGSD  WVQC PC  CY+Q  P+++P +SS+Y
Sbjct: 133 SLSTTN--YVASLRLGTP-ATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTY 189

Query: 76  KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
             + C + +C  L         S  + + C Y   Y D S T G LA + +T   S +  
Sbjct: 190 SAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPS 249

Query: 130 --DNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
             D V   VFGCGH+N G F E +  L+GLG  + SL SQ+ ++ GA  FSYCL      
Sbjct: 250 PADTVPGFVFGCGHSNAGTFGEVDG-LLGLGLGKASLPSQVAARYGA-AFSYCL-----P 302

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           SS ++  Y   G   +      T +V+ +D T Y++ L GI V   +       +  ++G
Sbjct: 303 SSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAG 362

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMA 300
            I      ID+G   + LP   Y  L    R+A+    Y+  R  S      CY      
Sbjct: 363 TI------IDSGTAFSRLPPSAYAALRSSFRSAMGRYRYK--RAPSSPIFDTCYDFTGHE 414

Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            +  P +   F  GA V L  +            C A  P + D+GI GN  Q  L + Y
Sbjct: 415 TVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-NHDLGILGNTQQRTLAVIY 473

Query: 360 DFDSQMVSFKPTDC 373
           D  SQ + F    C
Sbjct: 474 DVGSQRIGFGRKGC 487


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 37/379 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
           ++S +S  +G Y +K  +G+P       IVDTGS   W+QC PC + C+ Q  P++NP++
Sbjct: 92  LKSGLSMGSGNYYVKMGLGSPTKYYTM-IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSA 150

Query: 72  SSSYKELSCQSEQCHL-----LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNS 125
           S +YK + C S QC       L+  +CS Q   C Y   Y DSS + G L+ + +T   S
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS 210

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTD 184
                + V+GCG +N G+F   + G++GL    LS+ SQ+  + G N FSYCL   F T 
Sbjct: 211 QT-LSSFVYGCGQDNQGLFGRTD-GIIGLANNELSMLSQLSGKYG-NAFSYCLPTSFSTP 267

Query: 185 SSITSK-MYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG----NLSNSSKLIP 238
           +S     +  G  S         T L+    + + YF+ LE I+V      ++ SS  +P
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTP 297
                         ID+G   T LP   Y  L+      +     Q P +     C+K  
Sbjct: 328 ------------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG- 374

Query: 298 SMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
           S+AGI   AP +   F GGA + L   ++ +     G+ C AM      + I GN+ Q  
Sbjct: 375 SLAGISEVAPDIRIIFKGGADLQLKGHNSLVELET-GITCLAMAG-SSSIAIIGNYQQQT 432

Query: 355 LFIGYDFDSQMVSFKPTDC 373
           + + YD  +  V F P  C
Sbjct: 433 VKVAYDVGNSRVGFAPGGC 451


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 172/380 (45%), Gaps = 42/380 (11%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           +N+ T N  YV    +G     +   +VDT S+L WVQC PC  C+ Q  P+++P+SS S
Sbjct: 113 ANLRTLN--YVATVGLGAA---EATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPS 167

Query: 75  YKELSCQSEQCHLLD------TVSCS----SQQLCNYTYGYADSSLTKGVLATERITFGN 124
           Y  + C S  C  L       T  C+     Q  C+Y   Y D S ++GVLA +++    
Sbjct: 168 YAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAG 227

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
            +   +  VFGCG +N G       GL+GLGR+ +SL SQ + Q G   FSYCL P   +
Sbjct: 228 QD--IEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGG-VFSYCL-PMR-E 282

Query: 185 SSITSKMYFGNGSEV--SGGGVVSTSLVSKE---DKTYYFVTLEGISVGNLSNSSKLIPY 239
           S  +  +  G+ S    +   +V T++VS        +YF+ L GI+VG     S   P+
Sbjct: 283 SGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVES---PW 339

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
           +      S G + ID+G   T L    YN +  +  + +   P Q P       C+    
Sbjct: 340 F------SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYP-QAPAFSILDTCFNLTG 392

Query: 299 MAGI-APILTAHFDGGAKVPLIHTST--FIPPPVEGVFCFAMQPIDG--DVGIFGNFAQS 353
           +  +  P L   F+G  +V +       F+      V C A+  +    D  I GN+ Q 
Sbjct: 393 LKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQV-CLALASLKSEYDTSIIGNYQQK 451

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
           +L + +D     + F    C
Sbjct: 452 NLRVIFDTLGSQIGFAQETC 471


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 37/379 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
           ++S +S  +G Y +K  +G+P       IVDTGS   W+QC PC + C+ Q  P++NP++
Sbjct: 92  LKSGLSMGSGNYYVKMGLGSPTKYYTM-IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSA 150

Query: 72  SSSYKELSCQSEQCHL-----LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNS 125
           S +YK + C S QC       L+  +CS Q   C Y   Y DSS + G L+ + +T   S
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS 210

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTD 184
                + V+GCG +N G+F   + G++GL    LS+ SQ+  + G N FSYCL   F T 
Sbjct: 211 QT-LSSFVYGCGQDNQGLFGRTD-GIIGLANNELSMLSQLSGKYG-NAFSYCLPTSFSTP 267

Query: 185 SSITSK-MYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG----NLSNSSKLIP 238
           +S     +  G  S         T L+    + + YF+ LE I+V      ++ SS  +P
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTP 297
                         ID+G   T LP   Y  L+      +     Q P +     C+K  
Sbjct: 328 ------------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG- 374

Query: 298 SMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
           S+AGI   AP +   F GGA + L   ++ +     G+ C AM      + I GN+ Q  
Sbjct: 375 SLAGISEVAPDIRIIFKGGADLQLKGHNSLVELET-GITCLAMAG-SSSIAIIGNYQQQT 432

Query: 355 LFIGYDFDSQMVSFKPTDC 373
           + + YD  +  V F P  C
Sbjct: 433 VKVAYDVGNSRVGFAPGGC 451


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 163/356 (45%), Gaps = 26/356 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++DTGSD+ WVQC PC QC+ Q  P+++P+SSS+Y   SC S
Sbjct: 132 EYLITVRLGSPGKSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSS 190

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CSS Q C YT  Y D S T G  +++ +  G  +N      FGC +  
Sbjct: 191 AACAQLGQEGNGCSSSQ-CQYTVTYGDGSSTTGTYSSDTLALG--SNAVRKFQFGCSNVE 247

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SL SQ     GA  FSYCL      SS +  +  G G+   
Sbjct: 248 SG-FNDQTDGLMGLGGGAQSLVSQTAGTFGA-AFSYCL---PATSSSSGFLTLGAGTS-- 300

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
            G V +  L S +  T+Y V ++ I VG    S   IP      ++      +D+G   T
Sbjct: 301 -GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLS---IPT-----SVFSAGTIMDSGTVLT 351

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
            LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V  I
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVD-I 410

Query: 320 HTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            +   +      + C  FA    D  +GI GN  Q    + YD     V FK   C
Sbjct: 411 ASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 162/362 (44%), Gaps = 40/362 (11%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS-SSSYKELSCQSEQCHLL 88
           +GTPP   +   ++ G++L+W    P  +C++Q  P + P + S      SC S +    
Sbjct: 1   MGTPPN-PVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFW-- 57

Query: 89  DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENE 148
                   Q C YTY Y D S+T G L  ++ TF  +      V FGCG  N GVF  NE
Sbjct: 58  ------PNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNE 111

Query: 149 MGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKMYFGNGSEVSGG 202
            G+ G GR  LSL     SQL    FS+C       +P      + + + F NG     G
Sbjct: 112 TGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL-FSNGQ----G 162

Query: 203 GVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTG 256
            V +T L+    ++ + T Y+++L+GI+VG     S  +P   S+ A++ G     ID+G
Sbjct: 163 AVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALTNGTGGTIIDSG 217

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
              T LP   Y  + ++    IKL        G   C+  PS A    P L  HF+G   
Sbjct: 218 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATM 277

Query: 316 VPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                   F  P   G  + C A+   D +  I GNF Q ++ + YD  + M+SF    C
Sbjct: 278 DLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336

Query: 374 TK 375
            K
Sbjct: 337 DK 338


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 81/211 (38%), Positives = 111/211 (52%), Gaps = 12/211 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  S  +GEY  +  IG+PP   +Y +VDTGSD+ WVQC PC  CY+Q  PI+ P+ SSS
Sbjct: 44  SGASQGSGEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSS 102

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y  L+C++ QC  LD   C +   C Y   Y D S T G  ATE IT   S +  +NV  
Sbjct: 103 YAPLTCETHQCKSLDVSECRNDS-CLYEVSYGDGSYTVGDFATETITLDGSAS-LNNVAI 160

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GCGH+N G+F                 +    SQ+ A+ FSYCLV   TDS+ T +    
Sbjct: 161 GCGHDNEGLFVGAAG-----LLGLGGGSLSFPSQINASSFSYCLVNRDTDSASTLEF--- 212

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI 225
             S +    V +  L + +  T+Y++ + GI
Sbjct: 213 -NSPIPSHSVTAPLLRNNQLDTFYYLGMTGI 242


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 43/364 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV++ S GTP +  +  ++DTGSD+ W+QC PC   QC+ Q  P+Y+P+ SS+Y  + C
Sbjct: 78  EYVVRVSFGTPAVPQVV-VIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPC 136

Query: 81  QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            S+ C  L   +    C+S + C +   YAD + T G  + +++T         N  FGC
Sbjct: 137 ASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA-PGAIVQNFYFGC 195

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MYF 193
           GH    V    + G++GLGR R SL ++         FSYCL       S++SK   +  
Sbjct: 196 GHGKHAVRGLFD-GVLGLGRLRESLGARY-----GGVFSYCL------PSVSSKPGFLAL 243

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G G   SG        V  +  T+  VTL GI+VG       L P   S      G M +
Sbjct: 244 GAGKNPSGFVFTPMGTVPGQ-PTFSTVTLAGINVGG--KKLDLRPSAFS------GGMIV 294

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAG-IAPILTAH 309
           D+G   T L    Y  L    R A+   +L P  D       CY        + P +   
Sbjct: 295 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALT 350

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F GGA + L   +  +   V G   FA    DG  G+ GN  Q    + +D  +    F+
Sbjct: 351 FTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 407

Query: 370 PTDC 373
              C
Sbjct: 408 AKAC 411


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 166/358 (46%), Gaps = 37/358 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVK----PIYNPASSSSYKELSCQSEQCH--LLDTVSCS 94
           IVDTGSDL+W QC          +    P+Y+P  SS++  L C    C        +C+
Sbjct: 29  IVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNCT 88

Query: 95  SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
           S+  C Y   Y  S+   GVLA+E  TFG        + FGCG  + G       G++GL
Sbjct: 89  SKNRCVYEDVYG-SAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI-GATGILGL 146

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG----VVSTSLV 210
               LSL    ++QL   +FSYCL PF      TS + FG  +++S       + +T++V
Sbjct: 147 SPESLSL----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMADLSRHKTTRPIQTTAIV 200

Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDF 266
           S   +T YY+V L GIS+G+     K +    +S A+     G   +D+G+    L +  
Sbjct: 201 SNPVETVYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 255

Query: 267 YNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLI 319
           +  ++E V + ++L          +LC+  P     A       P L  HFDGGA + L 
Sbjct: 256 FEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLP 315

Query: 320 HTSTFIPPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
             + F   P  G+ C A+ +  DG  V I GN  Q ++ + +D      SF PT C +
Sbjct: 316 RDNYF-QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQ 372


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 43/364 (11%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV++ S GTP +  +  ++DTGSD+ W+QC PC   QC+ Q  P+Y+P+ SS+Y  + C
Sbjct: 112 EYVVRVSFGTPAVPQVV-VIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPC 170

Query: 81  QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            S+ C  L   +    C+S + C +   YAD + T G  + +++T         N  FGC
Sbjct: 171 ASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA-PGAIVQNFYFGC 229

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MYF 193
           GH    V    + G++GLGR R SL ++         FSYCL       S++SK   +  
Sbjct: 230 GHGKHAVRGLFD-GVLGLGRLRESLGARY-----GGVFSYCL------PSVSSKPGFLAL 277

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           G G   SG        V  +  T+  VTL GI+VG       L P   S      G M +
Sbjct: 278 GAGKNPSGFVFTPMGTVPGQ-PTFSTVTLAGINVGG--KKLDLRPSAFS------GGMIV 328

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAG-IAPILTAH 309
           D+G   T L    Y  L    R A+   +L P  D       CY        + P +   
Sbjct: 329 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALT 384

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F GGA + L   +  +   V G   FA    DG  G+ GN  Q    + +D  +    F+
Sbjct: 385 FTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 441

Query: 370 PTDC 373
              C
Sbjct: 442 AKAC 445


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 161/364 (44%), Gaps = 48/364 (13%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++  +IGTPP   +   +DTGSDL+W QC PC  C+ Q  P ++P++SS+    SC S
Sbjct: 88  EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 146

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
             C                  G   +SL +    +++ TF  +      V FGCG  N G
Sbjct: 147 TLCQ-----------------GLPVASLPR----SDKFTFVGAGASVPGVAFGCGLFNNG 185

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKMYFGNG 196
           VF  NE G+ G GR  LSL     SQL    FS+C       +P      + + + F NG
Sbjct: 186 VFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL-FSNG 240

Query: 197 SEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
                G V +T L+    + T+Y+++L+GI+VG     S  +P   S  A+  G     I
Sbjct: 241 Q----GAVQTTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 291

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
           D+G   T LP   Y  + +     +KL            C   P  A    P L  HF+G
Sbjct: 292 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 351

Query: 313 GA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
               +P  +    +      + C A+    G+V   GNF Q ++ + YD  +  +SF P 
Sbjct: 352 ATMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 410

Query: 372 DCTK 375
            C K
Sbjct: 411 QCDK 414


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 103/352 (29%), Positives = 160/352 (45%), Gaps = 43/352 (12%)

Query: 51  VQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ--LCNYTYGYADS 108
           +QC PCV CY+Q+ P++NP  SSSY  + C S+ C  LD   C       C YTY Y+  
Sbjct: 1   MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60

Query: 109 SLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ 168
            +TKG LA +++  G   + F  VVFGC  ++ G       GLVGLGR  LSL    +SQ
Sbjct: 61  GVTKGTLAIDKLAIG--GDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSL----VSQ 114

Query: 169 LGANKFSYCLVPFHTDSSITSKMYFGNGSEV---SGGGVVSTSLVSKEDKTYYFVTLEGI 225
           L  ++F YCL P  + +S   K+  G G++        V  T   S    +YY++ L+G+
Sbjct: 115 LSVHRFMYCLPPPMSRTS--GKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGL 172

Query: 226 SVGNLSNSSK-----------------LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
           +VG+ +  +                         +G  +   M +D  +  + L    Y+
Sbjct: 173 AVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYD 232

Query: 269 RLEEQVRNAIKLTPYQDP--RLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTS 322
            L + +   I+L P   P  RLG  LC+  P   G+     P ++  FD G  + L    
Sbjct: 233 ELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDR 290

Query: 323 TFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            F+    +G + C  +    G V I GNF   ++ + ++     ++F    C
Sbjct: 291 LFV---TDGRMMCLMIGRTSG-VSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 114/372 (30%), Positives = 165/372 (44%), Gaps = 29/372 (7%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
            +S  +  +G Y++   +GTP    +  I DTGSDL W QC PC + CY Q  P++ P+ 
Sbjct: 120 AKSGATIGSGNYIVSVGLGTPKKY-LSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQ 178

Query: 72  SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
           S++Y  +SC S  C  L++ +     CS+ + C Y   Y D S + G  A E +T   S 
Sbjct: 179 STTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTL-TST 237

Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           +  +N +FGCG NN G+F  +  GL+GLG+ ++S+  Q   + G   FSYCL      SS
Sbjct: 238 DVIENFLFGCGQNNRGLFG-SAAGLIGLGQDKISIVKQTAQKYG-QVFSYCL---PKTSS 292

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            T  + FG G        +  + ++K      +Y V + G+ VG        IP   SS 
Sbjct: 293 STGYLTFGGGGGGG---ALKYTPITKAHGVANFYGVDIVGMKVGGTQ-----IPI--SSS 342

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA 303
             S     ID+G   T LP D Y+ L+      +   P + P L     CY     + I 
Sbjct: 343 VFSTSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYP-KAPELSILDTCYDLSKYSTIQ 401

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
            P +   F GG ++ L             V   FA       V I GN  Q  L + YD 
Sbjct: 402 IPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDV 461

Query: 362 DSQMVSFKPTDC 373
               + F    C
Sbjct: 462 GGGKIGFGYNGC 473


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 164/372 (44%), Gaps = 36/372 (9%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N EY++  SIG P    +   +DTGSD++W QC PC +C+ Q  P ++ A+S++ + ++C
Sbjct: 89  NSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVAC 148

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDNVVFGC 136
               C+      C     C Y  GY D SL+ G    +  TF    G       ++ FGC
Sbjct: 149 SDPLCNAHSEHGCFLHG-CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGC 207

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           G  N G F + E G+ G GR  LSL     SQL   +FSYC        S  S ++ G  
Sbjct: 208 GMYNAGRFLQTETGIAGFGRGPLSLP----SQLKVRQFSYCFTTRFEAKS--SPVFLGGA 261

Query: 197 SEVSG---GGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISK 248
            ++     G ++ST  V       D ++Y ++ +G++VG     ++L +P   + G+   
Sbjct: 262 GDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGK----TRLPVPEIKADGS--- 314

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
           G  FID+G   T  P   + +L+     Q    +  T  +D    S    KT +M    P
Sbjct: 315 GATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGKKTAAM----P 370

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDS 363
            L  H + GA   L   +        G  C A+      D  + GNF Q +  I YD  +
Sbjct: 371 KLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAA 429

Query: 364 QMVSFKPTDCTK 375
             +   P  C K
Sbjct: 430 GKLLLVPAQCDK 441


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 176/377 (46%), Gaps = 46/377 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
            YV  F+IGTPP   + GIVD   +L+W QC  C    C+KQ  P+++P++S++Y+   C
Sbjct: 61  HYVANFTIGTPPQA-VSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQC 119

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGCG 137
            S  C  + T +CS    C    GY   S+   T G+ +T+ I  GN+      + FGC 
Sbjct: 120 GSPLCKSIPTRNCSGDGEC----GYEAPSMFGDTFGIASTDAIAIGNAEG---RLAFGCV 172

Query: 138 HNNTGVFN---ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
             + G  +   +   G VGLGRT  SL    + Q     FSYCL   H      S ++ G
Sbjct: 173 VASDGSIDGAMDGPSGFVGLGRTPWSL----VGQSNVTAFSYCLA-LHGPGK-KSALFLG 226

Query: 195 NGSEVSGGGVVS--TSLVSKEDKT--------YYFVTLEGISVGNLSNSSKLIPYYNSSG 244
             ++++G G  +  T L+ +            YY V LEGI  G+++ ++       SSG
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAA------SSG 280

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
             +   + ++T  P + LP   Y  LE+ V  A+      +P     LC++  +++G+ P
Sbjct: 281 GGAITVLQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGV-P 339

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPP-VEGVFCFA------MQPIDGDVGIFGNFAQSDLFI 357
            L   F GGA +    +   +      G  C +      +   D  V I G+  Q ++  
Sbjct: 340 DLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHF 399

Query: 358 GYDFDSQMVSFKPTDCT 374
            +D + + +SF+P DC+
Sbjct: 400 LFDLEKETLSFEPADCS 416


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 120/395 (30%), Positives = 188/395 (47%), Gaps = 51/395 (12%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S ++  +GEY M   +GTPP      I+DTGSDL W+QCLPC  C+ Q    Y+P +
Sbjct: 148 TLESGMTLGSGEYFMDVLVGTPPK-HFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKT 206

Query: 72  SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
           S+S+K ++C   +C L+ +    V C S  Q C Y Y Y D S T G  A E  T     
Sbjct: 207 SASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTT 266

Query: 123 ---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
              G+S     N++FGCGH N G+F+     L    R  LS +SQ+ S  G + FSYCLV
Sbjct: 267 TEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLG-RGPLSFSSQLQSLYG-HSFSYCLV 324

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKED---KTYYFVTLEGISVGNLSNSSK 235
             +++++++SK+ FG   ++     ++ TS V+ ++   +T+Y++ ++ I VG      K
Sbjct: 325 DRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG-----GK 379

Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFY----NRLEEQVRN---AIKLTPYQD 285
            +     +  IS    G   ID+G   +   +  Y    N+  E+++      +  P  D
Sbjct: 380 ALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLD 439

Query: 286 PRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP 339
           P      C+   +++GI       P L   F  G        ++FI    E + C A+  
Sbjct: 440 P------CF---NVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLS-EDLVCLAILG 489

Query: 340 I-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                  I GN+ Q +  I YD     + F PT C
Sbjct: 490 TPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PCV  C++Q  P++NP +SSSY  +SC
Sbjct: 127 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            ++QC  L T      SCS+  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 186 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 243

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F ++  GL+GL R +LSL  Q+   +G + FSYCL    + SS    +   N
Sbjct: 244 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 301

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   + S+SL    D + YF+ + GI V          P   SS A S     ID+
Sbjct: 302 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 350

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   Y+ L + V  A+K TP          C++  +     P +T  F GGA 
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 410

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  +        C A  P      I GN  Q    + YD  +  + F    C+
Sbjct: 411 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PCV  C++Q  P++NP +SSSY  +SC
Sbjct: 127 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            ++QC  L T      SCS+  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 186 SAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 243

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F ++  GL+GL R +LSL  Q+   +G + FSYCL    + SS    +   N
Sbjct: 244 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 301

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   + S+SL    D + YF+ + GI V          P   SS A S     ID+
Sbjct: 302 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 350

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   Y+ L + V  A+K TP          C++  +     P +T  F GGA 
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 410

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  +        C A  P      I GN  Q    + YD  +  + F    C+
Sbjct: 411 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 159/357 (44%), Gaps = 25/357 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
           EYV+   +GTP +      +DTGSD+ WVQC PC    C+ Q   +++PA SS+Y+ +SC
Sbjct: 126 EYVISVGLGTPAVTQTV-TIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSC 184

Query: 81  QSEQCHLLDTVS--CSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
            + +C  L+     C +    C Y   Y D S T G  + + +T   +++      FGC 
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCS 244

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           H  +G F++   GL+GLG    SL SQ  +  G N FSYCL P    +S +S      G 
Sbjct: 245 HLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYG-NSFSYCLPP----TSGSSGFLTLGGG 298

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
             + G V +  L SK+  T+Y   L+ I+VG       L P   ++G++      +D+G 
Sbjct: 299 GGASGFVTTRMLRSKQIPTFYGARLQDIAVGG--KQLGLSPSVFAAGSV------VDSGT 350

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
             T LP   Y+ L    +  +K       R     C+       I+ P +   F GGA +
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAI 410

Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            L                FA    DG  GI GN  Q    + YD  S  + F+   C
Sbjct: 411 DLDPNGIMY----GNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 158/365 (43%), Gaps = 38/365 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+   +GTP +  +  ++DTGSDL WVQC PC    CY Q  P+++P+ SS+Y  + C
Sbjct: 119 EYVVTVGLGTPAVSQVL-LIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPC 177

Query: 81  QSEQCHLLD--------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            ++ C  L         T        C Y   Y D S T GV + E +T        D  
Sbjct: 178 NTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKD-F 236

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCGH+  G  N+   GL+GLG    SL  Q  S  G   FSYCL P   D +     +
Sbjct: 237 HFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGG-AFSYCL-PAANDQA----GF 289

Query: 193 FGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
              G+ V+   G V T +V +E +T+Y V + GI+VG         P      A S G M
Sbjct: 290 LALGAPVNDASGFVFTPMV-REQQTFYVVNMTGITVGGE-------PIDVPPSAFS-GGM 340

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
            ID+G   T L    Y  L+   R A+   P   P      CY     + +  P +   F
Sbjct: 341 IIDSGTVVTELQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTF 399

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            GGA V L      +P  +    C A Q    D   GI GN  Q  L + YD     V F
Sbjct: 400 SGGATVDLD-----VPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGF 454

Query: 369 KPTDC 373
               C
Sbjct: 455 GADAC 459


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 161/358 (44%), Gaps = 27/358 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+   IG+P +     + DTGSD+ WVQC PC QC+ +V  +++P+SSS+Y   SC S
Sbjct: 121 EYVITVGIGSPAVTQTMSM-DTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSS 179

Query: 83  EQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
             C  L        C S Q C Y   Y DSS T G  +++ +T G+S     +  FGC  
Sbjct: 180 APCAQLSQSQEGNGCMSSQ-CQYIVNYGDSSSTTGTYSSDTLTLGSSA--MTDFQFGCSQ 236

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
           + +G FN+   GL+GLG    SLASQ     G   FSYCL P    S     +  G GS 
Sbjct: 237 SESGGFNDQTDGLMGLGGGAQSLASQTAGTFG-TAFSYCLPPTSGSSGF---LTLGTGSS 292

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
              G V +  L S +  TYY V LE I VG+           N   ++      +D+G  
Sbjct: 293 ---GFVKTPMLRSTQIPTYYVVLLESIKVGS--------QQLNLPTSVFSAGSLMDSGTI 341

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
            T LP   Y+ L    +  ++  P   P      C+     + I+ P +T  F GGA V 
Sbjct: 342 ITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVD 401

Query: 318 LIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           L      +      + C A  P   D  +GI GN  Q    + YD     V FK   C
Sbjct: 402 LAFDGIMLEIS-SSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++DTGSD+ WVQC PC QC+ Q  P+++P+SSS+Y   SC S
Sbjct: 127 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CSS   C Y   Y D S T G  +++ +  G+S     +  FGC +  
Sbjct: 186 AACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VKSFQFGCSNVE 243

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SL SQ    LG   FSYCL P  + S     +  G      
Sbjct: 244 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 298

Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
             G V T ++ S +  T+Y V L+ I VG    S   IP      ++      +D+G   
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 350

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
           T LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V L
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 410

Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +   +         FA    D  +GI GN  Q    + YD    +V F+   C
Sbjct: 411 DASGIIL----SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PCV  C++Q  P++NP +SSSY  +SC
Sbjct: 125 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            ++QC  L T      SCS+  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 184 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 241

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F ++  GL+GL R +LSL  Q+   +G + FSYCL    + SS    +   N
Sbjct: 242 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 299

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   + S+SL    D + YF+ + GI V          P   SS A S     ID+
Sbjct: 300 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 348

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   Y+ L + V  A+K TP          C++  +     P +T  F GGA 
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 408

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  +        C A  P      I GN  Q    + YD  +  + F    C+
Sbjct: 409 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 162/367 (44%), Gaps = 33/367 (8%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
           +S +S   G Y++   +G+P   D+  I DTGSDL W +C             ++P  S+
Sbjct: 124 KSGMSLGTGNYIVSIGLGSPKK-DLMLIFDTGSDLTWARC--------SAAETFDPTKST 174

Query: 74  SYKELSCQSEQCHLLDTV----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           SY  +SC +  C  + +     S  +   C Y   Y D S + G L  ER+T G S + F
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIG-STDIF 233

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           +N  FGCG +  G+F +   GL+GLGR +LS+ SQ   +     FSYCL      SS T 
Sbjct: 234 NNFYFGCGQDVDGLFGK-AAGLLGLGRDKLSVVSQTAPKYN-QLFSYCL----PSSSSTG 287

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
            + FG+    S       + +S    ++Y + L GI+VG    +  L   ++++G I   
Sbjct: 288 FLSFGSSQSKS----AKFTPLSSGPSSFYNLDLTGITVGGQKLAIPL-SVFSTAGTI--- 339

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
              ID+G   T LP   Y+ L    R A+   P   P      CY       I  P +  
Sbjct: 340 ---IDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            F GG  V +     F+   ++ V   FA      D  IFGN  Q +  + YD     V 
Sbjct: 397 SFSGGVDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVG 456

Query: 368 FKPTDCT 374
           F P  C+
Sbjct: 457 FAPASCS 463


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 172/398 (43%), Gaps = 59/398 (14%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPASSS 73
           S  ST +G+Y +   +GTPP   +  + DTGSDL+WV+C  C  C +      + P  SS
Sbjct: 79  SGASTGSGQYFVDIRLGTPPQ-SLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSS 137

Query: 74  SYKELSCQSEQCHLLDTVS---CSSQQL---CNYTYGYADSSLTKGVLATERITFGN--- 124
           S+    C    C LL       C+  +L   C + Y YAD SL+ G  + E  T  +   
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197

Query: 125 SNNFFDNVVFGCGHNNTGV------FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
           S      + FGCG   +G       FN    G++GLGR  +S +SQ+  + G NKFSYCL
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFN-GARGVMGLGRGSISFSSQLGRRFG-NKFSYCL 255

Query: 179 VPFHTDSSITSKMYFGNGSE----VSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNS 233
           + +      TS +  G G       +   +  T L ++    T+Y++T+  I++  +   
Sbjct: 256 MDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVK-- 313

Query: 234 SKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
              +P   +   I +   G   +D+G   T L K  Y  + + VR  +KL    +   G 
Sbjct: 314 ---LPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGF 370

Query: 291 QLCY------KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAM 337
            LC       + PS+    P L     GGA         F PPP        EGV C A+
Sbjct: 371 DLCVNASGESRRPSL----PRLRFRLGGGA--------VFAPPPRNYFLETEEGVMCLAI 418

Query: 338 QPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + ++   G  + GN  Q    + +D +   + F    C
Sbjct: 419 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 164/363 (45%), Gaps = 31/363 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           E+V+    GTP     Y ++ DTGSD+ W+QCLPC   CYKQ  PI++P  S++Y  + C
Sbjct: 119 EFVVTVGFGTP--AQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPC 176

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
              QC       CSS   C Y   Y D S T GVL+ E ++   S        FGCG  N
Sbjct: 177 GHPQCAAAGG-KCSSNGTCLYKVQYGDGSSTAGVLSHETLSL-TSARALPGFAFGCGETN 234

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G F + + GL+GLGR +LSL+SQ  +      FSYCL  ++T       +  G  +  S
Sbjct: 235 LGDFGDVD-GLIGLGRGQLSLSSQAAASF-GAAFSYCLPSYNTSHGY---LTIGTTTPAS 289

Query: 201 GG-GVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           G  GV  T+++ K+D  ++YFV L  I VG        I         ++    +D+G  
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPI-------LFTRDGTLLDSGTV 342

Query: 259 PTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGA 314
            T LP + Y  L ++ +  +   K  P  DP      CY       I  P+++  F  G+
Sbjct: 343 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPF---DTCYDFAGQNAIFMPLVSFKFSDGS 399

Query: 315 KVPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
              L      I P    P  G   F  +P      I GN  Q +  + YD  ++ + F  
Sbjct: 400 SFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVS 459

Query: 371 TDC 373
             C
Sbjct: 460 GSC 462


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 158/357 (44%), Gaps = 25/357 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
           EYV+   +GTP +      +DTGSD+ WVQC PC    CY Q   +++PA SS+Y+ +SC
Sbjct: 126 EYVISVGLGTPAVTQTV-TIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSC 184

Query: 81  QSEQCHLLDTV--SCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
            + +C  L+     C +    C Y   Y D S T G  + + +T   +++      FGC 
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCS 244

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           H  +G F++   GL+GLG    SL SQ  +  G N FSYCL P    +S +S      G 
Sbjct: 245 HVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYG-NSFSYCLPP----TSGSSGFLTLGGG 298

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
               G V +  L S++  T+Y   L+ I+VG       L P   ++G++      +D+G 
Sbjct: 299 GGVSGFVTTRMLRSRQIPTFYGARLQDIAVGG--KQLGLSPSVFAAGSV------VDSGT 350

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
             T LP   Y+ L    +  +K       R     C+       I+ P +   F GGA +
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAI 410

Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            L                FA    DG  GI GN  Q    + YD  S  + F+   C
Sbjct: 411 DLDPNGIMY----GNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/365 (32%), Positives = 170/365 (46%), Gaps = 25/365 (6%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSS 74
           ++ TAN  YV+   +GTPP      + DTGSD  WVQC PCV  CYKQ   +++PA SS+
Sbjct: 157 SLGTAN--YVVPIGLGTPPSRFTV-VFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSST 213

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y  +SC    C  LD   C++   C Y   Y D S T G  A +  T   + +      F
Sbjct: 214 YANVSCADPACADLDASGCNAGH-CLYGIQYGDGSYTVGFFAKD--TLAVAQDAIKGFKF 270

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF- 193
           GCG  N G+F +   GL+GLGR   S+  Q   + G + FSYCL      S+ T  + F 
Sbjct: 271 GCGEKNRGLFGQT-AGLLGLGRGPTSITVQAYEKYGGS-FSYCL---PASSAATGYLEFG 325

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
                 SG    +T +++ +  T+Y+V L GI VG     +     +++SG +      +
Sbjct: 326 PLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTL------V 379

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHF 310
           D+G   T LP   Y  L      A+  + Y+     S L  CY    ++ ++ P ++  F
Sbjct: 380 DSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVF 439

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            GGA + L   S  +    +   C  FA    D  VGI GN  Q    + YD   ++V F
Sbjct: 440 QGGACLDL-DASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGF 498

Query: 369 KPTDC 373
            P  C
Sbjct: 499 APGAC 503


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           G YV +  +GTP    +  +VDTGS L W+QC PCV  C++Q  P++NP +SSSY  +SC
Sbjct: 125 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            ++QC  L T      SCS+  +C Y   Y DSS + G L+ + ++FG+++    N  +G
Sbjct: 184 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 241

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           CG +N G+F ++  GL+GL R +LSL  Q+   +G + FSYCL    + SS    +   N
Sbjct: 242 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 299

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
             + S   + S+SL    D + YF+ + GI V          P   SS A S     ID+
Sbjct: 300 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 348

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
           G   T LP   Y+ L + V  A+K TP          C++  +     P +T  F GGA 
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 408

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + L   +  +        C A  P      I GN  Q    + YD  +  + F    C+
Sbjct: 409 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 168/361 (46%), Gaps = 31/361 (8%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           +T  G YV  + IGTPP   + G +D  SDL+W  C           P +NP  S++  +
Sbjct: 94  ATNAGMYVFSYGIGTPPQ-QVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVAD 144

Query: 78  LSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSL-TKGVLATERITFGNSNNFFDNVVFG 135
           + C  + C      +C +    C YTY Y   +  T G+L TE  TFG++    D VVFG
Sbjct: 145 VPCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR--IDGVVFG 202

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY--F 193
           CG  N G F+    G++GLGR  LSL    +SQL  ++FSY   P   D S+ ++ +  F
Sbjct: 203 CGLKNVGDFS-GVSGVIGLGRGNLSL----VSQLQVDRFSYHFAP---DDSVDTQSFILF 254

Query: 194 GNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGN 250
           G+ +       +ST L++ + + + Y+V L GI V   +L+  S      N  G+   G 
Sbjct: 255 GDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS---GG 311

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPILTAH 309
           +F+      T+L +  Y  L + V + I L       LG  LCY   S+A    P +   
Sbjct: 312 VFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALV 371

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F GGA + L   + F      G+ C  + P   GD  + G+  Q    + YD +   + F
Sbjct: 372 FAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431

Query: 369 K 369
           +
Sbjct: 432 E 432


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 165/361 (45%), Gaps = 24/361 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V + +GEY+++   GTP    +Y ++DTGSD+ W+ C  C  C+    PI++PA SSSYK
Sbjct: 108 VRSGSGEYIIQVDFGTPKQ-SMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYK 165

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
             +C S+ C  + + +C     C +   Y D +   G LA++ IT G  + +  N  FGC
Sbjct: 166 PFACDSQPCQEI-SGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLG--SQYLPNFSFGC 222

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
             + +   + +   +   G +   L     ++L    FSYCL    + S+ +  +  G  
Sbjct: 223 AESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL---PSSSTSSGSLVLGKE 279

Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           + VS   +  T+L+      T+YFVTL+ ISVGN   S   +P  N +   S G   ID+
Sbjct: 280 AAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRIS---VPGTNIA---SGGGTIIDS 333

Query: 256 GAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
           G   T L    Y  L +  R   ++++ TP +D       CY   S +   P +T H D 
Sbjct: 334 GTTITHLVPSAYTALRDAFRQQLSSLQPTPVED----MDTCYDLSSSSVDVPTITLHLDR 389

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
              + L   +  I     G+ C A    D    I GN  Q +  I +D  +  V F    
Sbjct: 390 NVDLVLPKENILITQE-SGLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447

Query: 373 C 373
           C
Sbjct: 448 C 448


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++DTGSD+ WVQC PC QC+ Q  P+++P+SSS+Y   SC S
Sbjct: 127 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CSS   C Y   Y D S T G  +++ +  G+S     +  FGC +  
Sbjct: 186 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 243

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SL SQ    LG   FSYCL P  + S     +  G      
Sbjct: 244 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 298

Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
             G V T ++ S +  T+Y V L+ I VG    S   IP      ++      +D+G   
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 350

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
           T LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V L
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 410

Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +   +         FA    D  +GI GN  Q    + YD    +V F+   C
Sbjct: 411 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 114/364 (31%), Positives = 159/364 (43%), Gaps = 31/364 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV+   +GTP +  +  ++DTGSDL WVQC PC    CY Q  P+++P+ SS+Y  + C
Sbjct: 123 EYVVTVGLGTPSVSQVL-LIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPC 181

Query: 81  QSEQCHLLDT-------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
            ++ C  L          S      C +   Y D S T+GV + E +         D   
Sbjct: 182 NTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKD-FR 240

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCGH+  G  N+   GL+GLG    SL  Q  S  G   FSYCL   +      +    
Sbjct: 241 FGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGG-AFSYCLPALNNQVGFLALGGG 298

Query: 194 GNGSE--VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
           G  S   V+  G V T ++ +E++T+Y V + GI+VG         P      A S G M
Sbjct: 299 GAPSGGVVNTSGFVFTPMI-REEETFYVVNMTGITVGGE-------PIDVPPSAFS-GGM 349

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAH 309
            ID+G   T L    YN L+   R A+   P    R G    CY     + +  P +   
Sbjct: 350 IIDSGTVVTELQHTAYNALQAAFRKAMAAYPLV--RNGELDTCYDFSGYSNVTLPKVALT 407

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           F GGA + L   +  +   ++    F     D   GI GN  Q  L + YD     V F+
Sbjct: 408 FSGGATIDLDVPNGIL---LDDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFR 464

Query: 370 PTDC 373
              C
Sbjct: 465 AAVC 468


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 123/379 (32%), Positives = 161/379 (42%), Gaps = 36/379 (9%)

Query: 3   PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
           PA++ Y       ++ T N  YV+  S+GTP +      VDTGSDL WVQC PC     C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           Y Q  P+++PA SSSY  + C    C  L     S  S   C Y   Y D S T GV ++
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 237

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           + +T  ++++      FGCGH  +G+FN  + GL+GLGR + SL  Q     G   FSYC
Sbjct: 238 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 294

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L P    ++    +  G  S  + G   +  L S    TYY V L GISVG    S    
Sbjct: 295 L-PTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPAS 353

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK 295
            +         G   +DTG   T LP   Y  L    R+ +    Y        L  CY 
Sbjct: 354 AF--------AGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN 405

Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
                 +  P +   F  GA V L            G   FA    DG + I GN  Q  
Sbjct: 406 FAGYGTVTLPNVALTFGSGATVMLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRS 461

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             +    D   V FKP+ C
Sbjct: 462 FEV--RIDGTSVGFKPSSC 478


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 167/362 (46%), Gaps = 53/362 (14%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQCLPC  CY Q +P++NP++SSS+  L C S  C  L   + SS     
Sbjct: 80  IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 139

Query: 96  --QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
                C+Y   Y D S ++G L  E++T G +    DN +FGCG NN G+F     GL+G
Sbjct: 140 KNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE--IDNFIFGCGRNNKGLFG-GASGLMG 196

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM------YFGNGSEVSGGGVVST 207
           L R+ LSL SQ  S  G+  FSYCL      SS +  +       F N S +S   ++  
Sbjct: 197 LARSELSLVSQTSSLFGS-VFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQN 255

Query: 208 SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
             +S     +YF+ L GIS+G ++ +   +P  +S+  +      +D+G   T L    Y
Sbjct: 256 PQMSN----FYFLNLTGISIGGVNLN---VPRLSSNEGVLS---LLDSGTVITRLSPSIY 305

Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAHFDGGAKVPLIHTSTF 324
              + +     + + Y+     S L  C+       +  P +   F+G A++ +      
Sbjct: 306 KAFKAEFEK--QFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIV------ 357

Query: 325 IPPPVEGVF----------CFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
               VEGVF          C A   +  +    I GN+ Q +  + Y+     V F    
Sbjct: 358 ---DVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEP 414

Query: 373 CT 374
           C+
Sbjct: 415 CS 416


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 169/363 (46%), Gaps = 55/363 (15%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
           IVDTGSDL WVQCLPC  CY Q +P++NP++SSS+  L C S  C  L   + SS     
Sbjct: 159 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 218

Query: 96  --QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
                C+Y   Y D S ++G L  E++T G +    DN +FGCG NN G+F     GL+G
Sbjct: 219 KNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE--IDNFIFGCGRNNKGLFG-GASGLMG 275

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM------YFGNGSEVSGGGVVST 207
           L R+ LSL SQ  S  G+  FSYCL      SS +  +       F N S +S   ++  
Sbjct: 276 LARSELSLVSQTSSLFGS-VFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQN 334

Query: 208 SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISKGNMFIDTGAPPTLLPKDF 266
             +S     +YF+ L GIS+G ++ +   +P  +S+ G +S     +D+G   T L    
Sbjct: 335 PQMSN----FYFLNLTGISIGGVNLN---VPRLSSNEGVLS----LLDSGTVITRLSPSI 383

Query: 267 YNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAHFDGGAKVPLIHTST 323
           Y   + +     + + Y+     S L  C+       +  P +   F+G A++ +     
Sbjct: 384 YKAFKAEFEK--QFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIV----- 436

Query: 324 FIPPPVEGVF----------CFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
                VEGVF          C A   +  +    I GN+ Q +  + Y+     V F   
Sbjct: 437 ----DVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 492

Query: 372 DCT 374
            C+
Sbjct: 493 PCS 495


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++DTGSD+ WVQC PC QC+ Q  P+++P+SSS+Y   SC S
Sbjct: 51  EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 109

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CSS   C Y   Y D S T G  +++ +  G+S     +  FGC +  
Sbjct: 110 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 167

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SL SQ    LG   FSYCL P  + S     +  G      
Sbjct: 168 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 222

Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
             G V T ++ S +  T+Y V L+ I VG    S   IP      ++      +D+G   
Sbjct: 223 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 274

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
           T LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V L
Sbjct: 275 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 334

Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +   +         FA    D  +GI GN  Q    + YD    +V F+   C
Sbjct: 335 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P       ++DTGSD+ WVQC PC QC+ Q  P+++P+SSS+Y   SC S
Sbjct: 197 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 255

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CSS   C Y   Y D S T G  +++ +  G+S     +  FGC +  
Sbjct: 256 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 313

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G FN+   GL+GLG    SL SQ    LG   FSYCL P  + S     +  G      
Sbjct: 314 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 368

Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
             G V T ++ S +  T+Y V L+ I VG    S   IP      ++      +D+G   
Sbjct: 369 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 420

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
           T LP   Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V L
Sbjct: 421 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 480

Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +   +         FA    D  +GI GN  Q    + YD    +V F+   C
Sbjct: 481 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 177/380 (46%), Gaps = 37/380 (9%)

Query: 10  NNVVQSNVS-TANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
           NN  ++ VS +  G  +M   SIG PP+  +  ++DTGSD++WV C PC  C   +  ++
Sbjct: 85  NNEYKARVSPSLTGRTIMANISIGQPPIPQLV-VMDTGSDILWVMCTPCTNCDNHLGLLF 143

Query: 68  NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN- 126
           +P+ SS++  L C++      D   CS      +T  YAD+S   G+   + + F  ++ 
Sbjct: 144 DPSMSSTFSPL-CKTP----CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDE 198

Query: 127 --NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
             +   +V+FGCGHN     +    G++GL     SLA++I       KFSYC+      
Sbjct: 199 GTSRIPDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKI-----GQKFSYCIGDLADP 253

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
                ++  G G+++ G      S   +    +Y+VT+EGISVG       + P      
Sbjct: 254 YYNYHQLILGEGADLEG-----YSTPFEVHNGFYYVTMEGISVGE--KRLDIAPETFEMK 306

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
               G + IDTG+  T L    +  L ++VRN +     + T  + P +  Q  Y + S 
Sbjct: 307 KNRTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWM--QCFYGSISR 364

Query: 300 AGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIFGNFAQS 353
             +  P++T HF  GA + L  + +F     + VFC  + P+          + G  AQ 
Sbjct: 365 DLVGFPVVTFHFADGADLAL-DSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQ 423

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
              +GYD  +Q V F+  DC
Sbjct: 424 SYSVGYDLVNQFVYFQRIDC 443


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 161/338 (47%), Gaps = 35/338 (10%)

Query: 58  QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVL 115
           +C  +  P + PASSS++ +L C S  C  L +  ++C++   C Y Y Y     T G L
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG-CVYYYPYG-MGFTAGYL 144

Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
           ATE +  G ++  F  V FGC   N GV N +  G+VGLGR+ LSL SQ+    G  +FS
Sbjct: 145 ATETLHVGGAS--FPGVAFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVGRFS 196

Query: 176 YCLVPFHTDSSI-TSKMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVG--NL 230
           YCL    +D+    S + FG+ ++V+GG      L + E    +YY+V L GI+VG  +L
Sbjct: 197 YCL---RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDL 253

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDP 286
             +S    +   +GA   G   +D+G   T L K+ Y  ++     Q+  A   T     
Sbjct: 254 PVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGT 313

Query: 287 RLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAM 337
           R G  LC+   +  G +    P L   F GGA+  +   S      V+      V C  +
Sbjct: 314 RFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLV 373

Query: 338 QPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            P      + I GN  Q DL + YD D  M SF P DC
Sbjct: 374 LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 174/369 (47%), Gaps = 43/369 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           +++ FSIG PP+   Y ++DTGS L W+QC PC+ C++Q  P+YNP+SSS+Y      S+
Sbjct: 110 FLVNFSIGQPPVPQ-YAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVS---CSD 165

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
                 T + +    CNY+  YAD + T+G  A E++ F   ++      +V+FGCGHNN
Sbjct: 166 FDRTDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNN 225

Query: 141 TGVFNEN--EMGLVGLGRTRLSLASQILSQLGANKFSYC-------LVPFHTDSSITSKM 191
           T +        G+ GLG +     S I+S+LG   FSYC       L  FH       ++
Sbjct: 226 TQLPGPTGYASGVFGLGDS----GSSIISKLGFG-FSYCIGNIGDPLYGFH-------RL 273

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
             GN  ++ G    ST LV    +  Y++TL GIS+G        I +           +
Sbjct: 274 TLGNKLKIEG---YSTPLVP---RGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRI 327

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCY---KTPSMAGIAPIL 306
            ID+GA  + +P+  YN + ++V + +   L+ Y+       LCY       + G  P  
Sbjct: 328 VIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGF-PDA 386

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQ 364
           T H   GA + +           + V C A+ P + D    + G  AQ    + YD   Q
Sbjct: 387 TFHLADGADL-VFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQ 445

Query: 365 MVSFKPTDC 373
            + F+  +C
Sbjct: 446 KLYFQRIEC 454


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 61/144 (42%), Positives = 88/144 (61%), Gaps = 4/144 (2%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S ++  +GEY  +  +GTPP   +Y ++DTGSD++W+QC PC +CY Q  P+++P  S
Sbjct: 163 VTSGLAQGSGEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKS 221

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            S+  +SC+S  C  LD+  C+S+Q C Y   Y D S T G  +TE +TF  +      V
Sbjct: 222 GSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR--VPKV 279

Query: 133 VFGCGHNNTGVFNENEMGLVGLGR 156
             GCGH+N G+F     GL+GLGR
Sbjct: 280 ALGCGHDNEGLF-VGAAGLLGLGR 302


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 169/371 (45%), Gaps = 42/371 (11%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP      I+D   +L+W QC  C +C+KQ  P++ P +SS+++   C ++ 
Sbjct: 44  VANFTIGTPPQ-PASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDA 102

Query: 85  CHLLDTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
           C    T +CS   +C Y   T    D   T G++ TE    G +     ++ FGC   + 
Sbjct: 103 CKSTPTSNCSG-DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA---SLAFGCVVASD 158

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
               +   G +GLGRT  SL    ++Q+   KFSYCL P  T  S  S+++ G+ ++++G
Sbjct: 159 IDTMDGTSGFIGLGRTPRSL----VAQMKLTKFSYCLSPRGTGKS--SRLFLGSSAKLAG 212

Query: 202 GGVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           G   ST+     S +D +  YY ++L+ I  GN + ++          A S G + + T 
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTV 262

Query: 257 APPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFD 311
           +P +LL    Y   ++ V  A+      P   P     LC+K  +      AP L   F 
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322

Query: 312 GGAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDS 363
           G A   VP    LI            +   A     G   V + G+  Q D+   YD   
Sbjct: 323 GAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKK 382

Query: 364 QMVSFKPTDCT 374
           + +SF+P DC+
Sbjct: 383 ETLSFEPADCS 393


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 168/365 (46%), Gaps = 35/365 (9%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           +T  G YV  + IGTPP   + G +D  SDL+W  C           P +NP  S++  +
Sbjct: 94  ATNAGMYVFSYGIGTPPQ-QVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVAD 144

Query: 78  LSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSL-TKGVLATERITFGNSNNFFDN 131
           + C  + C      +C +        C YTY Y   +  T G+L TE  TFG++    D 
Sbjct: 145 VPCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR--IDG 202

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           VVFGCG  N G F+    G++GLGR  LSL    +SQL  ++FSY   P   D S+ ++ 
Sbjct: 203 VVFGCGLQNVGDFS-GVSGVIGLGRGNLSL----VSQLQVDRFSYHFAP---DDSVDTQS 254

Query: 192 Y--FGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
           +  FG+ +       +ST L++ + + + Y+V L GI V   +L+  S      N  G+ 
Sbjct: 255 FILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS- 313

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPI 305
             G +F+      T+L +  Y  L + V + I L       LG  LCY   S+A    P 
Sbjct: 314 --GGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPS 371

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQ 364
           +   F GGA + L   + F      G+ C  + P   GD  + G+  Q    + YD +  
Sbjct: 372 MALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGS 431

Query: 365 MVSFK 369
            + F+
Sbjct: 432 KLVFE 436


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 164/361 (45%), Gaps = 24/361 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
           V + +GEY+++   GTP    +Y ++DTGSD+ W+ C  C  C+    PI++PA SSSYK
Sbjct: 108 VRSGSGEYIIQVDFGTPKQ-SMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYK 165

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
             +C S+ C  + + +C     C +   Y D +   G LA++ IT G  + +  N  FGC
Sbjct: 166 PFACDSQPCQEI-SGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLG--SQYLPNFSFGC 222

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
             + +     +   +   G +   L     ++L    FSYCL    + S+ +  +  G  
Sbjct: 223 AESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL---PSSSTSSGSLVLGKE 279

Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           + VS   +  T+L+      T+YFVTL+ ISVGN   S   +P  N +   S G   ID+
Sbjct: 280 AAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRIS---VPATNIA---SGGGTIIDS 333

Query: 256 GAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
           G   T L    Y  L +  R   ++++ TP +D       CY   S +   P +T H D 
Sbjct: 334 GTTITYLVPSAYKDLRDAFRQQLSSLQPTPVED----MDTCYDLSSSSVDVPTITLHLDR 389

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
              + L   +  I     G+ C A    D    I GN  Q +  I +D  +  V F    
Sbjct: 390 NVDLVLPKENILITQE-SGLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447

Query: 373 C 373
           C
Sbjct: 448 C 448


>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
          Length = 739

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/246 (39%), Positives = 134/246 (54%), Gaps = 12/246 (4%)

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F  +  GCG NN G F+    G+VGLG   +SL S I   + + K+SYCLVP    +S T
Sbjct: 58  FPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDS-KYSYCLVPLFEFNS-T 115

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
           SK+ FG  + V G G VST ++     T+Y++ LEG+SVG     SK I + ++S +   
Sbjct: 116 SKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVG-----SKRIDFVDASTSNEL 170

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
           KGN+ ID+G   T+L ++FY +LE +V   I L           LCYK+P    I  PI+
Sbjct: 171 KGNIIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPII 230

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
           T HF  G  + L   +TF+    +    FA  P+     IFGN AQ +  +GYD   + V
Sbjct: 231 TTHF-AGVDIVLNSLNTFV-SVFDDAMWFAFAPV-ASGSIFGNLAQMNHLVGYDLLRKTV 287

Query: 367 SFKPTD 372
           SFKPTD
Sbjct: 288 SFKPTD 293


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 164/353 (46%), Gaps = 24/353 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EY++   +G+P +     ++DTGSD+ WVQC PC QC+ Q   +++P+SSS+Y   SC S
Sbjct: 126 EYLITVGMGSPAVAQTM-LIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTS 184

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
             C  L    CSS Q C YT  Y D S   G  +++ +  G+S    +N  FGC  + +G
Sbjct: 185 AACAQLRQRGCSSSQ-CQYTVKYGDGSTGSGTYSSDTLALGSST--VENFQFGCSQSESG 241

Query: 143 -VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
            +  +   GL+GLG    SLA+Q     G   FSYCL P     +  S  +   G+  SG
Sbjct: 242 NLLQDQTAGLMGLGGGAESLATQTAGTFG-KAFSYCLPP-----TPGSSGFLTLGASTSG 295

Query: 202 GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
             V +  L S +  +YY V L+ I VG    +   IP    + A S G++ +D+G   T 
Sbjct: 296 FVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLN---IP----ASAFSAGSI-MDSGTIITR 347

Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIH 320
           LP+  Y+ L    +  +K  P   P      C+     + ++ P +   F GGA V L  
Sbjct: 348 LPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLAS 407

Query: 321 TSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
               +         FA    D  +GI GN  Q    + YD     V FK   C
Sbjct: 408 DGIIL----GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 172/363 (47%), Gaps = 42/363 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G Y    ++G+PP  D   ++DTGSDL WV+C PC          ++  +S++YK L+C 
Sbjct: 1   GVYYSTITLGSPPK-DFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCA 56

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN----FFDNVVFGCG 137
            +                 Y+YGY D S T+G L+ + +    + +     F   VFGCG
Sbjct: 57  DD-----------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCG 99

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNG 196
               G+ +  E+G++ L    LS  SQI  + G NKFSYCL+     +S+  S M FG  
Sbjct: 100 SLLKGLIS-GEVGILALSPGSLSFPSQIGEKYG-NKFSYCLLRQTAQNSLKKSPMVFGEA 157

Query: 197 S---EVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
           +   +  G G +     +   E   YY V L+GISVGN         + N      K  +
Sbjct: 158 AVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ---DKPTI 214

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
           F D+G   T+LP    + +++ + + +    +   + G   C++ P  +G   P +T HF
Sbjct: 215 F-DSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIK-GLDACFRVPPSSGQGLPDITFHF 272

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +GGA      ++  I   +  + C    P + +V IFGN  Q D F+ +D D++ + FK 
Sbjct: 273 NGGADFVTRPSNYVID--LGSLQCLIFVPTN-EVSIFGNLQQQDFFVLHDMDNRRIGFKE 329

Query: 371 TDC 373
           TDC
Sbjct: 330 TDC 332


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 88/222 (39%), Positives = 117/222 (52%), Gaps = 20/222 (9%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           N  T N  Y++   +G     D+  I+DTGSDL WVQC PC+ CY Q  P++ P++SSSY
Sbjct: 139 NFQTLN--YIVTMELGGQ---DMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSY 193

Query: 76  KELSCQSEQCHLLDTV-----SCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           + + C S  C  L        +C S    C+Y   Y D S T G L  E ++FG  +   
Sbjct: 194 QSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGIS--V 251

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
            N VFGCG NN G+F     GL+GLGR+ LSL SQ  S  G   FSYCL P  TD+  + 
Sbjct: 252 SNFVFGCGKNNKGLFG-GVSGLMGLGRSNLSLISQTNSTFGG-VFSYCLPP--TDAGASG 307

Query: 190 KMYFGNGSEVSGG--GVVSTSLV-SKEDKTYYFVTLEGISVG 228
            +  GN S V      +  T +V + +   +Y + L GI VG
Sbjct: 308 SLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 117/227 (51%), Gaps = 28/227 (12%)

Query: 24  YVMKFSIGTP---PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           YV   S+G     P  ++  IVDTGSDL WVQC PC  CY Q  P+++PA S++Y  + C
Sbjct: 92  YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 151

Query: 81  QSEQCHLLDTV--------SCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
            +  C   D++        SC S     + C Y   Y D S ++GVLAT+ +  G ++  
Sbjct: 152 NASACA--DSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 207

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
               VFGCG +N G+F     GL+GLGRT LSL SQ  S+ G   FSYCL P  T    +
Sbjct: 208 LGGFVFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTASRYG-GVFSYCL-PAATSGDAS 264

Query: 189 SKMYFGNGSEVSGG-----GVVSTSLVSKEDK-TYYFVTLEGISVGN 229
             +  G G + +        V  T +++   +  +YF+ + G +VG 
Sbjct: 265 GSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG 311


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 157/370 (42%), Gaps = 37/370 (10%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMW--VQCLP-CVQCYKQVKPIYNPAS 71
           S +    GEY  +  +GTP    +  ++DTGSD++W  V+ LP  ++  +Q       A+
Sbjct: 113 SGLPQGTGEYFAQVGVGTPATTALM-VLDTGSDVVWAPVRALPPLLRAVRQGS--STGAA 169

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
            +     +C +  C  LD+  C  ++  C Y   Y D S+T G  A+E +TF        
Sbjct: 170 PAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV-Q 228

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            V  GCGH+N G+F      L    R RLS  SQI    G   FSYCLV   +       
Sbjct: 229 RVAIGCGHDNEGLFIAASGLLGLG-RGRLSFPSQIARSFG-RSFSYCLVDRTSSRRARPS 286

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
             +G    ++               T+Y+V L G SVG              +    +G 
Sbjct: 287 RRWGGTPRMA---------------TFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGG 331

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI-A 303
           + +D+G   T L +  Y  + +  R A   ++++P      G  L   CY       +  
Sbjct: 332 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVVKV 386

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
           P ++ H  GGA V L   +  IP    G FCFAM   DG V I GN  Q    + +D D+
Sbjct: 387 PTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDA 446

Query: 364 QMVSFKPTDC 373
           Q V F P  C
Sbjct: 447 QRVGFVPKSC 456


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 50/409 (12%)

Query: 5   TYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCY 60
           T F+  + ++S      G+Y++  + GTPP  ++  I DTGSDL+W+QC     P   C 
Sbjct: 35  TSFWAESPMESGAFLGLGQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFCP 93

Query: 61  KQV---KPIYNPASSSSYKELSCQSEQCHLLDTV-----SCSSQQL--CNYTYGYADSSL 110
           K+    +P +  + S++   + C + QC L+        SCS      C Y Y YAD S 
Sbjct: 94  KKACSRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSS 153

Query: 111 TKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
           T G LA +  T  N  +       V FGCG  N G       G++GLG+ +LS  +Q  S
Sbjct: 154 TTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGS 213

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGIS 226
            L A  FSYCL+         S  +   G          T LVS     T+Y+V +  I 
Sbjct: 214 -LFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIR 272

Query: 227 VGNLSNSSKLIPYYNSSGAI---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY 283
           VGN     +++P   S  AI     G   ID+G+  T L    Y  L      ++ L   
Sbjct: 273 VGN-----RVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHL--- 324

Query: 284 QDPRL--------GSQLCYKTPSMAGIAPI------LTAHFDGGAKVPLIHTSTFIPPPV 329
             PR+        G +LCY   S + +AP       LT  F  G  + L  T  ++    
Sbjct: 325 --PRIPSSATFFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLEL-PTGNYLVDVA 381

Query: 330 EGVFCFAMQPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           + V C A++P        + GN  Q    + +D  S  + F  T+C   
Sbjct: 382 DDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECVAH 430


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 162/346 (46%), Gaps = 29/346 (8%)

Query: 41  IVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSCQSEQCHLL------DTVSC 93
           I+DTGS L W+QC PC V C+ Q  P+Y+P+ S +YK+LSC S +C  L      D +  
Sbjct: 2   ILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCE 61

Query: 94  SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
           +    C YT  Y D+S + G L+ + +T  +S        +GCG +N G+F     G++G
Sbjct: 62  TDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT-LPQFTYGCGQDNQGLFGR-AAGIIG 119

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
           L R +LS+ +Q+ ++ G + FSYCL   ++ SS    +  G+ S  S     +  L   +
Sbjct: 120 LARDKLSMLAQLSTKYG-HAFSYCLPTANSGSSGGGFLSIGSISPTSYK--FTPMLTDSK 176

Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
           + + YF+ L  I+V             + + A+ +    ID+G   T LP   Y  L  Q
Sbjct: 177 NPSLYFLRLTAITVSGRP--------LDLAAAMYRVPTLIDSGTVITRLPMSMYAAL-RQ 227

Query: 274 VRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--PILTAHFDGGAKVPLIHTSTFIPPPV 329
               I  T Y      S L  C+K  S+  I+  P +   F GGA + L   S  I    
Sbjct: 228 AFVKIMSTKYAKAPAYSILDTCFKG-SLKSISAVPEIKMIFQGGADLTLRAPSILIEAD- 285

Query: 330 EGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +G+ C A     G   + I GN  Q    I YD  +  + F P  C
Sbjct: 286 KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 178/372 (47%), Gaps = 31/372 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
            QS ++   G YV+   +GTP   D   + DTGS + W QC PC+  CY Q +  ++P  
Sbjct: 124 AQSGIAIGTGNYVVTVGLGTPKE-DFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTK 182

Query: 72  SSSYKELSCQSEQCHLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S+SY  +SC S  C+LL T    CS S   C Y   Y D S ++G  ATE +T  +S++ 
Sbjct: 183 STSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI-SSSDV 241

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F N +FGCG +N G+F +   GL+GL  + +SL SQ   +    +FSYCL      S+ +
Sbjct: 242 FTNFLFGCGQSNNGLFGQ-AAGLLGLSSSSVSLPSQTAEKY-QKQFSYCL-----PSTPS 294

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAIS 247
           S  Y   G +VS       + +S    ++Y + + GISV    +   + P  + +SGAI 
Sbjct: 295 STGYLNFGGKVS--QTAGFTPISPAFSSFYGIDIVGISVAG--SQLPIDPSIFTTSGAI- 349

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
                ID+G   T LP   Y  L+E     +   P  +       CY   +   ++ P +
Sbjct: 350 -----IDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKV 404

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
           +  F GG +V +   ++ I   V GV      FA    D + GIFGN  Q    + YD  
Sbjct: 405 SVSFKGGVEVDI--DASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGA 462

Query: 363 SQMVSFKPTDCT 374
             M+ F    C+
Sbjct: 463 KGMIGFAAGACS 474


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 36/357 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
           IVDTGSDL WVQC PC  CY Q  P+++P+ S+SY  + C +  C   L        SC+
Sbjct: 179 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238

Query: 95  S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
           +          + C Y+  Y D S ++GVLAT+ +  G ++   D  VFGCG +N G+F 
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 296

Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
               GL+GLGRT LSL SQ   + G   FSYCL    +  +  S    G+ S       V
Sbjct: 297 -GTAGLMGLGRTELSLVSQTAPRFG-GVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV 354

Query: 206 S-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
           S T +++   +  +YF+ +           + +     ++  +   N+ +D+G   T L 
Sbjct: 355 SYTRMIADPAQPPFYFMNV---------TGASVGGAAVAAAGLGAANVLLDSGTVITRLA 405

Query: 264 KDFYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIH 320
              Y   R E   +   +  P   P      CY       +  P+LT   +GGA + +  
Sbjct: 406 PSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 465

Query: 321 TSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                    +G   C AM  +  +    I GN+ Q +  + YD     + F   DC+
Sbjct: 466 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 36/357 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
           IVDTGSDL WVQC PC  CY Q  P+++P+ S+SY  + C +  C   L        SC+
Sbjct: 180 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239

Query: 95  S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
           +          + C Y+  Y D S ++GVLAT+ +  G ++   D  VFGCG +N G+F 
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 297

Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
               GL+GLGRT LSL SQ   + G   FSYCL    +  +  S    G+ S       V
Sbjct: 298 -GTAGLMGLGRTELSLVSQTAPRFG-GVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV 355

Query: 206 S-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
           S T +++   +  +YF+ +           + +     ++  +   N+ +D+G   T L 
Sbjct: 356 SYTRMIADPAQPPFYFMNV---------TGASVGGAAVAAAGLGAANVLLDSGTVITRLA 406

Query: 264 KDFYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIH 320
              Y   R E   +   +  P   P      CY       +  P+LT   +GGA + +  
Sbjct: 407 PSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 466

Query: 321 TSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                    +G   C AM  +  +    I GN+ Q +  + YD     + F   DC+
Sbjct: 467 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 161/367 (43%), Gaps = 29/367 (7%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
            S  +G Y +K  +G+P       IVDTGS L W+QC PCV  C+ Q  P+++P++S +Y
Sbjct: 6   ASIGSGNYYVKVGLGSPARYYSM-IVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 64

Query: 76  KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           K LSC S QC  L      + +  +S  +C YT  Y DSS + G L+ + +T   S    
Sbjct: 65  KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT-L 123

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
              V+GCG ++ G+F     G++GLGR +LS+  Q+ S+ G   FSYCL P        S
Sbjct: 124 PGFVYGCGQDSEGLFGR-AAGILGLGRNKLSMLGQVSSKFG-YAFSYCL-PTRGGGGFLS 180

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
               G  S        +       + + YF+ L  I+VG  +       Y        + 
Sbjct: 181 ---IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQY--------RV 229

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK--TPSMAGIAPIL 306
              ID+G   T LP   Y   ++     +     + P       C+K     M  + P +
Sbjct: 230 PTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSV-PEV 288

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
              F GGA + L   +  +    EG+ C A    +G V I GN  Q    + +D  +  +
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVD-EGLTCLAFAGNNG-VAIIGNHQQQTFKVAHDISTARI 346

Query: 367 SFKPTDC 373
            F    C
Sbjct: 347 GFATGGC 353


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 179/385 (46%), Gaps = 42/385 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
           +Y+ ++ IG PP      I+DTGS+L+W QC  C    C+ Q    Y+P+ S + K ++C
Sbjct: 83  QYIAEYLIGDPPQ-QAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVAC 141

Query: 81  QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV--VFGC- 136
               C L     C+   + C     Y   ++  G L TE  TFG+  +  +NV   FGC 
Sbjct: 142 NDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNVSLAFGCI 200

Query: 137 -GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
                T    +   G++GLGR +LSL SQ    LG NKFSYCL P+ +D++ TS ++ G 
Sbjct: 201 TASRLTPGSLDGASGIIGLGRGKLSLPSQ----LGDNKFSYCLTPYFSDAANTSTLFVGA 256

Query: 196 GSEVSGGGVVSTS---LVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK- 248
            + +SGGG  +TS   L + +D    ++Y++ L GI+VG          +     A +K 
Sbjct: 257 SAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKW 316

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYK--TPSMAG-IA 303
           G   ID+G+P T L    Y  L +++   +   + P      G  LC     P  AG + 
Sbjct: 317 GGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLV 376

Query: 304 PILTAHFDGGAKVPLIHTSTFIPP-----PV-EGVFCFAMQPIDG--------DVGIFGN 349
           P L  HF              +PP     PV +   C  +    G        +  I GN
Sbjct: 377 PPLVLHF---GSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
           + Q D+ + YD    ++SF+P DC+
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCS 458


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 168/366 (45%), Gaps = 38/366 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+   +IGTPP      I+    + +W QC PC +C+KQ  P++N ++SS+Y+   C + 
Sbjct: 28  YMANLTIGTPPQ-PASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTA 86

Query: 84  QCHLLDTVSCSSQQLCNYTYG--YADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
            C  +   +CS   +C+Y     + D+S   G+  T+    G +     ++ FGC  ++ 
Sbjct: 87  LCESVPASTCSGDGVCSYEVETMFGDTS---GIGGTDTFAIGTATA---SLAFGCAMDSN 140

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G+VGLGRT  SL    + Q+ A  FSYCL P H  +   S +  G  ++++G
Sbjct: 141 IKQLLGASGVVGLGRTPWSL----VGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLAG 195

Query: 202 G-GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
           G    +T LV + +D + Y + LEGI  G++     + P  N S       + +DT    
Sbjct: 196 GKSAATTPLVNTSDDSSDYMIHLEGIKFGDV----IIAPPPNGS------VVLVDTIFGV 245

Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGG 313
           + L    +  +++ V  A+   P   P     LC+   + A  A      P +   F G 
Sbjct: 246 SFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGA 305

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           A +  +  S ++     G  C AM       +  ++ I G   Q ++   +D D + +SF
Sbjct: 306 AAL-TVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSF 364

Query: 369 KPTDCT 374
           +P DC+
Sbjct: 365 EPADCS 370


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/219 (38%), Positives = 121/219 (55%), Gaps = 9/219 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY  +  +GTP   + Y ++DTGSD+ W+QC PC +CY Q  PI+NP+ S
Sbjct: 146 VVSGMEQGSGEYFTRIGVGTP-TREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYS 204

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           +S+  + C S  C  LD   C S   C Y   Y D S + G  ATE +TFG ++    NV
Sbjct: 205 ASFSTVGCDSAVCSQLDAYDCHSGG-CLYEASYGDGSYSTGSFATETLTFGTTS--VANV 261

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
             GCGH N G+F      L+GLG   LS  +QI +Q G + FSYCLV   +DSS    + 
Sbjct: 262 AIGCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTG-HTFSYCLVDRESDSS--GPLQ 317

Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLS 231
           FG  S V  G + +    +    T+Y++++  IS+  ++
Sbjct: 318 FGPKS-VPVGSIFTPLEKNPHLPTFYYLSVTAISISAIA 355


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 105/341 (30%), Positives = 146/341 (42%), Gaps = 26/341 (7%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV---SCS- 94
           +VDT SD+ WVQCLPC   QC+ Q  P+Y+PA SS++  + C S  C  L +     CS 
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231

Query: 95  SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
           +   C Y   Y D   T G   T+ +T  +      +  FGC H   G F+    G++ L
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTM-SPTIVVKDFRFGCSHAVRGSFSNQNAGILAL 290

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED 214
           G  R SL  Q     G N FSYC +P  + +   S    G   E S     +  + +K  
Sbjct: 291 GGGRGSLLEQTADAYG-NAFSYC-IPKPSSAGFLS---LGGPVEASLKFSYTPLIKNKHA 345

Query: 215 KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
            T+Y V LE I V        + P   ++GA+      +D+GA  T LP   Y  L    
Sbjct: 346 PTFYIVHLEAIIVAG--KQLAVPPTAFATGAV------MDSGAVVTQLPPQVYAALRAAF 397

Query: 275 RNAIKL-TPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
           R+A+    P   P      CY       +  P ++  F GGA + L   S  +    +G 
Sbjct: 398 RSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----DGC 453

Query: 333 FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             FA  P +  VG  GN  Q    + YD     V F+   C
Sbjct: 454 LAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 165/369 (44%), Gaps = 39/369 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
           +G Y +   +GTP   D   I DTGSDL W QC PCV+ CY Q + I+NP+ S+SY  +S
Sbjct: 150 SGNYFVTVGLGTPKK-DFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANIS 208

Query: 80  CQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           C S  C  L + +     C+S   C Y   Y DSS + G    E+++   + + F++  F
Sbjct: 209 CGSTLCDSLASATGNIFNCASST-CVYGIQYGDSSFSIGFFGKEKLSL-TATDVFNDFYF 266

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSIT 188
           GCG NN G+      GL+GLGR +LSL SQ  +Q     FSYCL        F T    T
Sbjct: 267 GCGQNNKGL-FGGAAGLLGLGRDKLSLVSQT-AQRYNKIFSYCLPSSSSSTGFLTFGGST 324

Query: 189 SK-MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           SK   F   + +SGG             ++Y + L GISVG      KL     S    S
Sbjct: 325 SKSASFTPLATISGG------------SSFYGLDLTGISVGG----RKLAI---SPSVFS 365

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
                ID+G   T LP   Y+ L    R  +   P          C+   +   I+ P +
Sbjct: 366 TAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKI 425

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
              F GG  V +  T  F    +  V   FA      DV IFGN  Q  L + YD  +  
Sbjct: 426 GLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGR 485

Query: 366 VSFKPTDCT 374
           V F P  C+
Sbjct: 486 VGFAPAGCS 494


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 177/382 (46%), Gaps = 57/382 (14%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
           ++ G YV  F+IGTPP   +  +VD   +L+W QC PC  C++Q  P+++P  SS+++ L
Sbjct: 52  SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110

Query: 79  SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            C S  C  +   S  C+S  +C Y      +  T G+  T+    G +    + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGMAGTDTFAIGAAK---ETLGFGC 165

Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
                 V  +  +       G+VGLGRT  SL    ++Q+    FSYCL         + 
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211

Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
            ++ G  ++   GG       V+ TS  S ++ +  YY V L GI  G        +   
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAP-----LQAA 266

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           +SSG+     + +DT +  + L    Y  L++ +  A+ + P   P     LC+ + ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCF-SKAVA 321

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDV---GIFGNFAQ 352
           G AP L   FDGGA +  +  + ++     G  C  +       + G++    I G+  Q
Sbjct: 322 GDAPELVFTFDGGAAL-TVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQ 380

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            ++ + +D   + +SFKP DC+
Sbjct: 381 ENVHVLFDLKEETLSFKPADCS 402


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 117/413 (28%), Positives = 175/413 (42%), Gaps = 50/413 (12%)

Query: 1   MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPC 56
           ++  T F+  + ++S      G+Y++  + GTPP  ++  I DTGSDL+W+QC     P 
Sbjct: 30  LATTTSFWAESPMESGAFLGLGQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPP 88

Query: 57  VQCYKQV---KPIYNPASSSSYKELSCQSEQCHLLDT-------VSCSSQQLCNYTYGYA 106
             C K+    +P +  + S++   + C + QC L+          S ++   C Y Y YA
Sbjct: 89  AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYA 148

Query: 107 DSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
           D S T G LA +  T  N  +       V FGCG  N G       G++GLG+ +LS  +
Sbjct: 149 DGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPA 208

Query: 164 QILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTL 222
           Q  S L A  FSYCL+         S  +   G          T LVS     T+Y+V +
Sbjct: 209 QSGS-LFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGV 267

Query: 223 EGISVGNLSNSSKLIPYYNSSGAI---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
             I VGN     +++P   S  AI     G   ID+G+  T L    Y  L      ++ 
Sbjct: 268 VAIRVGN-----RVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH 322

Query: 280 LTPYQDPRL--------GSQLCYKTPSMAGIAPI------LTAHFDGGAKVPLIHTSTFI 325
           L     PR+        G +LCY   S +  AP       LT  F  G  + L  T  ++
Sbjct: 323 L-----PRIPSSATFFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLEL-PTGNYL 376

Query: 326 PPPVEGVFCFAMQPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
               + V C A++P        + GN  Q    + +D  S  + F  T+C   
Sbjct: 377 VDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECVAH 429


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 106/376 (28%), Positives = 158/376 (42%), Gaps = 68/376 (18%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--LPCVQCYKQVKPIYNPASSSSYKELSC 80
           EY++  + GTPP  ++   +DTGSD+ W QC   P   C+ Q  P+++P++SSS+  L C
Sbjct: 87  EYLVHLAAGTPPQ-EVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPC 145

Query: 81  QSEQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGN-----SNNFFD 130
            S  C    T  C     ++ + CNY+  Y D S+++G +  E  TF +     S+    
Sbjct: 146 SSPACET--TPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVP 203

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
            +VFGCGH N GVF  NE G+ G GR  LSL     SQL    FS+C        S TS 
Sbjct: 204 GLVFGCGHANRGVFTSNETGIAGFGRGSLSLP----SQLKVGNFSHCFTTI--TGSKTSA 257

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           +  G    + G    S S + +   +Y        S    SNS   I             
Sbjct: 258 VLLG----LPGVAPPSASPLGRRRGSYRCR-----STPRSSNSGTSI------------- 295

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--APILTA 308
                    T LP   Y  + E+    +KL            C+  P        P +  
Sbjct: 296 ---------TSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMAL 346

Query: 309 HFDGGA-KVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
           HF+G   ++P      ++   V+         + C A+  I+G   I GN  Q ++ + Y
Sbjct: 347 HFEGATMRLP---QENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQNMHVLY 401

Query: 360 DFDSQMVSFKPTDCTK 375
           D  +  +SF P  C +
Sbjct: 402 DLQNSKLSFVPAQCDQ 417


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/327 (31%), Positives = 147/327 (44%), Gaps = 30/327 (9%)

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATER 119
           P ++ ++SS+    SC S  C  L   SC +      Q C YTY Y D S+T G+L  ++
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234

Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            TFG   +    V FGCG  N GVF  NE G+ G GR  LSL     SQL    FS+C  
Sbjct: 235 FTFGAGASV-PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFT 289

Query: 180 PFHTDSSITSKM-----YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNS 233
             +     T  +      + NG     G V ST L+ +  + T Y+++L+GI+VG     
Sbjct: 290 AVNGLKQSTVLLDLLADLYKNGR----GAVQSTPLIQNSANPTLYYLSLKGITVG----- 340

Query: 234 SKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
           S  +P   S+ A++ G     ID+G   T LP   Y  + ++    IKL        G  
Sbjct: 341 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 400

Query: 292 LCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFG 348
            C+  PS A    P L  HF+G           F  P   G  + C A+  +  +    G
Sbjct: 401 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIG 460

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           NF Q ++ + YD  + M+SF    C K
Sbjct: 461 NFQQQNMHVLYDLQNNMLSFVAAQCDK 487



 Score = 41.6 bits (96), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 11/144 (7%)

Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
           GI+VG     S  +P   S+ A++ G     ID+G   T LP   Y  + ++    IKL 
Sbjct: 41  GITVG-----STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 95

Query: 282 PYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQ 338
                  G   C+  PS A    P L  HF+G           F  P   G  + C A+ 
Sbjct: 96  VVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 155

Query: 339 PIDGDVGIFGNFAQSDLFIGYDFD 362
             D +  I GNF Q ++     FD
Sbjct: 156 KGD-ETTIIGNFQQQNMHALPYFD 178


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 66/157 (42%), Positives = 90/157 (57%), Gaps = 6/157 (3%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V ++ +  A GEY++K  IGTPP       +DT SDL+W QC PC  CY QV P++NP  
Sbjct: 77  VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135

Query: 72  SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           SS+Y  L C S+ C  LD   C     + C YTY Y+ ++ T+G LA +++  G   + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193

Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQI 165
             V FGC  ++T G       G+VGLGR  LSL SQ+
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQL 230



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 51/214 (23%), Positives = 79/214 (36%), Gaps = 31/214 (14%)

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
           H D       Y  +G+  + G +    LV  ED         G++ G  ++S+   P   
Sbjct: 159 HDDDESCQYTYTYSGNATTEGTLAVDKLVIGED------AFRGVAFGCSTSSTGGAPPPQ 212

Query: 242 SSGAISKGN---------------MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
           +SG +  G                M ID  +  T L    Y+ L   +   I+L      
Sbjct: 213 ASGVVGLGRGPLSLVSQLSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGS 272

Query: 287 RLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
            LG  LC+  P   G+A      P +   FDG   + L     F      G+ C  +   
Sbjct: 273 SLGLDLCFILPD--GVAFDRVYVPAVALAFDG-RWLRLDKARLFAEDRESGMMCLMVGRA 329

Query: 341 D-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + G V I GNF Q ++ + Y+     V+F  + C
Sbjct: 330 EAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 363


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/377 (29%), Positives = 172/377 (45%), Gaps = 32/377 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
           ++S +S  +G Y +K  +GTP       IVDTGS L W+QC PCV  C+ QV PI+ P++
Sbjct: 102 LKSGLSIGSGNYYVKIGLGTPAKY-FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPST 160

Query: 72  SSSYKELSCQSEQCHL-----LDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNS 125
           S +YK L C S QC       L+   CS+    C Y   Y D+S + G L+ + +T   S
Sbjct: 161 SKTYKALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPS 220

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFH 182
                  V+GCG +N G+F  +  G++GL   ++S+  Q+  + G N FSYCL       
Sbjct: 221 EAPSSGFVYGCGQDNQGLFGRSS-GIIGLANDKISMLGQLSKKYG-NAFSYCLPSSFSAP 278

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIP 238
             SS++  +  G  S  S     +  + +++  + YF+ L  I+V      +S SS  +P
Sbjct: 279 NSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP 338

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-T 296
                         ID+G   T LP   YN L++     +     Q P       C+K +
Sbjct: 339 ------------TIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGS 386

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
                  P +   F GGA + L   ++ +    +G  C A+      + I GN+ Q    
Sbjct: 387 VKEMSTVPEIQIIFRGGAGLELKAHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFK 445

Query: 357 IGYDFDSQMVSFKPTDC 373
           + YD  +  + F P  C
Sbjct: 446 VAYDVANFKIGFAPGGC 462


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 172/366 (46%), Gaps = 41/366 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--LPCVQCYKQVKPIYNPASSSSYKELS 79
           G Y M+FS+GTPP   +  + DTGSDL+W +C       C  Q  P Y P +SS++ +L 
Sbjct: 89  GAYDMEFSMGTPPQ-KLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLP 147

Query: 80  CQSEQCHLL--DTVS--CSSQQLCNYTYGYA----DSSLTKGVLATERITFGNSNNFFDN 131
           C    C LL  D+V+   ++   C+Y Y Y     D   T+G LA E  T G   +   +
Sbjct: 148 CSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG--ADAVPS 205

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V FGC    T        G   +G  R  L+  ++SQL A+ F YCL    +D+S  S +
Sbjct: 206 VRFGC---TTASEGGYGSGSGLVGLGRGPLS--LVSQLNASTFMYCLT---SDASKASPL 257

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG+ + ++G  V ST L++    T+Y V L  IS+G+ +          + G      +
Sbjct: 258 LFGSLASLTGAQVQSTGLLAS--TTFYAVNLRSISIGSAT----------TPGVGEPEGV 305

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILT 307
             D+G   T L +  Y+  +    +   L   +D   G + C++ P+   ++    P + 
Sbjct: 306 VFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTD-GFEACFQKPANGRLSNAAVPTMV 364

Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            HFDG      +  + ++    +GV C+ +Q     + I GN  Q +  + +D    ++S
Sbjct: 365 LHFDGADMA--LPVANYVVEVEDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLS 421

Query: 368 FKPTDC 373
           F+P +C
Sbjct: 422 FQPANC 427


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 174/383 (45%), Gaps = 44/383 (11%)

Query: 10  NNVVQSNVS-TANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
           NN  ++ VS +  G  +M   SIG PP+  +  ++DTGSD++WV C PC  C   +  ++
Sbjct: 85  NNDYKARVSPSLTGRTIMANISIGQPPIPQLV-VMDTGSDILWVMCTPCTNCDNDLGLLF 143

Query: 68  NPASSSSYKELS---CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
           +P+ SS++  L    C  E C   D +         +T  YAD+S   G    + + F  
Sbjct: 144 DPSKSSTFSPLCKTPCDFEGCR-CDPIP--------FTVTYADNSTASGTFGRDTVVFET 194

Query: 125 SN---NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
           ++   +   +V+FGCGHN     +    G++GL     SL +++       KFSYC+   
Sbjct: 195 TDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKL-----GQKFSYCIGNL 249

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
                   ++  G G+++ G      S   +    +Y+VT+EGISVG       + P   
Sbjct: 250 ADPYYNYHQLILGEGADLEG-----YSTPFEVYNGFYYVTMEGISVG--EKRLDIAPETF 302

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKT 296
                  G + IDTG+  T L    +  L ++VRN +     + T  + P +  Q  Y +
Sbjct: 303 EMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWM--QCFYGS 360

Query: 297 PSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNF 350
            S   +  P++T HF  GA + L  + +F     + VFC  + P     I     + G  
Sbjct: 361 ISRDLVGFPVVTFHFSDGADLAL-DSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLL 419

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
           AQ    +GYD  +Q V F+  DC
Sbjct: 420 AQQSYNVGYDLVNQFVYFQRIDC 442


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 168/370 (45%), Gaps = 40/370 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      IVD   +L+W QC  C +C+KQ  P++ P +SS++K   C + 
Sbjct: 62  YVANFTIGTPPQ-PASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 120

Query: 84  QCHLLDTVSCSSQQLCNYTYGYAD-SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
            C  + T SCS   +C+Y          T G  AT+    G +      + FGC   +  
Sbjct: 121 VCESIPTRSCSG-DVCSYKGPPTQLRGNTSGFAATDTFAIGTATV---RLAFGCVVASDI 176

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
              +   G +GLGRT  SL    ++Q+   +FSYCL P +T  S  S+++ G+ ++++GG
Sbjct: 177 DTMDGPSGFIGLGRTPWSL----VAQMKLTRFSYCLSPRNTGKS--SRLFLGSSAKLAGG 230

Query: 203 GVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
              ST+     S +D +  YY ++L+ I  GN + ++          A S G + + T +
Sbjct: 231 ESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTVS 280

Query: 258 PPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFDG 312
           P +LL    Y   ++ V  A+      P   P     LC+K  +      AP L   F G
Sbjct: 281 PFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 340

Query: 313 GAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQ 364
            A   VP    LI            +   A     G   V + G+  Q D+   YD   +
Sbjct: 341 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 400

Query: 365 MVSFKPTDCT 374
            +SF+P DC+
Sbjct: 401 TLSFEPADCS 410


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 106/393 (26%), Positives = 165/393 (41%), Gaps = 47/393 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPAS 71
           V S  ST +G+Y +   +GTPP   +  + DTGSDL+WV+C  C  C +      +    
Sbjct: 78  VVSGASTGSGQYFVDLRLGTPPQ-KLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARH 136

Query: 72  SSSYKELSCQSEQCHLL---DTVSCSSQQL---CNYTYGYADSSLTKGVLATERITFGNS 125
           S+++    C    C L+       C+  +L   C Y Y Y D S T G  + E  T   S
Sbjct: 137 STTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTS 196

Query: 126 NNF---FDNVVFGC-----GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           +        + FGC     G + +G       G++GLGR  +SL+SQ+  + G NKFSYC
Sbjct: 197 SGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFG-NKFSYC 255

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLSNS 233
           L+      S TS +  G+       G          ++    T+Y++ +E +SV  +   
Sbjct: 256 LMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIK-- 313

Query: 234 SKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
              +P   S  A+ +   G   +D+G   T LP+  Y ++   ++  ++L    +P  G 
Sbjct: 314 ---LPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGF 370

Query: 291 QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPIDG 342
            LC     +     P L+    G         S F PPP        E V C A+Q +  
Sbjct: 371 DLCVNVSEIEHPRLPKLSFKLGG--------DSVFSPPPRNYFVDTDEDVKCLALQAVMT 422

Query: 343 DVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             G  + GN  Q    + +D D   + F    C
Sbjct: 423 PSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 151/325 (46%), Gaps = 33/325 (10%)

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNN 127
           SS++K ++C    C     VS S+  +    C Y   Y D S+T G +  +  TF + N 
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 128 F---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                  + FGCG  NTG+F  NE G+ G GR   SL     SQL   +FSYCL      
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLP----SQLKVGRFSYCLT--LVT 115

Query: 185 SSITSKMYFGNGSEVSG------GGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLI 237
            S +S +  G   +  G      G   ST ++ +    T+Y+++LEGI+VG        +
Sbjct: 116 ESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTR-----L 170

Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLC 293
           P+  S  A+ K   G   ID+G   T LP+  +  L+E++     L  Y + P +G +LC
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRLC 230

Query: 294 YKTPSMAGIAPI--LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNF 350
           ++ P      P+  L  H   GA + L   + F+  P  GV C  +    D  + + GNF
Sbjct: 231 FRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNF 289

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
            Q ++ + YD ++  + F P  C K
Sbjct: 290 QQQNMHVVYDVENNKLLFAPAQCDK 314


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 165/365 (45%), Gaps = 29/365 (7%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIYNPASS 72
           +T  G YV+ FS+GTPP + + G++D  SD +W+QC  C  C          P +    S
Sbjct: 91  ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLS 149

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYAD--SSLTKGVLATERITFGNSNNFF 129
           S+ +E+ C +  C  L   +CS+    C Y+Y Y    ++ T G+LA +   F       
Sbjct: 150 STIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRA-- 207

Query: 130 DNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           D V+FGC      V  E ++ G++GLGR  LSL SQ+  Q+G  +FSY L P      + 
Sbjct: 208 DGVIFGC-----AVATEGDIGGVIGLGRGELSLVSQL--QIG--RFSYYLAP-DDAVDVG 257

Query: 189 SKMYFGNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
           S + F + ++      VST LV+ +  ++ Y+V L GI V         IP       A 
Sbjct: 258 SFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQAD 314

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPI 305
             G + +    P T L    Y  + + + + I L       LG  LCY + S+A    P 
Sbjct: 315 GSGGVVLSITIPVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPS 374

Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQ 364
           +   F GGA + L   + F      G+ C  + P   GD  + G+  Q    + YD    
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGS 434

Query: 365 MVSFK 369
            + F+
Sbjct: 435 RLVFE 439


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 173/374 (46%), Gaps = 38/374 (10%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELSCQ 81
           EYV+   IGTP   +   + DTGSDL WVQC PC   CY+Q +P+++P+ SS+Y ++ C 
Sbjct: 125 EYVVTIGIGTPAR-NFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCG 183

Query: 82  SEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           + QC +     ++C     C Y+  Y D S+T+G LA E  T   S      VVFGC H 
Sbjct: 184 TPQCKIGGGQDLTCGGTT-CEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHE 242

Query: 140 -NTGVFN-ENEM---GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
            ++GV   E EM   GL+GLGR   S+ SQ       + FSYCL P       +S  Y  
Sbjct: 243 YSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPP-----RGSSAGYLT 297

Query: 195 NGSEVSGGGVVS-TSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            G+       +S T LV+   +  + Y V L GISV     S   +P   S+  I     
Sbjct: 298 IGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISV-----SGAALPIDASAFYI---GT 349

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI-KLTPYQDPRLGS-QLCYKTPSMAGI-APILTA 308
            ID+G   T +P   Y  L ++ R  +   T   +  + S   CY       + AP +  
Sbjct: 350 VIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVAL 409

Query: 309 HFDGGAKVPLIHTSTFIPPPVEG------VFCFAMQPID--GDVGIFGNFAQSDLFIGYD 360
            F GGA++ +  +   +   V+       + C A  P +  G V I GN  Q    + +D
Sbjct: 410 EFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFV-IIGNMQQRAYNVVFD 468

Query: 361 FDSQMVSFKPTDCT 374
            + + + F    C+
Sbjct: 469 VEGRRIGFGANGCS 482


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 175/382 (45%), Gaps = 57/382 (14%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
           ++ G YV  F+IGTPP   +  +VD   +L+W QC PC  C++Q  P+++P  SS+++ L
Sbjct: 52  SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110

Query: 79  SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            C S  C  +   S  C+S  +C Y      +  T G   T+    G +    + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGKAGTDTFAIGAAK---ETLGFGC 165

Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
                 V  +  +       G+VGLGRT  SL    ++Q+    FSYCL         + 
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211

Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
            ++ G  ++   GG       V+ TS  S ++ +  YY V L GI  G        +   
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP-----LQAA 266

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           +SSG+     + +DT +  + L    Y  L++ +  A+ + P   P     LC+   ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPK-AVA 321

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDV---GIFGNFAQ 352
           G AP L   FDGGA +  +  + ++     G  C  +       + G++    I G+  Q
Sbjct: 322 GDAPELVFTFDGGAAL-TVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQ 380

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            ++ + +D   + +SFKP DC+
Sbjct: 381 ENVHVLFDLKEETLSFKPADCS 402


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 158/361 (43%), Gaps = 33/361 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK--QVKPIYNPASSSSYKELSC 80
           +YV+  S+GTP +      VDTGSD+ WVQC PC       Q   +++PA SSSY  + C
Sbjct: 499 QYVVTVSLGTPGVAQTV-EVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPC 557

Query: 81  QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            ++ C  L T    C++   C Y   Y D S T GV  ++ +T  +++      +FGCGH
Sbjct: 558 AADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDADAV-TGFLFGCGH 616

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+F   + GL+ LGR  +SL SQ     G   FSYCL P     S +S  +   G  
Sbjct: 617 AQAGLFAGID-GLLALGRKGMSLTSQTSGAYGGGVFSYCLPP-----SPSSTGFLTLGGP 670

Query: 199 VSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
            S  G  +T L++  D  T+Y V L GI VG    S   +P      +   G   +DTG 
Sbjct: 671 SSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSG--VP-----ASAFAGGTVVDTGT 723

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHFDG 312
             T LP   Y     +      + PY  P   +      CY       +  P ++  F G
Sbjct: 724 VITRLPPTAYA--ALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSG 781

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA + L     F+     G   FA    DGD  I GN  Q    +   FD   V F P  
Sbjct: 782 GATLKL-DAPGFL---SSGCLAFATNSGDGDPAILGNVQQRSFAV--RFDGSSVGFMPHS 835

Query: 373 C 373
           C
Sbjct: 836 C 836


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 162/364 (44%), Gaps = 27/364 (7%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIYNPASS 72
           +T  G YV+ FS+GTPP + + G++D  SD +W+QC  C  C          P +    S
Sbjct: 91  ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLS 149

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYAD--SSLTKGVLATERITFGNSNNFF 129
           S+ +E+ C +  C  L   +CS+    C Y+Y Y    ++ T G+LA +   F       
Sbjct: 150 STIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRA-- 207

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           D V+FGC     G       G++GLGR  LS  SQ+  Q+G  +FSY L P      + S
Sbjct: 208 DGVIFGCAVATEGDIG----GVIGLGRGELSPVSQL--QIG--RFSYYLAP-DDAVDVGS 258

Query: 190 KMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAIS 247
            + F + ++      VST LV S+  ++ Y+V L GI V         IP       A  
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQADG 315

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPIL 306
            G + +    P T L    Y  + + + + I+L       LG  LCY + S+A    P +
Sbjct: 316 SGGVVLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSM 375

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQM 365
              F GGA + L   + F      G+ C  + P   GD  + G+  Q    + YD     
Sbjct: 376 ALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSR 435

Query: 366 VSFK 369
           + F+
Sbjct: 436 LVFE 439


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/165 (45%), Positives = 92/165 (55%), Gaps = 9/165 (5%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S  S  +GEY  +  IG PP    Y ++DTGSD+ WVQC PC  CY+Q  PI+ P +S+S
Sbjct: 123 SGTSQGSGEYFSRIGIGEPPS-QAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASAS 181

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y  LSC++ QC  LD   C +   C Y   Y D S T G   TE +T G   N   NV  
Sbjct: 182 YAPLSCEAAQCRYLDQSQCRNGN-CLYQVSYGDGSYTVGDFVTETVTIG--VNKVKNVAL 238

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
           GCGHNN G+F     GL+GLG   LS      +QL +  FSYCLV
Sbjct: 239 GCGHNNEGLF-VGAAGLIGLGGGPLSFP----AQLNSTSFSYCLV 278


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 127/365 (34%), Positives = 177/365 (48%), Gaps = 31/365 (8%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
           +G Y++   +GTP   D+  I DTGSD+ W QC PC + CYKQ + I++P+ S+SY  +S
Sbjct: 146 SGNYIVTVGLGTPKK-DLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNIS 204

Query: 80  CQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           C S  C+ L     +T  C+S   C Y   Y DSS + G   TE++T   S + F+N+ F
Sbjct: 205 CSSSICNSLTSATGNTPGCASSA-CVYGIQYGDSSFSVGFFGTEKLTL-TSTDAFNNIYF 262

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYF 193
           GCG NN         GL+GLGR +LS+ SQ   +   NK FSYCL    + SS T  + F
Sbjct: 263 GCGQNNQ-GLFGGSAGLLGLGRDKLSVVSQTAQKY--NKIFSYCL---PSSSSSTGFLTF 316

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNM 251
           G GS          S +S    ++Y +   GISVG   L+ S+ +   ++++GAI     
Sbjct: 317 G-GSASKNAKFTPLSTIS-AGPSFYGLDFTGISVGGKKLAISASV---FSTAGAI----- 366

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
            ID+G   T LP   Y+ L    RN +   P          CY   S   I+ P +   F
Sbjct: 367 -IDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSF 425

Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
             G +V +  T       +  V   FA      DV IFGN  Q  L + YD  +  V F 
Sbjct: 426 SSGIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFA 485

Query: 370 PTDCT 374
           P  C+
Sbjct: 486 PGGCS 490


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 174/370 (47%), Gaps = 50/370 (13%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP       +D   +L+W QC  C+ C+KQ  P++ P +SS++K   C ++ 
Sbjct: 55  VANFTIGTPPQA-ASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDV 113

Query: 85  CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
           C  + T  C+S  +C Y         T G++AT+    G +     ++ FGC   +    
Sbjct: 114 CKSIPTPKCAS-DVCAYDGVTGLGGHTVGIVATDTFAIGTAAP--ASLGFGCVVASDIDT 170

Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
                G +GLGRT  SL    ++Q+   +FSYCL P   D+   S+++ G  ++++GGG 
Sbjct: 171 MGGPSGFIGLGRTPWSL----VAQMKLTRFSYCLAPH--DTGKNSRLFLGASAKLAGGGA 224

Query: 205 VSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
            +  + +  +     YY + LE I  G   +++  +P   ++  +    + +      +L
Sbjct: 225 WTPFVKTSPNDGMSQYYPIELEEIKAG---DATITMPRGRNTVLVQTAVVRV------SL 275

Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
           L    Y   ++ V  ++   P   P +G+  ++C+    ++G AP L   F  GA + + 
Sbjct: 276 LVDSVYQEFKKAVMASVGAAPTATP-VGAPFEVCFPKAGVSG-APDLVFTFQAGAALTV- 332

Query: 320 HTSTFIPPPVEGVF-------CFAMQPI--------DGDVGIFGNFAQSDLFIGYDFDSQ 364
                  PP   +F       C ++  I        DG + I G+F Q ++ + +D D  
Sbjct: 333 -------PPANYLFDVGNDTVCLSVMSIALLNITALDG-LNILGSFQQENVHLLFDLDKD 384

Query: 365 MVSFKPTDCT 374
           M+SF+P DC+
Sbjct: 385 MLSFEPADCS 394


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 112/400 (28%), Positives = 177/400 (44%), Gaps = 70/400 (17%)

Query: 8   YPNNVVQSNVSTANG-------------EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL 54
           YPN   QSN ST  G             E  +   IGTP + ++  + DT SDL+W QC 
Sbjct: 60  YPNEGDQSN-STRRGLSSTPGGVQEKHVEPHVFLGIGTPAM-NVTLVFDTTSDLLWTQCQ 117

Query: 55  PCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGV 114
           PC+ C  Q   +Y+P  + +Y  L+  S                  Y Y Y+  S T G 
Sbjct: 118 PCLSCVAQAGDMYDPNKTETYANLTSSS------------------YNYTYSKQSFTSGY 159

Query: 115 LATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
            ATE    GN      N+ FGCG  N G +  + +  V            +L+QLG ++F
Sbjct: 160 FATETFALGNVT--VANITFGCGTRNQGYY--DNVAGVFGVGRGGRGGVSLLNQLGIDRF 215

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-----LVSKEDKTYYFVTLEGISVGN 229
           SYC     + +  +S ++ G   E++     + +     +     K+ YFV L G++VG 
Sbjct: 216 SYCFS--SSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVG- 272

Query: 230 LSNSSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDP 286
               + L+    +S A   G  + ID+ +P T+L +  Y      VR A+  +L P ++ 
Sbjct: 273 ----ATLVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG----PVRRALVAQLAPLKEA 324

Query: 287 R------LGSQLCYKTPSMAGIAP-----ILTAHFDGGAKVPLIHTSTFIPP-PVEGVFC 334
                  +G  LC++  +  G  P      +T HFDGGA   ++  ++++      G+ C
Sbjct: 325 NANASAGVGLDLCFEL-AAGGATPTPPNVTMTLHFDGGAADLVLPPASYLAKDSAGGLIC 383

Query: 335 FAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             M P   + V + G++A  D  + YD    +VSF+P DC
Sbjct: 384 LTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQPLDC 423


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 169/372 (45%), Gaps = 43/372 (11%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP      I+D   +L+W QC  C +C+KQ  P++ P +SS+++   C ++ 
Sbjct: 44  VANFTIGTPPQ-PASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDA 102

Query: 85  CHLLDTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
           C    T +CS   +C Y   T    D   T G++ TE    G +     ++ FGC   + 
Sbjct: 103 CKSTPTSNCSG-DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA---SLAFGCVVASD 158

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
               +   G +GLGRT  SL    ++Q+   KFSYCL P  T  S  S+++ G+ ++++G
Sbjct: 159 IDTMDGTSGFIGLGRTPRSL----VAQMKLTKFSYCLSPRGTGKS--SRLFLGSSAKLAG 212

Query: 202 GGVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           G   ST+     S +D +  YY ++L+ I  GN + ++          A S G + + T 
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTV 262

Query: 257 APPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFD 311
           +P +LL    Y   ++ V  A+      P   P     LC+K  +      AP L   F 
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322

Query: 312 GGAK---VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
           GG     VP    LI            +   A     G   V + G+  Q ++   YD  
Sbjct: 323 GGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLK 382

Query: 363 SQMVSFKPTDCT 374
            + +SF+P DC+
Sbjct: 383 KETLSFEPADCS 394


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 165/363 (45%), Gaps = 33/363 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
           EYV    +GTP +     I+DTGS L WVQC PC   QCY Q  P+++P +SSSY  + C
Sbjct: 128 EYVATVGLGTPAVPQTL-ILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPC 186

Query: 81  QSEQCHLL----DTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S++C  L    D   C+S     C Y   Y   +   G  +T+ +T G          F
Sbjct: 187 DSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG-PGAIVKRFHF 245

Query: 135 GCGHN-NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           GCGH+   G F+  + G++GLGR   SLA Q  ++ G   FS+CL P     +  S  + 
Sbjct: 246 GCGHHQQRGKFDMAD-GVLGLGRLPQSLAWQASARRGGGVFSHCLPP-----TGVSTGFL 299

Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTL-EGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
             G+       V T L++ +D+ +++  +   ISV G L +    IP      A+ +  +
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLD----IP-----PAVFREGV 350

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
             D+G   + L +  Y  L    R+A+   P   P      C+       +  P ++  F
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTF 410

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
            GGA V L  +S  +   ++G   F     D   G+ G+ +Q  + + YD   + V F+ 
Sbjct: 411 RGGATVHLDASSGVL---MDGCLAF-WSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRT 466

Query: 371 TDC 373
             C
Sbjct: 467 GAC 469


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 150/366 (40%), Gaps = 31/366 (8%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELS 79
           G  +       P +L     +DT  D+ W+QCLPC+  QCY Q    ++P  SS+   + 
Sbjct: 143 GAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVR 202

Query: 80  CQSEQCHLLDTVS--CS---SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           C S  C  L   +  CS   S   C Y   Y+D  LT G   T+ +T   S  F  N  F
Sbjct: 203 CGSRACRTLGGYANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFL-NFRF 261

Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           GC H   G F+    G + LG    SL SQ     G N FSYC VP  + +   S     
Sbjct: 262 GCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYG-NAFSYC-VPGPSAAGFLSIGGPV 319

Query: 195 NGSEVSGGGVVSTSLVSKE----DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           NG +  G G  +T+ + +     + T Y V L+GI V             N    +  G 
Sbjct: 320 NGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRR--------LNVPPVVFSGG 371

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
             +D+ A  T LP   Y  L    RNA++    + P      C+    ++ +  P ++  
Sbjct: 372 TVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLV 431

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQMVS 367
           FDGGA + L   S  +        C A  P+  D  +G  GN  Q    + YD     V 
Sbjct: 432 FDGGAVIELGLLSVLLDS------CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVG 485

Query: 368 FKPTDC 373
           F+   C
Sbjct: 486 FRHGAC 491


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 158/361 (43%), Gaps = 38/361 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV+  SIGTP +     ++DTGSD+ WV C         +   ++P  SS+Y   SC S 
Sbjct: 125 YVITVSIGTPAMTQAV-MIDTGSDVSWVHCHARAGAGSSL--FFDPGKSSTYTPFSCSSA 181

Query: 84  QCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN- 140
            C  L+     CS    C YT  Y D S T G   ++ +   NS    +N  FGC   + 
Sbjct: 182 ACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL-NSTEKVENFQFGCSETSD 240

Query: 141 --TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G+  +   GL+GLG    SL SQ  +  G + FSYCL P  T SS     +   G+ 
Sbjct: 241 PGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG-SAFSYCL-PATTRSS----GFLTLGAS 294

Query: 199 VSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
               G V+T +  S+   T+YFV L+GI+VG   +   + P   ++G+I      +D+G 
Sbjct: 295 TGTSGFVTTPMFRSRRAPTFYFVILQGINVGG--DPVAISPTVFAAGSI------MDSGT 346

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
             T LP   Y+ L    R  ++  P          C+       ++ P +   F GGA V
Sbjct: 347 IITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFSGGAVV 406

Query: 317 PLIHTSTFIPPPVEGVF---CFAMQPIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTD 372
            L           +G+    C A  P  G +G I GN  Q    + +D    ++ F+P  
Sbjct: 407 DL---------DADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPGA 457

Query: 373 C 373
           C
Sbjct: 458 C 458


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/361 (31%), Positives = 156/361 (43%), Gaps = 36/361 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+  SIG+P +      +DTGSD+ W++C            +Y+P +SS+Y   SC +
Sbjct: 130 EYVITVSIGSPAVAXTM-FIDTGSDVSWLRC---------KSRLYDPGTSSTYAPFSCSA 179

Query: 83  EQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDNVVFGCGHN 139
             C  L      CSS   C Y+  Y D S T G   ++ +T  G S        FGC   
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAV 239

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
             G   +N  GL+GLG    S  SQ  +  G + FSYCL P    S     +  G  S  
Sbjct: 240 EHGFEEDNTDGLMGLGGDAQSFVSQTAATYG-SAFSYCLPPTWNSSGF---LTLGAPSSS 295

Query: 200 SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           +     +T ++ SK+  T+Y + L GISVG     +  IP    S   S G++ +D+G  
Sbjct: 296 TSAAFSTTPMLRSKQAATFYGLLLRGISVGG---KTLEIP----SSVFSAGSI-VDSGTV 347

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAG---IAPILTAHFDG 312
            T LP   Y  L    R+ +    YQ   PR     C+  T    G     P +    DG
Sbjct: 348 ITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDG 407

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA V L H +  +    +G   FA    DG  GI GN  Q    + YD    +  F+P  
Sbjct: 408 GAVVDL-HPNGIVQ---DGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGA 463

Query: 373 C 373
           C
Sbjct: 464 C 464


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 174/378 (46%), Gaps = 46/378 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  +  +GTPP    Y  VDTGSD+ WV C+PC  C +         I++P  S+S  
Sbjct: 46  GLYYTRIYLGTPPQ-QFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKT 104

Query: 77  ELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITF-----GNS--NNF 128
            +SC  E+C+L     CS   + C Y+  Y D S T G L  + ++F     GNS   + 
Sbjct: 105 SISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSG 164

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
              + FGCG N TG +  +  GLVG G+  +SL SQ+  Q +  N F++CL     D+  
Sbjct: 165 TARLTFGCGSNQTGTWLTD--GLVGFGQAEVSLPSQLSKQNVSVNIFAHCL---QGDNKG 219

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAI 246
           +  +  G+  E    G+V T +V K+  ++Y V L  I V G    +       NS G I
Sbjct: 220 SGTLVIGHIRE---PGLVYTPIVPKQ--SHYNVELLNIGVSGTNVTTPTAFDLSNSGGVI 274

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
                 +D+G   T L +  Y++ + +VR+ ++      P      C    ++ G  P +
Sbjct: 275 ------MDSGTTLTYLVQPAYDQFQAKVRDCMRSGVL--PVAFQFFC----TIEGYFPNV 322

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAMQPIDGDVG-----IFGNFAQSDLFIG 358
           T +F GGA + L  +S      +      +CF+        G     IFG+    D  + 
Sbjct: 323 TLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVV 382

Query: 359 YDFDSQMVSFKPTDCTKQ 376
           YD  +  + +K  DCTK+
Sbjct: 383 YDNVNNRIGWKNFDCTKE 400


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 165/370 (44%), Gaps = 40/370 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      IVD   +L+W QC  C +C+KQ  P++ P +SS++K   C + 
Sbjct: 45  YVANFTIGTPPQ-PASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 103

Query: 84  QCHLLDTVSCSSQQLCNYTYGYAD-SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
            C  + T SCS   +C+Y          T G  AT+    G +      + FGC   +  
Sbjct: 104 VCESIPTRSCSG-DVCSYKGPPTQLRGNTSGFAATDTFAIGTATV---RLAFGCVVASDI 159

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
              +   G +GLGRT  SL    ++Q+   +FSYCL P +T  S  S+++ G+ ++++G 
Sbjct: 160 DTMDGPSGFIGLGRTPWSL----VAQMKLTRFSYCLSPRNTGKS--SRLFLGSSAKLAGS 213

Query: 203 GVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
              ST+   K     +   YY ++L+ I  GN + ++          A S G + + T +
Sbjct: 214 ESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTVS 263

Query: 258 PPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFDG 312
           P +LL    Y   ++ V  A+      P   P     LC+K  +      AP L   F G
Sbjct: 264 PFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 323

Query: 313 GAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQ 364
            A   VP    LI            +   A     G   V + G+  Q D+   YD   +
Sbjct: 324 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 383

Query: 365 MVSFKPTDCT 374
            +SF+P DC+
Sbjct: 384 TLSFEPADCS 393


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 169/360 (46%), Gaps = 27/360 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
           E+V+    G+P       + DTGSDL W+QC PC   CYKQ  P+++PA SSSY  + C 
Sbjct: 111 EFVVVVGFGSPAQTSAT-MFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCG 169

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
           + +C       C+    C Y   Y D S T GVLA E +TF +S+  F   +FGCG  N 
Sbjct: 170 TTECAAAGG-ECNGTT-CVYGVEYGDGSSTTGVLARETLTFSSSSE-FTGFIFGCGETNL 226

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
           G F E +  L+GLGR  LSL+SQ     G   FSYCL  ++     T+  Y   G+    
Sbjct: 227 GDFGEVDG-LLGLGRGSLSLSSQAAPAFG-GIFSYCLPSYN-----TTPGYLSIGATPVT 279

Query: 202 GG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
           G   V  T++V+K D  ++YF+ L  I++G       ++P   S    +K    +D+G  
Sbjct: 280 GQIPVQYTAMVNKPDYPSFYFIELVSINIGGY-----VLPVPPSE--FTKTGTLLDSGTI 332

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVP 317
            T LP   Y  L ++ +  ++ +    P      CY     +GI  P ++ +F  GA   
Sbjct: 333 LTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFN 392

Query: 318 L----IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           L    I T      P  G   F  +P D    + G+  Q    + YD  +Q + F P  C
Sbjct: 393 LNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 152/361 (42%), Gaps = 25/361 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
           EYV+   +G+P +     ++DTGSD+ WVQC PC     C+     +++PA+SS+Y   +
Sbjct: 134 EYVISVGLGSPAMTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 192

Query: 80  CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           C +  C  L    +   C ++  C Y   Y D S T G  +++ +T   S +      FG
Sbjct: 193 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGS-DVVRGFQFG 251

Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C H   G   ++   GL+GLG    SL SQ  ++ G   FSYCL      S   +     
Sbjct: 252 CSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYG-KSFSYCLPATPASSGFLTLGAPA 310

Query: 195 NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           +G         +T ++ SK+  TYYF  LE I+VG       L P   ++G++      +
Sbjct: 311 SGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------V 362

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
           D+G   T LP   Y  L    R  +      +P      C+    +  ++ P +   F G
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 422

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           GA V L            G   FA    D   G  GN  Q    + YD    +  F+   
Sbjct: 423 GAVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVFGFRAGA 478

Query: 373 C 373
           C
Sbjct: 479 C 479


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 115/378 (30%), Positives = 174/378 (46%), Gaps = 34/378 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
           ++S +S  +G Y +K  +GTP       IVDTGS L W+QC PCV  C+ QV PI+ P+ 
Sbjct: 96  LKSGLSIGSGNYYVKIGVGTPAKY-FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSV 154

Query: 72  SSSYKELSCQSEQCHL-----LDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNS 125
           S +YK LSC S QC       L+   CS+    C Y   Y D+S + G L+ + +T   S
Sbjct: 155 SKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPS 214

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFH 182
                  V+GCG +N G+F  +  G++GL   +LS+  Q+ ++ G N FSYCL       
Sbjct: 215 AAPSSGFVYGCGQDNQGLFGRSA-GIIGLANDKLSMLGQLSNKYG-NAFSYCLPSSFSAQ 272

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIP 238
            +SS++  +  G  S  S     +  + + +  + YF+ L  I+V      +S SS  +P
Sbjct: 273 PNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP 332

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-T 296
                         ID+G   T LP   YN L++     +     Q P       C+K +
Sbjct: 333 ------------TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGS 380

Query: 297 PSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
                  P +   F GGA + L +H S  +    +G  C A+      + I GN+ Q   
Sbjct: 381 VKEMSTVPEIRIIFRGGAGLELKVHNS--LVEIEKGTTCLAIAASSNPISIIGNYQQQTF 438

Query: 356 FIGYDFDSQMVSFKPTDC 373
            + YD  +  + F P  C
Sbjct: 439 TVAYDVANSKIGFAPGGC 456


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 176/372 (47%), Gaps = 54/372 (14%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP       +D   +L+W QC  C+ C+KQ  P++ P +SS++K   C ++ 
Sbjct: 25  VANFTIGTPPQA-ASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDV 83

Query: 85  CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
           C  + T  C+S  +C +         T G++AT+    G +     ++ FGC   +    
Sbjct: 84  CKSIPTPKCAS-DVCAFDGVTGLGGHTVGIVATDTFAIGTAAP--ASLGFGCVVASDIDT 140

Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
                G +GLGRT  SL    ++Q+   +FSYCL P   D+   S+++ G  ++++GGG 
Sbjct: 141 MGGPSGFIGLGRTPWSL----VAQMKLTRFSYCLAPH--DTGKNSRLFLGASAKLAGGGA 194

Query: 205 VSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAISKGNMFIDTGA 257
            +  + +  +     YY + LE I  G   +++  +P   ++     A+ + ++ +D+  
Sbjct: 195 WTPFVKTSPNDGMSQYYPIELEEIKAG---DATITMPRGRNTVLVQTAVVRVSLLVDS-- 249

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVP 317
               + ++F   +   V  A   TP  +P    ++C+    ++G AP L   F  GA + 
Sbjct: 250 ----VYQEFKKAVMASVGAAPTATPVGEPF---EVCFPKAGVSG-APDLVFTFQAGAALT 301

Query: 318 LIHTSTFIPPPVEGVF-------CFAMQPI--------DGDVGIFGNFAQSDLFIGYDFD 362
           +        PP   +F       C ++  I        DG + I G+F Q ++ + +D D
Sbjct: 302 V--------PPANYLFDVGNDTVCLSVMSIALLNITALDG-LNILGSFQQENVHLLFDLD 352

Query: 363 SQMVSFKPTDCT 374
             M+SF+P DC+
Sbjct: 353 KDMLSFEPADCS 364


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 27/362 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
           E+V+    G+P     Y + +DTGSD+ W+QCLPC   CYKQ  P+++P  S++Y  + C
Sbjct: 160 EFVVTVGFGSP--AQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPC 217

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
              QC       CS+   C Y   Y D S T GVL+ E ++  ++ +      FGCG  N
Sbjct: 218 GHPQCAAAGG-KCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD-LPGFAFGCGQTN 275

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G F   +  LVGLGR  LSL SQ  +  GA  FSYCL  + T     +       +   
Sbjct: 276 LGEFGGVDG-LVGLGRGALSLPSQAAATFGAT-FSYCLPSYDTTHGYLTMGSTTPAASND 333

Query: 201 GGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
              V  T+++ KED  + YFV +  I +G       ++P   +    ++     D+G   
Sbjct: 334 DDDVQYTAMIQKEDYPSLYFVEVVSIDIGGY-----ILPVPPT--VFTRDGTLFDSGTIL 386

Query: 260 TLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAK 315
           T LP + Y  L ++ +  +   K  P  DP      CY  T   A   P +   F  GA 
Sbjct: 387 TYLPPEAYASLRDRFKFTMTQYKPAPAYDPF---DTCYDFTGHNAIFMPAVAFKFSDGAV 443

Query: 316 VPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
             L   +  I P    P  G   F  +P      I GN  Q    + YD  ++ + F   
Sbjct: 444 FDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQF 503

Query: 372 DC 373
            C
Sbjct: 504 TC 505


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 124/381 (32%), Positives = 160/381 (41%), Gaps = 40/381 (10%)

Query: 3   PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
           PA++ Y       ++ T N  YV+  S+GTP +      VDTGSDL WVQC PC     C
Sbjct: 36  PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSC 85

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           Y Q  P+++PA SSSY  + C    C  L     S  S   C Y   Y D S T GV ++
Sbjct: 86  YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 145

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           + +T  ++++      FGCGH  +G+FN  + GL+GLGR + SL  Q     G   FSYC
Sbjct: 146 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 202

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSK 235
           L    T  S    +  G G         ST+  L S    TYY V L GISVG    S  
Sbjct: 203 L---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-- 257

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--C 293
            +P    +G        + T  PPT      Y  L    R+ +    Y        L  C
Sbjct: 258 -VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTC 311

Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           Y       +  P +   F  GA V L            G   FA    DG + I GN  Q
Sbjct: 312 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQ 367

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
               +    D   V FKP+ C
Sbjct: 368 RSFEV--RIDGTSVGFKPSSC 386


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 151/360 (41%), Gaps = 31/360 (8%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSC 80
           YV+  S+GTP +      VDTGSDL WVQC PC     CY Q  P+++PA SSSY  + C
Sbjct: 140 YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 81  QSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
               C  L     S  S   C Y   Y D S T GV +++ +T  ++++      FGCGH
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-SASSAVQGFFFGCGH 257

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
             +G+FN  + GL+GLGR + SL  Q     G   FSYCL    T  S    +  G G  
Sbjct: 258 AQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGP 312

Query: 199 VSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
                  ST+  L S    TYY V L GISVG    S   +P    +G        + T 
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTR 369

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGG 313
            PPT      Y  L    R+ +    Y        L  CY       +  P +   F  G
Sbjct: 370 LPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 424

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           A V L            G   FA    DG + I GN  Q    +    D   V FKP+ C
Sbjct: 425 ATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 174/404 (43%), Gaps = 66/404 (16%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC------------------- 53
           V S++   + EY+   ++GTPP+     + DTGSDL+W++C                   
Sbjct: 71  VSSDLFYGDFEYLAAVNVGTPPVR-FLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSN 129

Query: 54  LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT-VSCSSQ-QLCNYTYGYADSSLT 111
                   +    +NP  SSSY  + C    C  L T  SC+     C++ Y Y D +  
Sbjct: 130 SSPPPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASA 189

Query: 112 KGVLATERITFG----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
            G+LA +  TFG    N      ++ FGC     G   + + G+VGLG   LSLASQ+  
Sbjct: 190 TGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQAD-GMVGLGAGPLSLASQL-- 246

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV--SKEDKTYYFVTLEGI 225
                KFS+CL  +  D + +S + FG  + VS  G  +T L+  S     YY      I
Sbjct: 247 ---GRKFSFCLTAYDIDDA-SSILNFGARAVVSDPGAATTPLIASSSNAAAYY-----AI 297

Query: 226 SVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK-DFYNRLEE---QVRNAIKLT 281
           S+ +L  + + +P     G  S   + +DTG   T L +      L E   +V +   L 
Sbjct: 298 SIDSLKVAGQPVP-----GTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLP 352

Query: 282 PYQDPRLGSQLCY---KTPSMAGIAP--ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
               P    +LCY   +   + G+ P   L     GG +V L    TF+    EGV C A
Sbjct: 353 RAPPPDETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVK-EGVLCLA 411

Query: 337 -------MQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                  +QP+     + GN A  DL +G D D++  +F   +C
Sbjct: 412 VVTTSPELQPL----SVLGNVALQDLHVGIDLDARTATFATANC 451


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 124/381 (32%), Positives = 160/381 (41%), Gaps = 40/381 (10%)

Query: 3   PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
           PA++ Y       ++ T N  YV+  S+GTP +      VDTGSDL WVQC PC     C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           Y Q  P+++PA SSSY  + C    C  L     S  S   C Y   Y D S T GV ++
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 237

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           + +T  ++++      FGCGH  +G+FN  + GL+GLGR + SL  Q     G   FSYC
Sbjct: 238 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 294

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSK 235
           L    T  S    +  G G         ST+  L S    TYY V L GISVG    S  
Sbjct: 295 L---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-- 349

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--C 293
            +P    +G        + T  PPT      Y  L    R+ +    Y        L  C
Sbjct: 350 -VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTC 403

Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
           Y       +  P +   F  GA V L            G   FA    DG + I GN  Q
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQ 459

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
               +    D   V FKP+ C
Sbjct: 460 RSFEV--RIDGTSVGFKPSSC 478


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/388 (29%), Positives = 177/388 (45%), Gaps = 43/388 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYK---QVKPIYN 68
           + S   +   +Y +   IGTP       + DTGSDL W+ C   C  C K       ++ 
Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167

Query: 69  PASSSSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF 122
              SSS++ + C S+ C   L D  S +        C + Y Y +     GV A E +T 
Sbjct: 168 ANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTV 227

Query: 123 GNSNN----FFDNVVFGCGHNNTGVFNENE---MGLVGLGRTRLSLASQILSQLGANKFS 175
           G +++     FD V+ GC    T  FNE      G++GLG  + SLA + L+++  NKFS
Sbjct: 228 GLNDHKKIRLFD-VLIGC----TESFNETNGFPDGVMGLGYRKHSLALR-LAEIFGNKFS 281

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNS 233
           YCLV   + S+  + + FG+  E+    +  T L+      +Y V + GISVG   LS S
Sbjct: 282 YCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSIS 341

Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR----NAIKLTPYQDPRLG 289
           S +   +N +G    G M +D+G   T+L  + Y+++ + ++       K+ P + P L 
Sbjct: 342 SDI---WNVTGV---GGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPEL- 394

Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
           +  C++       A P L  HF  GA  K P+    ++I    EG+ C  +   D     
Sbjct: 395 NNFCFEDKGFDRAAVPRLLIHFADGAIFKPPV---KSYIIDVAEGIKCLGIIKADFPGSS 451

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           I GN  Q +    YD     + F P+ C
Sbjct: 452 ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 164/358 (45%), Gaps = 23/358 (6%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
           E+V+    GTP       I+DTGSDL W+QC PC   CY+Q  P ++PA SSSY  + C 
Sbjct: 136 EFVVVVGFGTPAQTAAI-ILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCG 194

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
           +  C     + C+    C Y   Y D S T GVL+ + +TF NS++ F    FGCG  N 
Sbjct: 195 TPVCAAAGGM-CNGTT-CLYGVQYGDGSSTTGVLSRDTLTF-NSSSKFTGFTFGCGEKNI 251

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
           G F E +  L+GLGR +LSL SQ     G   FSYCL  ++T       +  G     S 
Sbjct: 252 GDFGEVDG-LLGLGRGKLSLPSQAAPSFGG-VFSYCLPSYNTTPGY---LNIGATKPTST 306

Query: 202 GGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
             V  T+++ K    ++YF+ L  I++G       ++P   S    +K    +D+G   T
Sbjct: 307 VPVQYTAMIKKPQYPSFYFIELVSINIGGY-----ILPVPPS--VFTKTGTLLDSGTILT 359

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI 319
            LP   Y  L ++ +  ++      P      CY  T   A + P ++ +F  GA   L 
Sbjct: 360 YLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419

Query: 320 HTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                I P    P+ G   F  +P      I GN  Q    + YD  SQ + F P  C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 151/358 (42%), Gaps = 30/358 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
           EYV   S GTP +  +  ++DTGSDL W+QC PC   QC  Q  P+++P+ SS+Y  + C
Sbjct: 111 EYVATVSFGTPAVPQVV-VIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPC 169

Query: 81  QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            S +C  L   +    CS+ Q C +   Y D + T GV   +++T         +  FGC
Sbjct: 170 ASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLA-PGAIVKDFYFGC 228

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           GH+ + +    +  L     +  SL +Q         FSYCL   ++       + FG G
Sbjct: 229 GHSKSSLPGLFDGLLGLGRLSE-SLGAQYGG---GGGFSYCLPAVNSKPGF---LAFGAG 281

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
              SG        V  +  T+  VTL GI+VG       L P   S      G M +D+G
Sbjct: 282 RNPSGFVFTPMGRVPGQ-PTFSTVTLAGITVGG--KKLDLRPSAFS------GGMIVDSG 332

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDGGAK 315
              T+L    Y  L    R A+K   Y+        CY        + P +   F GGA 
Sbjct: 333 TVVTVLQSTVYRALRAAFREAMKA--YRLVHGDLDTCYDLTGYKNVVVPKIALTFSGGAT 390

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + L   +  +   V G   FA    DG  G+ GN  Q    + +D  +    F+   C
Sbjct: 391 INLDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 179/378 (47%), Gaps = 44/378 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + Y  VDTGSD++WV C PC +C  +        +Y+  +SS+ K
Sbjct: 75  GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSK 133

Query: 77  ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
            + C+   C  ++ + +C +++ C+Y   Y D S + G    + IT     GN       
Sbjct: 134 NVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLA 193

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
             VVFGCG N +G   + E    G++G G++  S+ SQ+ +     + FS+CL       
Sbjct: 194 QEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCL------D 247

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           ++     F  G EV    V +T LV   ++ +Y V L+G+ V        L P   S+  
Sbjct: 248 NMNGGGIFAIG-EVESPVVKTTPLV--PNQVHYNVILKGMDVDG--EPIDLPPSLASTNG 302

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
              G   ID+G     LP++ YN L E++  +  +KL   Q+    +  C+   S    A
Sbjct: 303 --DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDKA 356

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDLF 356
            P++  HF+   K+  ++   ++    E ++CF  Q       DG DV + G+   S+  
Sbjct: 357 FPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKL 415

Query: 357 IGYDFDSQMVSFKPTDCT 374
           + YD +++++ +   +C+
Sbjct: 416 VVYDLENEVIGWADHNCS 433


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 105/379 (27%), Positives = 183/379 (48%), Gaps = 46/379 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + Y  VDTGSD++WV C PC +C  +        +Y+  +SS+ K
Sbjct: 76  GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSK 134

Query: 77  ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
            + C+ + C  ++ + +C +++ C+Y   Y D S + G    + IT     GN       
Sbjct: 135 NVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 194

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFHTD 184
             VVFGCG N +G   + +    G++G G++  S+ SQ L+  G+ K  FS+CL      
Sbjct: 195 QEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ-LAAGGSTKRIFSHCL------ 247

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            ++     F  G EV    V +T +V   ++ +Y V L+G+ V    +   L P   S+ 
Sbjct: 248 DNMNGGGIFAVG-EVESPVVKTTPIV--PNQVHYNVILKGMDVDG--DPIDLPPSLASTN 302

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
               G   ID+G     LP++ YN L E++  +  +KL   Q+    +  C+   S    
Sbjct: 303 G--DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDK 356

Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDL 355
           A P++  HF+   K+  ++   ++    E ++CF  Q       DG DV + G+   S+ 
Sbjct: 357 AFPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 415

Query: 356 FIGYDFDSQMVSFKPTDCT 374
            + YD +++++ +   +C+
Sbjct: 416 LVVYDLENEVIGWADHNCS 434


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 105/379 (27%), Positives = 183/379 (48%), Gaps = 46/379 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + Y  VDTGSD++WV C PC +C  +        +Y+  +SS+ K
Sbjct: 72  GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSK 130

Query: 77  ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
            + C+ + C  ++ + +C +++ C+Y   Y D S + G    + IT     GN       
Sbjct: 131 NVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 190

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFHTD 184
             VVFGCG N +G   + +    G++G G++  S+ SQ L+  G+ K  FS+CL      
Sbjct: 191 QEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ-LAAGGSTKRIFSHCL------ 243

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            ++     F  G EV    V +T +V   ++ +Y V L+G+ V    +   L P   S+ 
Sbjct: 244 DNMNGGGIFAVG-EVESPVVKTTPIV--PNQVHYNVILKGMDVDG--DPIDLPPSLASTN 298

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
               G   ID+G     LP++ YN L E++  +  +KL   Q+    +  C+   S    
Sbjct: 299 G--DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDK 352

Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDL 355
           A P++  HF+   K+  ++   ++    E ++CF  Q       DG DV + G+   S+ 
Sbjct: 353 AFPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 411

Query: 356 FIGYDFDSQMVSFKPTDCT 374
            + YD +++++ +   +C+
Sbjct: 412 LVVYDLENEVIGWADHNCS 430


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 160/390 (41%), Gaps = 44/390 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           + EY++  SIGTP    +   +DTGSDL+W QC  C  C+ Q  P ++  +S +   + C
Sbjct: 97  DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQC-ACHVCFAQPFPTFDALASQTTLAVPC 155

Query: 81  QSEQCH--LLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITF----GNSNN------ 127
               C         C+ +   C Y Y YAD S+T G +  +  TF    GN+ +      
Sbjct: 156 SDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGV 215

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
              NV FGCG  N G+F  NE G+ G  R  +SL     SQL   +FS+C        + 
Sbjct: 216 AVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLP----SQLKVARFSHCFTAIA--DAR 269

Query: 188 TSKMYFGN--GSEVSGG---GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
           TS ++ G   G +  G    G V ++  +  + + Y++TL+GI+VG        + +   
Sbjct: 270 TSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGK 329

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCYKTPSMA 300
                 G   ID+G     LP   Y  L       +KL    +    ++  LC++  +  
Sbjct: 330 GTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFE--AAR 387

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--------------FCFAMQPI-DGDVG 345
             +    A      KV L         P E                 C  M    D D+ 
Sbjct: 388 SASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLT 447

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           I GNF Q ++ + YD +   + F P  C K
Sbjct: 448 IIGNFQQQNMHVAYDLEKNKLVFVPARCDK 477


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 113/413 (27%), Positives = 178/413 (43%), Gaps = 81/413 (19%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS- 71
           + S  S+ +G+Y +   +G+PP   +  + DTGSDL WV+C  C    K    I+ P S 
Sbjct: 72  LMSGASSGSGQYFVSIRLGSPPQTLLL-VADTGSDLTWVRCSAC----KTNCSIHPPGST 126

Query: 72  -----SSSYKELSCQSEQCHLL---DTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
                S+++    C S  C L+   +   C+  +L   C Y Y Y+D S T G  + E  
Sbjct: 127 FLARHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186

Query: 121 TFGNSNNF---FDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGA 171
           T   S+       ++ FGCG + +G       FN    G++GLGR  +S ASQ+  + G 
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFN-GASGVMGLGRGPISFASQLGRRFG- 244

Query: 172 NKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS------------LVSKEDKTYYF 219
             FSYCL+ +      TS +  G+        VVST             L++ E  T+Y+
Sbjct: 245 RSFSYCLLDYTLSPPPTSYLMIGD--------VVSTKKDNKSMMSFTPLLINPEAPTFYY 296

Query: 220 VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
           ++++G+ V  +     + P   S   +  G   ID+G   T L +  Y  +    +  +K
Sbjct: 297 ISIKGVFVDGV--KLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354

Query: 280 LTPYQDP-----RLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPV- 329
           L P   P     R G  LC    ++ G++    P L+    G         S + PPP  
Sbjct: 355 L-PSPTPGGASTRSGFDLCV---NVTGVSRPRFPRLSLELGG--------ESLYSPPPRN 402

Query: 330 ------EGVFCFAMQPIDGDVGIF---GNFAQSDLFIGYDFDSQMVSFKPTDC 373
                 EG+ C A+QP++ + G F   GN  Q    + +D     + F    C
Sbjct: 403 YFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 114/414 (27%), Positives = 179/414 (43%), Gaps = 73/414 (17%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC----------VQCYKQVKPIYNPASS 72
           +Y+  + IG PP      +VDTGSDL+W QC  C            C+ Q  P YN + S
Sbjct: 77  QYIASYGIGDPPQ-PAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLS 135

Query: 73  SSYKELSCQSEQCHLL----DTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFG 123
            + + + C  +   L     +T  C     S    C     Y  + +  GVL T+  TF 
Sbjct: 136 RTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFP 194

Query: 124 NSNNFFDNVVFGCGHNNT---GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
           +S++    + FGC        G  N    G++GLGR  LSL    +SQL A +FSYCL P
Sbjct: 195 SSSSV--TLAFGCVSQTRISPGALN-GASGIIGLGRGALSL----VSQLNATEFSYCLTP 247

Query: 181 FHTDSSITSKMYFGNGSEVSGGGV----------VSTSLVSKEDK-----TYYFVTLEGI 225
           +  D+   S ++ G+G                  V+T   +K  K     T+Y++ L G+
Sbjct: 248 YFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 307

Query: 226 SVGNLSNS--SKLIPYYNSSGAISKGNMFIDTGAPPTLL----PKDFYNRLEEQVRNAIK 279
           + GN + +  +       ++  +  G   ID+G+P T L     +     L  Q+R +  
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367

Query: 280 LTPYQDPRLGS--QLCYKT----PSMAGIA-PILTAHFD---GGAKVPLIHTSTFIPPPV 329
           L P    +LG   +LC +      S+A  A P L   FD   GG +  +I    +     
Sbjct: 368 LVP-PPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE 426

Query: 330 EGVFCFAM---------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              +C A+          P + +  I GNF Q D+ + YD  + ++SF+P +C+
Sbjct: 427 ASTWCMAVVSSASGNATLPTN-ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 154/369 (41%), Gaps = 24/369 (6%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
           VQS +    G Y++K ++GTP L  +   +DTGSD+ W QC PCV  CY+Q +  ++P  
Sbjct: 34  VQSGIPLGAGNYLVKMALGTPKL-SLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRK 92

Query: 72  SSSYKELSCQSEQCHLLD----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           SSSYK +SC S  C ++        C S   C Y   Y D S + G  ATE++T   S +
Sbjct: 93  SSSYKNVSCSSSSCRIITDSGGARGCVSST-CIYKVQYGDGSYSVGFFATEKLTISPS-D 150

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
              N +FGCG  N G F    +  +          +   S+   N F+YCL  F + S+ 
Sbjct: 151 VISNFLFGCGQQNAGRF--GRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSST- 207

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
                   G    GG V  +   +     +      GI +  LS    ++P    +   S
Sbjct: 208 --------GHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPI--DASVFS 257

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
                ID+G   T L    Y+ L  + +  +K  P  D       CY       I+ P +
Sbjct: 258 NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNESISVPRI 317

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQ 364
           +  F GG +V +               C A  P   DGD  +FGN  Q    + +D    
Sbjct: 318 SFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKG 377

Query: 365 MVSFKPTDC 373
            + F P+ C
Sbjct: 378 RIGFAPSGC 386


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 74/194 (38%), Positives = 104/194 (53%), Gaps = 12/194 (6%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           +S  +   G YV+   +GTP   D+  I DTGSDL W QC PC + CY Q +PI+NP+ S
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKR-DLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKS 186

Query: 73  SSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
           +SY  +SC S  C  L     ++ SCS+   C Y   Y D S + G  A +++    S +
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLAL-TSTD 244

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
            F+N +FGCG NN G+F     GL+GLGR  LSL S+      A+    C      D+  
Sbjct: 245 VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSKYPKAAPASILDTCYDFSQYDTVD 303

Query: 188 TSK--MYFGNGSEV 199
             K  +YF +G+E+
Sbjct: 304 VPKINLYFSDGAEM 317


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 165/372 (44%), Gaps = 36/372 (9%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           ++   IGTPP      I+DTGS L W+QC   V        +++P+ SSS+  L C    
Sbjct: 83  LVSLPIGTPPQTQQM-ILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPL 141

Query: 85  CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C           SC   +LC+Y+Y YAD +L +G L  E+ITF  S +    ++ GC   
Sbjct: 142 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQS-TPPLILGCAEE 200

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           ++     +  G++G+   RLS ASQ        KFSYC+         T    F  G   
Sbjct: 201 SS-----DAKGILGMNLGRLSFASQA----KLTKFSYCVPTRQVRPGFTPTGSFYLGENP 251

Query: 200 SGGGVVSTSLVS--------KEDKTYYFVTLEGISVGNLSNSSKLIPYY-NSSGAISKGN 250
           + GG    +L++          D   Y V ++GI +GN   +  +  +  + SGA   G 
Sbjct: 252 NSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGA---GQ 308

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCY--KTPSMAGIAPIL 306
             ID+G+  T L  + YN++ E+V   +     +    G  S +C+      +  +   +
Sbjct: 309 TMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNM 368

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDS 363
              FD G ++ ++     +     GV C  +   + +     I GNF Q ++++ +D  +
Sbjct: 369 VFEFDKGVEI-VVEKERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLAN 427

Query: 364 QMVSFKPTDCTK 375
           + V F   DC++
Sbjct: 428 RRVGFGKADCSR 439


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 163/368 (44%), Gaps = 17/368 (4%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
            +S  +  Y++K  IG+P +  +Y + DTGS L W QC PC + ++Q+ PI+N  +S +Y
Sbjct: 83  RISQDDTCYLVKVIIGSPGV-PLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTY 141

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           ++L CQ + C     V       C Y   YA  S T GV A + +    ++       FG
Sbjct: 142 RDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRI--PFYFG 199

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLAS----QILSQLGANKFSYCLVPFH--TDSSITS 189
           C  +N   F+  E    G G   L+++     Q ++ +  N+FSYCL  F   + S  TS
Sbjct: 200 CSRDNQN-FSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATS 258

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
            + FGN    S    +ST  VS      YF+ L  +SV    N  ++ P   +      G
Sbjct: 259 LLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVA--GNRMQIPPGTFALKPDGTG 316

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYKTPSMA-GIAPIL 306
              ID+G   T + +  Y  +    +N      +Q  + +L   +CYK         P +
Sbjct: 317 GTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSM 376

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQM 365
             HF  GA   +     ++     G FC A+QPI      I G   Q++    YD  ++ 
Sbjct: 377 AFHFQ-GADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQ 435

Query: 366 VSFKPTDC 373
           + F P +C
Sbjct: 436 LLFTPENC 443


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 174/387 (44%), Gaps = 47/387 (12%)

Query: 15  SNVSTANGE----YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
           S+   A+G+    YV++  +G+P    I   +DT +D  W  C PC  C      ++ PA
Sbjct: 64  SSAPVASGQSPPSYVVRAGLGSP-AQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPA 121

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQ---------LCNYTYGYADSSLTKGVLATERIT 121
           +S+SY  L C S  C +L    C +Q          +C +T  +AD+S  +  LA++ + 
Sbjct: 122 NSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLH 180

Query: 122 FGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            G   +   N  FGC    +G   N  + GL+GLGR  ++L SQ+   +    FSYCL  
Sbjct: 181 LGK--DAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQV-GNMYNGVFSYCLPS 237

Query: 181 FHTDSSITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKT-YYFVTLEGISVGN--LSN 232
           +        K Y+ +GS   G      GV  T ++   +++  Y+V + G+SVG   +  
Sbjct: 238 Y--------KSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKV 289

Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-Q 291
            +    +  ++GA       +D+G   T      Y  L E+ R  +   P     LG+  
Sbjct: 290 PAGSFAFDPATGA----GTVVDSGTVITRWTPPVYAALREEFRRHVA-APSGYTSLGAFD 344

Query: 292 LCYKTPSMA-GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGI 346
            C+ T  +A G+AP +T H DGG  + L   +T I      + C AM    Q ++  V +
Sbjct: 345 TCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNV 404

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             N  Q +L + +D  +  V F    C
Sbjct: 405 LANLQQQNLRVVFDVANSRVGFARESC 431


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 58/359 (16%)

Query: 36  LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS 95
           +D+  + DT SDL+W QC PC+ C  Q   +Y+P  + +Y  L+                
Sbjct: 1   MDVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSS-------------- 46

Query: 96  QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLG 155
               NY Y Y+  S T G  ATE    GN      N+ FGCG  N G ++          
Sbjct: 47  ----NYNYTYSKQSFTSGYFATETFALGNVT--VANITFGCGTRNQGYYDNVAG-----V 95

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-----LV 210
                    +L+QLG ++FSYC     + +  +S ++ G   E++     + +     + 
Sbjct: 96  FGVGRGGVSLLNQLGIDRFSYCFS--SSGAPGSSAVFLGGSPELATNATTTPAASTPMVA 153

Query: 211 SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNR 269
               K+ YFV L G++VG     +  +    +S A   G  + ID+ +P T+L +  Y  
Sbjct: 154 DPVLKSGYFVKLVGVTVG-----ATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG- 207

Query: 270 LEEQVRNAI--KLTPYQDPR------LGSQLCYKTPSMAGIAP-----ILTAHFDGGAKV 316
               VR A+  +L P ++        +G  LC++  +  G  P      +T HFDGGA  
Sbjct: 208 ---PVRRALVAQLAPLKEANANASAGVGLDLCFEL-AAGGATPTPPNVTMTLHFDGGAAD 263

Query: 317 PLIHTSTFIPP-PVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            ++  + ++      G+ C  M P   + V + G+ A  D  + YD    +VSF+P DC
Sbjct: 264 LVLPPANYLAKDSAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 11/171 (6%)

Query: 14  QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
           +S  +  +G YV+   +G+P   D+  I DTGSDL W QC PCV  CY+Q + I++P++S
Sbjct: 79  KSASTLGSGNYVVTVGLGSPKR-DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS 137

Query: 73  SSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
            SY  +SC S  C  L++ +     CSS   C Y   Y D S + G  A E+++   S +
Sbjct: 138 LSYSNVSCDSPSCEKLESATGNSPGCSSST-CLYGIRYGDGSYSIGFFAREKLSL-TSTD 195

Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
            F+N  FGCG NN G+F     GL+GL R  LSL SQ   + G   FSYCL
Sbjct: 196 VFNNFQFGCGQNNRGLFG-GTAGLLGLARNPLSLVSQTAQKYG-KVFSYCL 244


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 175/378 (46%), Gaps = 43/378 (11%)

Query: 10  NNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           +++ Q+ VS  NG  Y    ++G+PP  D   ++DTGSDL WV+C PC          ++
Sbjct: 109 HDLAQTPVSFTNGGVYYSSITLGSPPK-DFSLVMDTGSDLTWVRCDPC---SPDCSSTFD 164

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
             +S++YK L+C  +    L  +    ++L +      D+    G  + E          
Sbjct: 165 RLASNTYKALTCADDL--RLPVLLRLWRRLFHSGRSLRDTLKMAGAASDEL-------EE 215

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
           F   VFGCG    G+ +  E+G++ L    LS  SQI  + G NKFSYCL+     +S+ 
Sbjct: 216 FPGFVFGCGSLLKGLIS-GEVGILALSPGSLSFPSQIGEKYG-NKFSYCLLRQTAQNSLK 273

Query: 189 -SKMYFGNGS-EVSGGGVVSTSLVS----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            S M FG  + E+   G      +      E   YY V L+GISVGN      L P    
Sbjct: 274 KSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGN--QRLDLSPSTFL 331

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
           +G   K  +F D+G   T+LP    + +++ + + +    +   + G   C++ P  +G 
Sbjct: 332 NGQ-DKPTIF-DSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIK-GLDACFRVPPSSGQ 388

Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVF------CFAMQPIDGDVGIFGNFAQSDL 355
             P +T HF+GGA         F+  P   V       C    P + +V IFGN  Q D 
Sbjct: 389 GLPDITFHFNGGAD--------FVTRPSNYVIDLGSLQCLIFVPTN-EVSIFGNLQQQDF 439

Query: 356 FIGYDFDSQMVSFKPTDC 373
           F+ +D D++ + FK TDC
Sbjct: 440 FVLHDMDNRRIGFKETDC 457


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 81/130 (62%), Gaps = 4/130 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           VQ+ VS  NGE++M+ +IG P L     I+DTGSDL W QC+PC  CYKQ  PIY+P+ S
Sbjct: 10  VQAPVSAGNGEFLMQLAIGKPSLA-YSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLS 68

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC+S  C  L   +C S   C Y Y Y D S T+G+L+ E  TF  S+    ++
Sbjct: 69  STYGTVSCKSSLCLALPASACISAT-CEYLYTYGDYSSTQGILSYE--TFTLSSQSIPHI 125

Query: 133 VFGCGHNNTG 142
            FGCG +N G
Sbjct: 126 AFGCGQDNEG 135


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/342 (29%), Positives = 147/342 (42%), Gaps = 28/342 (8%)

Query: 42  VDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV--SCSSQQ 97
           +DT  D+ W+QC PC   QCY Q  P+++P +SS+   + C+S  C  L      CS++ 
Sbjct: 152 IDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRS 211

Query: 98  L---CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
               C Y   Y+D   T G   T+ +T   +     N  FGC H   G F++   G + L
Sbjct: 212 ANAECRYLIEYSDDRATAGTYMTDTLTISGTTA-VRNFRFGCSHAVRGRFSDLTAGTMSL 270

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSK 212
           G    SL +Q    LG N FSYC VP    +S +  +  G  +  +   V +T+  + S 
Sbjct: 271 GGGAQSLLAQTARSLG-NAFSYC-VP---QASASGFLSIGGPATTNSTTVFATTPLVRSA 325

Query: 213 EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
            + + Y V L+GI V        + P   S+GA+      +D+ A  T LP   Y  L  
Sbjct: 326 INPSLYLVRLQGIVVAG--RRLGIPPVAFSAGAV------MDSSAVITQLPPTAYRALRR 377

Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG 331
             RNA++  P          CY    +  +  P ++  F GGA V L   +  I     G
Sbjct: 378 AFRNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI----GG 433

Query: 332 VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              F     D  +G  GN  Q    + YD  +  V F+   C
Sbjct: 434 CLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 174/381 (45%), Gaps = 61/381 (16%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            ++S ++  +GEY M   +G+PP      I+DTGSDL W+QCLPC  C++Q         
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCYDCFQQ--------- 207

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-----GNSN 126
                                 +  Q C Y Y Y DSS T G  A E  T      G S+
Sbjct: 208 ----------------------NDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSS 245

Query: 127 NFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
             +  +N++FGCGH N G+F+     L    R  LS +SQ+ S  G + FSYCLV  ++D
Sbjct: 246 ELYNVENMMFGCGHWNRGLFHGAAGLLGLG-RGPLSFSSQLQSLYG-HSFSYCLVDRNSD 303

Query: 185 SSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSSKLIPY 239
           ++++SK+ FG   + +S   +  TS V+ ++    T+Y+V ++ I V G + N  +    
Sbjct: 304 TNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWN 363

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLGSQLCY 294
            +S GA   G   ID+G   +   +  Y    N++ E+ +   K   Y+D P L    C+
Sbjct: 364 ISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILDP--CF 416

Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQ 352
               +  +  P L   F  GA       ++FI    E + C AM         I GN+ Q
Sbjct: 417 NVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAMLGTPKSAFSIIGNYQQ 475

Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
            +  I YD     + + PT C
Sbjct: 476 QNFHILYDTKRSRLGYAPTKC 496


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 81/130 (62%), Gaps = 4/130 (3%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           VQ+ VS  NGE++M+ +IG P L     I+DTGSDL W QC+PC  CYKQ  PIY+P+ S
Sbjct: 10  VQAPVSAGNGEFLMQLAIGKPSLA-YSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLS 68

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
           S+Y  +SC+S  C  L   +C S   C Y Y Y D S T+G+L+ E  TF  S+    ++
Sbjct: 69  STYGTVSCKSSLCLALPASACISAT-CEYLYTYGDYSSTQGILSYE--TFTLSSQSIPHI 125

Query: 133 VFGCGHNNTG 142
            FGCG +N G
Sbjct: 126 AFGCGQDNEG 135


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 105/378 (27%), Positives = 168/378 (44%), Gaps = 48/378 (12%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           ++   IGTPP      I+DTGS L W+QC   V        +++P+ SSS+  L C    
Sbjct: 78  LVSLPIGTPPQSQQM-ILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPL 136

Query: 85  CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C           SC   +LC+Y+Y YAD +L +G L  E+ITF  S +    ++ GC  +
Sbjct: 137 CKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQS-TPPLILGCAED 195

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI--TSKMYFGNGS 197
            +     ++ G++G+   RLS ASQ        KFSYC+           T   Y G   
Sbjct: 196 AS-----DDKGILGMNLGRLSFASQA----KITKFSYCVPTRQVRPGFTPTGSFYLGENP 246

Query: 198 EVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYY-NSSGAISKGN 250
             +G   +S    S+       D   + V L+GI +GN   +  +  +  + SGA   G 
Sbjct: 247 NSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGA---GQ 303

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSM--A 300
             ID+G+  T L    YN++ E+V   ++L     PRL         S +C+   +M   
Sbjct: 304 SMIDSGSEFTYLVDVAYNKVREEV---VRLA---GPRLKKGYVYSGVSDMCFDGNAMEIG 357

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFI 357
            +   +   FD G ++ +I     +     GV C  +   + +     I GNF Q +L++
Sbjct: 358 RLIGNMVFEFDKGVEI-VIEKGRVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNLWV 416

Query: 358 GYDFDSQMVSFKPTDCTK 375
            +D  ++ V F   DC++
Sbjct: 417 EFDIANRRVGFGKADCSR 434


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 158/376 (42%), Gaps = 35/376 (9%)

Query: 24  YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
           Y+++  IGTP   +   Y + DTGSDL W QC PC  C      P ++P+ S +++ LSC
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 161

Query: 81  QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
               C L   V         C +   Y D     G L ++   FG + +        +V 
Sbjct: 162 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 221

Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL--------VPFHTD 184
           FGC H  ++        G++ LG  + S     ++QLG ++FSYC+             +
Sbjct: 222 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDE 277

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYYNS 242
               S + FG+ + ++G          K+D + Y V L+ +    G   N  + +P Y +
Sbjct: 278 ERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 332

Query: 243 -SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              A +   M +D+G     LP   +  L+ ++   I LT   D    S  CY       
Sbjct: 333 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDV 392

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            A  +T  F GGA + L  TS F       E   C A+    G+  I G + Q ++ +GY
Sbjct: 393 EAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGY 450

Query: 360 DFDSQMVSFKPTDCTK 375
           D  +  ++F    C +
Sbjct: 451 DLSTMEIAFDRDQCDR 466


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 113/342 (33%), Positives = 142/342 (41%), Gaps = 30/342 (8%)

Query: 42  VDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           VDTGSDL WVQC PC     CY Q  P+++PA SSSY  + C    C  L     S  S 
Sbjct: 3   VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
             C Y   Y D S T GV +++ +T  ++++      FGCGH  +G+FN  + GL+GLGR
Sbjct: 63  AQCGYVVSYGDGSNTTGVYSSDTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGR 120

Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKED 214
            + SL  Q     G   FSYCL    T  S    +  G G         ST+  L S   
Sbjct: 121 EQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNA 176

Query: 215 KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
            TYY V L GISVG    S   +P    +G        + T  PPT      Y  L    
Sbjct: 177 PTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAF 228

Query: 275 RNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
           R+ +    Y        L  CY       +  P +   F  GA V L            G
Sbjct: 229 RSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG 284

Query: 332 VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              FA    DG + I GN  Q    +    D   V FKP+ C
Sbjct: 285 CLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 158/376 (42%), Gaps = 35/376 (9%)

Query: 24  YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
           Y+++  IGTP   +   Y + DTGSDL W QC PC  C      P ++P+ S +++ LSC
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 182

Query: 81  QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
               C L   V         C +   Y D     G L ++   FG + +        +V 
Sbjct: 183 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 242

Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL--------VPFHTD 184
           FGC H  ++        G++ LG  + S     ++QLG ++FSYC+             +
Sbjct: 243 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDE 298

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYYNS 242
               S + FG+ + ++G          K+D + Y V L+ +    G   N  + +P Y +
Sbjct: 299 ERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 353

Query: 243 -SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              A +   M +D+G     LP   +  L+ ++   I LT   D    S  CY       
Sbjct: 354 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDV 413

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
            A  +T  F GGA + L  TS F       E   C A+    G+  I G + Q ++ +GY
Sbjct: 414 EAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGY 471

Query: 360 DFDSQMVSFKPTDCTK 375
           D  +  ++F    C +
Sbjct: 472 DLSTMEIAFDRDQCDR 487


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 109/381 (28%), Positives = 173/381 (45%), Gaps = 46/381 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP+ +    +DTGSD++WV C  C  C +    Q++   ++P SSS+  
Sbjct: 76  GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSS 134

Query: 77  ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
            ++C  ++C+        +CSSQ   C+YT+ Y D S T G   ++ +     N  F+  
Sbjct: 135 MIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 191

Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
                   VVFGC +  TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL 
Sbjct: 192 MTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL- 250

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
               DSS    +  G   E+    +V TSLV  +   +Y + L+ ISV    L   S + 
Sbjct: 251 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQ--PHYNLNLQSISVNGQTLQIDSSVF 303

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
              NS G I      +D+G     L ++ Y+     +  AI  +       G+Q    T 
Sbjct: 304 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQCYLITS 357

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQS 353
           S+  + P ++ +F GGA + L      I     G   V+C   Q I G  + I G+    
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           D  + YD   Q + +   DC+
Sbjct: 418 DKIVVYDLAGQRIGWANYDCS 438


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 129/276 (46%), Gaps = 28/276 (10%)

Query: 8   YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYG-IVDTGSDLMWVQCLPCV-QCYKQ 62
           +P +V   +    S  +G Y +K   G+P     Y  IVDTGS L W+QC PCV  C+ Q
Sbjct: 99  FPKSVSVPLNPGASIGSGNYYVKVGFGSPA--RYYSMIVDTGSSLSWLQCKPCVVYCHVQ 156

Query: 63  VKPIYNPASSSSYKELSCQSEQCHLLDTVSC------SSQQLCNYTYGYADSSLTKGVLA 116
             P+++P++S +YK LSC S QC  L   +       +S  +C YT  Y DSS + G L+
Sbjct: 157 ADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLS 216

Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY 176
            + +T   S       V+GCG ++ G+F     G++GLGR +LS+  Q+ S+ G   FSY
Sbjct: 217 QDLLTLAPSQT-LPGFVYGCGQDSDGLFGR-AAGILGLGRNKLSMLGQVSSKFG-YAFSY 273

Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
           CL P        S    G  S        +       + + YF+ L  I+VG  +     
Sbjct: 274 CL-PTRGGGGFLS---IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRA----- 324

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
                 + A  +    ID+G   T LP   Y   ++
Sbjct: 325 ---LGVAAAQYRVPTIIDSGTVITRLPMSVYTPFQQ 357


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)

Query: 24  YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
           Y+++  IGTP   +   Y + DTGSDL W QC PC  C      P ++P+ S +++ LSC
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 160

Query: 81  QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
               C L   V         C +   Y D     G L ++   FG + +        +V 
Sbjct: 161 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 220

Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
           FGC H  ++        G++ LG  + S     ++QLG ++FSYC+              
Sbjct: 221 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 276

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
            +    S + FG+ + ++G          K+D + Y V L+ +    G   N  + +P Y
Sbjct: 277 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 331

Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            +   A +   M +D+G     LP   +  L+ ++   I LT   D    S  CY     
Sbjct: 332 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 391

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
              A  +T  F GGA + L  TS F       E   C A+    G+  I G + Q ++ +
Sbjct: 392 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 449

Query: 358 GYDFDSQMVSFKPTDCTK 375
           GYD  +  ++F    C +
Sbjct: 450 GYDLSTMEIAFDRDQCDR 467


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)

Query: 24  YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
           Y+++  IGTP   +   Y + DTGSDL W QC PC  C      P ++P+ S +++ LSC
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 163

Query: 81  QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
               C L   V         C +   Y D     G L ++   FG + +        +V 
Sbjct: 164 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 223

Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
           FGC H  ++        G++ LG  + S     ++QLG ++FSYC+              
Sbjct: 224 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 279

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
            +    S + FG+ + ++G          K+D + Y V L+ +    G   N  + +P Y
Sbjct: 280 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 334

Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            +   A +   M +D+G     LP   +  L+ ++   I LT   D    S  CY     
Sbjct: 335 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 394

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
              A  +T  F GGA + L  TS F       E   C A+    G+  I G + Q ++ +
Sbjct: 395 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 452

Query: 358 GYDFDSQMVSFKPTDCTK 375
           GYD  +  ++F    C +
Sbjct: 453 GYDLSTMEIAFDRDQCDR 470


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/381 (28%), Positives = 174/381 (45%), Gaps = 46/381 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP+ +    +DTGSD++WV C  C  C +    Q++   ++P SSS+  
Sbjct: 73  GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 131

Query: 77  ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
            ++C  ++C+        +CSSQ   C+YT+ Y D S T G   ++ +     N  F+  
Sbjct: 132 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 188

Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
                   VVFGC +  TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL 
Sbjct: 189 VTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL- 247

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
               DSS    +  G   E+    +V TSLV  +   +Y + L+ I+V    L   S + 
Sbjct: 248 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQP--HYNLNLQSIAVNGQTLQIDSSVF 300

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
              NS G I      +D+G     L ++ Y+     +  +I  + +     G+Q    T 
Sbjct: 301 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQCYLITS 354

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQS 353
           S+  + P ++ +F GGA + L      I     G   V+C   Q I G  + I G+    
Sbjct: 355 SVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 414

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           D  + YD   Q + +   DC+
Sbjct: 415 DKIVVYDLAGQRIGWANYDCS 435


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)

Query: 24  YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
           Y+++  IGTP   +   Y + DTGSDL W QC PC  C      P ++P+ S +++ LSC
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 181

Query: 81  QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
               C L   V         C +   Y D     G L ++   FG + +        +V 
Sbjct: 182 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 241

Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
           FGC H  ++        G++ LG  + S     ++QLG ++FSYC+              
Sbjct: 242 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 297

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
            +    S + FG+ + ++G          K+D + Y V L+ +    G   N  + +P Y
Sbjct: 298 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 352

Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            +   A +   M +D+G     LP   +  L+ ++   I LT   D    S  CY     
Sbjct: 353 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 412

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
              A  +T  F GGA + L  TS F       E   C A+    G+  I G + Q ++ +
Sbjct: 413 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 470

Query: 358 GYDFDSQMVSFKPTDCTK 375
           GYD  +  ++F    C +
Sbjct: 471 GYDLSTMEIAFDRDQCDR 488


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 31/315 (9%)

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATER 119
           P ++ ++SS+    SC S  C  L   SC +      Q C YTY Y D S+T G++  ++
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82

Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            TFG   +    V FGCG  N GVF  NE G+ G GR  LSL     SQL    FS+C  
Sbjct: 83  FTFGAGASV-PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFT 137

Query: 180 PFHTDSSITSKM-----YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNS 233
             +     T  +      + NG     G V ST L+ +  + T+Y+++L+GI+VG     
Sbjct: 138 AVNGLKQSTVLLDLPADLYKNGR----GAVQSTPLIQNSANPTFYYLSLKGITVG----- 188

Query: 234 SKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
           S  +P   S+ A++ G     ID+G   T LP   Y  + ++    IKL        G  
Sbjct: 189 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 248

Query: 292 LCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFG 348
            C+  PS A    P L  HF+G           F  P   G  + C A+   D +  I G
Sbjct: 249 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIG 307

Query: 349 NFAQSDLFIGYDFDS 363
           NF Q ++ + YD  +
Sbjct: 308 NFQQQNMHVLYDLQN 322


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 143/338 (42%), Gaps = 23/338 (6%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
           ++D+ SD+ WVQC+PC    C+ QV   Y+P+ S S    SC S  C  L   +  C++ 
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
           Q C Y   Y D S T G    + +T  ++ N      FGC H   G F+    G++ LG 
Sbjct: 222 Q-CQYLVRYPDGSSTSGAYIADLLTL-DAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 279

Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKT 216
              SL SQ  S+ G N FSYC+    +DS        G     S   VV+  +  ++  T
Sbjct: 280 GPESLLSQTASRYG-NAFSYCIPATASDSGF---FTLGVPRRASSRYVVTPMVRFRQAAT 335

Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
           +Y V L  I+VG       + P   ++G++      +D+    T LP   Y  L    R+
Sbjct: 336 FYGVLLRTITVGG--QRLGVAPAVFAAGSV------LDSRTAITRLPPTAYQALRSAFRS 387

Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
           ++ +     P+     CY    +  I  P ++  FD  A +PL  +             F
Sbjct: 388 SMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAF 443

Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                D   G+ G+  Q  + + YD     V F+   C
Sbjct: 444 TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 139/348 (39%), Gaps = 26/348 (7%)

Query: 34  PLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT- 90
           P+L     +DT  DL W+QC PC   +CY Q   +++P  S +   + C S  C  L   
Sbjct: 142 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 201

Query: 91  -VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
              CS+ Q C Y   Y D   T G    + +T  N +    N  FGC H   G F+ +  
Sbjct: 202 GAGCSNNQ-CQYFVDYGDGRATSGTYMVDALTL-NPSTVVMNFRFGCSHAVRGNFSASTS 259

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
           G + LG  R SL SQ  +  G N FSYC VP  + S   S     +G          T L
Sbjct: 260 GTMSLGGGRQSLLSQTAATFG-NAFSYC-VPDPSSSGFLSLGGPADGGGAG--RFARTPL 315

Query: 210 VSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           V       T Y V L GI VG       + P   + GA+   ++ I      T LP   Y
Sbjct: 316 VRNPSIIPTLYLVRLRGIEVGG--RRLNVPPVVFAGGAVMDSSVII------TQLPPTAY 367

Query: 268 NRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFI 325
             L    R+A+   P     R G   CY       +  P ++  FDGGA V L       
Sbjct: 368 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM- 426

Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              VEG   F   P D  +G  GN  Q    + YD     V F+   C
Sbjct: 427 ---VEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 109/364 (29%), Positives = 157/364 (43%), Gaps = 33/364 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELS 79
           E+V+   +GTP       I DTGSDL WVQC PC     C+ Q  P+++P+ SS+Y  + 
Sbjct: 148 EFVVAVGLGTPAQPSAL-IFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 206

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C   QC     +       C Y   Y D S T GVL+ + +    S+       FGCG  
Sbjct: 207 CGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLAL-TSSRALAGFPFGCGTR 265

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N G F   +  L+GLGR  LSL SQ  +  GA  FSYCL    + +S T  +  G     
Sbjct: 266 NLGDFGRVDG-LLGLGRGELSLPSQAAASFGA-VFSYCL---PSSNSTTGYLTIGATPAT 320

Query: 200 SGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
             G    T+++ K    ++YFV L  I +G       ++P        ++G   +D+G  
Sbjct: 321 DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGY-----ILPV--PPAVFTRGGTLLDSGTV 373

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----IAPILTAHFDGGA 314
            T LP   Y  L ++ R  ++      P      CY     AG    I P ++  F  GA
Sbjct: 374 LTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYD---FAGESEVIVPAVSFRFGDGA 430

Query: 315 --KVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFK 369
             ++       F+    E V C A   +D     + I GN  Q    + YD  ++ + F 
Sbjct: 431 VFELDFFGVMIFLD---ENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 487

Query: 370 PTDC 373
           P  C
Sbjct: 488 PASC 491


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 159/371 (42%), Gaps = 31/371 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV++  +G+P    +  + DT +D  W  C PC  C      ++ PA+SSSY  L C S
Sbjct: 78  SYVVRAGLGSPSQQLLLAL-DTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSS 134

Query: 83  EQCHLLDTVSCSSQQ-------------LCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
             C L    +C + Q              C ++  +AD+S  +  LA++ +  G   +  
Sbjct: 135 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLG--KDAI 191

Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            N  FGC  + TG   N    GL+GLGR  ++L SQ  S L    FSYCL P +     +
Sbjct: 192 PNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-LYNGVFSYCL-PSYRSYYFS 249

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             +  G G         +  L +    + Y+V + G+SVG+     K+     +  A + 
Sbjct: 250 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHA--WVKVPAGSFAFDAATG 307

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGIAPIL 306
               +D+G   T      Y  L E+ R  +   P     LG+   C+ T  + AG AP +
Sbjct: 308 AGTVVDSGTVITRWTAPVYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGGAPAV 366

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFD 362
           T H DGG  + L   +T I      + C AM    Q ++  V +  N  Q ++ + +D  
Sbjct: 367 TVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVA 426

Query: 363 SQMVSFKPTDC 373
           +  V F    C
Sbjct: 427 NSRVGFAKESC 437


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 139/348 (39%), Gaps = 26/348 (7%)

Query: 34  PLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT- 90
           P+L     +DT  DL W+QC PC   +CY Q   +++P  S +   + C S  C  L   
Sbjct: 158 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 217

Query: 91  -VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
              CS+ Q C Y   Y D   T G    + +T  N +    N  FGC H   G F+ +  
Sbjct: 218 GAGCSNNQ-CQYFVDYGDGRATSGTYMVDALTL-NPSTVVMNFRFGCSHAVRGNFSASTS 275

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
           G + LG  R SL SQ  +  G N FSYC VP  + S   S     +G          T L
Sbjct: 276 GTMSLGGGRQSLLSQTAATFG-NAFSYC-VPDPSSSGFLSLGGPADGGGAG--RFARTPL 331

Query: 210 VSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
           V       T Y V L GI VG       + P   + GA+   ++ I      T LP   Y
Sbjct: 332 VRNPSIIPTLYLVRLRGIEVGG--RRLNVPPVVFAGGAVMDSSVII------TQLPPTAY 383

Query: 268 NRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFI 325
             L    R+A+   P     R G   CY       +  P ++  FDGGA V L       
Sbjct: 384 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM- 442

Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              VEG   F   P D  +G  GN  Q    + YD     V F+   C
Sbjct: 443 ---VEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/414 (26%), Positives = 172/414 (41%), Gaps = 59/414 (14%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQC---YKQV 63
           P+   +  +S    +Y + F++G+ P   I   +DTGSDL+W  C P  C+ C   +   
Sbjct: 4   PSPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNAT 63

Query: 64  KPI----------YNPASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYAD 107
           KP+           +PA S+++  +S    C   +C L  ++T  CSS     + Y Y D
Sbjct: 64  KPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD 123

Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-- 165
            S    +    R T   S  F  N  FGC H           G+ G GR  LSL +Q+  
Sbjct: 124 GSF---IAHLHRDTLSMSQLFLKNFTFGCAHTALA----EPTGVAGFGRGLLSLPAQLAT 176

Query: 166 LSQLGANKFSYCLVPFHTDSSITSK---MYFGNGSEVSGGGV--VSTSLVSKEDKTYYF- 219
           LS    N+FSYCLV    D     K   +  G+  + S   V  V TS++     +Y++ 
Sbjct: 177 LSPNLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYC 236

Query: 220 VTLEGISVGNLSN-SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQV 274
           V L GISVG  +  + +++   +  G    G + +D+G   T+LP   YN      + +V
Sbjct: 237 VGLTGISVGKRTILAPEMLRRVDRRG---DGGVVVDSGTTFTMLPASLYNSVVAEFDRRV 293

Query: 275 RNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--- 331
               K     + + G   CY    +  + P +T HF G     ++    +    ++G   
Sbjct: 294 GRVHKRASEVEEKTGLGPCYFLEGLVEV-PTVTWHFLGNNSNVMLPRMNYFYEFLDGEDE 352

Query: 332 ----VFCFAMQPIDGDV-------GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
               V C  +     D         I GN+ Q    + YD ++Q V F    C 
Sbjct: 353 ARRKVGCLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCA 406


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/405 (27%), Positives = 177/405 (43%), Gaps = 67/405 (16%)

Query: 24  YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQ-------------- 62
           Y++  ++GTPP ++ +Y  +DTGSDL WV C      C+ C  Y+               
Sbjct: 29  YLISLNLGTPPKVIQVY--MDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSS 86

Query: 63  ------VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
                 V P+ +   SS      C    C L   V  +  + C ++ Y Y    +  G L
Sbjct: 87  SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTL 146

Query: 116 ATERIT-FGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLG 170
             + +T  G+S +F     N  FGC     G      +G+ G GR  LSL SQ+   Q G
Sbjct: 147 TRDTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG 202

Query: 171 ANKFSYCLV--PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISV 227
              FS+C +   F  + +I+S +  G+ +  S   +  TSL+       YY++ LE I+V
Sbjct: 203 ---FSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITV 259

Query: 228 GNLS--NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPY 283
           GN +       +  ++S G    G M ID+G   T LP  FY +L   +++ I       
Sbjct: 260 GNATAIQVPSSLREFDSHG---NGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQE 316

Query: 284 QDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTF----IPPPVEGV 332
           Q+ R G  LCY+ P    +        P ++ HF     + L   + F     P     V
Sbjct: 317 QEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVV 376

Query: 333 FCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            C  +Q +D    G  G+FG+F Q ++ + YD + + + F+P DC
Sbjct: 377 KCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 177/406 (43%), Gaps = 67/406 (16%)

Query: 24  YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQ-------------- 62
           Y++  ++GTPP ++ +Y  +DTGSDL WV C      C+ C  Y+               
Sbjct: 12  YLISLNLGTPPKVIQVY--MDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSS 69

Query: 63  ------VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
                 V P+ +   SS      C    C L   V  +  + C ++ Y Y    +  G L
Sbjct: 70  SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTL 129

Query: 116 ATERIT-FGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLG 170
             + +T  G+S +F     N  FGC     G      +G+ G GR  LSL SQ+   Q G
Sbjct: 130 TRDTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG 185

Query: 171 ANKFSYCLV--PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISV 227
              FS+C +   F  + +I+S +  G+ +  S   +  TSL+       YY++ LE I+V
Sbjct: 186 ---FSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITV 242

Query: 228 GNLS--NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPY 283
           GN +       +  ++S G    G M ID+G   T LP  FY +L   +++ I       
Sbjct: 243 GNATAIQVPSSLREFDSHG---NGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQE 299

Query: 284 QDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTF----IPPPVEGV 332
           Q+ R G  LCY+ P    +        P ++ HF     + L   + F     P     V
Sbjct: 300 QEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVV 359

Query: 333 FCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            C  +Q +D    G  G+FG+F Q ++ + YD + + + F+P DC 
Sbjct: 360 KCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 405


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 143/338 (42%), Gaps = 23/338 (6%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
           ++D+ SD+ WVQC+PC    C+ QV   Y+P+ S +    SC S  C  L   +  C++ 
Sbjct: 32  VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
           Q C Y   Y D S T G    + +T  ++ N      FGC H   G F+    G++ LG 
Sbjct: 92  Q-CQYLVRYPDGSSTSGAYIADLLTL-DAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 149

Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKT 216
              SL SQ  S+ G N FSYC+    +DS        G     S   VV+  +  ++  T
Sbjct: 150 GPESLLSQTASRYG-NAFSYCIPATASDSGF---FTLGVPRRASSRYVVTPMVRFRQAAT 205

Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
           +Y V L  I+VG       + P   ++G++      +D+    T LP   Y  L    R+
Sbjct: 206 FYGVLLRTITVGG--QRLGVAPAVFAAGSV------LDSRTAITRLPPTAYQALRAAFRS 257

Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
           ++ +     P+     CY    +  I  P ++  FD  A +PL  +             F
Sbjct: 258 SMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAF 313

Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                D   G+ G+  Q  + + YD     V F+   C
Sbjct: 314 TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/377 (29%), Positives = 166/377 (44%), Gaps = 41/377 (10%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N+   +N+   +G +++  + GTP   +I  I+DTGS + W QC  CV C +     ++ 
Sbjct: 114 NHAHNNNLFDEDGNFLVDVAFGTP-XTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDS 172

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           ++SS+Y   SC      +  TV        NY   Y D S + G    + +T    ++ F
Sbjct: 173 SASSTYSFGSC------IPSTVE------NNYNMTYGDDSTSVGNYGCDTMTL-EPSDVF 219

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSIT 188
               FGCG NN G F     G++GLG+ +LS  SQ  S+   NK FSYCL     + SI 
Sbjct: 220 QKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKF--NKVFSYCL---PEEDSIG 274

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSS 243
           S + FG  +      +  TSLV+     ++  YYFV L  ISVGN     +L IP    S
Sbjct: 275 S-LLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGN----ERLNIP----S 325

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSM 299
              +     ID+    T LP+  Y+ L+   + A+   P  + R         CY     
Sbjct: 326 SVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGR 385

Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
             +  P +  HF GGA V L  T+           C A      ++ I GN  Q  L + 
Sbjct: 386 KDVLLPEIVLHFGGGADVRLNGTNIVWGSDAS-RLCLAFAGTS-ELTIIGNRQQLSLTVL 443

Query: 359 YDFDSQMVSFKPTDCTK 375
           YD   + + F    C+K
Sbjct: 444 YDIQGRRIGFGGNGCSK 460


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 149/375 (39%), Gaps = 106/375 (28%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N + +S +   NGEY+M+  IGTPP+  +  I DTGSD +WVQC PC  C          
Sbjct: 64  NKLPESILIPNNGEYLMRLYIGTPPVERLV-IADTGSDFIWVQCSPCQNCQ--------- 113

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS 125
                                        C Y   YA+ S T  V+ TE ++F    G  
Sbjct: 114 -----------------------------CVYLNIYANKSFTIEVVGTETLSFDSTGGAQ 144

Query: 126 NNFFDNVVFGCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
              F N +FGCG NN   F  ++   GLVGL   +LSL SQ+ +Q+G  KFSY       
Sbjct: 145 TVSFPNSIFGCGANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGY-KFSY------- 196

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
                  + FG+ + ++  GVVST L+ K     YF+ LE +++G      K++P     
Sbjct: 197 -------LKFGSEAIITTNGVVSTPLIIKPSLPLYFLNLEVVTIGQ-----KVVP----- 239

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
                                             + +   QD     + C+       + 
Sbjct: 240 -------------------------------TETLGVESVQDLPFPFKFCFPYRDNMTV- 267

Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYD 360
           P +   F  GA V L   +  I      +   A+ P       + IFG  AQ D  + YD
Sbjct: 268 PAIAFQFT-GASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVISIFGIIAQFDFQVLYD 326

Query: 361 FDSQMVSFKPTDCTK 375
            D + VS  PTDCTK
Sbjct: 327 LDGKKVSVAPTDCTK 341


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 158/371 (42%), Gaps = 31/371 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV++  +G+P    +  + DT +D  W  C PC  C      ++ PA+SSSY  L C S
Sbjct: 80  SYVVRAGLGSPSQQLLLAL-DTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSS 136

Query: 83  EQCHLLDTVSCSSQQ-------------LCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
             C L    +C + Q              C ++  +AD+S  +  LA++ +  G   +  
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLG--KDAI 193

Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            N  FGC  + TG   N    GL+GLGR  ++L SQ  S L    FSYCL P +     +
Sbjct: 194 PNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-LYNGVFSYCL-PSYRSYYFS 251

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             +  G G         +  L +    + Y+V + G+SVG      K+     +  A + 
Sbjct: 252 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRA--WVKVPAGSFAFDAATG 309

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGIAPIL 306
               +D+G   T      Y  L E+ R  +   P     LG+   C+ T  + AG AP +
Sbjct: 310 AGTVVDSGTVITRWTAPVYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGGAPAV 368

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFD 362
           T H DGG  + L   +T I      + C AM    Q ++  V +  N  Q ++ + +D  
Sbjct: 369 TVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVA 428

Query: 363 SQMVSFKPTDC 373
           +  + F    C
Sbjct: 429 NSRIGFAKESC 439


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 146/348 (41%), Gaps = 25/348 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
           EYV+   +G+P +     ++DTGSD+ WVQC PC     C+     +++PA+SS+Y   +
Sbjct: 107 EYVISVGLGSPAVTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 165

Query: 80  CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           C +  C  L    +   C ++  C Y   Y D S T G  +++ +T   S +      FG
Sbjct: 166 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGS-DVVRGFQFG 224

Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C H   G   ++   GL+GLG    S  SQ  ++ G   F YCL      S   +     
Sbjct: 225 CSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYG-KSFFYCLPATPASSGFLTLGAPA 283

Query: 195 NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
           +G         +T ++ SK+  TYYF  LE I+VG       L P   ++G++      +
Sbjct: 284 SGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------V 335

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
           D+G   T LP   Y  L    R  +      +P      C+    +  ++ P +   F G
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 395

Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
           GA V L            G   FA    D   G  GN  Q    + YD
Sbjct: 396 GAVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 25/375 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S +     +Y  +  +GTP       +VDTGS+L WV C    +  K  + ++    S S
Sbjct: 97  SGIDYGTAQYFTEIRVGTPAK-KFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKS 154

Query: 75  YKELSCQSEQC-----HLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           +K + C ++ C     +L    +C +    C+Y Y YAD S  +GV A E IT G +N  
Sbjct: 155 FKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR 214

Query: 129 FDNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
              +   + GC  + TG   +   G++GL  +  S  S   S  GA KFSYCLV   ++ 
Sbjct: 215 MARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGA-KFSYCLVDHLSNK 273

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSS 243
           ++++ + FG+          +T L       +Y + + GIS+G   L   S++       
Sbjct: 274 NVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD----- 328

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
            A S G   +D+G   TLL    Y ++   + R  ++L   +   +  + C+   S   +
Sbjct: 329 -ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNV 387

Query: 303 A--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGY 359
           +  P LT H  GGA+    H  +++     GV C            + GN  Q +    +
Sbjct: 388 SKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEF 446

Query: 360 DFDSQMVSFKPTDCT 374
           D  +  +SF P+ CT
Sbjct: 447 DLMASTLSFAPSACT 461


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 157/364 (43%), Gaps = 33/364 (9%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELS 79
           E+V+   +GTP       I DTGSDL WVQC PC     C+ Q  P+++P+ SS+Y  + 
Sbjct: 143 EFVVAVGLGTPAQPSAL-IFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 201

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C   QC     +       C Y   Y D S T GVL+ + +    S+       FGCG  
Sbjct: 202 CGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLAL-TSSRALTGFPFGCGTR 260

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
           N G F   +  L+GLGR  LSL SQ  +  GA  FSYCL    + +S T  +  G     
Sbjct: 261 NLGDFGRVDG-LLGLGRGELSLPSQAAASFGA-VFSYCL---PSSNSTTGYLTIGATPAT 315

Query: 200 SGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
             G    T+++ K    ++YFV L  I +G       ++P        ++G   +D+G  
Sbjct: 316 DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGY-----VLPV--PPAVFTRGGTLLDSGTV 368

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----IAPILTAHFDGGA 314
            T LP   Y  L ++ R  ++      P      CY     AG    + P ++  F  GA
Sbjct: 369 LTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYD---FAGESEVVVPAVSFRFGDGA 425

Query: 315 --KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFK 369
             ++       F+    E V C A   +D     + I GN  Q    + YD  ++ + F 
Sbjct: 426 VFELDFFGVMIFLD---ENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 482

Query: 370 PTDC 373
           P  C
Sbjct: 483 PASC 486


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 101/350 (28%), Positives = 151/350 (43%), Gaps = 50/350 (14%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN 100
           IVDTGSDL+W QC          +    P S ++       +  C               
Sbjct: 56  IVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTRTC--------------- 100

Query: 101 YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLS 160
                  S+   GVLA+E  TFG        + FGCG  + G       G++GL    LS
Sbjct: 101 -----TASAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI-GATGILGLSPESLS 154

Query: 161 LASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG----VVSTSLVSKEDKT 216
           L    ++QL   +FSYCL PF      TS + FG  +++S       + +T++VS   +T
Sbjct: 155 L----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVET 208

Query: 217 -YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEE 272
            YY+V L GIS+G+     K +    +S A+     G   +D+G+    L +  +  ++E
Sbjct: 209 VYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE 263

Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTFI 325
            V + ++L          +LC+  P     A       P L  HFDGGA + L   + F 
Sbjct: 264 AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYF- 322

Query: 326 PPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             P  G+ C A+ +  DG  V I GN  Q ++ + +D      SF PT C
Sbjct: 323 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 164/379 (43%), Gaps = 47/379 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----YNPASSSSYKE 77
           G Y  K  +GTP   D +  VDTGSD++WV C  C++C ++   +    Y+  +SS+ K 
Sbjct: 83  GLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKS 141

Query: 78  LSCQSEQCHLLDTVS-CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN- 131
           +SC    C  ++  S C S   C Y   Y D S T G L  + +      GN      N 
Sbjct: 142 VSCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNG 201

Query: 132 -VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSS 186
            ++FGCG   +G   E++    G++G G++  S  SQ+ SQ    + F++CL       +
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL------DN 255

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSG 244
                 F  G EV    V +T ++SK    +Y V L  I VGN  L  SS      +  G
Sbjct: 256 NNGGGIFAIG-EVVSPKVKTTPMLSKS--AHYSVNLNAIEVGNSVLQLSSDAFDSGDDKG 312

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
            I      ID+G     LP   YN L  Q+      + L   QD    S  C+       
Sbjct: 313 VI------IDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQD----SFTCFHYIDRLD 362

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ------PIDGDVGIFGNFAQSDL 355
             P +T  FD    +  ++   ++    E  +CF  Q           + I G+ A S+ 
Sbjct: 363 RFPTVTFQFDKSVSLA-VYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421

Query: 356 FIGYDFDSQMVSFKPTDCT 374
            + YD ++Q++ +   +C+
Sbjct: 422 LVVYDIENQVIGWTNHNCS 440


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 25/375 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
           S +     +Y  +  +GTP       +VDTGS+L WV C    +  K  + ++    S S
Sbjct: 75  SGIDYGTAQYFTEIRVGTPAK-KFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKS 132

Query: 75  YKELSCQSEQC-----HLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           +K + C ++ C     +L    +C +    C+Y Y YAD S  +GV A E IT G +N  
Sbjct: 133 FKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR 192

Query: 129 FDNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
              +   + GC  + TG   +   G++GL  +  S  S   S  GA KFSYCLV   ++ 
Sbjct: 193 MARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGA-KFSYCLVDHLSNK 251

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSS 243
           ++++ + FG+          +T L       +Y + + GIS+G   L   S++       
Sbjct: 252 NVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD----- 306

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
            A S G   +D+G   TLL    Y ++   + R  ++L   +   +  + C+   S   +
Sbjct: 307 -ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNV 365

Query: 303 A--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGY 359
           +  P LT H  GGA+    H  +++     GV C            + GN  Q +    +
Sbjct: 366 SKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEF 424

Query: 360 DFDSQMVSFKPTDCT 374
           D  +  +SF P+ CT
Sbjct: 425 DLMASTLSFAPSACT 439


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 101/344 (29%), Positives = 146/344 (42%), Gaps = 30/344 (8%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           I+D+GSD+ WVQC PC    C++Q  P+++PA S++Y  + C S  C  L      CS+ 
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
             C +   Y D S   G  + + +T G   +      FGC H + G  F+ +  G + LG
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
               SL  Q  ++ G   FSYCL P  T SS+   +  G   E +      VST L+S  
Sbjct: 290 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 345

Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
              T+Y V L  I V   +     +P      A+   +  ID+    + LP   Y  L  
Sbjct: 346 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 397

Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
             R+A+ +     P      CY    +  I  P +   FDGGA V L      +      
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 451

Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             C A  P   D   G  GN  Q  L + YD  ++ + F+   C
Sbjct: 452 GSCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 170/383 (44%), Gaps = 50/383 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY     +G+P    I  IVDTGS+L W+QCLPC  C   V  IY+ A S+SY+ ++C 
Sbjct: 98  GEYYTSIKLGSPGQEAIL-IVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCN 156

Query: 82  SEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN 131
           + Q  L    S      C+    C +   Y D S + G L+T+ +      G       +
Sbjct: 157 NSQ--LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGC   +  +      G++GL   +++L  Q+  + G  KFS+C     +  + T  +
Sbjct: 215 FAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGW-KFSHCFPDRSSHLNSTGVV 273

Query: 192 YFGNGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           +FGN +E+    V  TS+    S+  + +Y V L+G+S+    NS +L+  +   G++  
Sbjct: 274 FFGN-AELPHEQVQYTSVALTNSELQRKFYHVALKGVSI----NSHELV--FLPRGSV-- 324

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ------LCYKTPS---- 298
             + +D+G+  +   + F+++L E     +K  P     L          C+K  +    
Sbjct: 325 --VILDSGSSFSSFVRPFHSQLREAF---LKHRPPSLKHLEGDSFGDLGTCFKVSNDDID 379

Query: 299 -MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGD---VGIFGNFA 351
            +    P L+  F+ G  + +      +P          CFA +  DG    V + GN+ 
Sbjct: 380 ELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFE--DGGPNPVNVIGNYQ 437

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
           Q +L++ YD     V F    C 
Sbjct: 438 QQNLWVEYDIQRSRVGFARASCV 460


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 168/405 (41%), Gaps = 57/405 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP----- 65
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C         + P     
Sbjct: 86  LTSGAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRRPASANSSLSPADSGP 142

Query: 66  ----IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLAT 117
                + P  S ++  +SC S+ C         +C +    C Y Y Y D S  +G + T
Sbjct: 143 GPGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGT 202

Query: 118 ERITFGNSNN-----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
           E  T   S           +V GC  + TG   E   G++ LG + +S AS   S+ G  
Sbjct: 203 ESATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFG-G 261

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-------------LVSKEDKTYYF 219
           +FSYCLV   +  + TS + FG    VS      +S             L+ +  + +Y 
Sbjct: 262 RFSYCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYD 321

Query: 220 VTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
           V+L+ ISV G      + +    + G +      +D+G   T+L K  Y  +   +   +
Sbjct: 322 VSLKAISVAGEFLKIPRAVWDVEAGGGV-----ILDSGTSLTVLAKPAYRAVVAALSKGL 376

Query: 279 KLTPY--QDPRLGSQLCYKTPSMAG-----IAPILTAHFDGGAKVPLIHTSTFIPPPVEG 331
              P    DP    + CY   S +G       P +  HF G A++     S ++     G
Sbjct: 377 AGLPRVTMDP---FEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKS-YVIDAAPG 432

Query: 332 VFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           V C  +Q  P  G + + GN  Q +    +D  ++ + F+ + CT
Sbjct: 433 VKCIGLQEGPWPG-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 476


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 168/385 (43%), Gaps = 52/385 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP   +  ++DTGS+L W+ C    +    +  ++NP SSSSY  + C
Sbjct: 37  NVTLTVSLTVGSPPQ-QVTMVLDTGSELSWLHC----KKSPNLTSVFNPLSSSSYSPIPC 91

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C      L + V+C  ++LC+    YAD+S  +G LA++    G+S       +FG
Sbjct: 92  SSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA--LPGTLFG 149

Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C   G ++    +    GL+G+ R  LS     ++QLG  KFSYC+     DSS    + 
Sbjct: 150 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VTQLGLPKFSYCIS--GRDSS--GVLL 201

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA- 245
           FG+      G +  T LV         D+  Y V L+GI VGN     K++P   S  A 
Sbjct: 202 FGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGN-----KILPLPKSIFAP 256

Query: 246 --ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
                G   +D+G   T L    Y  L  +     K  L P  DP    Q    LCY+ P
Sbjct: 257 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVP 316

Query: 298 SMAGIA--PILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
           +   +   P ++  F G   V     L++    +    E V+C      D    +  + G
Sbjct: 317 AGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIG 376

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
           +  Q ++++ +D     V F  T C
Sbjct: 377 HHHQQNVWMEFDLVKSRVGFVETRC 401


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 171/386 (44%), Gaps = 71/386 (18%)

Query: 22  GEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQC--YKQVK-PI--YNPASSSSY 75
           G Y  +  +GTPP    Y + VDTGSDL+WV C PC+ C  +  +K PI  Y+  +S+S 
Sbjct: 34  GLYFTQVQLGTPP--RTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91

Query: 76  KELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            ++ C    C L+  +S   C+ Q  C Y++ Y D S T G L  + + +    N    V
Sbjct: 92  SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY--MVNATATV 149

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           +FGCG   +G  + +E    G++G G + LS  SQ+  Q    N F++CL          
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL---------- 199

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISV--GNLSNSSKLI 237
                 +G E  GGG++    V + D  Y         Y V L+ ISV   NL+   KL 
Sbjct: 200 ------DGGE-RGGGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLF 252

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
                S  + +G +F D+G     LP + Y    + V   +      D RL S+  YK  
Sbjct: 253 -----SNDVMQGTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRL-SRFIYK-- 303

Query: 298 SMAGIAPILTAHFDGGAKVP-----LIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIF 347
               + P +  +F+G +        LI  ++    P   ++C   Q +     +    IF
Sbjct: 304 ----LFPNVVLYFEGASMTLTPAEYLIRQASAANAP---IWCMGWQSMGSAESELQYTIF 356

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G+    +  + YD +   + ++P DC
Sbjct: 357 GDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 173/385 (44%), Gaps = 69/385 (17%)

Query: 22  GEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQC--YKQVK-PI--YNPASSSSY 75
           G Y  +  +GTPP    Y + VDTGSDL+WV C PC+ C  +  +K PI  Y+  +S+S 
Sbjct: 34  GLYFTQVQLGTPP--RTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91

Query: 76  KELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            ++ C    C L+  +S   C+ Q  C Y++ Y D S T G L  + + +    N    V
Sbjct: 92  SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY--MVNATATV 149

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           +FGCG   +G  + +E    G++G G + LS  SQ+  Q    N F++CL          
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL---------- 199

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK--------TYYFVTLEGISV--GNLSNSSKLIP 238
                 +G E  GG +V  +++  + +        ++Y V L+ ISV   NL+   KL  
Sbjct: 200 ------DGGERGGGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLF- 252

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
               S  + +G +F D+G     LP + Y    + V   +      D RL S+  YK   
Sbjct: 253 ----SNDVMQGTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRL-SRFIYK--- 303

Query: 299 MAGIAPILTAHFDGGAKVP-----LIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIFG 348
              + P +  +F+G +        LI  ++    P   ++C   Q +     +    IFG
Sbjct: 304 ---LFPNVVLYFEGASMTLTPAEYLIRQASAANAP---IWCMGWQSMGSAESELQYTIFG 357

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
           +    +  + YD +   + ++P DC
Sbjct: 358 DLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 94/171 (54%), Gaps = 8/171 (4%)

Query: 12  VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
            V S +S  +GEY M+  +GTP   ++Y ++DTGSD++W+QC PC  CY Q   I++P  
Sbjct: 123 AVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKK 181

Query: 72  SSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S ++  + C S  C  LD  S C +++   C Y   Y D S T+G  +TE +TF  +   
Sbjct: 182 SKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR-- 239

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            D+V  GCGH+N G+F      L       LS  SQ  ++    KFSYCLV
Sbjct: 240 VDHVPLGCGHDNEGLFVGAAGLLGLGR-GGLSFPSQTKNRYNG-KFSYCLV 288


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 95/169 (56%), Gaps = 11/169 (6%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
            S  +G Y +K   G+P       IVDTGS L W+QC PCV  C+ Q  P+++P++S +Y
Sbjct: 111 ASIGSGNYYVKVGFGSPARYYSM-IVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 169

Query: 76  KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           K LSC S QC  L      + +  +S  +C YT  Y DSS + G L+ + +T   S    
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT-L 228

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
              V+GCG ++ G+F     G++GLGR +LS+  Q+ S+ G   FSYCL
Sbjct: 229 PGFVYGCGQDSDGLFGR-AAGILGLGRNKLSMLGQVSSKFGY-AFSYCL 275


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 105/356 (29%), Positives = 155/356 (43%), Gaps = 32/356 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           EYV+   IG+P +     ++DTGSD+ WV+C            +++P+ S++Y   SC S
Sbjct: 128 EYVITVGIGSPAVTQTM-MIDTGSDVSWVRC-----NSTDGLTLFDPSKSTTYAPFSCSS 181

Query: 83  EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
             C  L  +   CS+   C Y   Y D S T G  +++ +    S+   D   FGC H+ 
Sbjct: 182 AACAQLGNNGDGCSNSG-CQYRVQYGDGSNTTGTYSSDTLALSASDTVTD-FHFGCSHHE 239

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
                E   GL+GLG    SL SQ  +  G   FSYCL P +  S     + FG  +  S
Sbjct: 240 EDFDGEKIDGLMGLGGDAQSLVSQTAATYG-KSFSYCLPPTNRTSGF---LTFGAPNGTS 295

Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
           GG V +  L   +  T Y V L+ ISVG         P       +S G++ +D+G   T
Sbjct: 296 GGFVTTPMLRWPKAPTLYGVLLQDISVGG-------TPLGIQPSVLSNGSV-MDSGTVIT 347

Query: 261 LLPKDFYNRLEEQVRNAI-KLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVP 317
            LP+  Y+ L    R+++ +L   +   LG    CY    +  ++ P ++   DGGA V 
Sbjct: 348 WLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVD 407

Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           L      I        C A     GD  I GN  Q    + +D    +  F+   C
Sbjct: 408 LDGNGIMIQD------CLAFAATSGD-SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 110/402 (27%), Positives = 170/402 (42%), Gaps = 62/402 (15%)

Query: 24  YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------ 64
           Y++  SIGTPP ++ +Y  +DTGSDL W  C      C++C  Y+  +            
Sbjct: 80  YLISLSIGTPPQVIQVY--MDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSS 137

Query: 65  --------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC---NYTYG---YADSSL 110
                   P      SS      C    C L   V  +    C    YTYG       +L
Sbjct: 138 SHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTL 197

Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
           T+  L       G +        FGC  ++        +G+ G GR  LSL SQ+     
Sbjct: 198 TRDTLRVHGRNLGVTQEI-PRFCFGCVASSY----REPIGIAGFGRGALSLPSQL--GFL 250

Query: 171 ANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISV 227
              FS+C + F    + +I+S +  G+ +  S   +  T ++ S     YY+V LE I+V
Sbjct: 251 RKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITV 310

Query: 228 GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP- 286
           GN+S ++++        ++  G M +D+G   T LP+ FY+++   +++ I      D  
Sbjct: 311 GNVS-ATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDME 369

Query: 287 -RLGSQLCYKTPSM------AGIAPILTAHFDGGAKVPLIHTSTF----IPPPVEGVFCF 335
            R G  LCYK P          + P +T HF   A + L   S F     P     V C 
Sbjct: 370 MRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCL 429

Query: 336 AMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             Q +D    G  G+ G+F Q D+ + YD + + + F+P DC
Sbjct: 430 LFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 173/379 (45%), Gaps = 30/379 (7%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P + V   +      ++   SIG PP  ++Y ++DTGSDL W+QC PC  CYKQ  PIYN
Sbjct: 91  PADFVPPPLIRDKSAFLANLSIGNPPT-NVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYN 149

Query: 69  PASSSSYKELSCQSEQCHLLDTV-SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
              S SY E+ C    C  L     CS    C Y   YAD S T G+L+ E++ F +  +
Sbjct: 150 RTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYS 209

Query: 128 FFDN---VVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLG--ANKFSYCLVPF 181
             D    V FGCG  N   V +  + G++GLG   +SL SQ LS +G  +  F+YC    
Sbjct: 210 DEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQ-LSAIGKVSKSFAYCFGNL 268

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
            ++ +    + FG+ + ++G     T +V  E   +Y+V L GI +G       +    N
Sbjct: 269 -SNPNAGGFLVFGDATYLNGD---MTPMVIAE---FYYVNLLGIGLGVEEPRLDI----N 317

Query: 242 SSGAISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--K 295
           SS    K    G + ID+G+  ++ P + Y  +   V + +K      P   S  C+  K
Sbjct: 318 SSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGK 377

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
                 + P L  + +    +     S F+    E +FC      +G + I G  AQ   
Sbjct: 378 IGRDLPLFPTLVLYLESTGILN-DRWSIFLQRYDE-LFCLGFTSGEG-LSIIGTLAQQSY 434

Query: 356 FIGYDFDSQMVSFKPT-DC 373
             GY+ +   +S +   DC
Sbjct: 435 KFGYNLELSTLSIESNPDC 453


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 110/404 (27%), Positives = 176/404 (43%), Gaps = 65/404 (16%)

Query: 9   PNNVVQSNVSTANGEYVMKFS--------IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
           PNN  Q+   + N ++  K+S        IGTPP      ++DTGS L W+QC      +
Sbjct: 53  PNNP-QNKTPSYNYKFSFKYSMALIINLPIGTPPQTQPM-VLDTGSQLSWIQC------H 104

Query: 61  KQVKPI--YNPASSSSYKELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKG 113
           K+  P   ++P+ SS++  L C    C           SC   +LC+Y+Y YAD +  +G
Sbjct: 105 KKQPPTASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEG 164

Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
            L  E+ TF  S +    ++ GC   +T     +  G++G+   RLS A Q  S++   K
Sbjct: 165 NLVREKFTFSRSVS-TPPLILGCATEST-----DPRGILGMNLGRLSFAKQ--SKI--TK 214

Query: 174 FSYCLVPFHTDSSI--TSKMYFGNGSEVSGGGVVSTSLVSKE-----DKTYYFVTLEGIS 226
           FSYC+ P  T      T   Y GN     G   V     S++     D   Y + + GI 
Sbjct: 215 FSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIR 274

Query: 227 V-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD 285
           + G   N S  +   ++ G+   G   ID+G+  T L  + Y+++  QV  A+       
Sbjct: 275 IAGKKLNISPAVFRADAGGS---GQTMIDSGSEFTYLVSEAYDKVRAQVVRAV------G 325

Query: 286 PRLG--------SQLCYKTPSMAGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFC 334
           PRL         + +C+ +     I  +   +   F+ G +V +I     +     GV C
Sbjct: 326 PRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEV-VIPKERVLADVGGGVHC 384

Query: 335 FAMQPID---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
             +   D       I GNF Q +L++ +D   + V F   DC++
Sbjct: 385 VGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKADCSR 428


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 109/396 (27%), Positives = 171/396 (43%), Gaps = 72/396 (18%)

Query: 41  IVDTGSDLMWVQCLPC----------VQCYKQVKPIYNPASSSSYKELSCQSEQCHLL-- 88
           +VDTGSDL+W QC  C            C+ Q  P YN + S + + + C  +   L   
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 89  --DTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
             +T  C     S    C     Y  + +  GVL T+  TF +S++    + FGC     
Sbjct: 137 APETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFPSSSSV--TLAFGCVSQTR 193

Query: 142 ---GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
              G  N    G++GLGR  LSL    +SQL A +FSYCL P+  D+   S ++ G+G  
Sbjct: 194 ISPGALN-GASGIIGLGRGALSL----VSQLNATEFSYCLTPYFRDTVSPSHLFVGDGEL 248

Query: 199 VSGGGV----------VSTSLVSKEDK-----TYYFVTLEGISVGNLSNS--SKLIPYYN 241
                           V+T   +K  K     T+Y++ L G++ GN + +  +       
Sbjct: 249 AGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLRE 308

Query: 242 SSGAISKGNMFIDTGAPPTLL----PKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYK 295
           ++  +  G   ID+G+P T L     +     L  Q+R +  L P    +LG   +LC +
Sbjct: 309 AAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVP-PPAKLGGALELCVE 367

Query: 296 T----PSMAGIA-PILTAHFD---GGAKVPLIHTSTFIPPPVEGVFCFAM---------Q 338
                 S+A  A P L   FD   GG +  +I    +        +C A+          
Sbjct: 368 AGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATL 427

Query: 339 PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           P + +  I GNF Q D+ + YD  + ++SF+P +C+
Sbjct: 428 PTN-ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 171/380 (45%), Gaps = 32/380 (8%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
           P + V   +      ++   SIG PP  ++Y ++DTGSDL W+QC PC  CYKQ  PIYN
Sbjct: 78  PADFVPPPLIRDKSAFLANLSIGNPPT-NVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYN 136

Query: 69  PASSSSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
              S SY E+ C    C  L     CS    C Y   YAD + T G+L+ E++ F +  +
Sbjct: 137 RTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYS 196

Query: 128 FFDN---VVFGCGHNNTGVFNEN-EMGLVGLGRTRLSLASQILSQLG--ANKFSYCLVPF 181
             D    V FGCG  N      N + G++GLG   +SL SQ LS +G  +  F+YC    
Sbjct: 197 DEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQ-LSAIGKVSKSFAYCFGNI 255

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLS-NSSKL 236
            ++ +    + FG+ + ++G     T +V  E   +Y+V L GI +G     L  NSS  
Sbjct: 256 -SNPNAGGFLVFGDATYLNGD---MTPMVIAE---FYYVNLLGIGLGVGEPRLDINSSSF 308

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-- 294
               + SG +      ID+G+  ++ P + Y  +   V + +K      P   S  C+  
Sbjct: 309 ERKPDGSGGV-----IIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEG 363

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
           K      + P L  + +      L    +      + +FC      +G + I G  AQ  
Sbjct: 364 KIERDLPLFPTLVLYLESTGI--LNDRWSIFLQRYDELFCLGFTSGEG-LSIIGTLAQQS 420

Query: 355 LFIGYDFDSQMVSFKPT-DC 373
              GY+ +   +S +   DC
Sbjct: 421 YKFGYNLELSTLSIESNPDC 440


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/382 (27%), Positives = 160/382 (41%), Gaps = 46/382 (12%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           M+  IGTPP  ++  +VDT S+L WVQ   C  C     P +NP  SSS+    C S  C
Sbjct: 1   MQTKIGTPPR-EVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVC 59

Query: 86  HLLDTVSCSSQQLCNYTYG-------YADSSLTKGVLATERI---TFGNSNNFFDNVVFG 135
             L       Q  CN + G       Y D S   GV+A E     ++  + +   +V+FG
Sbjct: 60  --LGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFG 117

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---ANKFSYCLVPFHTDSSITSKMY 192
           C   +     +   G +GL R   S  +QI S+     +++FSYC        + +  + 
Sbjct: 118 CASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVII 177

Query: 193 FGNGSEVSGGGVVSTSLVSKEDK-------TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
           FG+    SG        +S E +        +Y+V L+GISVG      +L+    S+  
Sbjct: 178 FGD----SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGG-----ELLHIPRSAFK 228

Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           I +   G  + D+G   + L +  +  L E   R  + L          +LCY   +   
Sbjct: 229 IDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDA 288

Query: 302 ---IAPILTAHFDGGAKVPLIHTSTFIP----PPVEGV---FCFAMQPIDGDVGIFGNFA 351
               AP++T HF     + L   S ++P    P V  +   F  A     G V + GN+ 
Sbjct: 289 RLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQ 348

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
           Q D  I +D +   + F P +C
Sbjct: 349 QQDYLIEHDLERSRIGFAPANC 370


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/145 (37%), Positives = 82/145 (56%), Gaps = 7/145 (4%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           V S +   +GEY     +GTP    +  ++DTGSDL+W+QC PC +CY Q   +++P  S
Sbjct: 75  VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 73  SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
           S+Y+ + C S QC  L    C S       C Y   Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN-DTY 192

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVG 153
            +NV  GCG +N G+F ++  GL+G
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLG 216


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 162/367 (44%), Gaps = 41/367 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +G +++  + GTPP      I+DTGS + W QC PCV+C K  +  ++P++S +Y   SC
Sbjct: 159 DGNFLVDVAFGTPPQ-KFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC 217

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
                 +  TV  +      Y   Y D S + G    + +T  +S + F    FGCG NN
Sbjct: 218 ------IPSTVGNT------YNMTYGDKSTSVGNYGCDTMTLEHS-DVFPKFQFGCGRNN 264

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
            G F     G++GLG+ +LS  SQ  S+     FSYCL     + SI S + FG  +   
Sbjct: 265 EGDFGSGADGMLGLGQGQLSTVSQTASKF-KKVFSYCL---PEEDSIGS-LLFGEKATSQ 319

Query: 201 GGGVVSTSLVSK------EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFI 253
              +  TSLV+       E+  YYFV L  ISVGN     +L IP    S   +     I
Sbjct: 320 SSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGN----KRLNIP----SSVFASPGTII 371

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILTA 308
           D+G   T LP+  Y+ L+   + A+   P  + R         CY       +  P +  
Sbjct: 372 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVL 431

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           HF  GA V L +    I        C A    + ++ I GN  Q  L + YD     + F
Sbjct: 432 HFGEGADVRL-NGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGRIGF 489

Query: 369 KPTDCTK 375
               C+K
Sbjct: 490 GGNGCSK 496


>gi|358345193|ref|XP_003636666.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502601|gb|AES83804.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 161

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/126 (48%), Positives = 80/126 (63%), Gaps = 5/126 (3%)

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAH 309
           M ID+G P ++LP+DF++RL EQVR  + L P   DP LG QLCY+TP+     P L AH
Sbjct: 1   MLIDSGTPISILPEDFFHRLLEQVRKKVALEPMPFDPSLGYQLCYRTPTNLK-GPTLVAH 59

Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
           F+G A V L  T  FIP    G+FCFA       + G +G++ QS+  IG+D + Q+VSF
Sbjct: 60  FEG-ADVLLTPTQIFIPVQY-GIFCFAFTSSFSNEYGTYGSYVQSNYLIGFDLEKQVVSF 117

Query: 369 KPTDCT 374
           K TDCT
Sbjct: 118 KATDCT 123


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 141/320 (44%), Gaps = 40/320 (12%)

Query: 78  LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV---- 133
           + C    C  +   SC     C Y Y Y D ++T GV ATER TF +S            
Sbjct: 1   MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60

Query: 134 -FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            FGCG  N G  N N  G+VG GR  LSL    +SQL   +FSYCL  +   S   S + 
Sbjct: 61  GFGCGSVNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSYA--SRRQSTLL 113

Query: 193 FGNGSEVSGG---GVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           FG+ S+   G   G V T+  L S ++ T+Y+V   G++VG     ++ +    S+ A+ 
Sbjct: 114 FGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVG-----ARRLRIPESAFALR 168

Query: 248 ---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAG 301
               G + +D+G   TLLP      +    R  ++L P+    +P  G  +C+  P+   
Sbjct: 169 PDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWR 225

Query: 302 IA--------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQS 353
            +        P +  HF  GA + L   +  +     G  C  +     D    GN  Q 
Sbjct: 226 RSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQ 284

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
           D+ + YD +++ +S  P  C
Sbjct: 285 DMRVLYDLEAETLSIAPARC 304


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 164/379 (43%), Gaps = 33/379 (8%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           + S   +  G+Y +K  +GTP + +   + DTGSDL WV+C       +    ++ P +S
Sbjct: 105 MSSGAYSGTGQYFVKLRVGTP-VQEFTLVADTGSDLTWVKCAGASPPGR----VFRPKTS 159

Query: 73  SSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSL-TKGVLATERITF---GN 124
            S+  + C S+ C L       +CSS    C Y Y Y + S   +G++ TE  T    G 
Sbjct: 160 RSWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGG 219

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
                 +VV GC  ++ G    +  G++ LG  ++S A+Q  ++ G + FSYCLV     
Sbjct: 220 KVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGS-FSYCLVDHLAP 278

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            + T  + FG G +V       T L    +  +Y V ++ I V     + K +       
Sbjct: 279 RNATGYLAFGPG-QVPRTPATQTKLFLDPEMPFYGVKVDAIHV-----AGKALDIPAEVW 332

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCY----KTPSM 299
               G + +D+G   T+L    Y  +   +   +   P    P    + CY    + P  
Sbjct: 333 DAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPF--EHCYNWTARRPGA 390

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLF 356
             I P L   F G A++     S ++     GV C  +Q  +G+   + + GN  Q +  
Sbjct: 391 PEIIPKLAVQFAGSARLEPPAKS-YVIDVKPGVKCIGVQ--EGEWPGLSVIGNIMQQEHL 447

Query: 357 IGYDFDSQMVSFKPTDCTK 375
             +D  +  V FK ++CT+
Sbjct: 448 WEFDLKNMQVRFKQSNCTR 466


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 41/389 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIY 67
           + S   T  G+Y ++  +GTP    +  + DTGSDL WV+C               + ++
Sbjct: 93  LTSGAYTGTGQYFVRLRVGTPAQPFVL-VADTGSDLTWVKCSSPSSSSSSPAASPPQRVF 151

Query: 68  NPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG 123
            PA S S+  L C S+ C         +CSS    C+Y Y Y D+S  +GV+  +  T  
Sbjct: 152 RPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVS 211

Query: 124 NSNN------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
            S N          VV GC  +  G   ++  G++ LG + +S AS+  S+ G  +FSYC
Sbjct: 212 LSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFG-GRFSYC 270

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVV--STSLVSKED---KTYYFVTLEGISVG--NL 230
           LV      + TS + FGNG    G       T LV  ED   + +YFV+++ ++V    L
Sbjct: 271 LVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERL 330

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRL 288
                +  +  + GAI      +D+G   T+L    Y+ + + +       P    DP  
Sbjct: 331 EILPDVWDFRKNGGAI------LDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP-- 382

Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVG 345
             + CY    ++   P +   F G A +     S ++     GV C  +  ++G    V 
Sbjct: 383 -FEYCYNWTGVSAEIPRMELRFAGAATLAPPGKS-YVIDTAPGVKCIGV--VEGAWPGVS 438

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + GN  Q +    +D  ++ + FK + C 
Sbjct: 439 VIGNILQQEHLWEFDLANRWLRFKQSRCA 467


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 169/384 (44%), Gaps = 52/384 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           GEY     +G+P    I  IVDTGS+L W++CLPC  C   V  IY+ A S SYK ++C 
Sbjct: 98  GEYYTSIKLGSPGQEAIL-IVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCN 156

Query: 82  SEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN 131
           + Q  L    S      C+    C +   Y D S + G L+T+ +      G       +
Sbjct: 157 NSQ--LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
             FGC   +  +      G++GL   +++L  Q+  + G  KFS+C     +  + T  +
Sbjct: 215 FAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGW-KFSHCFPDRSSHLNSTGVV 273

Query: 192 YFGNGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           +FGN +E+    V  TS+    S+  + +Y V L+G+S+    NS +L+        + +
Sbjct: 274 FFGN-AELPHEQVQYTSVALTNSELQRKFYHVALKGVSI----NSHELV-------LLPR 321

Query: 249 GNMFI-DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ------LCYKTPS--- 298
           G++ I D+G+  +   + F+++L E     +K  P     L          C+K  +   
Sbjct: 322 GSVVILDSGSSFSSFVRPFHSQLREAF---LKHRPPSLKHLEGDSFGDLGTCFKVSNDDI 378

Query: 299 --MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGD---VGIFGNF 350
             +    P L+  F+ G  + +      +P          CFA +  DG    V + GN+
Sbjct: 379 DELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFE--DGGPNPVNVIGNY 436

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
            Q +L++ YD     V F    C 
Sbjct: 437 QQQNLWVEYDIQRSRVGFARASCV 460


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 116/435 (26%), Positives = 186/435 (42%), Gaps = 82/435 (18%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQCYKQVK 64
           ++++   +  +G Y++  ++GTPP +  +Y  +DTGSDL WV C       C+ C   VK
Sbjct: 13  DIIEPVTAYTDG-YLLSLNLGTPPQVFQVY--LDTGSDLTWVPCGSSSSYQCLDCGSSVK 69

Query: 65  PI--YNPASSSSYKELSCQSEQC-------HLLDTVSCSSQQLCNYT------------Y 103
           P   + P+ S+S     C S  C       +  D  + +   +  +T            Y
Sbjct: 70  PTPTFLPSESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSY 129

Query: 104 GYADSSLTKGVLATERITFGNSNN-----------FFDNVVFGCGHNNTGVFNENEMGLV 152
            Y   +L  G L+ + +T   S +            F    FGC     G      +G+ 
Sbjct: 130 TYGGGALVLGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGC----VGSSIREPLGIA 185

Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGN----GSEVSGGGVVS 206
           G GR  LSL SQ L  LG   FS+C + F    + + TS +  G+     +   GG V +
Sbjct: 186 GFGRGALSLPSQ-LGFLG-KGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFT 243

Query: 207 TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPK 264
             L S     +Y+V LEG+ +G+    S +    + SG  ++GN  + +DTG   T LP 
Sbjct: 244 PMLTSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPD 303

Query: 265 DFYNRLEEQVRNAIKLTPYQ-----DPRLGSQLCYKTP-SMAGIA----PILTAHFDGGA 314
            FY  +   + +A    PY+     + R G  LC+K P + A  A    P +T H  GGA
Sbjct: 304 PFYASVLASLISAAP--PYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGA 361

Query: 315 KVPLIHTSTFIPPPVEG----VFCFAMQPIDGD-----------VGIFGNFAQSDLFIGY 359
           ++ L   S++ P         V C   Q ++ +             + G+F   ++ + Y
Sbjct: 362 RLALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVY 421

Query: 360 DFDSQMVSFKPTDCT 374
           D  +  V F+P DC 
Sbjct: 422 DLAAGRVGFRPRDCA 436


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 107/402 (26%), Positives = 175/402 (43%), Gaps = 61/402 (15%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------- 64
           Y++  +IGTPP + I  ++DTGSDL WV C      C++C  Y+  K             
Sbjct: 82  YLISLNIGTPPQV-IQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSS 140

Query: 65  -------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVLA 116
                  P      SS     +C    C L   V  +  + C ++ Y Y    +  G+L 
Sbjct: 141 YRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILT 200

Query: 117 TERITFGNSN----NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLGA 171
            + +    S+           FGC     G      +G+ G GR  LS+ SQ+   Q G 
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGC----VGSAYREPIGIAGFGRGTLSMVSQLGFLQKG- 255

Query: 172 NKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG 228
             FS+C + F    + +I+S +  G+ +  S   +  T ++ S     +Y+V LE I+VG
Sbjct: 256 --FSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVG 313

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDP 286
           N+S ++++        ++  G M ID+G   T LP+ FY+++   +++ I        + 
Sbjct: 314 NVS-ATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEM 372

Query: 287 RLGSQLCYKTP-------SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG----VFCF 335
           + G  LCYK P       +   + P +T HF     + L   + F P    G    V C 
Sbjct: 373 QTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCL 432

Query: 336 AMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             Q  D    G  G+FG+F Q ++ + YD + + + F+P DC
Sbjct: 433 MFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 147/343 (42%), Gaps = 27/343 (7%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           I+D+GSD+ WVQC PC  + C+ Q  P+++PA+S++Y  + C S  C  L      C + 
Sbjct: 84  IIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLAN 143

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
             C +   YA+ +   G  +++ +T G   +     +FGC H + G  F+ +  G + LG
Sbjct: 144 SQCQFGITYANGATATGTYSSDDLTLG-PYDVVRGFLFGCAHADQGSTFSYDVAGTLALG 202

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
               S   Q  SQ  +  FSYC+ P    +S    + FG   + +      VST L+S  
Sbjct: 203 GGSQSFVQQTASQY-SRVFSYCVPP---STSSFGFIMFGVPPQRAALVPTFVSTPLLSSS 258

Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
               T+Y V L  I V       + +P   +   +   +  ID+    + +P   Y  L 
Sbjct: 259 TMSPTFYRVLLRSIIVAG-----RPLPVPPT---VFSASSVIDSATVISRIPPTAYQALR 310

Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVE 330
              R+A+ +     P      CY    +  I  P +   FDGGA V L      +    +
Sbjct: 311 AAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL----Q 366

Query: 331 GVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G   FA    D   G  GN  Q  L + YD   + + F+   C
Sbjct: 367 GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/408 (26%), Positives = 170/408 (41%), Gaps = 60/408 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQV------- 63
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C P               
Sbjct: 84  LTSAAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRPAKAAAASTNSSSSAS 140

Query: 64  ----KPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----QQLCNYTYGYADSSLTKGVL 115
               +  + P  S ++  + C S+ C      S S+       C Y Y Y D S  +G +
Sbjct: 141 ASSPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTV 200

Query: 116 ATERITFG-----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
            TE  T                     +V GC  + TG   E   G++ LG + +S AS 
Sbjct: 201 GTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASH 260

Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS-------GGGVVSTSLV-SKEDKT 216
             S+ G  +FSYCLV   +  + TS + FG  S +S       G G   T LV     + 
Sbjct: 261 AASRFG-GRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRP 319

Query: 217 YYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR 275
           +Y V+++ ISV G L    + +  +   G    G + +D+G   T+L K  Y  +   + 
Sbjct: 320 FYDVSIKAISVDGELLKIPRDV--WEVDGG---GGVIVDSGTSLTVLAKPAYRAVVAALG 374

Query: 276 NAIKLTPY--QDPRLGSQLCYK--TPSMAGIA---PILTAHFDGGAKVPLIHTSTFIPPP 328
             +   P    DP    + CY   +PS        P L  HF G A++    + +++   
Sbjct: 375 KKLARFPRVAMDP---FEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLE-PPSKSYVIDA 430

Query: 329 VEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             GV C  +Q  P  G + + GN  Q +    +D  ++ + FK + CT
Sbjct: 431 APGVKCIGVQEGPWPG-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/378 (27%), Positives = 163/378 (43%), Gaps = 54/378 (14%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           +NG Y  +  IGTPP  +   IVDTGS + +V C  C QC K   P + P  SSSYK L 
Sbjct: 76  SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALK 134

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
           C  + C+  D       +LC Y   YA+ S + GVL+ + I+FGN +       VFGC +
Sbjct: 135 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCEN 188

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
             TG +F++   G++GLGR +LS+  Q++ + +  + FS C                  G
Sbjct: 189 VETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 232

Query: 197 SEVSGGGVV-------STSLVSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
            EV GG +V       +  + S  D     YY + L+ + V     S KL P  +N    
Sbjct: 233 MEVGGGAMVLGKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 286

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
             K    +D+G      PK+ +  +++ +   I   K     DP     +C+        
Sbjct: 287 -GKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 344

Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +    P +   F  G K+ L      F    V G +C  + P      + G     +  
Sbjct: 345 EIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 404

Query: 357 IGYDFDSQMVSFKPTDCT 374
           + YD ++  + F  T+C+
Sbjct: 405 VTYDRENDKLGFLKTNCS 422


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/419 (26%), Positives = 179/419 (42%), Gaps = 66/419 (15%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYK---- 61
           +VV   +      Y++  +IGTPP  + +Y  +DTGSDL WV C      C++CY     
Sbjct: 70  DVVMEPLREVRDGYLITLNIGTPPQAVQVY--LDTGSDLTWVPCGNLSFDCIECYDLKNN 127

Query: 62  --QVKPIYNPASSSSYKELSCQSEQC---HLLD-------TVSCSSQQLC---------N 100
             +   +++P  SS+    SC S  C   H  D          CS   L          +
Sbjct: 128 DLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPS 187

Query: 101 YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLS 160
           + Y Y +  L  G+L   R              FGC    T  + E  +G+ G GR  LS
Sbjct: 188 FAYTYGEGGLISGILT--RDILKARTRDVPRFSFGCV---TSTYRE-PIGIAGFGRGLLS 241

Query: 161 LASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGS---EVSGGGVVSTSLVSKEDK 215
           L SQ+        FS+C +PF    + +I+S +  G  +    ++     +  L +    
Sbjct: 242 LPSQL--GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYP 299

Query: 216 TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR 275
             Y++ LE I++G     +++        +   G M +D+G   T LP+ FY++L   ++
Sbjct: 300 NSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ 359

Query: 276 NAIK--LTPYQDPRLGSQLCYKTP-----------SMAGIAPILTAHFDGGAKVPLIHTS 322
           + I        + R G  LCYK P            +  I P +T HF   A + L   +
Sbjct: 360 STITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGN 419

Query: 323 TF--IPPPVEG--VFCFAMQPI-DGD---VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +F  +  P +G  V C   Q + DGD    G+FG+F Q ++ + YD + + + F+  DC
Sbjct: 420 SFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 166/376 (44%), Gaps = 51/376 (13%)

Query: 34  PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQSEQCH----- 86
           P  +I  ++DTGS+L W++C           P+  ++P  SSSY  + C S  C      
Sbjct: 82  PPQNISMVIDTGSELSWLRC----NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 87  LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNE 146
            L   SC S +LC+ T  YAD+S ++G LA E   FGNS N   N++FGC  + +G   E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN-DSNLIFGCMGSVSGSDPE 196

Query: 147 NE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
            +    GL+G+ R  LS     +SQ+G  KFSYC+    TD      +  G+ +      
Sbjct: 197 EDTKTTGLLGMNRGSLSF----ISQMGFPKFSYCIS--GTD-DFPGFLLLGDSNFTWLTP 249

Query: 204 VVSTSLVSKE------DKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           +  T L+         D+  Y V L GI V G L    K +   + +GA   G   +D+G
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA---GQTMVDSG 306

Query: 257 APPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM---AGI---AP 304
              T L    Y  L     N     LT Y+DP    Q    LCY+   +   +GI    P
Sbjct: 307 TQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLP 366

Query: 305 ILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFI 357
            ++  F+G        PL++    +    + V+CF     D    +  + G+  Q +++I
Sbjct: 367 TVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 426

Query: 358 GYDFDSQMVSFKPTDC 373
            +D     +   P +C
Sbjct: 427 EFDLQRSRIGLAPVEC 442


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 98/346 (28%), Positives = 144/346 (41%), Gaps = 36/346 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
           ++DT SD+ WVQC PC    CY Q   +Y+P  SSS    SC S  C  L   +  C++ 
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN--ENEMGLVGL 154
             C Y   Y D + T G   ++ +T   +     +  FGC H   G F+   +  G++ L
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPATA-VRSFQFGCSHGVQGSFSFGSSAAGIMAL 265

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG-SEVSGGGVVSTSLVSKE 213
           G    SL SQ  +  G   FS+C  P       T + +F  G   V+    V T ++   
Sbjct: 266 GGGPESLVSQTAATYG-RVFSHCFPP------PTRRGFFTLGVPRVAAWRYVLTPMLKNP 318

Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
               T+Y V LE I+V        + P   ++GA       +D+    T LP   Y  L 
Sbjct: 319 AIPPTFYMVRLEAIAVAG--QRIAVPPTVFAAGAA------LDSRTAITRLPPTAYQALR 370

Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPP 327
           +  R+ + +     P+     CY    MAG+     P +T  FD  A V L  +      
Sbjct: 371 QAFRDRMAMYQPAPPKGPLDTCYD---MAGVRSFALPRITLVFDKNAAVELDPSGVLF-- 425

Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +G   F   P D   GI GN     L + Y+  + +V F+   C
Sbjct: 426 --QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 165/379 (43%), Gaps = 47/379 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----YNPASSSSYKE 77
           G Y  K  +GTP   D +  VDTGSD++WV C  C++C ++   +    Y+  +SS+ K 
Sbjct: 83  GLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKS 141

Query: 78  LSCQSEQCHLLDTVS-CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN- 131
           +SC    C  ++  S C S   C Y   Y D S T G L  + +      GN      N 
Sbjct: 142 VSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNG 201

Query: 132 -VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSS 186
            ++FGCG   +G   E++    G++G G++  S  SQ+ SQ    + F++CL       +
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL------DN 255

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSG 244
                 F  G EV    V +T ++SK    +Y V L  I VGN  L  SS      +  G
Sbjct: 256 NNGGGIFAIG-EVVSPKVKTTPMLSKS--AHYSVNLNAIEVGNSVLELSSNAFDSGDDKG 312

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYKTPSMAG 301
            I      ID+G     LP   YN L  ++  +   + L   Q+    S  C+       
Sbjct: 313 VI------IDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQE----SFTCFHYTDKLD 362

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ------PIDGDVGIFGNFAQSDL 355
             P +T  FD    +  ++   ++    E  +CF  Q           + I G+ A S+ 
Sbjct: 363 RFPTVTFQFDKSVSLA-VYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421

Query: 356 FIGYDFDSQMVSFKPTDCT 374
            + YD ++Q++ +   +C+
Sbjct: 422 LVVYDIENQVIGWTNHNCS 440


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 98/346 (28%), Positives = 144/346 (41%), Gaps = 36/346 (10%)

Query: 41  IVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
           ++DT SD+ WVQC PC    CY Q   +Y+P  SSS    SC S  C  L   +  C++ 
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN--ENEMGLVGL 154
             C Y   Y D + T G   ++ +T   +     +  FGC H   G F+   +  G++ L
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPATA-VRSFQFGCSHGVQGSFSFGSSAAGIMAL 290

Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG-SEVSGGGVVSTSLVSKE 213
           G    SL SQ  +  G   FS+C  P       T + +F  G   V+    V T ++   
Sbjct: 291 GGGPESLVSQTAATYG-RVFSHCFPP------PTRRGFFTLGVPRVAAWRYVLTPMLKNP 343

Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
               T+Y V LE I+V        + P   ++GA       +D+    T LP   Y  L 
Sbjct: 344 AIPPTFYMVRLEAIAVAG--QRIAVPPTVFAAGAA------LDSRTAITRLPPTAYQALR 395

Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPP 327
           +  R+ + +     P+     CY    MAG+     P +T  FD  A V L  +      
Sbjct: 396 QAFRDRMAMYQPAPPKGPLDTCYD---MAGVRSFALPRITLVFDKNAAVELDPSGVLF-- 450

Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +G   F   P D   GI GN     L + Y+  + +V F+   C
Sbjct: 451 --QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 146/316 (46%), Gaps = 48/316 (15%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
           ++ G YV  F+IGTPP   +  +VD   +L+W QC PC  C++Q  P+++P  SS+++ L
Sbjct: 52  SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110

Query: 79  SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            C S  C  +   S  C+S  +C Y      +  T G   T+    G +    + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGKAGTDTFAIGAAK---ETLGFGC 165

Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
                 V  +  +       G+VGLGRT  SL    ++Q+    FSYCL         + 
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211

Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
            ++ G  ++   GG       V+ TS  S ++ +  YY V L GI  G        +   
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP-----LQAA 266

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           +SSG+     + +DT +  + L    Y  L++ +  A+ + P   P     LC+   ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPK-AVA 321

Query: 301 GIAPILTAHFDGGAKV 316
           G AP L   FDGGA +
Sbjct: 322 GDAPELVFTFDGGAAL 337


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 171/379 (45%), Gaps = 44/379 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK------PIYNPASSSSY 75
           G Y  K  +GTPP+   Y  VDTGSD+ W+ C PC  C  + +        Y+P+ SS+ 
Sbjct: 35  GLYYTKIYLGTPPV-GYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93

Query: 76  KELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
             LSC+   C      + VSC+S   C Y+  Y D S T+G    + +TF   +N     
Sbjct: 94  GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153

Query: 131 ---NVVFGCGHNNTG--VFNENEM-GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFH 182
              +V FGCG   +G  + +   + GL+G G+  +S+ SQ L+ +G   N+F++CL    
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQ-LASMGKVGNRFAHCL---Q 209

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            D+     +  G+ SE +   +  T +VS+    +Y V ++ I+V   + ++   P    
Sbjct: 210 GDNQGGGTIVIGSVSEPN---ISYTPIVSRN---HYAVGMQNIAVNGRNVTT---PASFD 260

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
           + + S G + +D+G     L    Y +    V +  + + +       QL +   S+   
Sbjct: 261 TTSTSAGGVIMDSGTTLAYLVDPAYTQFVNAV-STFESSMFSSHSQCLQLAW--CSLQAD 317

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGDVG-----IFGNFAQSD 354
            P +   FD GA + L   +     P+   +  +C   Q      G     I G+    D
Sbjct: 318 FPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKD 377

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             + YD D+++V +K  DC
Sbjct: 378 HLVVYDNDNRVVGWKSFDC 396


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 162/378 (42%), Gaps = 54/378 (14%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           +NG Y  +  IGTPP  +   IVDTGS + +V C  C QC K   P + P  S+SY+ L 
Sbjct: 72  SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALK 130

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
           C  + C+  D       +LC Y   YA+ S + GVL+ + I+FGN +       VFGC +
Sbjct: 131 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
             TG +F++   G++GLGR +LS+  Q++ + +  + FS C                  G
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 228

Query: 197 SEVSGGGVVSTSL-------VSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
            EV GG +V   +        S  D     YY + L+ + V     S KL P  +N    
Sbjct: 229 MEVGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 282

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
             K    +D+G      PK+ +  +++ V   I   K     DP     +C+        
Sbjct: 283 -GKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 340

Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +    P +   F  G K+ L      F    V G +C  + P      + G     +  
Sbjct: 341 EIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 400

Query: 357 IGYDFDSQMVSFKPTDCT 374
           + YD ++  + F  T+C+
Sbjct: 401 VTYDRENDKLGFLKTNCS 418


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 171/393 (43%), Gaps = 50/393 (12%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP-----IY 67
           + S   T  G+Y ++F +GTP    +  + DTGSDL WV+C           P     ++
Sbjct: 99  LTSGAYTGTGQYFVQFRVGTPAQPFVL-VADTGSDLTWVKCRGRRASSPDASPLASPRVF 157

Query: 68  NPASSSSYKELSCQSEQCHL---LDTVSCSSQQL----CNYTYGYADSSLTKGVLATERI 120
            PA+S S+  + C S+ C         +CS+       C Y Y Y D S  +GV+ T+  
Sbjct: 158 RPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAA 217

Query: 121 TFGNSNNFFDN------VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           T   S +  D       VV GC  +  G   ++  G++ LG + +S AS+  ++ G  +F
Sbjct: 218 TIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GRF 276

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGN--L 230
           SYCLV      + TS + FG    V      S +  L+  +   +Y VT++ +SV    L
Sbjct: 277 SYCLVDHLAPRNATSYLTFG---PVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKAL 333

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRL 288
           +  +++     + GAI      +D+G   T+L    Y  +   +   +   P    DP  
Sbjct: 334 NIPAEVWDVKKNGGAI------LDSGTSLTILATPAYKAVVAALSKQLARVPRVTMDP-- 385

Query: 289 GSQLCY------KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-ID 341
             + CY      + P++    P L   F G A++    T +++     GV C  +Q  + 
Sbjct: 386 -FEYCYNWTATRRPPAV----PRLEVRFAGSARL-RPPTKSYVIDAAPGVKCIGLQEGVW 439

Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             V + GN  Q +    +D  ++ + F+ + C 
Sbjct: 440 PGVSVIGNILQQEHLWEFDLANRWLRFQESRCA 472


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 100/336 (29%), Positives = 157/336 (46%), Gaps = 44/336 (13%)

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKG 113
           P+  P SSSS   ++C    C  L    CS+          C+Y Y Y ++      T+G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
           +L TE  TFG+    F  + FGC   + G F     GLVGLGR +LSL    ++QL    
Sbjct: 73  ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEA 127

Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISV 227
           F Y L    +D S  S + FG+ ++V+GG     +ST L++    +D  +Y+V L GISV
Sbjct: 128 FGYRL---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISV 184

Query: 228 GN--LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD 285
           G   +   S    +  S+GA   G +  D+G   T+LP   Y  + +++ + +    +Q 
Sbjct: 185 GGKLVQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQK 238

Query: 286 PRLGSQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAM 337
           P   +     +C+   S     P +  HFDGGA + L  T  ++P       E   C+++
Sbjct: 239 PPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSV 297

Query: 338 QPIDGDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
                 + I GN  Q D  + +D   +++M+   PT
Sbjct: 298 VKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 333


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 167/378 (44%), Gaps = 40/378 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
           G Y  +  +GTPP  D Y  +DTGSD++WV C  C  C        P+  ++P SS +  
Sbjct: 50  GLYYTRLQLGTPPR-DFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTAS 108

Query: 77  ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
            +SC  ++C L     D+V  +   LC Y + Y D S T G   ++ + F    G S  N
Sbjct: 109 LISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMN 168

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    +VFGC    TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL    
Sbjct: 169 NSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCL---K 225

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            D S    +  G   E+    +V T LV  +   +Y + ++ ISV    N   L    + 
Sbjct: 226 GDDSGGGILVLG---EIVEPNIVYTPLVPSQ--PHYNLNMQSISV----NGQTLAIDPSV 276

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMA 300
            G  S     ID+G     L +  Y+     + + +  +P   P L  G+     + S+ 
Sbjct: 277 FGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIV--SPSVRPYLSKGNHCYLISSSIN 334

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLF 356
            I P ++ +F GGA + LI     I     G   ++C   Q I G  + I G+    D  
Sbjct: 335 DIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKI 394

Query: 357 IGYDFDSQMVSFKPTDCT 374
             YD  +Q + +   DC+
Sbjct: 395 FVYDIANQRIGWANYDCS 412


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 162/378 (42%), Gaps = 54/378 (14%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           +NG Y  +  IGTPP  +   IVDTGS + +V C  C QC K   P + P  S+SY+ L 
Sbjct: 72  SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALK 130

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
           C  + C+  D       +LC Y   YA+ S + GVL+ + I+FGN +       VFGC +
Sbjct: 131 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
             TG +F++   G++GLGR +LS+  Q++ + +  + FS C                  G
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 228

Query: 197 SEVSGGGVVSTSL-------VSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
            EV GG +V   +        S  D     YY + L+ + V     S KL P  +N    
Sbjct: 229 MEVGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 282

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
             K    +D+G      PK+ +  +++ V   I   K     DP     +C+        
Sbjct: 283 -GKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 340

Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +    P +   F  G K+ L      F    V G +C  + P      + G     +  
Sbjct: 341 EIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 400

Query: 357 IGYDFDSQMVSFKPTDCT 374
           + YD ++  + F  T+C+
Sbjct: 401 VTYDRENDKLGFLKTNCS 418


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 163/376 (43%), Gaps = 51/376 (13%)

Query: 34  PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQSEQCH----- 86
           P  +I  ++DTGS+L W++C           P+  ++P  SSSY  + C S  C      
Sbjct: 82  PPQNISMVIDTGSELSWLRC----NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 87  LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNE 146
            L   SC S +LC+ T  YAD+S ++G LA E   FGNS N   N++FGC  + +G   E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN-DSNLIFGCMGSVSGSDPE 196

Query: 147 NE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
            +    GL+G+ R  LS     +SQ+G  KFSYC+    TD      +  G+ +      
Sbjct: 197 EDTKTTGLLGMNRGSLSF----ISQMGFPKFSYCIS--GTD-DFPGFLLLGDSNFTWLTP 249

Query: 204 VVSTSLVSKE------DKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           +  T L+         D+  Y V L GI V G L    K +   + +GA   G   +D+G
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGA---GQTMVDSG 306

Query: 257 APPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM---AGI---AP 304
              T L    Y  L     N     LT Y+DP    Q    LCY+        GI    P
Sbjct: 307 TQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLP 366

Query: 305 ILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFI 357
            ++  F+G        PL++    +    + V+CF     D    +  + G+  Q +++I
Sbjct: 367 TVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 426

Query: 358 GYDFDSQMVSFKPTDC 373
            +D     +   P  C
Sbjct: 427 EFDLQRSRIGLAPVQC 442


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/155 (45%), Positives = 90/155 (58%), Gaps = 8/155 (5%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQS 82
           Y++   IGTP   DI  + DTGSDL W QC PC+  CY Q +P +NP+SSSSY  +SC S
Sbjct: 134 YIVTIGIGTPKH-DISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192

Query: 83  EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
             C   +  SCS+   C Y  GY D S+T G LA E+ T  NS +  D++ FGCG NN G
Sbjct: 193 PMCG--NPESCSASN-CLYGIGYGDGSVTVGFLAKEKFTLTNS-DVLDDIYFGCGENNKG 248

Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           VF     G++GLG  + S   Q  +    N FSYC
Sbjct: 249 VF-IGSAGILGLGPGKFSFPLQTTTTYN-NIFSYC 281


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 170/383 (44%), Gaps = 42/383 (10%)

Query: 8   YPNNVVQSNVSTANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
           Y N+   S   +  G  ++   SIG P +  +  ++DTGSD++W+ C PC  C   +  +
Sbjct: 84  YNNDYTASVSPSLTGRTILVNLSIGQPSIPQLV-VMDTGSDILWIMCNPCTNCDNHLGLL 142

Query: 67  YNPASSSSYKELS---CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
           ++P+ SS++  L    C  + C   D +         +T  Y D+S   G    + + F 
Sbjct: 143 FDPSMSSTFSPLCKTPCGFKGCK-CDPIP--------FTISYVDNSSASGTFGRDILVFE 193

Query: 124 NSN---NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
            ++   +   +V+ GCGHN     +    G++GL     SLA+QI       KFSYC+  
Sbjct: 194 TTDEGTSQISDVIIGCGHNIGFNSDPGYNGILGLNNGPNSLATQI-----GRKFSYCIGN 248

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                   +++  G G+++ G      S   +    +Y+VT+EGISVG       L  + 
Sbjct: 249 LADPYYNYNQLRLGEGADLEG-----YSTPFEVYHGFYYVTMEGISVGEKRLDIALETFE 303

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYK--- 295
                   G + +D+G   T L    +  L  +VRN +K +  Q        +LCY    
Sbjct: 304 MKRNG--TGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGII 361

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP---IDGDV--GIFGNF 350
           +  + G  P++T HF  GA + L   S F     + +FC  + P   ++  +   + G  
Sbjct: 362 SRDLVGF-PVVTFHFVDGADLALDTGSFF--SQRDDIFCMTVSPASILNTTISPSVIGLL 418

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
           AQ    +GYD  +Q V F+  DC
Sbjct: 419 AQQSYNVGYDLVNQFVYFQRIDC 441


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 170/379 (44%), Gaps = 49/379 (12%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-YNPASSSSYKELSCQSE 83
           ++   IGTPP      ++DTGS L W+QC       K      ++P+ SSS+  L C   
Sbjct: 81  IVSLPIGTPPQTQQM-VLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHP 139

Query: 84  QCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
            C           +C   +LC+Y+Y YAD +  +G L  E+ITF +S +    ++ GC  
Sbjct: 140 LCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQS-TPPLILGCAE 198

Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
            +T     +E G++G+   R S ASQ       +KFSYC+      + ++S   F  G+ 
Sbjct: 199 AST-----DEKGILGMNLGRRSFASQA----KISKFSYCVPTRQARAGLSSTGSFYLGNN 249

Query: 199 VSGGGVVSTSLVS--------KEDKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGAISKG 249
            + G     +L++          D   Y + ++GI +GN   N S  +   + SGA   G
Sbjct: 250 PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGA---G 306

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSM-- 299
              ID+G+  T L  + YN++ E+V   ++L     P+L         S +C+    M  
Sbjct: 307 QTIIDSGSEFTYLVDEAYNKVREEV---VRLV---GPKLKKGYVYGGVSDMCFDGNPMEI 360

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLF 356
             +   +   F+ G ++ +I     +     GV C  +   + +     I GNF Q +L+
Sbjct: 361 GRLIGNMVFEFEKGVEI-VIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLW 419

Query: 357 IGYDFDSQMVSFKPTDCTK 375
           + YD  ++ +     DC++
Sbjct: 420 VEYDLANRRIGLGKADCSR 438


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 161/373 (43%), Gaps = 43/373 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C QC +   P ++P SSS+YK + C
Sbjct: 80  NGYYTTRLWIGTPPQ-QFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D +  S    C Y   YA+ S + GVL  + I+FGN +       VFGC + 
Sbjct: 139 N------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F++   G++GLG   LSL  Q++ +   N  FS C             M  G G+
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY----------GGMDIGGGA 242

Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-KGNMF 252
            V GG    + ++          YY V L+ I V     + K +P   SSG    +    
Sbjct: 243 MVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHV-----AGKKLPL--SSGIFDGRYGAV 295

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
           +D+G     LP + ++  ++ + + I   K     DP     +C+         ++   P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNF-KDICFSGAGSDAAELSNKFP 354

Query: 305 ILTAHFDGGAKVPLIHTSTFIP-PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
            +   F+ G K+ L   + F     V G +C  +     D   + G     +  + YD  
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414

Query: 363 SQMVSFKPTDCTK 375
           +  + F  T+C++
Sbjct: 415 NSKIGFWKTNCSE 427


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 99/178 (55%), Gaps = 16/178 (8%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           N+ T N  Y++   +G+    ++  I+DT SDL WVQC PC+ CY Q  PI+ P++SSSY
Sbjct: 59  NLQTLN--YIVTMGLGSK---NMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113

Query: 76  KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
           + +SC S  C  L     +T +C S     CNY   Y D S T G L  E ++FG  +  
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVS-- 171

Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
             + VFGCG NN G+F     GL+GLGR+ LSL SQ  +  G   FSYCL      SS
Sbjct: 172 VSDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPTTEAGSS 227


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 161/373 (43%), Gaps = 43/373 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C QC +   P ++P SSS+YK + C
Sbjct: 80  NGYYTTRLWIGTPPQ-QFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D +  S    C Y   YA+ S + GVL  + I+FGN +       VFGC + 
Sbjct: 139 N------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F++   G++GLG   LSL  Q++ +   N  FS C             M  G G+
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY----------GGMDIGGGA 242

Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-KGNMF 252
            V GG    + ++          YY V L+ I V     + K +P   SSG    +    
Sbjct: 243 MVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHV-----AGKKLPL--SSGIFDGRYGAV 295

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
           +D+G     LP + ++  ++ + + I   K     DP     +C+         ++   P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNF-KDICFSGAGSDAAELSNKFP 354

Query: 305 ILTAHFDGGAKVPLIHTSTFIP-PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
            +   F+ G K+ L   + F     V G +C  +     D   + G     +  + YD  
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414

Query: 363 SQMVSFKPTDCTK 375
           +  + F  T+C++
Sbjct: 415 NSKIGFWKTNCSE 427


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 171/391 (43%), Gaps = 55/391 (14%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LP-----CVQC-YKQVKP----IYNPA 70
           G Y + FS+GTPP   +  ++DTGS L+W  C +P     C  C +  V P    IY   
Sbjct: 72  GGYSVIFSLGTPPQ-KVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARN 130

Query: 71  SSSSYKELSCQSEQCHLL--DTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNS 125
            SS+ + L C+S +C+ +    ++CS+ + C Y    YG      T G L ++ +     
Sbjct: 131 KSSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS---TTGQLVSDVLGLSKL 187

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
           N   D  +FGC    + V N    G+ G GR    LAS I +QLG  KFSYCLV    D 
Sbjct: 188 NRIPD-FLFGC----SLVSNRQPEGIAGFGR---GLAS-IPAQLGLTKFSYCLVSHRFDD 238

Query: 186 SITSK---MYFGNGSEVSGGGVVSTSLVSKED-----KTYYFVTLEGISVGNLSNSSKLI 237
           +  S    ++ G     +    V+ +  +K         YY+++L  I VG       + 
Sbjct: 239 TPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGG--KDVPIP 296

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL-----GSQL 292
           P Y        G M +D+G+  T + +  ++ +  ++     +T Y+  +      G   
Sbjct: 297 PRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEK--HMTKYKRAKEIEDSSGLGP 354

Query: 293 CYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QP--IDGDV 344
           CY     + +  P LT  F GGA + L  T  F     +GV C  +     +P    G  
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYF-SLVTDGVVCMTVLTDPDEPGSTTGPA 413

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            I GN+ Q + +I YD   Q   FKP  C +
Sbjct: 414 IILGNYQQQNFYIEYDLKKQRFGFKPQQCDR 444


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 160/362 (44%), Gaps = 41/362 (11%)

Query: 39  YGIVDTGSDLMWVQCLPCVQ----CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS 94
           Y  +DTG++L W+QC  C      C+    P Y  + S SYK +SC   Q    +   C 
Sbjct: 102 YFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSC--NQHSFCEPNQC- 158

Query: 95  SQQLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGCGHNNTG-----VFNE 146
            + LC Y   Y   S T G LA E  TF      +    ++ FGC  ++       + ++
Sbjct: 159 KEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDK 218

Query: 147 NEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
           N + G++G+G    S  +Q L  +   KFSYC+   +T ++    + FG    V    + 
Sbjct: 219 NPVSGVLGMGWGPRSFLAQ-LGSISHGKFSYCITANNTHNTY---LRFGK-HVVKSKNLQ 273

Query: 206 STSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
           +T ++  +    Y V L GISV     N++ +   +    S G I      ID G   TL
Sbjct: 274 TTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCI------IDAGTLATL 327

Query: 262 LPKDFYNRLEEQVRNAI----KLTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGAK 315
           L K  ++ L   + N +     L  +   +L   LCY+  S AG    P++T H +  A 
Sbjct: 328 LVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLE-NAD 386

Query: 316 VPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           + +   + F+    EG  VFC +M   D    I G + Q      YD  ++++SF P DC
Sbjct: 387 LEVKPEAIFLFREFEGKNVFCLSMLSDDSKT-IIGAYQQMKQKFVYDTKARVLSFGPEDC 445

Query: 374 TK 375
            K
Sbjct: 446 EK 447


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/381 (28%), Positives = 162/381 (42%), Gaps = 43/381 (11%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N+   +N+   +G +++  + GTPP      I+DTGS + W QC  CV C K     ++ 
Sbjct: 113 NHAHNNNLFDEDGNFLVDVAFGTPPQ-KFKLILDTGSSITWTQCKACVHCLKDSHRHFDS 171

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
            +SS+Y   SC      +  TV  +      Y   Y D S + G    + +T   S + F
Sbjct: 172 LASSTYSFGSC------IPSTVGNT------YNMTYGDKSTSVGNYGCDTMTLEPS-DVF 218

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
               FGCG NN G F     G++GLG+ +LS  SQ  S+     FSYCL     ++SI S
Sbjct: 219 QKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKF-KKVFSYCLPE---ENSIGS 274

Query: 190 KMYFGNGSEVSGGGVVSTSLVSK------EDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            + FG  +      +  TSLV+       E+  YYFV L  ISVGN   +   IP    S
Sbjct: 275 -LLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLN---IP----S 326

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSM 299
              +     ID+G   T LP+  Y+ L+   + A+   P  + R         CY     
Sbjct: 327 SVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGR 386

Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDGDVGIFGNFAQS 353
             +  P    HF  GA V L +    +        C A        ++ ++ I GN  Q 
Sbjct: 387 KDVLLPEXVLHFGDGADVRL-NGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQV 445

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
            L + YD   + + F    C+
Sbjct: 446 SLTVLYDIRGRRIGFGGNGCS 466


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 105/410 (25%), Positives = 166/410 (40%), Gaps = 61/410 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-------------- 58
           + S   T  G+Y ++F +GTP    +  I DTGSDL WV+C                   
Sbjct: 99  LSSGAYTGTGQYFVRFRVGTPAQPFVL-IADTGSDLTWVKCRGAASPSHATATASPAAAP 157

Query: 59  -CYKQVKPIYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKG 113
                   ++ P  S ++  + C SE C         +CSS    C+Y Y Y D+S  +G
Sbjct: 158 SPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARG 217

Query: 114 VLATERITFG-----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLA 162
           V+ T+  T             +       VV GC   + G   E   G++ LG + +S A
Sbjct: 218 VVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFA 277

Query: 163 SQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKTY 217
           S+  S+ G  +FSYCLV      + TS + FG G + +       G  +  L+    + +
Sbjct: 278 SRAASRFG-GRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPF 336

Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQ 273
           Y V ++ +SV  ++     IP        S G   ID+G   T+L    Y      L EQ
Sbjct: 337 YAVAVDSVSVDGVALD---IP-AEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQ 392

Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAG-----IAPILTAHFDGGAKVPLIHTSTFIPPP 328
           +    ++    DP      CY   +          P L   F G A++     S ++   
Sbjct: 393 LAGLPRVA--MDP---FDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKS-YVIDA 446

Query: 329 VEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
             GV C  +Q  +G    V + GN  Q +    +D +++ + F+ T CT+
Sbjct: 447 APGVKCIGVQ--EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCTQ 494


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 155/347 (44%), Gaps = 35/347 (10%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS--- 74
           +T  G YV+ FS+GTPP + + G++D  SD +W+QC  C  C         PA++S+   
Sbjct: 91  ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADA-----PAATSAPPF 144

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
           Y  LS    +     T  C      +Y YG   ++ T G+LA +   F       D V+F
Sbjct: 145 YAFLSFHDTRAPT--TPPCG----YSYVYGGGAANTTAGLLAVDAFAFATVRA--DGVIF 196

Query: 135 GCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           GC      V  E ++ G++GLGR  LS  SQ+  Q+G  +FSY L P      + S + F
Sbjct: 197 GC-----AVATEGDIGGVIGLGRGELSPVSQL--QIG--RFSYYLAP-DDAVDVGSFILF 246

Query: 194 GNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAISKGNM 251
            + ++      VST LV S+  ++ Y+V L GI V         IP       A   G +
Sbjct: 247 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQADGSGGV 303

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPILTAHF 310
            +    P T L    Y  + + + + I+L       LG  LCY + S+A    P +   F
Sbjct: 304 VLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVF 363

Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLF 356
            GGA + L   + F      G+ C  + P   GD  + G+  Q  L 
Sbjct: 364 AGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVSLL 410


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/407 (26%), Positives = 164/407 (40%), Gaps = 65/407 (15%)

Query: 23  EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
           +Y +  S+G P       + +DTGSDL+W  C P  C+ C  +                 
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146

Query: 64  ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
                  P+ + A SS+     C + +C L  ++T SC+S       Y Y D SL    L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205

Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
              R+    S    +N  F C H          +G+ G GR  LSL +Q+   L + +FS
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSL-SGRFS 259

Query: 176 YCLVP--FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV------SKEDKTYYFVTLEGIS 226
           YCLV   F  D  I +S +  G  ++ +  G   T  V      + +   +Y V LE +S
Sbjct: 260 YCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVS 319

Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--- 283
           VG     ++  P          G M +D+G   T+LP D + R+ ++   A+    +   
Sbjct: 320 VGGKRIQAQ--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRA 377

Query: 284 --QDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAM 337
              + + G   CY  +PS   + P+   HF G A V L   + F+    E    V C  +
Sbjct: 378 EGAEAQTGLAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436

Query: 338 QPIDGD----------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             + G+           G  GNF Q    + YD D+  V F    CT
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/395 (25%), Positives = 168/395 (42%), Gaps = 46/395 (11%)

Query: 1   MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQC 59
           + PA     ++ V  + S    +Y M  S+GTPP+ ++  I DTGS L WVQC  C ++C
Sbjct: 2   IQPANIPADSSTVIGDDSMRKNKYFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKC 60

Query: 60  YKQVKP---IYNPASSSSYKELSCQSEQC---HLLDTVS---CSSQQLCNYTYGYADSSL 110
           Y Q      I+NP +SS+Y ++ C +E C   H+   V          C Y+  Y     
Sbjct: 61  YDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY 120

Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
           + G L  +R+T   SN   DN +FGCG +N  ++N    G++G G    S  +Q+  Q  
Sbjct: 121 SVGYLGKDRLTLA-SNRSIDNFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTD 177

Query: 171 ANKFSYCLVPFH-TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
              FSYC    H  + S+T       G       ++ T L+  + K  Y +    + V  
Sbjct: 178 YTAFSYCFPRDHENEGSLTI------GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNG 231

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           +    ++ PY      ISK  + +D+G   T +    ++ L++ +   ++   Y      
Sbjct: 232 I--RLEIDPYI----YISKMTI-VDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDE 284

Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF--------CFAMQPID 341
            ++C+ + S        +A+++    V +    + +  PVE  F        C    P D
Sbjct: 285 RRICFISNSG-------SANWNDFPTVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDD 337

Query: 342 GDVG---IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             V    + GN A     + +D  +    FK   C
Sbjct: 338 AGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/407 (26%), Positives = 164/407 (40%), Gaps = 65/407 (15%)

Query: 23  EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
           +Y +  S+G P       + +DTGSDL+W  C P  C+ C  +                 
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146

Query: 64  ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
                  P+ + A SS+     C + +C L  ++T SC+S       Y Y D SL    L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205

Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
              R+    S    +N  F C H          +G+ G GR  LSL +Q+   L + +FS
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSL-SGRFS 259

Query: 176 YCLVP--FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV------SKEDKTYYFVTLEGIS 226
           YCLV   F  D  I +S +  G  ++ +  G   T  V      + +   +Y V LE +S
Sbjct: 260 YCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVS 319

Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--- 283
           VG     ++  P          G M +D+G   T+LP D + R+ ++   A+    +   
Sbjct: 320 VGGKRIQAQ--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRA 377

Query: 284 --QDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAM 337
              + + G   CY  +PS   + P+   HF G A V L   + F+    E    V C  +
Sbjct: 378 EGAEAQTGLAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436

Query: 338 QPIDGD----------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             + G+           G  GNF Q    + YD D+  V F    CT
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 169/390 (43%), Gaps = 58/390 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           + M+  IG+    ++  I+DTGS+ + VQC        + +P+++PA+S SY+++ C S+
Sbjct: 100 FSMQLGIGSLQK-NLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQ 152

Query: 84  QCHLLD--TVSCSSQ------QLCNYTYGYADSSLTKGVLATERITFGNSNNF------F 129
            C  +   T + SSQ        C Y+  Y DS  + G  + + + F NS N       F
Sbjct: 153 LCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQD-VIFLNSTNSSGQAVQF 211

Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
            +V FGC H+  G + +   +G+VG  R  LSL SQ+  +LG +KFSYC          T
Sbjct: 212 RDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRAT 271

Query: 189 SKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLS-----NSSKLIPY 239
             ++ G+ S +S   V  T L    V+      Y+V L  ISV   +     ++ KL P 
Sbjct: 272 GVIFLGD-SGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 330

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR------LGSQLC 293
               G +      +D+G   T +  D Y       RNA   +     R       G   C
Sbjct: 331 TGDGGTV------LDSGTTFTRVVDDAYTAF----RNAFAASNRSGLRKKVGAAAGFDDC 380

Query: 294 YKTPSMAGI--APILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPID----GDV 344
           Y   + + +   P +        ++ L     F+P    G     C A+        G +
Sbjct: 381 YNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKI 440

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + GN+ QS+  + YD +   V F+  DC+
Sbjct: 441 NVLGNYQQSNYLVEYDNERSRVGFERADCS 470


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP  D Y  VDTGSD++WV C  C  C +    Q++   ++P SS +  
Sbjct: 79  GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137

Query: 77  ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
            +SC  ++C      S   CS Q  LC YT+ Y D S T G   ++ + F    G+S   
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    VVFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL    
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
                  K   G G  +  G +V  ++V       + +Y V L  ISV     + + +P 
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301

Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             S  + S G    IDTG     L +  Y    E + NA+  +       G+Q    T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
           +  I P ++ +F GGA + L      I     G   V+C   Q I    + I G+    D
Sbjct: 362 VGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
               YD   Q + +   DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 116/412 (28%), Positives = 173/412 (41%), Gaps = 76/412 (18%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASSSSYKE--- 77
           +Y + F++G P    I   +DTGSDL+W  C P  C+ C  + K   +P+  ++      
Sbjct: 74  DYTLSFNLG-PHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTP 132

Query: 78  LSCQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
           +SC S  C +                    ++T  C S     + Y Y D SL   + + 
Sbjct: 133 ISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSL---IASL 189

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKF 174
            R T   S     N  FGC H     F+E   G+ G GR  LSL +Q+ +   QLG N+F
Sbjct: 190 YRDTLSLSTLQLTNFTFGCAHT---TFSE-PTGVAGFGRGLLSLPAQLATHSPQLG-NRF 244

Query: 175 SYCLVPFHTDSSITSK---MYFG--------NGSEVSGGGVVSTSLVSKEDKTYYF-VTL 222
           SYCLV     S    K   +  G        NG EV     V TS++     +Y++ V L
Sbjct: 245 SYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVV--EFVYTSMLENPKHSYFYTVGL 302

Query: 223 EGISVGNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNA 277
           +GISVG  +  + K++   N  G    G + +D+G   T+LP+ FYN + E    + R +
Sbjct: 303 KGISVGKKTVPAPKILRRVNKKG---DGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKS 359

Query: 278 IKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG-GAKVPLIHTSTFIP--------PP 328
            +  P  + + G   CY   + A I P +T  F G  + V L   + F            
Sbjct: 360 NRRAPEIEQKTGLSPCYYL-NTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRR 418

Query: 329 VEGVFCFAM-------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            E V C          +   G  G+ GN+ Q    + YD + + V F    C
Sbjct: 419 KERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP  D Y  VDTGSD++WV C  C  C +    Q++   ++P SS +  
Sbjct: 79  GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137

Query: 77  ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
            +SC  ++C      S   CS Q  LC YT+ Y D S T G   ++ + F    G+S   
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    VVFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL    
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
                  K   G G  +  G +V  ++V       + +Y V L  ISV     + + +P 
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301

Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             S  + S G    IDTG     L +  Y    E + NA+  +       G+Q    T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
           +  I P ++ +F GGA + L      I     G   V+C   Q I    + I G+    D
Sbjct: 362 VGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
               YD   Q + +   DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 99/345 (28%), Positives = 131/345 (37%), Gaps = 27/345 (7%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT-----VSC 93
           +VDT SD+ WVQC PC   QCY Q   +Y+P  S       C S QC  L          
Sbjct: 177 VVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGA 236

Query: 94  SSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDNVVFGCGHN--NTGVFNENEMG 150
            +   C Y   Y D S T G   ++ +T   +         FGC H     G FN    G
Sbjct: 237 GNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKTAG 296

Query: 151 LVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
            + LGR   SL+SQ        N FSYCL P  +     S    G     +    V+  L
Sbjct: 297 FMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLS---LGVPQHAASRYAVTPML 353

Query: 210 VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
            SK     Y V L GI V     + + +P      A+   N  +D+    T LP   Y  
Sbjct: 354 KSKMAPMIYMVRLIGIDV-----AGQRLPV---PPAVFAANAAMDSRTIITRLPPTAYMA 405

Query: 270 LEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPP 328
           L    R  ++      P+     CY    +  +  P +T  FD  A V L  +   +   
Sbjct: 406 LRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML--- 462

Query: 329 VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            +    FA    D   GI GN  Q  L + Y+ D   V F+   C
Sbjct: 463 -DSCLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 149/358 (41%), Gaps = 40/358 (11%)

Query: 42  VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS------S 95
           +D G  L W+QCLPC  C  Q+ P+++P  S ++  +          +TV C       +
Sbjct: 115 LDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAH-------NTVWCRPPYQPLA 167

Query: 96  QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNNTGVFNENEM-GL 151
              C +   Y D++   G LA +  +F   N+ F     +VFGC H      N+  + G+
Sbjct: 168 NGACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGI 227

Query: 152 VGL-----GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
           +GL     G+   +   Q+L   G  +FSYC  PF    S+ S + FG+         V 
Sbjct: 228 LGLGMGPAGKPPTAFTKQVLPAHGG-RFSYC--PFVPGMSMYSYLRFGSDIPSHPPPNVH 284

Query: 207 TS----LVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
                 L    +   YFV L G+SVG   LS  +  +   N+ GA   G   +D G   T
Sbjct: 285 RQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGA---GGCVVDIGTRMT 341

Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-MAGIAPILTAHFDGGAKVPLI 319
                 Y  ++  VR  ++        +    C + P+    + P +T HF+ GA + ++
Sbjct: 342 AFIHSAYVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGAWLRVM 401

Query: 320 HTSTFIPPPVEGVF--CFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ--MVSFKPTDC 373
               F+P  V G    CF       D+ + G   Q +    +D      ++SF P DC
Sbjct: 402 PEHVFMPFVVGGHHYQCFGFVS-STDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 50/383 (13%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           M+  IG+    ++  I+DTGS+ + VQC        + +P+++PA+S SY+++ C S+ C
Sbjct: 1   MQLGIGSLQK-NLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQLC 53

Query: 86  HLLD--TVSCSSQ------QLCNYTYGYADSSLTKGVLATERITFGNSNNF-----FDNV 132
             +   T + SSQ        C Y+  Y DS  + G  + + I   ++N+      F +V
Sbjct: 54  LAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDV 113

Query: 133 VFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
            FGC H+  G + +   +G+VG  R  LSL SQ+  +LG +KFSYC          T  +
Sbjct: 114 AFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVI 173

Query: 192 YFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLS-----NSSKLIPYYNS 242
           + G+ S +S   V  T L    V+      Y+V L  ISV   +     ++ KL P    
Sbjct: 174 FLGD-SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 232

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEE--QVRNAIKLTPYQDPRLGSQLCYKT---P 297
            G +      +D+G   T +  D Y          N   L        G   CY      
Sbjct: 233 GGTV------LDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGS 286

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPID----GDVGIFGNF 350
           S+ G+ P +        ++ L     F+P    G     C A+        G + + GN+
Sbjct: 287 SLPGV-PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNY 345

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            QS+  + YD +   V F+  DC
Sbjct: 346 QQSNYLVEYDNERSRVGFERADC 368


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 170/384 (44%), Gaps = 46/384 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP  ++  ++DTGS+L W+ C           P +NP  SSSY  +SC
Sbjct: 63  NVSLTISITVGTPPQ-NMSMVIDTGSELSWLHCNTNTTATIPY-PFFNPNISSSYTPISC 120

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C           SC S  LC+ T  YAD+S ++G LA++  TFG  ++F   +VFG
Sbjct: 121 SSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASD--TFGFGSSFNPGIVFG 178

Query: 136 CGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C +++    +E   N  GL+G+    LSL    +SQL   KFSYC+    + S  +  + 
Sbjct: 179 CMNSSYSTNSESDSNTTGLMGMNLGSLSL----VSQLKIPKFSYCI----SGSDFSGILL 230

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
            G  +   GG +  T LV         D++ Y V LEGI + + L N S  +   + +GA
Sbjct: 231 LGESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGA 290

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDP----RLGSQLCYKTP-- 297
              G    D G   + L    YN L ++  N     L    DP    ++   LCY+ P  
Sbjct: 291 ---GQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVN 347

Query: 298 -SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDVGIFGN 349
            S     P ++  F+G   +V        +P  V G   V+CF     D    +  I G+
Sbjct: 348 QSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIGH 407

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
             Q  +++ +D     V      C
Sbjct: 408 HHQQSMWMEFDLVEHRVGLAHARC 431


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 166/386 (43%), Gaps = 53/386 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP  ++  ++DTGS+L W++C       +  +  ++P  SSSY  + C
Sbjct: 82  NVSLTVSLTVGTPPQ-NVSMVLDTGSELSWLRC----NKTQTFQTTFDPNRSSSYSPVPC 136

Query: 81  QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C           SC S QLC+    YAD+S ++G LA++    GNS+      +FG
Sbjct: 137 SSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSD--MPGTIFG 194

Query: 136 CGHNNTGVFNENE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C  ++     E +    GL+G+ R  LS     +SQ+   KFSYC+    +DS  +  + 
Sbjct: 195 CMDSSFSTNTEEDSKNTGLMGMNRGSLSF----VSQMDFPKFSYCI----SDSDFSGVLL 246

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS---S 243
            G+ +      +  T L+         D+  Y V LEGI V     SSKL+P   S    
Sbjct: 247 LGDANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKV-----SSKLLPLPKSVFVP 301

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTP 297
                G   +D+G   T L    Y+ L  +  N     L   +DP      G  LCY+ P
Sbjct: 302 DHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVP 361

Query: 298 ---SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDVGIF 347
              +     P ++  F G   KV        +P  V G   V+CF     D    +  + 
Sbjct: 362 LSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVI 421

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G+  Q ++++ +D +   + F    C
Sbjct: 422 GHHHQQNVWMEFDLEKSRIGFAQVQC 447


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 161/373 (43%), Gaps = 46/373 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP---IYNPASSSSYKEL 78
           +Y M  S+GTPP+ ++  I DTGS L WVQC  C ++CY Q      I+NP +SS+Y ++
Sbjct: 5   KYFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKV 63

Query: 79  SCQSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
            C +E C+     L     C  +   C Y+  Y     + G L  +R+T   SN   DN 
Sbjct: 64  GCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA-SNRSIDNF 122

Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-TDSSITSKM 191
           +FGCG +N  ++N    G++G G    S  +Q+  Q     FSYC    H  + S+T   
Sbjct: 123 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI-- 178

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
               G       ++ T L+  + K  Y +    + V  +    ++ PY      ISK  +
Sbjct: 179 ----GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGI--RLEIDPYI----YISKMTI 228

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFD 311
            +D+G   T +    ++ L++ +   ++   Y       ++C+       I+   +A+++
Sbjct: 229 -VDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICF-------ISNSGSANWN 280

Query: 312 GGAKVPLIHTSTFIPPPVEGVF--------CFAMQPIDGDVG---IFGNFAQSDLFIGYD 360
               V +    + +  PVE  F        C    P D  V    + GN A     + +D
Sbjct: 281 DFPTVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFD 340

Query: 361 FDSQMVSFKPTDC 373
             +    FK   C
Sbjct: 341 IQAMNFGFKARAC 353


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 160/375 (42%), Gaps = 45/375 (12%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           +NG Y  +  IGTPP  +   IVDTGS + +V C  C QC K   P + P SSS+YK + 
Sbjct: 84  SNGYYTTRLFIGTPPQ-EFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142

Query: 80  CQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCG 137
           C         + +C  + + C Y   YA+ S + G+LA + ++FGN +       +FGC 
Sbjct: 143 CNP-------SCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCE 195

Query: 138 HNNTG-VFNENEMGLVGLGRTRLSLASQ-ILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
              TG +F++   G++GLGR  LS+  Q ++ ++  N FS C                  
Sbjct: 196 TVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCY----------------G 239

Query: 196 GSEVSGGGVVSTSLVSKEDKTY-----YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           G +V GG +V  ++    D  +     Y      I +  L  + K +   N      K  
Sbjct: 240 GMDVVGGAMVLGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLK-LNPRVFDGKHG 298

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCY-----KTPSMAGI 302
             +D+G     LP++ +   ++ +   IK        DP   + +C+         ++ I
Sbjct: 299 TVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSY-NDICFSGAGRDVSQLSKI 357

Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYD 360
            P +   F  G K+ L      F    V G +C  + Q       + G     +  + YD
Sbjct: 358 FPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYD 417

Query: 361 FDSQMVSFKPTDCTK 375
            D+  + F  T+C++
Sbjct: 418 RDNDKIGFWKTNCSE 432


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 162/377 (42%), Gaps = 46/377 (12%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           ++   IGTPP +    ++DTGS L W+QC             ++P+ SS++  L C    
Sbjct: 98  IVDLPIGTPPQVQPM-VLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPV 156

Query: 85  CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
           C           SC   +LC+Y+Y YAD +  +G L  E+ TF  S  F   ++ GC   
Sbjct: 157 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS-LFTPPLILGCATE 215

Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFGNG 196
           +T     +  G++G+ R RLS ASQ  S++   KFSYC VP        + T   Y G+ 
Sbjct: 216 ST-----DPRGILGMNRGRLSFASQ--SKI--TKFSYC-VPTRVTRPGYTPTGSFYLGHN 265

Query: 197 SEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
              +    +     ++       D   Y V L+GI +G       + P    + A   G 
Sbjct: 266 PNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIG--GRKLNISPAVFRADAGGSGQ 323

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSMAGI 302
             +D+G+  T L  + Y+++  +V  A+       PR+         + +C+   ++   
Sbjct: 324 TMLDSGSEFTYLVNEAYDKVRAEVVRAV------GPRMKKGYVYGGVADMCFDGNAIEIG 377

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPID---GDVGIFGNFAQSDLFIG 358
             I    F+    V ++     +   VE GV C  +   D       I GNF Q +L++ 
Sbjct: 378 RLIGDMVFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAASNIIGNFHQQNLWVE 437

Query: 359 YDFDSQMVSFKPTDCTK 375
           +D  ++ + F   DC++
Sbjct: 438 FDLVNRRMGFGTADCSR 454


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +G+PP  D Y  VDTGSD++WV C  C  C +    Q++   ++P SS +  
Sbjct: 79  GLYYTKIRLGSPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAT 137

Query: 77  ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
            +SC  ++C      S   CS Q  LC YT+ Y D S T G   ++ + F    G+S   
Sbjct: 138 PVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    VVFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ L    FS+CL    
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL---- 253

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
                  K   G G  +  G +V  ++V       + +Y V L  ISV     + + +P 
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301

Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             S  + S G    IDTG     L +  Y    E + NA+  +       G+Q      S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVIATS 361

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
           +A I P ++ +F GGA + L      I     G   V+C   Q I    + I G+    D
Sbjct: 362 VADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
               YD   Q + +   DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 163/384 (42%), Gaps = 49/384 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP   +  ++DTGS+L W+ C    +    +  +++P  SSSY  + C
Sbjct: 60  NVSLTVSLTVGSPPQ-TVTMVLDTGSELSWLHC----KKAPNLHSVFDPLRSSSYSPIPC 114

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C          VSC  ++LC+    YAD+S  +G LA++    GNS       +FG
Sbjct: 115 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSA--IPATIFG 172

Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C   G ++    +    GL+G+ R  LS     ++Q+G  KFSYC+     DSS    + 
Sbjct: 173 CMDSGFSSNSDEDSKTTGLIGMNRGSLSF----VTQMGLQKFSYCIS--GQDSS--GILL 224

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
           FG  S      +  T LV         D+  Y V LEGI V N +    K +   + +GA
Sbjct: 225 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 284

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM 299
              G   +D+G   T L    Y  L+ +     K  L   +DP    Q    LCY+ P  
Sbjct: 285 ---GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT 341

Query: 300 AGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAM---QPIDGDVGIFGN 349
               P L   T  F G    V        +P  + G   V+CF     + +  +  I G+
Sbjct: 342 RRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGH 401

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
             Q ++++ +D     V F    C
Sbjct: 402 HHQQNVWMEFDLAKSRVGFAEVRC 425


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 162/373 (43%), Gaps = 42/373 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVD+GS + +V C  C QC K   P + P  SS+Y+ + C
Sbjct: 90  NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC 148

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
             + C+  D      ++ C Y   YA+ S +KGVL  + I+FGN +       VFGC   
Sbjct: 149 NMD-CNCDD-----DREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLG+  LSL  Q++ + L +N F  C             M  G GS
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 252

Query: 198 EVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            + GG    + +V   S  D++ YY + L GI V     S     +    GA+      +
Sbjct: 253 MILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAV------L 306

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS------MAGIAP 304
           D+G     LP   +   EE V   +   K     DP      C++  +      ++ I P
Sbjct: 307 DSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNF-KDTCFQVAASNYVSELSKIFP 365

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
            +   F  G    L      F    V G +C  + P   D   + G     +  + YD +
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRE 425

Query: 363 SQMVSFKPTDCTK 375
           +  V F  T+C++
Sbjct: 426 NSKVGFWRTNCSE 438


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 163/384 (42%), Gaps = 49/384 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP   +  ++DTGS+L W+ C    +    +  +++P  SSSY  + C
Sbjct: 53  NVSLTVSLTVGSPPQ-TVTMVLDTGSELSWLHC----KKAPNLHSVFDPLRSSSYSPIPC 107

Query: 81  QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C          VSC  ++LC+    YAD+S  +G LA++    GNS       +FG
Sbjct: 108 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSA--IPATIFG 165

Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C   G ++    +    GL+G+ R  LS     ++Q+G  KFSYC+     DSS    + 
Sbjct: 166 CMDSGFSSNSDEDSKTTGLIGMNRGSLSF----VTQMGLQKFSYCIS--GQDSS--GILL 217

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
           FG  S      +  T LV         D+  Y V LEGI V N +    K +   + +GA
Sbjct: 218 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 277

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM 299
              G   +D+G   T L    Y  L+ +     K  L   +DP    Q    LCY+ P  
Sbjct: 278 ---GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT 334

Query: 300 AGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAM---QPIDGDVGIFGN 349
               P L   T  F G    V        +P  + G   V+CF     + +  +  I G+
Sbjct: 335 RRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGH 394

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
             Q ++++ +D     V F    C
Sbjct: 395 HHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 156/371 (42%), Gaps = 32/371 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSCQ 81
            YV +  +GTPP   +  I D  +D  WV C  C+ C      P ++P  SS+Y+ + C 
Sbjct: 99  SYVARARLGTPPQTLLVAI-DPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCG 157

Query: 82  SEQCHLLD--TVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVF 134
           + QC  +   T SC +     C +   YA S+L   VL  + ++  +SN      D+  F
Sbjct: 158 APQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYTF 216

Query: 135 GCGHNNTGVFNE-NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           GC    TG        GLVG GR  LS  SQ  +  G + FSYCL P +  S+ +  +  
Sbjct: 217 GCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYG-SIFSYCL-PSYKSSNFSGTLRL 274

Query: 194 GNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSS----GAISK 248
           G   +     + +T L+S   + + Y+V + G+ V     + K +P   S+     A  +
Sbjct: 275 GPAGQPR--RIKTTPLLSNPHRPSLYYVAMVGVRV-----NGKAVPIPASALALDAATGR 327

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
           G   +D G   T L    Y  L    R  +   P      G   CY       + P +  
Sbjct: 328 GGTIVDAGTMFTRLSPPAYAALRNAFRRGVS-APAAPALGGFDTCYYVNGTKSV-PAVAF 385

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDS 363
            F GGA+V L   +  I     GV C AM   P DG    + +  +  Q +  + +D  +
Sbjct: 386 VFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGN 445

Query: 364 QMVSFKPTDCT 374
             V F    CT
Sbjct: 446 GRVGFSRELCT 456


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 116/433 (26%), Positives = 166/433 (38%), Gaps = 73/433 (16%)

Query: 5   TYFYPNNVVQSNVS---TANGEYVMKFSIGTPPLLDIYGI---VDTGSDLMWVQCLP--C 56
           T+  P++     +S       +Y +  S+G  PL     +   +DTGSDL+W  C P  C
Sbjct: 61  THHLPSSRRHRQLSLPLAPGSDYTLSLSVG--PLSTANPVSLFLDTGSDLVWFPCAPFTC 118

Query: 57  VQCYKQVKPIYN------------------------PASSSSYKELSCQSEQCHL--LDT 90
           + C  +  P  N                         A SS+     C + +C L  ++T
Sbjct: 119 MLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAAARCPLDDIET 178

Query: 91  VSCSSQQLCN-YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
            SC++   C    Y Y D SL    L   R+    S    +N  F C H   G      +
Sbjct: 179 GSCAASHACPPLYYAYGDGSLV-ARLRRGRVGIAASVAV-ENFTFACAHTALG----EPV 232

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVP--FHTDSSIT-SKMYFGNG---SEVSGGG 203
           G+ G GR  LSL +Q+     + +FSYCLV   F  D  I  S +  G        S  G
Sbjct: 233 GVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIRPSPLILGRSPGEDPASETG 292

Query: 204 VVSTSLVSKEDKTYYF-VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
           +V T L+      Y++ V LE +SVG     ++  P     G    G M +D+G   T+L
Sbjct: 293 IVYTPLLHNPKHPYFYSVALEAVSVGGTRIPAR--PELGRVGRAGDGGMVVDSGTTFTML 350

Query: 263 PKDFYNRLEEQVRNAIKLTPYQDP-----RLGSQLCY--------KTPSMAGIAPILTAH 309
           P + Y R+ E+   A+    ++       + G   CY             A   P L  H
Sbjct: 351 PNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVPPLAMH 410

Query: 310 FDGGAKVPLIHTSTFI---PPPVEGVFCFAM-----QPIDGDVGIFGNFAQSDLFIGYDF 361
           F G A V L   + F+         V C  +         G  G  GNF Q    + YD 
Sbjct: 411 FRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDV 470

Query: 362 DSQMVSFKPTDCT 374
           D+  V F    CT
Sbjct: 471 DAGRVGFARRRCT 483


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 161/392 (41%), Gaps = 46/392 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV----KPIYN 68
           + S   T  G+Y ++F +GTP    +  + DTGSDL WV+C                ++ 
Sbjct: 90  LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAGAAAGTGAGSPARVFR 148

Query: 69  PASSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG- 123
            A+S S+  ++C S+ C         +CSS    C Y Y Y D S  +GV+ T+  T   
Sbjct: 149 TAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIAL 208

Query: 124 -------------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
                                VV GC     G   ++  G++ LG + +S AS+  ++ G
Sbjct: 209 SSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFG 268

Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN- 229
             +FSYCLV      + TS + FG G+  +     +  L+ +    +Y VT++ + V   
Sbjct: 269 -GRFSYCLVDHLAPRNATSYLTFGPGA--TAPAAQTPLLLDRRMTPFYAVTVDAVYVAGE 325

Query: 230 -LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDP 286
            L   + +     + GAI      +D+G   T+L    Y  +   +   +   P    DP
Sbjct: 326 ALDIPADVWDVDRNGGAI------LDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMDP 379

Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-- 343
               + CY       +  P +  HF G A++     S ++     GV C  +Q  +G   
Sbjct: 380 ---FEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGSWP 433

Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            V + GN  Q +    +D   + + FK T C 
Sbjct: 434 GVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 465


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 166/393 (42%), Gaps = 53/393 (13%)

Query: 2   SPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
           +PA    P   + S    A GEY  +  +G+P     + +VDTGS+  W+ C        
Sbjct: 94  TPAEVEMP---MHSGRDDALGEYFAEVKVGSPGQ-RFWLVVDTGSEFTWLNC-------- 141

Query: 62  QVKPIYNPASSSSYKELSCQSEQC-----HLLDTVSCSS-QQLCNYTYGYADSSLTKGVL 115
                     S S++ ++C S +C      L     C      C Y   YAD S  KG  
Sbjct: 142 ----------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFF 191

Query: 116 ATERITFGNSN---NFFDNVVFGCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
            T+ IT G +N      +N+  GC  +  N   FNE   G++GLG  + S   +  ++ G
Sbjct: 192 GTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYG 251

Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
           A KFSYCLV   +  S++S +  G        G +  T L+      +Y V + GIS+G 
Sbjct: 252 A-KFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILF--PPFYGVNVVGISIGG 308

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPT--LLP--KDFYNRLEEQVRNAIKLTPYQD 285
                K+ P      A  +G   ID+G   T  LLP  +  +  L + +    ++T    
Sbjct: 309 --QMLKIPPQVWDFNA--EGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDF 364

Query: 286 PRLGSQLCYKTPSM-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDGD 343
             L  + C+        + P L  HF GGA+  P + +      P+  V C  + PIDG 
Sbjct: 365 DAL--EFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL--VKCIGIVPIDGI 420

Query: 344 VG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            G  + GN  Q +    +D  +  V F P+ CT
Sbjct: 421 GGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 164/382 (42%), Gaps = 45/382 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
           G Y  +  +GTPP    Y  +DTGSD++WV C PC  C            ++P  SS+  
Sbjct: 39  GLYYTRIELGTPPR-PFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97

Query: 77  ELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
            LSC   +C   + +S   C++ + C Y++ Y D S T G   ++   +        +NN
Sbjct: 98  PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
               + FGC +N +G   + +    G+ G G+  LS+ SQ+ SQ L    FS+CL     
Sbjct: 158 ASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADP 217

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
              I   +  G   E++  G+V T +V  +   +Y + L+GI+V    LS   ++    N
Sbjct: 218 GGGI---LVLG---EITEPGMVYTPIVPSQ--PHYNLNLQGIAVNGQQLSIDPQVFATTN 269

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSMA 300
           + G I      ID G     L ++ Y      +  A+  +  Q   L    C+ T  S+ 
Sbjct: 270 TRGTI------IDCGTTLAYLAEEAYEPFVNTIIAAVSQST-QPFMLKGNPCFLTVHSID 322

Query: 301 GIAPILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFAMQPI------DGDVGIFGNFAQ 352
            I P +T +F+G      P  +    + P    V+C   Q           + I G+   
Sbjct: 323 EIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVL 382

Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
            D    YD ++Q + +   DC+
Sbjct: 383 KDKVFVYDLENQRIGWTSFDCS 404


>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
          Length = 304

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 165/371 (44%), Gaps = 89/371 (23%)

Query: 26  MKFSIGTPPLL--DIYGIVDTGSDLMWVQCLPCVQCYKQVKP-----IYNPASSSSYKEL 78
           M+ ++GTPP+    ++GI    SDL WV+C PC  C     P     +Y+ A+SSS+  L
Sbjct: 1   MELAVGTPPVTVQALFGI----SDLCWVECTPCSGCNNNAAPPAGARLYDRANSSSFSPL 56

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGYA----DSSLTKGVLATERITFG-NSNNFFDNVV 133
                           +   C Y Y Y     D +  KG+L TE I FG N      +  
Sbjct: 57  ----------------ADTECGYRYVYGATDTDRNYVKGILGTETIKFGSNDAATVQSFT 100

Query: 134 FGCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           FGC +      +F+ N  G+VGLGR++LSL    + QLG ++FSYCL    ++ ++ S +
Sbjct: 101 FGCTNTVYRNDLFDGNT-GVVGLGRSKLSL----VGQLGLDRFSYCLA---SNPNVASPV 152

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGN 250
            FG+ + + G GV ST L+   D   Y+V L GISV    + ++L IP  N +  +S+  
Sbjct: 153 LFGSTASMDGNGVSSTPLL--PDDANYYVNLLGISV----DGTRLAIP--NDTARMSRTY 204

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
             ++       L       +++  +N + +                       P +T HF
Sbjct: 205 EAVNGSGLLCFL-------VDDASKNVVTV-----------------------PTMTMHF 234

Query: 311 DGGAKVPLIHTSTFIPPPVEG------VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           D G  + L+  + F     +       V C  +        I GN+ Q D  + Y+  + 
Sbjct: 235 D-GMDMELLFGNYFAYTGKQSGGGGGDVLCLMIGKSSTGSRI-GNYLQMDFHVLYELKNS 292

Query: 365 MVSFKPTDCTK 375
           ++S +P DC K
Sbjct: 293 VLSVQPADCGK 303


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 167/374 (44%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SSSY  + C
Sbjct: 85  NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKC 143

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    S ++ C Y   YA+ S + GVL  + ++FG  +     + +FGC ++
Sbjct: 144 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENS 197

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + + ++ FS C             M  G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDIGGGA 247

Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
            V GG +    ++ S  D     YY + L+ I V    L   S++   +N     SK   
Sbjct: 248 MVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRI---FN-----SKHGT 299

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
            +D+G     LP+  +   +E V    +++K     DP     +C+         +  + 
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSY-KDICFAGAGRNVSKLHEVF 358

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    V+G +C  + Q       + G     +  + YD 
Sbjct: 359 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDR 418

Query: 362 DSQMVSFKPTDCTK 375
            ++ + F  T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 77/378 (20%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           V  F+IGTPP      I+D                     P   P +SS+++   C ++ 
Sbjct: 68  VANFTIGTPPQ-PASAIIDVAGP----------------APCSFPNASSTFRPEPCGTDA 110

Query: 85  CHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGC----G 137
           C  + T +CSS  +C Y  G  +S L   T G++AT+    G +     ++ FGC    G
Sbjct: 111 CKSIPTSNCSSN-MCTYE-GTINSKLGGHTLGIVATDTFAIGTATA---SLGFGCVVASG 165

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            +  G       GL+GLGR      S ++SQ+   KFSYCL P   DS   S++  G+ +
Sbjct: 166 IDTMG----GPSGLIGLGRA----PSSLVSQMNITKFSYCLTPH--DSGKNSRLLLGSSA 215

Query: 198 EVSGGGVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           +++GGG  +T+   K     +   YY + L+GI  G+ + +  L P  N+        + 
Sbjct: 216 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIA--LPPSGNT--------VL 265

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
           + T AP + L    Y  L+++V  A+   P   P     LC+    ++   AP L   F 
Sbjct: 266 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQ 325

Query: 312 GGAKVPLIHTSTFIPPPV--------EGVFCFAM--------QPIDGDVGIFGNFAQSDL 355
            GA       +  +PPP         +G  C A+          +D ++ I G+  Q + 
Sbjct: 326 QGAA------ALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENT 379

Query: 356 FIGYDFDSQMVSFKPTDC 373
               D + + +SF+P DC
Sbjct: 380 HFLLDLEKKTLSFEPADC 397


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/358 (31%), Positives = 137/358 (38%), Gaps = 53/358 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSC 80
           YV+  S+GTP +      VDTGSDL WVQC PC     CY Q  P+++PA SSSY  + C
Sbjct: 140 YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
               C  L   +                       A      G    FF    FGCGH  
Sbjct: 199 GGPVCAGLGIYA---------------------ASACSAAQCGAVQGFF----FGCGHAQ 233

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G+FN  + GL+GLGR + SL  Q     G   FSYCL    T  S    +  G G    
Sbjct: 234 SGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGPSG 288

Query: 201 GGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
                ST+  L S    TYY V L GISVG    S   +P    +G        + T  P
Sbjct: 289 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTRLP 345

Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
           PT      Y  L    R+ +    Y        L  CY       +  P +   F  GA 
Sbjct: 346 PTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGAT 400

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           V L            G   FA    DG + I GN  Q    +    D   V FKP+ C
Sbjct: 401 VTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 110/434 (25%), Positives = 164/434 (37%), Gaps = 73/434 (16%)

Query: 4   ATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYK 61
           AT F+  +   S   +   +Y + F++G+ P   I   +DTGSDL+W  C P  C+ C  
Sbjct: 53  ATRFHHRHRQISLPLSPGSDYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEG 112

Query: 62  QVKPI----YNPASSSSYKELSCQSEQC--------------------HLLDTVSCSSQQ 97
           +         +P + +S   +SC+S  C                     L++T  CSS  
Sbjct: 113 KYDTAATGGLSPPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFS 172

Query: 98  LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRT 157
              + Y Y D SL   +         +S     N  FGC H   G      +G+ G GR 
Sbjct: 173 CPPFYYAYGDGSLVARLYRDSLSMPASSPLVLHNFTFGCAHTALG----EPVGVAGFGRG 228

Query: 158 RLSLASQILS---QLGANKFSYCLVPFHTDSSITSK---MYFGNGS----------EVSG 201
            LSL +Q+ S    LG N+FSYCLV    D+    +   +  G  S             G
Sbjct: 229 VLSLPAQLASFSPHLG-NQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRG 287

Query: 202 GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAP 258
             V +  L + +   +Y V LEGI+VGN     + IP       + +   G M +D+G  
Sbjct: 288 EFVYTAMLDNPKHPYFYCVGLEGITVGN-----RKIPVPEILKRVDRRGNGGMVVDSGTT 342

Query: 259 PTLLPKDFYNRLEEQVRNAI----KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
            T+LP   Y  L  +  + +    K     + R G   CY +   A   P +  HF G +
Sbjct: 343 FTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPCYYSDDSAAKVPAVALHFVGNS 402

Query: 315 KVPLIHTSTFIP--------PPVEGVFCFAMQ------PIDGDVGIFGNFAQSDLFIGYD 360
            V L   + +               V C  +          G     GN+ Q    + YD
Sbjct: 403 TVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYD 462

Query: 361 FDSQMVSFKPTDCT 374
            +   V F    C 
Sbjct: 463 LEKHRVGFARRKCA 476


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/326 (29%), Positives = 137/326 (42%), Gaps = 30/326 (9%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           I+D+GSD+ WVQC PC    C++Q  P+++PA S++Y  + C S  C  L      CS+ 
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
             C +   Y D S   G  + + +T G   +      FGC H + G  F+ +  G + LG
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
               SL  Q  ++ G   FSYCL P  T SS+   +  G   E +      VST L+S  
Sbjct: 290 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 345

Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
              T+Y V L  I V   +     +P      A+   +  ID+    + LP   Y  L  
Sbjct: 346 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 397

Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
             R+A+ +     P      CY    +  I  P +   FDGGA V L      +      
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 451

Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDL 355
             C A  P   D   G  GN  Q  L
Sbjct: 452 GSCLAFAPTASDRMPGFIGNVQQKTL 477



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 62/247 (25%), Positives = 91/247 (36%), Gaps = 25/247 (10%)

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           +G G   TG ++ +++ L      R  L  +  +Q G   FSYC+ P     S +S  + 
Sbjct: 492 YGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYG-RVFSYCIPP-----SPSSLGFI 545

Query: 194 GNGSEVSGGGVV----STSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
             G       +V    ST L+S      T+Y V L  I V        + P   S+ ++ 
Sbjct: 546 TLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVAG--RPLPVPPTVFSTSSVI 603

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
                I      + LP   Y  L    R A+ +     P      CY    +  I  P +
Sbjct: 604 ASTTVI------SRLPPTAYQALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSI 657

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
              FDGGA V L      +    +G   FA    D   G  GN  Q  L + YD   + +
Sbjct: 658 ALVFDGGATVNLDAAGILL----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAI 713

Query: 367 SFKPTDC 373
            F+   C
Sbjct: 714 RFRSAAC 720


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 158/425 (37%), Gaps = 91/425 (21%)

Query: 23  EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
           +Y +  S+G       +   +DTGSDL+W  C P  C+ C  +                 
Sbjct: 89  DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRR 148

Query: 63  ---VKPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCN-YTYGYADSSLTKGVLA 116
                P+ + A +S+     C + +C L  ++T SC +   C    Y Y D SL    L 
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH-LR 207

Query: 117 TERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
             R+  G           DN  F C H   G      +G+ G GR  LSL  Q+  QL +
Sbjct: 208 RGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLSPQL-S 262

Query: 172 NKFSYCLVP--FHTDSSIT-SKMYFGNGSEVSGG------GVVSTSLVSKEDKTYYF-VT 221
            +FSYCLV   F  D  I  S +  G   + +        G V T L+      Y++ V 
Sbjct: 263 GRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVA 322

Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE---------- 271
           LE +SVG     ++  P          G M +D+G   T+LP + Y R+           
Sbjct: 323 LEAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 380

Query: 272 -----EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
                E+      LTP          CY+  +     P L  HF G A V L   + F+ 
Sbjct: 381 GFARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRRNYFMG 430

Query: 327 PPVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
              E          V C  +        +  DG  G  GNF Q    + YD D+  V F 
Sbjct: 431 FKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFA 490

Query: 370 PTDCT 374
              CT
Sbjct: 491 RRRCT 495


>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 76/137 (55%), Gaps = 4/137 (2%)

Query: 61  KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
           KQ  PIY+PA SS+Y ++SC+S  C+ L    C S   C Y Y Y D S+T G+L+ E +
Sbjct: 1   KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETL 60

Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           T      +     N  FGCG NN G   +   G+VGLGR  LSL SQ+ + +   KFSYC
Sbjct: 61  TLTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119

Query: 178 LVPFHTDSSITSKMYFG 194
           L+      S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 48/342 (14%)

Query: 53  CLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTK 112
           C  C+ C+KQ  P++ P +SS++K   C ++ C  + T  C+S  +C Y         T 
Sbjct: 55  CSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCAS-DVCAYDGVTGLGGHTV 113

Query: 113 GVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
           G++AT+    G +         G     T        G +GLGRT  SL    ++Q+   
Sbjct: 114 GIVATDTFAIGTAAPARPPAS-GASWRATSTPWAGPSGFIGLGRTPWSL----VAQMKLT 168

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGN 229
           +FSYCL P   D+   S+++ G  ++++GGG  +  + +  +     YY + LE I  G 
Sbjct: 169 RFSYCLAPH--DTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAG- 225

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
             +++  +P   ++  +    + +      +LL    Y   ++ V  ++   P   P +G
Sbjct: 226 --DATITMPRGRNTVLVQTAVVRV------SLLVDSVYQEFKKAVMASVGAAPTATP-VG 276

Query: 290 S--QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF-------CFAMQPI 340
           +  ++C+    ++G AP L   F  GA + +        PP   +F       C ++  I
Sbjct: 277 APFEVCFPKAGVSG-APDLVFTFQAGAALTV--------PPANYLFDVGNDTVCLSVMSI 327

Query: 341 --------DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                   DG + I G+F Q ++ + +D D  M+SF+P DC+
Sbjct: 328 ALLNITALDG-LNILGSFQQENVHLLFDLDKDMLSFEPADCS 368


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 96/370 (25%), Positives = 159/370 (42%), Gaps = 46/370 (12%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP---IYNPASSSSYKELSCQ 81
           M  S+GTPP+ ++  I DTGS L WVQC  C ++CY Q      I+NP +SS+Y ++ C 
Sbjct: 1   MGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCS 59

Query: 82  SEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           +E C+     L     C  +   C Y+  Y     + G L  +R+T   SN   DN +FG
Sbjct: 60  TEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA-SNRSIDNFIFG 118

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-TDSSITSKMYFG 194
           CG +N  ++N    G++G G    S  +Q+  Q     FSYC    H  + S+T      
Sbjct: 119 CGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI----- 171

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
            G       ++ T L+  + K  Y +    + V  +    ++ PY      ISK  + +D
Sbjct: 172 -GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGI--RLEIDPYI----YISKMTI-VD 223

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
           +G   T +    ++ L++ +   ++   Y       ++C+ + S        +A+++   
Sbjct: 224 SGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSG-------SANWNDFP 276

Query: 315 KVPLIHTSTFIPPPVEGVF--------CFAMQPIDGDVG---IFGNFAQSDLFIGYDFDS 363
            V +    + +  PVE  F        C    P D  V    + GN A     + +D  +
Sbjct: 277 TVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQA 336

Query: 364 QMVSFKPTDC 373
               FK   C
Sbjct: 337 MNFGFKARAC 346


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 168/384 (43%), Gaps = 65/384 (16%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVDTGS + +V C  C QC +   P + P SSS+Y+ + C
Sbjct: 81  NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 139

Query: 81  QSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGC 136
                    T+ C   S +  C Y   YA+ S + GVL  + I+FGN +       VFGC
Sbjct: 140 ---------TIDCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGC 190

Query: 137 GHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFG 194
            +  TG +++++  G++GLGR  LS+  Q++ + + ++ FS C               +G
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC---------------YG 235

Query: 195 NGSEVSGGGVVSTSLVSKEDKT----------YYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            G +V GG +V   +    D            YY + L+ I V     + K +P  N++ 
Sbjct: 236 -GMDVGGGAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHV-----AGKRLP-LNANV 288

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              K    +D+G     LP+  +   ++ +     ++K     DP   + +C+   S AG
Sbjct: 289 FDGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNY-NDICF---SGAG 344

Query: 302 IA--------PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFA 351
           I         P++   F+ G K  L      F    V G +C  + Q  +    + G   
Sbjct: 345 IDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGII 404

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
             +  + YD +   + F  T+C +
Sbjct: 405 VRNTLVVYDREQTKIGFWKTNCAE 428


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 146/379 (38%), Gaps = 62/379 (16%)

Query: 3   PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
           PA++ Y       ++ T N  YV+  S+GTP +      VDTGSDL WVQC PC     C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177

Query: 60  YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER 119
           Y Q  P+++PA SSSY  + C    C  L   +                       A   
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA---------------------ASACSA 216

Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
              G    FF    FGCGH  +G+FN  + GL+GLGR + SL  Q     G   FSYCL 
Sbjct: 217 AQCGAVQGFF----FGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL- 269

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
              T  S    +  G G         ST+  L S    TYY V L GISVG    S   +
Sbjct: 270 --PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---V 324

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK 295
           P    +G        + T  PPT      Y  L    R+ +    Y        L  CY 
Sbjct: 325 PASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYN 379

Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
                 +  P +   F  GA V L            G   FA    DG + I GN  Q  
Sbjct: 380 FAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRS 435

Query: 355 LFIGYDFDSQMVSFKPTDC 373
             +    D   V FKP+ C
Sbjct: 436 FEV--RIDGTSVGFKPSSC 452


>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 76/137 (55%), Gaps = 4/137 (2%)

Query: 61  KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
           KQ  PIY+PA SS+Y ++SC+S  C+ L    C S   C Y Y Y D S+T G+L+ E +
Sbjct: 1   KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETL 60

Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           T      +     N  FGCG NN G   +   G+VGLGR  LSL SQ+ + +   KFSYC
Sbjct: 61  TLTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119

Query: 178 LVPFHTDSSITSKMYFG 194
           L+      S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 107/407 (26%), Positives = 169/407 (41%), Gaps = 67/407 (16%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASSSSYKELSC 80
           +Y + F++G+ P   I   +DTGSDL+W  C P  C+ C  + +       +     +SC
Sbjct: 74  DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133

Query: 81  QS------------------EQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
           QS                   +C L  ++T  CSS     + Y Y D S    +    + 
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLY---QQ 190

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--LSQLGANKFSYCL 178
           T   S+    N  FGC H           G+ G GR  LSL +Q+  LS    N+FSYCL
Sbjct: 191 TLSLSSLHLQNFTFGCAHTALA----EPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCL 246

Query: 179 VPFHTDSSITSK---MYFGNGSE-VSGGG------VVSTSLVSKEDKTYYF-VTLEGISV 227
           V    D     +   +  G  ++ ++G G       V TS++S     YY+ V L GISV
Sbjct: 247 VSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISV 306

Query: 228 GNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTP 282
           G  +  + +++   +  G    G M +D+G   T+LP+ FY    N  +++V    K   
Sbjct: 307 GKRTVPAPEILKRVDEKG---NGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRAS 363

Query: 283 YQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---------VF 333
             + + G   CY    ++ I P+L  HF G     ++    +    ++G         V 
Sbjct: 364 EIETKTGLGPCYYLNGLSQI-PVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVG 422

Query: 334 CFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           C  +        +DG  G   GN+ Q    + YD + + V F   +C
Sbjct: 423 CMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 165/375 (44%), Gaps = 46/375 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVD+GS + +V C  C QC K   P + P  SS+Y+ + C
Sbjct: 91  NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKC 149

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
             + C+  D      ++ C Y   YA+ S +KGVL  + I+FGN +       VFGC   
Sbjct: 150 NMD-CNCDD-----DKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 203

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLG+  LSL  Q++ + L +N F  C             M  G GS
Sbjct: 204 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 253

Query: 198 EVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNM 251
            + GG    + ++   S  D++ YY + L GI V    LS +S++  +    GA+     
Sbjct: 254 MILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRV--FDGEHGAV----- 306

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY------KTPSMAGI 302
            +D+G     LP   +   EE V   +   K     DP      C+          ++ I
Sbjct: 307 -LDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNF-KDTCFLVAASNDVSELSKI 364

Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
            P +   F  G    L      F    V G +C  + P   D   + G     +  + YD
Sbjct: 365 FPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYD 424

Query: 361 FDSQMVSFKPTDCTK 375
            ++  V F  T+C++
Sbjct: 425 RENSKVGFWRTNCSE 439


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/273 (31%), Positives = 128/273 (46%), Gaps = 23/273 (8%)

Query: 113 GVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
           GVLATE  TFG   NF  N+ FGCG    G       G++G+    LS    +L QL   
Sbjct: 5   GVLATETFTFGAHQNFSANLTFGCGKLTNGTI-AGASGIMGVSPGPLS----VLKQLSIT 59

Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEV----SGGGVVSTSLVSKE-DKTYYFVTLEGISV 227
           KFSYCL PF TD   TS + FG  +++    + G V +  L+    +  YY+V + GIS+
Sbjct: 60  KFSYCLTPF-TDHK-TSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISI 117

Query: 228 GNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
           G+   +  + I      G    G   +D+      L +  +  L++ V   +KL      
Sbjct: 118 GSKRLDVPEAILALRPDGT---GGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRS 174

Query: 287 RLGSQLCYKTP---SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PI 340
                +C++ P   SM G+  P L  HF G A++ L   S F   P  G+ C A+   P 
Sbjct: 175 IDDYPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPF 233

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +G   + GN  Q ++ + YD  ++  S+ PT C
Sbjct: 234 EGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 97/326 (29%), Positives = 137/326 (42%), Gaps = 30/326 (9%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           I+D+GSD+ WVQC PC    C++Q  P+++PA S++Y  + C S  C  L      CS+ 
Sbjct: 80  IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 139

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
             C +   Y D S   G  + + +T G   +      FGC H + G  F+ +  G + LG
Sbjct: 140 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 198

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
               SL  Q  ++ G   FSYCL P  T SS+   +  G   E +      VST L+S  
Sbjct: 199 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 254

Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
              T+Y V L  I V   +     +P      A+   +  ID+    + LP   Y  L  
Sbjct: 255 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 306

Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
             R+A+ +     P      CY    +  I  P +   FDGGA V L      +      
Sbjct: 307 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 360

Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDL 355
             C A  P   D   G  GN  Q  L
Sbjct: 361 GSCLAFAPTASDRMPGFIGNVQQKTL 386



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/247 (25%), Positives = 91/247 (36%), Gaps = 25/247 (10%)

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           +G G   TG ++ +++ L      R  L  +  +Q G   FSYC+ P     S +S  + 
Sbjct: 401 YGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYG-RVFSYCIPP-----SPSSLGFI 454

Query: 194 GNGSEVSGGGVV----STSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
             G       +V    ST L+S      T+Y V L  I V        + P   S+ ++ 
Sbjct: 455 TLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVAG--RPLPVPPTVFSTSSVI 512

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
                I      + LP   Y  L    R A+ +     P      CY    +  I  P +
Sbjct: 513 ASTTVI------SRLPPTAYQALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSI 566

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
              FDGGA V L      +    +G   FA    D   G  GN  Q  L + YD   + +
Sbjct: 567 ALVFDGGATVNLDAAGILL----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAI 622

Query: 367 SFKPTDC 373
            F+   C
Sbjct: 623 RFRSAAC 629


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 100/381 (26%), Positives = 156/381 (40%), Gaps = 46/381 (12%)

Query: 24  YVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
           Y +   +GT    + Y + +D  +   W+QC PC  C  Q+ P+++PA S +++ +S   
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGH- 159

Query: 83  EQCHLLDTVSCS------SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN---VV 133
                 + V C           C +   Y + +   G LA +  +F   +N F +   +V
Sbjct: 160 ------NAVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIV 213

Query: 134 FGCGHNNTGVFNEN-------EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
           FGC  N    F+ +        MG+   G+       Q+    G  +FSYC  P    ++
Sbjct: 214 FGCA-NRIARFDTHGALAGVLGMGMGAEGKPLTGFMRQLYHN-GGGRFSYC--PIVPGTT 269

Query: 187 ITSKMYFGNG-SEVSGGGVVSTSLVSKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNS 242
             S + FGN        GV   S+      T    Y+V L GISVG L     + P    
Sbjct: 270 AYSFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGAL-RVPGVTPEMFE 328

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLC-YKTPS 298
                +G   ID G   T + +  Y  +E  VR  ++       Q P  G  LC ++TP+
Sbjct: 329 RDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSP--GHHLCVHRTPA 386

Query: 299 MAGIAPILTAHFDGGA--KVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSD 354
           +    P +T HF GG   +V   H    +  P  G    C  + P D ++ + G   Q D
Sbjct: 387 IEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVP-DAEMTVIGAMQQID 445

Query: 355 LFIGYDFDSQ--MVSFKPTDC 373
               +D  +   +VSF P DC
Sbjct: 446 TRFIFDLHNNIPIVSFNPEDC 466


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 108/397 (27%), Positives = 178/397 (44%), Gaps = 68/397 (17%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPAS 71
           + T  G Y  +  IGTP     Y  VDTGSD++WV C+ C  C ++        +Y+P +
Sbjct: 82  IPTDTGLYFTQIGIGTPSK-GYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTA 140

Query: 72  SSSYKELSCQSEQCHLLDT----VSCSSQQLCNYTYGYADSSLTKGVLATERITF----- 122
           S+S K ++C  E C          SC++   C Y+  Y D S T G    + + +     
Sbjct: 141 SASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSG 200

Query: 123 -GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYC 177
            G +N    +V FGCG    G    + +   G++G G+   S+ SQ+ S     K FS+C
Sbjct: 201 DGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVST-SLVSKEDKT--------YYFVTLEGISVG 228
           L                    V+GGG+ +  ++V  + KT        +Y V L+ I VG
Sbjct: 261 L------------------DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVG 302

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQD 285
              ++ +L       G  S+G + ID+G     LP+  Y  +   V      + L   QD
Sbjct: 303 G--STLQLPTNIFDIGGGSRGTI-IDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD 359

Query: 286 PRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQ 338
                 LC++ + S+    P +T HFDG   +PL ++   ++    E V+C       +Q
Sbjct: 360 -----FLCFQYSGSVDNGFPEVTFHFDG--DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQ 412

Query: 339 PIDG-DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             DG D+ + G+ A S+  + YD ++Q++ +   +C+
Sbjct: 413 SKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCS 449


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 163/374 (43%), Gaps = 43/374 (11%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
           +NG Y  +  IGTPP  +   IVDTGS + +V C  C QC K   P + P  SS+Y+ + 
Sbjct: 73  SNGYYTTRLFIGTPPQ-EFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVK 131

Query: 80  CQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCG 137
           C         + +C  + + C Y   YA+ S + GV+A + ++FGN +       VFGC 
Sbjct: 132 CNP-------SCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCE 184

Query: 138 HNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGN 195
           +  TG ++++   G++GLGR RLS+  Q++ + +  + FS C             M  G 
Sbjct: 185 NVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCY----------GGMDVGG 234

Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
           G+ V G      ++V          YY + L+ + V       K   +    G +     
Sbjct: 235 GAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTV----- 289

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIA 303
            +D+G      P+  ++ L++ +   I   K  P  DP     +C+     +   ++ + 
Sbjct: 290 -LDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNY-HDICFSGAGREVSHLSKVF 347

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    V G +C  +     D+  + G     +  + YD 
Sbjct: 348 PEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDR 407

Query: 362 DSQMVSFKPTDCTK 375
           ++  + F  T+C++
Sbjct: 408 ENDKIGFWKTNCSE 421


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 166/374 (44%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SSSY  + C
Sbjct: 86  NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC 144

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    S ++ C Y   YA+ S + GVL  + ++FG  +       VFGC ++
Sbjct: 145 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENS 198

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + + ++ FS C             M  G G+
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDIGGGA 248

Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
            V GG    + +V S  D     YY + L+ I V    L   S++   +N     SK   
Sbjct: 249 MVLGGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRV---FN-----SKHGT 300

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
            +D+G     LP+  +   ++ V    +++K     DP     +C+         +  + 
Sbjct: 301 VLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNY-KDICFAGAGRNVSKLHEVF 359

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    V+G +C  + Q       + G     +  + YD 
Sbjct: 360 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDR 419

Query: 362 DSQMVSFKPTDCTK 375
            ++ + F  T+C++
Sbjct: 420 HNEKIGFWKTNCSE 433


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 112/424 (26%), Positives = 156/424 (36%), Gaps = 90/424 (21%)

Query: 23  EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
           +Y +  S+G       +   +DTGSDL+W  C P  C+ C  +                 
Sbjct: 89  DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRR 148

Query: 63  ---VKPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCN-YTYGYADSSLTKGVLA 116
                P+ + A +S+     C   +C L  ++T SC +   C    Y Y D SL    L 
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH-LR 207

Query: 117 TERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
             R+  G           DN  F C H   G      +G+ G GR  LSL  Q+  QL +
Sbjct: 208 RGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLSPQL-S 262

Query: 172 NKFSYCLVP--FHTDSSIT-SKMYFGNG-----SEVSGGGVVSTSLVSKEDKTYYF-VTL 222
            +FSYCLV   F  D  I  S +  G       +     G V T L+      Y++ V L
Sbjct: 263 GRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVAL 322

Query: 223 EGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----------- 271
           E +SVG     ++  P          G M +D+G   T+LP + Y R+            
Sbjct: 323 EAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAG 380

Query: 272 ----EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPP 327
               E+      LTP          CY+  +     P L  HF G A V L   + F+  
Sbjct: 381 FARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRRNYFMGF 430

Query: 328 PVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
             E          V C  +        +  DG  G  GNF Q    + YD D+  V F  
Sbjct: 431 KSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFAR 490

Query: 371 TDCT 374
             CT
Sbjct: 491 RRCT 494


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 161/374 (43%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SS+Y  + C
Sbjct: 82  NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 140

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
            +      D    S +  C Y   YA+ S + GVL  + ++FG  +       VFGC ++
Sbjct: 141 SA------DCTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 194

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + +  + FS C             M  G G+
Sbjct: 195 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 244

Query: 198 EVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            V G       +V S+ D     YY + L+ I V     + +L P        SK    +
Sbjct: 245 MVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAG--KALRLDPRIFD----SKHGTVL 298

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA---------- 303
           D+G     LP+  +   ++ V +  K+ P +  R G    YK    AG            
Sbjct: 299 DSGTTYAYLPEQAFVAFKDAVTS--KVRPLKKIR-GPDPNYKDICFAGAGRNVSQLSQAF 355

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    VEG +C  + Q       + G     +  + YD 
Sbjct: 356 PDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 415

Query: 362 DSQMVSFKPTDCTK 375
            ++ + F  T+C++
Sbjct: 416 HNEKIGFWKTNCSE 429


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 171/381 (44%), Gaps = 42/381 (11%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  +GTPP    Y  VDTGSD++WV C+ C +C ++         Y+P +SS
Sbjct: 79  TDTGLYFTEIKLGTPPK-RYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASS 137

Query: 74  SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           S   +SC    C          C++   C Y+  Y D S T G   T+ + F      G 
Sbjct: 138 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQ 197

Query: 125 SNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +      V FGCG    G     N+   G++G G+   S+ SQ+ +     K F++CL  
Sbjct: 198 TQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL-- 255

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                +I     F  G+ V    V +T LV+  D  +Y V L+ I VG    ++  +P +
Sbjct: 256 ----DTIKGGGIFAIGNVVQ-PKVKTTPLVA--DMPHYNVNLKSIDVG---GTTLQLPAH 305

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP-SM 299
                  KG + ID+G   T LP+  +  +   + N  +   + + +    +C++ P S+
Sbjct: 306 VFETGERKGTI-IDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQ--DFMCFQYPGSV 362

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
               P +T HF+    + +     F P   + ++C      A+Q  DG D+ + G+   S
Sbjct: 363 DDGFPTITFHFEDDLALHVYPHEYFFPNGND-MYCVGFQNGALQSKDGKDIVLMGDLVLS 421

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + YD ++Q++ +   +C+
Sbjct: 422 NKLVIYDLENQVIGWTDYNCS 442


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 168/383 (43%), Gaps = 61/383 (15%)

Query: 28  FSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCH- 86
            +IGTPP  +I  ++DTGS+L W++C    +       I+NP +S +Y ++ C S+ C  
Sbjct: 71  LTIGTPPQ-NITMVLDTGSELSWLRC----KKEPNFTSIFNPLASKTYTKIPCSSQTCKT 125

Query: 87  ----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
               L   V+C   +LC++   YAD+S  +G LA E   FG+        VFGC  + + 
Sbjct: 126 RTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTR--PATVFGCMDSGSS 183

Query: 143 VFNENEM---GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
              E +    GL+G+ R  LS     ++Q+G  KFSYC+    +    T  +  G     
Sbjct: 184 SNTEEDAKTTGLMGMNRGSLSF----VNQMGFRKFSYCISGLDS----TGFLLLGEARYS 235

Query: 200 SGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS------SGAIS 247
               +  T LV         D+  Y V LEGI V N     K++P   S      +GA  
Sbjct: 236 WLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNN-----KVLPLPKSVFVPDHTGA-- 288

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQ--VRNAIKLTPYQDPRLGSQ----LCY---KTPS 298
            G   +D+G   T L    Y+ L ++  ++ A  L    +P+   Q    LCY    T S
Sbjct: 289 -GQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSS 347

Query: 299 MAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGI----FGNF 350
                P++   F G    V        +P  V G   V+CF     D ++GI     G+ 
Sbjct: 348 TLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSD-ELGISSFLIGHH 406

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
            Q ++++ YD ++  + F    C
Sbjct: 407 QQQNVWMEYDLENSRIGFAELRC 429


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 173/380 (45%), Gaps = 55/380 (14%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-YNPASSSSYKELSCQSE 83
           ++   IGTPP      ++DTGS L W+QC    +   +  P  ++P  SSS+  L C   
Sbjct: 79  IVSLPIGTPPQTQQM-VLDTGSQLSWIQC----KVPPKTPPTAFDPLLSSSFSVLPCNHS 133

Query: 84  QC------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
            C      + L T SC   +LC+Y+Y YAD +  +G L  E+ TF +S      ++ GC 
Sbjct: 134 LCKPRVPDYTLPT-SCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQT-TPPLILGCA 191

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD--SSITSKMYFGN 195
            +++     +  G++G+   RLS +S  L+++  +KFSYC+ P  +   SS T   Y G 
Sbjct: 192 TDSS-----DTQGILGMNLGRLSFSS--LAKI--SKFSYCVPPRRSQSGSSPTGSFYLGP 242

Query: 196 GSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISK 248
               +G   V+      +  +   D   Y + + GI + G   N S      + SGA   
Sbjct: 243 NPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGA--- 299

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--------QLCYKTPSMA 300
           G   ID+G   T L  + Y++++E++   +KL     P+L           +C+   +M 
Sbjct: 300 GQTLIDSGTWFTFLVDEAYSKVKEEI---VKLA---GPKLKKGYVYGGSLDMCFDGDAMV 353

Query: 301 GIAPI--LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDL 355
               I  +   F+ G ++ ++     +     GV C  +   D       I GNF Q DL
Sbjct: 354 IGRMIGNMAFEFENGVEI-VVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDL 412

Query: 356 FIGYDFDSQMVSFKPTDCTK 375
           ++ +D   + V F  TDC++
Sbjct: 413 WVEFDLVGRRVGFGRTDCSR 432


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 168/386 (43%), Gaps = 56/386 (14%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
           +G Y     +G PP    LDI    DTGSDL WVQC  PC  C K   P+Y P   +  S
Sbjct: 196 DGLYYTYIMVGEPPRPYFLDI----DTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVS 251

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--NV 132
           +K+  C   Q +  D   C++ Q CNY   YAD S + GVL  +  T   SN      N 
Sbjct: 252 FKDSLCMEVQRNY-DGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLTKLNA 310

Query: 133 VFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           +FGC ++  G+         G++GL R ++SL SQ+ S+ +  N   +CL     D +  
Sbjct: 311 IFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLT---GDPAGG 367

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             ++ G+   V   G+   +++      +Y   +  I  G+       IP    +   S+
Sbjct: 368 GYLFLGD-DFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGS-------IPLSLDTWGSSR 419

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY----QDPRLGSQLCYKTP-SMAGIA 303
             +  D+G+  T   K+ Y +L   V N  +++ +    QD      +C+KT  S+  + 
Sbjct: 420 EQVVFDSGSSYTYFTKEAYYQL---VANLEEVSAFGLILQDS--SDTICWKTEQSIRSVK 474

Query: 304 PI------LTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIF 347
            +      LT  F  G++  L+ T   I P        EG  C  +    Q  DG   I 
Sbjct: 475 DVKHFFKPLTLQF--GSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIIL 532

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G+ A     + YD  +Q + +  +DC
Sbjct: 533 GDNALRGKLVVYDNVNQRIGWTSSDC 558


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 164/384 (42%), Gaps = 43/384 (11%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
           + G Y  +  +G P    I   VDTGSD++WV C PC  C ++        +Y+P  SS+
Sbjct: 25  SGGLYFTQVGLGNPVKHYIVQ-VDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESST 83

Query: 75  YKELSCQSEQC---HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFG--NSN-- 126
              +SC    C          CS +   C Y + Y D S ++G    + + +   +SN  
Sbjct: 84  TSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGL 143

Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPF 181
            N    V+FGC    TG  + ++    G++G G+  LS+ +Q+ +Q    + FS+CL   
Sbjct: 144 ANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--- 200

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
             +      +  G  +E    G+  T LV   D  +Y V L GISV    NS++L     
Sbjct: 201 EGEKRGGGILVIGGIAEP---GMTYTPLV--PDSVHYNVVLRGISV----NSNRLPIDAE 251

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
              + +   + +D+G      P   YN   + +R A   TP +   + +Q    +  ++ 
Sbjct: 252 DFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSD 311

Query: 302 IAPILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQ-------PIDG-DVGIFGN 349
           + P +T +F+GGA        +      P     V+C   Q       P DG  + I G+
Sbjct: 312 LFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGD 371

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
               D  + YD D+  + +   +C
Sbjct: 372 IVLKDKLVVYDLDNSRIGWMSYNC 395


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 165/374 (44%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVDTGS + +V C  C QC +   P + P SSS+Y+ + C
Sbjct: 109 NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 167

Query: 81  QSEQCHLLDTVSCS---SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGC 136
                    T+ C+    +  C Y   YA+ S + GVL  + I+FGN +       VFGC
Sbjct: 168 ---------TIDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGC 218

Query: 137 GHNNTG-VFNENEMGLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDSSITSKMYFG 194
            +  TG +++++  G++GLGR  LS+  Q++  ++ ++ FS C             M  G
Sbjct: 219 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY----------GGMDVG 268

Query: 195 NGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            G+ V GG    + +    S  D++ Y+     I +  +  + K +P  N++    K   
Sbjct: 269 GGAMVLGGISPPSDMTFAYSDPDRSPYY----NIDLKEMHVAGKRLP-LNANVFDGKHGT 323

Query: 252 FIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
            +D+G     LP+  +   ++ +     ++K     DP   + +C+         ++   
Sbjct: 324 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNY-NDICFSGAGNDVSQLSKSF 382

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P++   F  G K  L      F    V G +C  + Q  +    + G     +  + YD 
Sbjct: 383 PVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDR 442

Query: 362 DSQMVSFKPTDCTK 375
           +   + F  T+C +
Sbjct: 443 EQTKIGFWKTNCAE 456


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 173/384 (45%), Gaps = 49/384 (12%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  IGTP     Y  VDTGSD++WV C+ C  C ++        +Y+P+ SS
Sbjct: 76  TETGLYFTQIGIGTPAK-SYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSS 134

Query: 74  SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
           S   ++C  + C   H     SC     C Y+  Y D S T G   T+ + +    GNS 
Sbjct: 135 SGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194

Query: 127 NFFDN--VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
               N  + FGCG    G    +     G++G G++  S+ SQ+ +     K F++CL  
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL-- 252

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
                +I     F  G +V    V +T LV      +Y V LE I VG   L   + +  
Sbjct: 253 ----DTINGGGIFAIG-DVVQPKVSTTPLV--PGMPHYNVNLEAIDVGGVKLQLPTNIFD 305

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TP 297
              S G I      ID+G     LP   YN +  +V       P ++ +     C++ + 
Sbjct: 306 IGESKGTI------IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQ--DFQCFRYSG 357

Query: 298 SMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNF 350
           S+    PI+T HF+GG  +PL IH   ++    E ++C       +Q  DG D+ + G+ 
Sbjct: 358 SVDDGFPIITFHFEGG--LPLNIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDL 414

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
           A S+  + YD ++Q++ +   +C+
Sbjct: 415 AFSNRLVLYDLENQVIGWTDYNCS 438


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 94/364 (25%), Positives = 158/364 (43%), Gaps = 46/364 (12%)

Query: 42  VDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKELSCQSEQC---HLLDTVSC 93
           VDTGSD++WV C PC  C ++        +Y+P  SS+   +SC    C          C
Sbjct: 19  VDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQC 78

Query: 94  S-SQQLCNYTYGYADSSLTKGVLATERITFG--NSN---NFFDNVVFGCGHNNTGVFNEN 147
           S +   C Y + Y D S ++G    + + +   +SN   N    V+FGC    TG  + +
Sbjct: 79  SQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTS 138

Query: 148 EM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
           +    G++G G+  LS+ +Q+ +Q    + FS+CL     +      +  G  +E    G
Sbjct: 139 QQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL---EGEKRGGGILVIGGIAE---PG 192

Query: 204 VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
           +  T LV   D  +Y V L GISV    NS++L        + +   + +D+G      P
Sbjct: 193 MTYTPLV--PDSVHYNVVLRGISV----NSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFP 246

Query: 264 KDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTST 323
              YN   + +R A   TP +   + +Q    +  ++ + P +T +F+GGA    +    
Sbjct: 247 SGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAME--LQPDN 304

Query: 324 FI------PPPVEGVFCFAMQ-------PIDG-DVGIFGNFAQSDLFIGYDFDSQMVSFK 369
           ++      P     V+C   Q       P DG  + I G+    D  + YD D+  + + 
Sbjct: 305 YLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWM 364

Query: 370 PTDC 373
             +C
Sbjct: 365 SYNC 368


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 159/393 (40%), Gaps = 50/393 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYK------------------- 61
           G Y++    GTP L   Y +V DT +DL W+ C    +  K                   
Sbjct: 125 GMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKE 182

Query: 62  -QVKPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
            + K  Y PA SSS++ + C  ++C LL   +C   S  + C+Y     D +LT G+   
Sbjct: 183 ARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242

Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           E+ T   S+        ++ GC     G   +   G++ LG   +S A     + G  +F
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFG-QRF 301

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
           S+CL+  ++    +S + FG    V G G + T +V   D K  Y   + GI VG     
Sbjct: 302 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGG---- 357

Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
            +L IP   +++   +  G + +DT    T L  + Y  +   +   +   P      G 
Sbjct: 358 ERLDIPQEIWDAEKVVG-GGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGF 416

Query: 291 QLCYKTPSMAG---------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI- 340
           + CY+  + AG           P LT    GGA++     S  +P  V GV C A + + 
Sbjct: 417 EYCYRW-TFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLP 475

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            G  GI GN    +     D     + F+   C
Sbjct: 476 RGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 103/409 (25%), Positives = 175/409 (42%), Gaps = 66/409 (16%)

Query: 24  YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYK------QVKPIYNPASS 72
           Y++  +IGTPP  + +Y  +DTGSDL WV C      C+ C        +   I++P  S
Sbjct: 11  YLITLNIGTPPQAVQVY--MDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHS 68

Query: 73  SSYKELSCQSEQCHLLDT----------VSCSSQQLC---------NYTYGYADSSLTKG 113
           SS    SC S  C  + +            CS   L          ++ Y Y +  L  G
Sbjct: 69  SSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSG 128

Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
           +L  + +     +       FGC    T  ++E  +G+ G GR  LSL SQ+        
Sbjct: 129 ILTRDILKARTRD--VPRFSFGCV---TSTYHE-PIGIAGFGRGLLSLPSQL--GFLEKG 180

Query: 174 FSYCLVPFH--TDSSITSKMYFGNGS---EVSGGGVVSTSLVSKEDKTYYFVTLEGISVG 228
           FS+C +PF    + +I+S +  G  +    ++     +  L +      Y++ LE I++G
Sbjct: 181 FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIG 240

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDP 286
                +++        +   G M +D+G   T LP  FY++L   +++ I        + 
Sbjct: 241 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATETES 300

Query: 287 RLGSQLCYKTP-----------SMAGIAPILTAHFDGGAKVPLIHTSTF--IPPPVEG-- 331
           R G  LCYK P            +  + P +T +F   A + L   ++F  +  P +G  
Sbjct: 301 RTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSV 360

Query: 332 VFCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           V C   Q ++    G  G+FG+F Q ++ + YD + + + F+  DC  +
Sbjct: 361 VQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLE 409


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 170/385 (44%), Gaps = 50/385 (12%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  +GTPP    Y  VDTGSD++WV C+ C QC  +        +Y+P +SS
Sbjct: 83  TDTGLYYTEVRLGTPPKR-FYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASS 141

Query: 74  SYKELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITF------ 122
           +   + C    C   DT       CS+   C Y+  Y D S T G    + + F      
Sbjct: 142 TGSTVMCDQGFC--ADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGD 199

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCL 178
           G +     +V+FGCG    G    +     G++G G    S+ SQ+ +     K F++CL
Sbjct: 200 GQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL 259

Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKL 236
                  +I     F  G +V    V +T LV+  DK +Y V L+ I VG   L   + +
Sbjct: 260 ------DTIKGGGIFAIG-DVVQPKVKTTPLVA--DKPHYNVNLKTIDVGGTTLELPADI 310

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK- 295
                  G I      ID+G   T LP+  + ++   V N  +   + D +    LC++ 
Sbjct: 311 FKPGEKRGTI------IDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQ--DFLCFEY 362

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGN 349
           + S+    P LT HF+    + +     F P   + V+C      A+Q  DG D+ + G+
Sbjct: 363 SGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGND-VYCVGFQNGALQSKDGKDIVLMGD 421

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
              S+  + YD +++++ +   +C+
Sbjct: 422 LVLSNKLVVYDLENRVIGWTDYNCS 446


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
           G+Y + F +GTP       + DTGSDL W+ C    +            + K +++   S
Sbjct: 81  GQYFVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 139

Query: 73  SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
           SS+K + C ++ C   L+D  S ++       C Y Y Y+D S   G  A E +T     
Sbjct: 140 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 199

Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                  NV+ GC  +  G   +   G++GLG ++ S A +   + G  KFSYCLV   +
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 258

Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
             ++++ + FG     E     +  T LV     ++Y V + GIS+G    +   IP   
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 315

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
           ++  GA   G   +D+G+  T L +  Y  +   +R ++      +  +G  + C+ +  
Sbjct: 316 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372

Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
               + P L  HF  GA+  P +   +++    +GV C     +      + GN  Q + 
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              +D   + + F P+ CT
Sbjct: 431 LWEFDLGLKKLGFAPSSCT 449


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 159/393 (40%), Gaps = 50/393 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYK------------------- 61
           G Y++    GTP L   Y +V DT +DL W+ C    +  K                   
Sbjct: 125 GMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKE 182

Query: 62  -QVKPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
            + K  Y PA SSS++ + C  ++C LL   +C   S  + C+Y     D +LT G+   
Sbjct: 183 ARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242

Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           E+ T   S+        ++ GC     G   +   G++ LG   +S A     + G  +F
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFG-QRF 301

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
           S+CL+  ++    +S + FG    V G G + T +V   D K  Y   + GI VG     
Sbjct: 302 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGG---- 357

Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
            +L IP   +++   +  G + +DT    T L  + Y  +   +   +   P      G 
Sbjct: 358 ERLDIPQEIWDAEKVVG-GGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGF 416

Query: 291 QLCYKTPSMAG---------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI- 340
           + CY+  + AG           P LT    GGA++     S  +P  V GV C A + + 
Sbjct: 417 EYCYRW-TFAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLP 475

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            G  GI GN    +     D     + F+   C
Sbjct: 476 RGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 157/427 (36%), Gaps = 93/427 (21%)

Query: 23  EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
           +Y +  S+G       +   +DTGSDL+W  C P  C+ C  +                 
Sbjct: 93  DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPP 152

Query: 63  ---------VKPIYNPASSSSYKELSCQSEQCHLLD--TVSC--SSQQLCNYTYGYADSS 109
                      P+ + A +S+     C +  C L D  T SC  +S       Y Y D S
Sbjct: 153 PPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGS 212

Query: 110 LTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
           L    L   R+  G S    DN  F C H   G      +G+ G GR  LSL  Q+  QL
Sbjct: 213 LVA-HLRRGRVGLGASVA-VDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLAPQL 266

Query: 170 GANKFSYCLV--PFHTDSSIT-SKMYFGNGSEVSG--GGVVSTSLVSKEDKTYYF-VTLE 223
            + +FSYCLV   F  D  I  S +  G   + +   GG V T L+      Y++ V LE
Sbjct: 267 -SGRFSYCLVSHSFRADRLIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALE 325

Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL------------- 270
            +SVG     ++  P          G M +D+G   T+LP + Y R+             
Sbjct: 326 AVSVGATRIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGF 383

Query: 271 --EEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPP 328
              E+      LTP          CY   +     P L  HF G A V L   + F+   
Sbjct: 384 ARAERAEEQTGLTP----------CYHYAASDRGVPPLALHFRGNATVALPRRNYFMGFK 433

Query: 329 VE----------GVFCFAMQ----------PIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
            E           V C  +             DG  G  GNF Q    + YD D+  V F
Sbjct: 434 SEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGF 493

Query: 369 KPTDCTK 375
               CT+
Sbjct: 494 ARRRCTE 500


>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 75/137 (54%), Gaps = 4/137 (2%)

Query: 61  KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
           KQ  PIY+PA SS+Y ++SC+S  C+ L    C S   C Y Y Y D S+T G+L+ E +
Sbjct: 1   KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETL 60

Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
           T      +        FGCG NN G   +   G+VGLGR  LSL SQ+ + +   KFSYC
Sbjct: 61  TLTSKSGAEQLIPKFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119

Query: 178 LVPFHTDSSITSKMYFG 194
           L+      S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 143/335 (42%), Gaps = 42/335 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV++  +GTP +  +   +DT +D  W  C PC  C    +  + PASSSSY  L C S
Sbjct: 78  SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134

Query: 83  EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           + C L +   C + Q        C ++  +AD+S  +  L ++ +  G   +      FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191

Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C G       N  + GL+GLGR  +SL SQ  S+     FSYCL  + +        YF 
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYN-GVFSYCLPSYRS-------YYFS 243

Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
               +   G    V  T L++   + + Y+V + G+SVG       + S    P   +  
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
            I  G +     AP        Y  L E+ R  +   P     LG+   C+ T  + AG 
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
           AP +T H DGG  + L   +T I      + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 108/422 (25%), Positives = 169/422 (40%), Gaps = 76/422 (18%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQC----------------L 54
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C                L
Sbjct: 76  LSSGAYTGTGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCHRAAAAASASPRNASSL 132

Query: 55  PCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSL 110
           P        +  + P  S ++  + C S  C         +C++    C Y Y Y D S 
Sbjct: 133 P-APAPASPRRTFRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSA 191

Query: 111 TKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI 165
            +G +  +  T   S           VV GC  +  G       G++ LG + +S AS+ 
Sbjct: 192 ARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRA 251

Query: 166 LSQLGANKFSYCLVPFHTDSSITSKMYFGN----GSEVSGGGVVS--------------- 206
            S+ G  +FSYCLV      + TS + FG      S     G+ S               
Sbjct: 252 ASRFG-GRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAP 310

Query: 207 ----TSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-GNMFIDTGAPPT 260
               T LV     + +Y VT++G+SV       +L+    +   + + G   +D+G   T
Sbjct: 311 GARQTPLVLDHRTRPFYAVTVKGVSVAG-----ELLKIPRAVWDVEQGGGAILDSGTSLT 365

Query: 261 LLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQLCYK--TPSMAGIA---PILTAHFDGG 313
           +L K  Y  +   +   +   P    DP      CY   +PS + +A   P+L  HF G 
Sbjct: 366 MLAKPAYRAVVAALSKRLAGLPRVTMDP---FDYCYNWTSPSGSDVAAPLPMLAVHFAGS 422

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           A++     S ++     GV C  +Q  P  G + + GN  Q +    YD  ++ + FK +
Sbjct: 423 ARLEPPAKS-YVIDAAPGVKCIGLQEGPWPG-LSVIGNILQQEHLWEYDLKNRRLRFKRS 480

Query: 372 DC 373
            C
Sbjct: 481 RC 482


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 143/335 (42%), Gaps = 42/335 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV++  +GTP +  +   +DT +D  W  C PC  C    +  + PASSSSY  L C S
Sbjct: 78  SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134

Query: 83  EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           + C L +   C + Q        C ++  +AD+S  +  L ++ +  G   +      FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191

Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C G       N  + GL+GLGR  +SL SQ  S+     FSYCL  + +        YF 
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYN-GVFSYCLPSYRS-------YYFS 243

Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
               +   G    V  T L++   + + Y+V + G+SVG       + S    P   +  
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
            I  G +     AP        Y  L E+ R  +   P     LG+   C+ T  + AG 
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
           AP +T H DGG  + L   +T I      + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
           G+Y + F +GTP       + DTGSDL W+ C    +            + K +++   S
Sbjct: 10  GQYSVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 68

Query: 73  SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
           SS+K + C ++ C   L+D  S ++       C Y Y Y+D S   G  A E +T     
Sbjct: 69  SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 128

Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                  NV+ GC  +  G   +   G++GLG ++ S A +   + G  KFSYCLV   +
Sbjct: 129 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 187

Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
             ++++ + FG     E     +  T LV     ++Y V + GIS+G    +   IP   
Sbjct: 188 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 244

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
           ++  GA   G   +D+G+  T L +  Y  +   +R ++      +  +G  + C+ +  
Sbjct: 245 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 301

Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
               + P L  HF  GA+  P +   +++    +GV C     +      + GN  Q + 
Sbjct: 302 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 359

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              +D   + + F P+ CT
Sbjct: 360 LWEFDLGLKKLGFAPSSCT 378


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
           G+Y + F +GTP       + DTGSDL W+ C    +            + K +++   S
Sbjct: 81  GQYSVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 139

Query: 73  SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
           SS+K + C ++ C   L+D  S ++       C Y Y Y+D S   G  A E +T     
Sbjct: 140 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 199

Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                  NV+ GC  +  G   +   G++GLG ++ S A +   + G  KFSYCLV   +
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 258

Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
             ++++ + FG     E     +  T LV     ++Y V + GIS+G    +   IP   
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 315

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
           ++  GA   G   +D+G+  T L +  Y  +   +R ++      +  +G  + C+ +  
Sbjct: 316 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372

Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
               + P L  HF  GA+  P +   +++    +GV C     +      + GN  Q + 
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              +D   + + F P+ CT
Sbjct: 431 LWEFDLGLKKLGFAPSSCT 449


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 165/378 (43%), Gaps = 25/378 (6%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK--PIYNPASS 72
           S +     +Y  +  +GTP       +VDTGS+L WV C    +   +VK   ++    S
Sbjct: 79  SGIDYGTAQYFTEVRVGTPAK-KFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEES 137

Query: 73  SSYKELSCQSEQCH--LLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
            S+K + C ++ C   L++  S S+       C+Y Y YAD S  +GV A E IT G +N
Sbjct: 138 KSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTN 197

Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
                   ++ GC  + +G   +   G++GL  +  S  S   S  GA K SYCLV   +
Sbjct: 198 GRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGA-KLSYCLVDHLS 256

Query: 184 DSSITSKMYFGNGSEVSGGGVV---STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
           + +I++ + FG  S  +        +T L       +Y + + GIS+G+      L    
Sbjct: 257 NKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGD----DMLDIPT 312

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPS- 298
               A + G   +D+G   TLL +  Y  +   + R  ++L   +   +  + C+ + S 
Sbjct: 313 QVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSG 372

Query: 299 -MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIFGNFAQSDLF 356
                 P LT H  GGA+    H  +++     GV C   M        + GN  Q +  
Sbjct: 373 FNESKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYL 431

Query: 357 IGYDFDSQMVSFKPTDCT 374
             +D  +  +SF P+ CT
Sbjct: 432 WEFDLMASTLSFAPSTCT 449


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 167/387 (43%), Gaps = 41/387 (10%)

Query: 6   YFYPNNVVQ-SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
           Y +PN  ++  +   +NG Y  +  IGTPP  +   IVDTGS + +V C  C  C K   
Sbjct: 69  YHHPNARMRLYDDLLSNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSDCEHCGKHQD 127

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
           P + P  SS+Y  + C  +     D V+C       Y   YA+ S + GVL  + I+FGN
Sbjct: 128 PRFQPDESSTYHPVKCNMDCNCDHDGVNCV------YERRYAEMSSSSGVLGEDIISFGN 181

Query: 125 SNNFF-DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPF 181
            +       VFGC +  TG ++++   G++GLGR +LS+  Q++ +   N  FS C    
Sbjct: 182 QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCY--- 238

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGNLSNSSKLI 237
                    M+ G G+ V GG      +V S+ D     YY + L+ I V       KL 
Sbjct: 239 -------GGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAG--KPLKLS 289

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY 294
           P    S    K    +D+G     LP++ +    + +    + +K     DP   + +C+
Sbjct: 290 P----STFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNY-NDICF 344

Query: 295 K-----TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
                    ++   P +   F  G K+ L      F    V G +C  +        + G
Sbjct: 345 SGAGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLG 404

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
                +  + YD +++ + F  T+C++
Sbjct: 405 GIIVRNTLVTYDRENEKIGFWKTNCSE 431


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 104/402 (25%), Positives = 164/402 (40%), Gaps = 60/402 (14%)

Query: 8   YPNNVVQSNVSTANGEYVMKFSIGTP-PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
           Y +   +  VS  +  Y++ F +G   P  +I  +VDTGSD+ W     C          
Sbjct: 94  YSDGRHEGRVSIPDASYIITFYLGNQRPEDNISAVVDTGSDIFWTTEKEC---------- 143

Query: 67  YNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---------CNYT--YGYADSSLTKGVL 115
              + S +   L C S +C    +  C   +L         C Y   YG   +  T GV+
Sbjct: 144 ---SRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAGVM 200

Query: 116 ATERITFGN-------SNNFFDNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILS 167
             +++T          S+  F  V  GC  + T  F +  + G+ GLGR+    A+ +  
Sbjct: 201 YEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRS----ATSLPR 256

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV-------SKEDKTYYFV 220
           QL  +KFSYCL  +  +  + S +      +++ G V   + V       + + KT YFV
Sbjct: 257 QLNFSKFSYCLSSYQ-EPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFV 315

Query: 221 TLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL 280
            L+ IS+G           + +    S GNMF+DTGA  T L    + +L  ++   +K 
Sbjct: 316 HLQNISIGGTR--------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKE 367

Query: 281 TPY---QDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVF 333
             Y   Q  R   Q+CY  PS A       P +  HF   A + L   S       +   
Sbjct: 368 RKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCL 427

Query: 334 CFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
                 I G + + GNF   +  +  D  ++ +SF   DC+K
Sbjct: 428 AIYKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSK 469


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 165/375 (44%), Gaps = 47/375 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVDTGS + +V C  C QC +   P + P  SS+Y+ + C
Sbjct: 78  NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC 136

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  LD    + +  C Y   YA+ S + GVL  + ++FGN +       VFGC + 
Sbjct: 137 T------LDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +++++  G++GLGR  LS+  Q++ + + ++ FS C             M  G G+
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY----------GGMDVGGGA 240

Query: 198 EVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            V GG    + +V ++ D     YY + L+ I V     + K +P  N S    K    +
Sbjct: 241 MVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHV-----AGKRLP-LNPSVFDGKHGSVL 294

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYKTPSMAGIA------- 303
           D+G     LP++ +   +E +   ++        DP   + LC+   S AGI        
Sbjct: 295 DSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNY-NDLCF---SGAGIDVSQLSKT 350

Query: 304 -PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYD 360
            P++   F  G K  L      F    V G +C  + Q       + G     +  + YD
Sbjct: 351 FPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYD 410

Query: 361 FDSQMVSFKPTDCTK 375
            +   + F  T+C +
Sbjct: 411 REQTKIGFWKTNCAE 425


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 171/407 (42%), Gaps = 71/407 (17%)

Query: 24  YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------ 64
           Y++  +IGTPP ++ +Y  +DTGSDL WV C      C+ C  Y+  K            
Sbjct: 12  YLISLNIGTPPQVIQVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSS 69

Query: 65  --------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
                   P      SS      C    C L   +  +  + C ++ Y Y    +  G L
Sbjct: 70  SYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTL 129

Query: 116 A--TERITFGNSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
              T R+  G +    D     FGC     G      +G+ G  R  LS  SQ+   L  
Sbjct: 130 TRDTLRVHEGPARVTKDIPKFCFGC----VGSTYHEPIGIAGFVRGTLSFPSQL--GLLK 183

Query: 172 NKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG 228
             FS+C + F    + +I+S +  G+ +  S   +  T ++ S     YY++ LE I+VG
Sbjct: 184 KGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVG 243

Query: 229 NLSNSSKLIPY----YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
           N+S ++  +P     ++S G    G M ID+G   T LP+ FY++L   +  AI   P  
Sbjct: 244 NVSATT--VPLNLREFDSQG---NGGMLIDSGTTYTHLPEPFYSQLLS-IFKAIITYPRA 297

Query: 285 ---DPRLGSQLCYKTP-------SMAGIAPILTAHFDGGAKVPLIHTSTF----IPPPVE 330
              + R G  LCYK P           + P +T HF       L   + F     P    
Sbjct: 298 TEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNST 357

Query: 331 GVFCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            V C   Q +     G  G+FG+F Q ++ I YD + + + F+P DC
Sbjct: 358 VVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 171/413 (41%), Gaps = 73/413 (17%)

Query: 23  EYVMKFSIGTPPLLD-IYGIVDTGSDLMWVQCLP--CVQCYKQ-----VKPIYN------ 68
           +Y + F++G       I   +DTGSDL+W  C P  C+ C  +       P  N      
Sbjct: 69  DYTLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVA 128

Query: 69  -----PASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
                PA S+++        C + +C L  ++T  C++ +   + Y Y D SL   +   
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL---IARL 185

Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKF 174
            R T   S+ F  N  FGC H           G+ G GR  LSL +Q+ +   QLG N+F
Sbjct: 186 YRDTLSLSSLFLRNFTFGCAHTTLA----EPTGVAGFGRGLLSLPAQLATLSPQLG-NRF 240

Query: 175 SYCLVPFHTDSSITSK-------MYFGNGSEVSGGGV---VSTSLVSKEDKTYYF-VTLE 223
           SYCLV    DS    K        Y     E  GGGV   V TS++      Y++ V+L 
Sbjct: 241 SYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLI 300

Query: 224 GISVGNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAI 278
           GI+VG  +  + +++   N+ G    G + +D+G   T+LP  FYN      + +V    
Sbjct: 301 GIAVGKRTIPAPEMLRRVNNRG---DGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDN 357

Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG--AKVPLIHTSTFI--------PPP 328
           K     + + G   CY   S+A + P LT  F GG  + V L   + F            
Sbjct: 358 KRARKIEEKTGLAPCYYLNSVADV-PALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKG 416

Query: 329 VEGVFCFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              V C  +        + G  G   GN+ Q    + YD + + V F    C 
Sbjct: 417 KRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 168/377 (44%), Gaps = 39/377 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
           G Y  +  +G+PP  + Y  +DTGSD++WV C  C  C +      P+  ++P SSS+  
Sbjct: 66  GLYFTRVLLGSPPK-EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 124

Query: 77  ELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS-NN 127
            +SC  ++C L        CSSQ   C YT+ Y D S T G   ++ + F    G+S  N
Sbjct: 125 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 184

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
              ++VFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL     
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
              I          E+    +V + LV  +   +Y + L+ ISV   +L+   ++     
Sbjct: 245 GGGILVLG------EIVEEDIVYSPLVPSQ--PHYNLNLQSISVNGKSLAIDPEVFATST 296

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           + G I      +D+G     L ++ Y+     +  A+  +       G+Q    T S+ G
Sbjct: 297 NRGTI------VDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKG 350

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLFI 357
           I P ++ +F GG  + L      +     G   V+C   Q I G  + I G+    D   
Sbjct: 351 IFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 410

Query: 358 GYDFDSQMVSFKPTDCT 374
            YD   Q + +   DC+
Sbjct: 411 VYDLAGQRIGWANYDCS 427


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 162/390 (41%), Gaps = 58/390 (14%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP  ++  ++DTGS+L W+ C    Q        +NP  SSSY  + C
Sbjct: 70  NISLTVSLTVGTPPQ-NVTMVIDTGSELSWLHC-NTSQNSSSSSSTFNPVWSSSYSPIPC 127

Query: 81  QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C           SC S Q C+ T  YAD+S ++G LAT+    G+S     NVVFG
Sbjct: 128 SSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG--IPNVVFG 185

Query: 136 CGHNNTGVFNENE------MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           C  +   +F+ N        GL+G+ R  LS     +SQ+G  KFSYC+    ++   + 
Sbjct: 186 CMDS---IFSSNSEEDSKNTGLMGMNRGSLSF----VSQMGFPKFSYCI----SEYDFSG 234

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS- 242
            +  G+ +      +  T L+         D+  Y V LEGI V +     KL+P   S 
Sbjct: 235 LLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAH-----KLLPIPESV 289

Query: 243 --SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN--AIKLTPYQDPRLGSQ----LCY 294
                   G   +D+G   T L    Y  L +   N  A  L  Y+D     Q    LCY
Sbjct: 290 FEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCY 349

Query: 295 KTPSMAGIAPIL---TAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPID---GD 343
           + P+     P L   T  F  GA++ +         P E      + CF     D    +
Sbjct: 350 RVPTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVE 408

Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             + G+  Q ++++ +D     +      C
Sbjct: 409 AFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 168/377 (44%), Gaps = 39/377 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
           G Y  +  +G+PP  + Y  +DTGSD++WV C  C  C +      P+  ++P SSS+  
Sbjct: 81  GLYFTRVLLGSPPK-EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 139

Query: 77  ELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS-NN 127
            +SC  ++C L        CSSQ   C YT+ Y D S T G   ++ + F    G+S  N
Sbjct: 140 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 199

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
              ++VFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL     
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
              I          E+    +V + LV  +   +Y + L+ ISV   +L+   ++     
Sbjct: 260 GGGILVLG------EIVEEDIVYSPLVPSQ--PHYNLNLQSISVNGKSLAIDPEVFATST 311

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           + G I      +D+G     L ++ Y+     +  A+  +       G+Q    T S+ G
Sbjct: 312 NRGTI------VDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKG 365

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLFI 357
           I P ++ +F GG  + L      +     G   V+C   Q I G  + I G+    D   
Sbjct: 366 IFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 425

Query: 358 GYDFDSQMVSFKPTDCT 374
            YD   Q + +   DC+
Sbjct: 426 VYDLAGQRIGWANYDCS 442


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 163/383 (42%), Gaps = 46/383 (12%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
           N+    G Y     IGTP +   Y  +DTGS   WV  + C QC  +   +     Y+P 
Sbjct: 75  NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           SS S KE+ C    C       C+    C Y  GYAD  LT G+L T+ + +      G 
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLV 179
           +     +V FGCG   +G  N + +   G++G G +  +  SQ L+  G  K  FS+CL 
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQ-LAAAGKTKKIFSHCL- 249

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
                 S      F  G  V     V T+ + K ++ Y+ V L+ I   N++ ++  +P 
Sbjct: 250 -----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP- 298

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
            N  G       FID+G+    LP+  Y+ L   V        + D  +G+   ++    
Sbjct: 299 ANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHF 353

Query: 300 AGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFA 351
            G      P +T HF+    +  ++   ++       +CF  Q   I G  D+ I G+  
Sbjct: 354 LGSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMV 412

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
            S+  + YD + Q + +   +C+
Sbjct: 413 ISNKVVVYDMEKQAIGWTEHNCS 435


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/407 (25%), Positives = 168/407 (41%), Gaps = 59/407 (14%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKP---I 66
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C  P     +        
Sbjct: 83  LTSGAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRRPAANSSESGSGSGRA 139

Query: 67  YNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF 122
           + P  S ++  +SC S+ C         +C +    C Y Y Y D S  +G + TE  T 
Sbjct: 140 FRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATI 199

Query: 123 GNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
             S             +V GC  + TG   E   G++ LG + +S AS   S+  A +FS
Sbjct: 200 ALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRF-AGRFS 258

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS---------------------LVSKED 214
           YCLV   +  + TS + FG    V+     S+                      L+ +  
Sbjct: 259 YCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRM 318

Query: 215 KTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
           + +Y V ++ +SV G      + +   ++ G +      +D+G   T+L K  Y  +   
Sbjct: 319 RPFYDVAVKAVSVAGQFLKIPRAVWDVDAGGGV-----ILDSGTSLTVLAKPAYRAVVAA 373

Query: 274 VRNAIKLTPY--QDPRLGSQLCYKTPSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPV 329
           +   +   P    DP    + CY   S +G    P +  HF G A++     S ++    
Sbjct: 374 LSEGLAGLPRVTMDP---FEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKS-YVIDAA 429

Query: 330 EGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            GV C  +Q  P  G + + GN  Q +    +D  ++ + F+ + CT
Sbjct: 430 PGVKCIGLQEGPWPG-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 475


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 42/381 (11%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  +GTPP    Y  VDTGSD++WV C+ C QC  +        +Y+P +SS
Sbjct: 81  TDTGLYYTEIKLGTPPK-HYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASS 139

Query: 74  SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           +   + C    C          C +   C Y+  Y D S T G   T+ + F      G 
Sbjct: 140 TGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQ 199

Query: 125 SNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +     +V+FGCG    G     N+   G++G G    S+ SQ+ +     K F++CL  
Sbjct: 200 TQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCL-- 257

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                +I     F  G +V    V +T LV+  DK +Y V L+ I VG    ++  +P +
Sbjct: 258 ----DTIKGGGIFSIG-DVVQPKVKTTPLVA--DKPHYNVNLKTIDVG---GTTLQLPAH 307

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP-SM 299
                  KG + ID+G   T LP+  +  +   V N  +   + D +    LC++ P S+
Sbjct: 308 IFEPGEKKGTI-IDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDVQ--GFLCFQYPGSV 364

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
               P +T HF+    + +     F     + V+C      A Q  DG D+ + G+   S
Sbjct: 365 DDGFPTITFHFEDDLALHVYPHEYFFANGND-VYCVGFQNGASQSKDGKDIVLMGDLVLS 423

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + YD +++++ +   +C+
Sbjct: 424 NKLVIYDLENRVIGWTDYNCS 444


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 173/398 (43%), Gaps = 58/398 (14%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----P 65
           N+  + + T  G Y  K  +G+PP  D Y  VDTGSD++WV C+ C +C ++        
Sbjct: 57  NLGGNGLPTETGLYFTKLGLGSPPK-DYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLT 115

Query: 66  IYNPASSSSYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           +Y+P  S + + +SC  E C          C S+  C Y+  Y D S T G    + +T+
Sbjct: 116 LYDPKGSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTY 175

Query: 123 GNSNNFF------DNVVFGCGHNNTGVFN----ENEMGLVGLGRTRLSLASQILSQLGAN 172
            + N+         +++FGCG   +G  +    E   G++G G++  S+ SQ+ +     
Sbjct: 176 NHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVK 235

Query: 173 K-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN-- 229
           K FS+CL       +I     F  G EV    V +T LV +    +Y V L+ I V    
Sbjct: 236 KIFSHCL------DNIRGGGIFAIG-EVVEPKVSTTPLVPR--MAHYNVVLKSIEVDTDI 286

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           L   S +    N  G I      ID+G     LP   Y+ L  +V         + PRL 
Sbjct: 287 LQLPSDIFDSGNGKGTI------IDSGTTLAYLPAIVYDELIPKVMA-------RQPRLK 333

Query: 290 SQL------CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG 342
             L      C++ T ++    P++  HF+    +  ++   ++    +G++C   Q    
Sbjct: 334 LYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLSLT-VYPHDYLFQFKDGIWCIGWQKSVA 392

Query: 343 ------DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                 D+ + G+   S+  + YD ++  + +   +C+
Sbjct: 393 QTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNCS 430


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 142/335 (42%), Gaps = 42/335 (12%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV++  +GTP +  +   +DT +D  W  C PC  C    +  + PASSSSY  L C S
Sbjct: 78  SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134

Query: 83  EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           + C L +   C + Q        C ++  +AD+S  +  L ++ +  G   +      FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191

Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C G       N  + GL+GLGR  +SL SQ  S      FSYCL  + +        YF 
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYN-GVFSYCLPSYRS-------YYFS 243

Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
               +   G    V  T L++   + + Y+V + G+SVG       + S    P   +  
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
            I  G +     AP        Y  L E+ R  +   P     LG+   C+ T  + AG 
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
           AP +T H DGG  + L   +T I      + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 168/386 (43%), Gaps = 58/386 (15%)

Query: 21   NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
            N    +  ++G+PP   +  ++DTGS+L W+ C    +    +  ++NP SSSSY  + C
Sbjct: 997  NVTLTVSLTVGSPPQ-QVTMVLDTGSELSWLHC----KKSPNLTSVFNPLSSSSYSPIPC 1051

Query: 81   QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
             S  C      L + V+C  ++LC+    YAD+S  +G LA++    G+S       +FG
Sbjct: 1052 SSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA--LPGTLFG 1109

Query: 136  C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
            C   G ++    +    GL+G+ R  LS     ++QLG  KFSYC+     DSS    + 
Sbjct: 1110 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VTQLGLPKFSYCIS--GRDSS--GVLL 1161

Query: 193  FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA- 245
            FG+      G +  T LV         D+  Y V L+GI VGN     K++P   S  A 
Sbjct: 1162 FGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGN-----KILPLPKSIFAP 1216

Query: 246  --ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY--- 294
                 G   +D+G   T L    Y  L  +     K  L P  DP    Q    LCY   
Sbjct: 1217 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVA 1276

Query: 295  ---KTPSMAGIAPILT-AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIF 347
               K P++  ++ +   A    G +V L      +    E V+C      D    +  + 
Sbjct: 1277 AGGKLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGN-EWVYCLTFGNSDLLGIEAFVI 1335

Query: 348  GNFAQSDLFIGYDFDSQMVSFKPTDC 373
            G+  Q ++++ +D    +V+F    C
Sbjct: 1336 GHHHQQNVWMEFD----LVAFAADLC 1357


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 152/321 (47%), Gaps = 42/321 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP+ +    +DTGSD++WV C  C  C +    Q++   ++P SSS+  
Sbjct: 23  GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 81

Query: 77  ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
            ++C  ++C+        +CSSQ   C+YT+ Y D S T G   ++ +     N  F+  
Sbjct: 82  MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 138

Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
                   VVFGC +  TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL 
Sbjct: 139 VTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL- 197

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
               DSS    +  G   E+    +V TSLV  +   +Y + L+ I+V    L   S + 
Sbjct: 198 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQ--PHYNLNLQSIAVNGQTLQIDSSVF 250

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
              NS G I      +D+G     L ++ Y+     +  +I  + +     G+Q    T 
Sbjct: 251 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGNQCYLITS 304

Query: 298 SMAGIAPILTAHFDGGAKVPL 318
           S+  + P ++ +F GGA + L
Sbjct: 305 SVTEVFPQVSLNFAGGASMIL 325


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 108/409 (26%), Positives = 163/409 (39%), Gaps = 70/409 (17%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKP-IYNPASSSSYKELS 79
           +Y + FSI +   L +Y  +DTGSD++W  C P  C+ C  + +P    P + S    +S
Sbjct: 93  DYTLTFSINSQ-TLSVY--MDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLIS 149

Query: 80  CQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLATER 119
           C+S  C                      ++T  CS+    ++ Y Y D SL   +     
Sbjct: 150 CKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNL 209

Query: 120 ITFGNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--LSQLGANKFS 175
           I    SN  F   +  FGC H+  G      +G+ G G   LSL +Q+  LS    N+FS
Sbjct: 210 IMPSTSNKPFSLKDFTFGCAHSALG----EPIGVAGFGFGSLSLPAQLANLSPDLGNQFS 265

Query: 176 YCLVPFHTDSSIT---SKMYFGNGSEVSGGGV---VSTSLVSKEDKTYYF-VTLEGISVG 228
           YCLV    DS+     S +  G   E     +   V T ++      Y++ V++E ISVG
Sbjct: 266 YCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVG 325

Query: 229 NLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLT 281
                S  +   N+   I +   G + +D+G   T+LP  FYN     L+ +V    K  
Sbjct: 326 -----SSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRA 380

Query: 282 PYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGAKVPLIHTSTFI-------PPPV 329
              + + G   CY         +  + P L  HF G   V L   + F            
Sbjct: 381 SETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKG 440

Query: 330 EGVFCFAM-----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             V C  +     +   G     GN+ Q    + YD + + V F P  C
Sbjct: 441 RKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 164/384 (42%), Gaps = 42/384 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYK-- 76
           +G Y  +  +G P     Y + +DTGSDL W+QC  PC  C K    +Y P   +  +  
Sbjct: 195 DGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 254

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
           E  C   Q + L T  C S   C+Y   YAD S + GVL  ++  +   N +    ++VF
Sbjct: 255 EPFCVEVQRNQL-TEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 313

Query: 135 GCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
           GCG++  G+     +   G++GL R ++SL SQ+ S+ + +N   +CL      S +  +
Sbjct: 314 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA-----SDLNGE 368

Query: 191 MYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
            Y   GS+ V   G+    ++       Y + +  +S GN      ++     +G +  G
Sbjct: 369 GYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGN-----AMLSLDGENGRV--G 421

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTA 308
            +  DTG+  T  P   Y++L   ++    L   +D    +  +C++  + + I+ +   
Sbjct: 422 KVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDV 481

Query: 309 H-------FDGGAKVPLIHTSTFIPPPV------EGVFCFAM----QPIDGDVGIFGNFA 351
                      G+K  +I     I P        +G  C  +       DG   I G+ +
Sbjct: 482 KKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDIS 541

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
                I YD   Q + +  +DC +
Sbjct: 542 MRGRLIVYDNVKQRIGWMKSDCVR 565


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 99/390 (25%), Positives = 169/390 (43%), Gaps = 57/390 (14%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  K  +G+P   D Y  VDTGSD++WV C+ C +C ++        +Y+P  S 
Sbjct: 64  TVTGLYFTKIGLGSPSK-DYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSK 122

Query: 74  SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
           + + +SC+   C   +    + C ++  C Y+  Y D S T G    + +TF    GN +
Sbjct: 123 TSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPH 182

Query: 127 NFFDN--VVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLV 179
               N  ++FGCG   +G F     E   G++G G+   S+ SQ+ +     K FS+CL 
Sbjct: 183 TATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL- 241

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLI 237
               D+++   ++  +  EV    V +T LV   +  +Y V L+ I V    L   S   
Sbjct: 242 ----DTNVGGGIF--SIGEVVEPKVKTTPLV--PNMAHYNVILKNIEVDGDILQLPSDTF 293

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL----- 292
              N  G +      ID+G     LP+  Y++L  +V         + PRL   L     
Sbjct: 294 DSENGKGTV------IDSGTTLAYLPRIVYDQLMSKVLA-------KQPRLKVYLVEEQY 340

Query: 293 -CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG------DV 344
            C++ T ++    PI+  HF+    + +           +  +C   Q          D+
Sbjct: 341 SCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDM 400

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + G+F  S+  + YD ++  + +   +C+
Sbjct: 401 TLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430


>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
          Length = 449

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 155/369 (42%), Gaps = 48/369 (13%)

Query: 39  YGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV------- 91
           Y ++DTGS L+W QC  C  C+    P Y  + S +++E+SC  +  +  +         
Sbjct: 96  YLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSCGDDDDNDKEEAIASYCPA 155

Query: 92  ------------SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-----FDNVVF 134
                        C  + L N T         +G ++ +   F +   F     F  +VF
Sbjct: 156 KPPGYITLCVNGRCMFKALYNLT---GQGETVQGYMSMDTFHFIDDRRFDYQAKF-RMVF 211

Query: 135 GCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT--SK 190
           GC H    V    +   G++GLG    S     L Q G  KFSYC+ P     S    S 
Sbjct: 212 GCAHQENIVLTAVKECTGILGLGMGDASF----LRQTGITKFSYCVPPRMPGYSYRRHSW 267

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG+ +++SG  V    LV +  K Y  +T    +   L +   +I Y +    +   +
Sbjct: 268 LRFGSHAQISGKKV---PLVMRWGKYYLPLTAITYTYNELMSPVPIIAYKSQEDYL---H 321

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYKTPSMAGIAPI-LTA 308
           M +DTG     LP   ++ L +++   IK     +      + CYK  +M  +  I +T 
Sbjct: 322 MMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKHCYKR-TMDEVKDITVTL 380

Query: 309 HFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQM 365
            FDGG  + L  ++ FI          C A+  + D    I G FAQ+++ +GYD  S+ 
Sbjct: 381 SFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMFAQTNINVGYDLLSRE 440

Query: 366 VSFKPTDCT 374
           ++  P  C 
Sbjct: 441 IAMDPIRCA 449


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 163/384 (42%), Gaps = 61/384 (15%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
           ++   IGTPP      ++DTGS L W+QC    LP      + K  ++P+ SSS+  L C
Sbjct: 73  IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 126

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
               C           SC S +LC+Y+Y YAD +  +G L  E+ITF N+      ++ G
Sbjct: 127 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE-ITPPLILG 185

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT--SKMYF 193
           C   ++     ++ G++G+ R RLS     +SQ   +KFSYC+ P       T     Y 
Sbjct: 186 CATESS-----DDRGILGMNRGRLSF----VSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236

Query: 194 GNGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           G+     G   VS      +  +   D   Y V + GI  G        +   N SG++ 
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFG--------LKKLNISGSVF 288

Query: 248 K------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYK---- 295
           +      G   +D+G+  T L    Y+++  ++   +     +    G  + +C+     
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVA 348

Query: 296 -TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFA 351
             P + G    L   F  G ++ L+     +     G+ C  +     +     I GN  
Sbjct: 349 MIPRLIGD---LVFVFTRGVEI-LVPKERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVH 404

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
           Q +L++ +D  ++ V F   DC++
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADCSR 428


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 94/387 (24%), Positives = 166/387 (42%), Gaps = 44/387 (11%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP-- 65
           P +++Q N    N  ++M   +GTPP+ ++   VDTG+ L +VQC PC ++C+KQ     
Sbjct: 192 PIDLIQ-NGDINNFLFLMPIKLGTPPVWNLVA-VDTGATLSFVQCEPCTLRCHKQTDAGE 249

Query: 66  IYNPASSSSYKELSCQSEQC-------HLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLAT 117
           I++P+ S S+  + C   +C       HL        +  C Y+  +   SS + G L  
Sbjct: 250 IFDPSKSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVR 309

Query: 118 ERITFGN--SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
           +R+  G       F + +FGC  +    +++ E GLVG      S   Q+   +    FS
Sbjct: 310 DRLAIGKYAKGYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQVAPLVNYKAFS 367

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
           YC   F +D   T  +  G+ + V+      T L     ++ Y + L+ +    L N   
Sbjct: 368 YC---FPSDRRKTGYLSIGDYTRVNS---TYTPLFLARQQSRYALKLDEV----LVNGMA 417

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLC 293
           L+         +   M +D+G+  T+L  D + +L+  +  A++   Y     R    +C
Sbjct: 418 LV--------TTPSEMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDYIC 469

Query: 294 YKTPSMAGIA-----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGI 346
           ++       +     P++   FD G K+ L   S+F      G+  + M+   +   V +
Sbjct: 470 FEDAHFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQL 529

Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            GN     + I +D       F+  DC
Sbjct: 530 LGNTMTRSVGITFDIQGGQFGFRKGDC 556


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 166/386 (43%), Gaps = 53/386 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  + GTP L +I  ++DTGS+L W+ C    +       I+NP +S +Y ++ C
Sbjct: 64  NVTLTVSLTAGTP-LQNITMVLDTGSELSWLHC----KKEPNFNSIFNPLASKTYTKIPC 118

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C      L   VSC   +LC++   YAD+S  +G LA E    G+        VFG
Sbjct: 119 SSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTG--PATVFG 176

Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C   G ++    +    GL+G+ R  LS     ++Q+G  KFSYC+    +D   +  + 
Sbjct: 177 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VNQMGFRKFSYCI----SDRDSSGVLL 228

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNS---SKLIPYYNSS 243
            G  S      +  T LV         D+  Y V LEGI V +   S   S  +P  + +
Sbjct: 229 LGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVP--DHT 286

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKT- 296
           GA   G   +D+G   T L    Y+ L+++     K  L    +PR   Q    LCY   
Sbjct: 287 GA---GQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIE 343

Query: 297 PSMAGI--APILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPIDG---DVGIF 347
           P+ A +   P++   F G    V        +P  V G   V+CF     D    +  + 
Sbjct: 344 PTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVI 403

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G+  Q ++++ YD +   + F    C
Sbjct: 404 GHHQQQNVWMEYDLEKSRIGFAEVRC 429


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 123/435 (28%), Positives = 181/435 (41%), Gaps = 80/435 (18%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYKQVK 64
           +NV++      +G Y+M  SIGTPP ++ +Y  +DTGSDL WV C      C  C +   
Sbjct: 8   DNVIEPLREIRDG-YLMSLSIGTPPQVVQVY--MDTGSDLTWVPCGNLSFDCQDCEEYQN 64

Query: 65  PIYNPA--------SSSSYKELS-----------------CQSEQCHLLDTVSCSSQQLC 99
            I  P         SS+S ++                   C    C L   V  +  + C
Sbjct: 65  NISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPC 124

Query: 100 ---NYTYGYA---DSSLTKGVLATE--RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGL 151
               YTYG +     SLT+ VL T        N+N       FGC     G      +G+
Sbjct: 125 PSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGC----VGATYREPIGI 180

Query: 152 VGLGRTRLSLASQI-LSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTS 208
            G GR  LSL  Q+  S  G   FS+C +PF    + + +S +  GN +  S    +  +
Sbjct: 181 AGFGRGLLSLPFQLGFSHKG---FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFT 237

Query: 209 --LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPK 264
             L S     YY++ LE I++GN  N+ +    +      +KGN  M ID+G   T LP+
Sbjct: 238 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 297

Query: 265 DFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMAGIA--------PILTAHFDGGA 314
             Y++L   +   I     +   L  G  LCYK P     +        P +T HF    
Sbjct: 298 PLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNV 357

Query: 315 KVPLIHTSTF--IPPPVEG--VFCFAMQPIDGD-----------VGIFGNFAQSDLFIGY 359
            V L   + F  +  P+    V C   Q +DG             GIFG+F Q ++ + Y
Sbjct: 358 SVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVY 417

Query: 360 DFDSQMVSFKPTDCT 374
           D + + + F+P DC 
Sbjct: 418 DLEKERLGFQPMDCV 432


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 110/397 (27%), Positives = 173/397 (43%), Gaps = 64/397 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK----PIYNPASSS 73
           G Y +  + GTPP    + ++DTGS L+W  C     C +C +  +K    P + P  SS
Sbjct: 81  GGYSISLNFGTPPQTTKF-VMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSS 139

Query: 74  SYKELSCQSEQCHLL------------DTVSCSSQQLCN-YTYGYADSSLTKGVLATERI 120
           S K + C++ +C ++            D+ + +  Q C  Y   Y   S T G+L +E +
Sbjct: 140 SSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETL 198

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFN-ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            F N     D +V GC      +F+ +   G+ G GR+  SL SQ    LG  KFSYCLV
Sbjct: 199 DFPNKKTIPDFLV-GCS-----IFSIKQPEGIAGFGRSPESLPSQ----LGLKKFSYCLV 248

Query: 180 PFHTDSSITSK---MYFGNGSEVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSN 232
               D + TS    +  G+GS V+    +S +   K   T    YY+V L  I +G   +
Sbjct: 249 SHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIG---D 305

Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPR 287
           +   +PY +   G    G   +D+G   T +    Y       E+Q+ +    T  Q+  
Sbjct: 306 THVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQN-L 364

Query: 288 LGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG- 345
            G + CY       ++ P L   F GGAK+ L   S +      GV C  +  +  +V  
Sbjct: 365 TGLRPCYNISGEKSLSVPDLIFQFKGGAKMAL-PLSNYFSIVDSGVICLTI--VSDNVAG 421

Query: 346 ---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                    I GN+ Q + ++ +D +++   FK   C
Sbjct: 422 PGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 96/405 (23%), Positives = 170/405 (41%), Gaps = 59/405 (14%)

Query: 4   ATYFYPNNV------VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC- 56
           ++  +P NV      V  N     G++ M  S+GTPP+ ++   VDTGS L WV C  C 
Sbjct: 49  SSLIHPTNVPAEPSPVVGNHEIHEGKFFMDISLGTPPVANLV-TVDTGSTLSWVVCQRCQ 107

Query: 57  VQCYK---QVKPIYNPASSSSYKELSCQSEQC-----HLLDTVSCSSQ-QLCNYTYGYAD 107
           + C+    +   +++P  S++Y+ + C S  C      L+    C  +   C Y+  Y  
Sbjct: 108 ISCHTTAPEAGSVFDPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGS 167

Query: 108 S---SLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
                 + G L T+++T  +S++  D  +FGC  +++  F   E G++G G    S  +Q
Sbjct: 168 GPSGQYSAGRLGTDKLTLASSSSIIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQ 225

Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLE 223
           +  Q     FSYC    HT     S   +          +V T+L+    D++ Y  +L+
Sbjct: 226 VARQTNYRAFSYCFPGDHTAEGFLSIGAYPKDE------LVYTNLIPHFGDRSVY--SLQ 277

Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY 283
            I +    N  ++          +K  M +D+G   T L    ++   + + +A++   +
Sbjct: 278 QIDMMVDGNRLQV-----DQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGF 332

Query: 284 QDPRLGSQLCYK----TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPP--------PVEG 331
               +G++ C++        +G  P +   F        I T+  +PP        P   
Sbjct: 333 LSDTVGTETCFRPNGGDSVDSGDLPTVEMRF--------IGTTLKLPPENVFHDLLPSHD 384

Query: 332 VFCFAMQP-IDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             C A +P + G  +V I GN A     + YD  +    F+   C
Sbjct: 385 KICLAFKPDVAGVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 127/288 (44%), Gaps = 22/288 (7%)

Query: 94  SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
           S+  +CNY   Y D S T+G L  E++ FG       + +FGCG NN G+F     GL+G
Sbjct: 128 SAAPICNYAINYGDGSFTRGELGHEKLKFGTI--LVKDFIFGCGRNNKGLFG-GVSGLMG 184

Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
           LGR+ LSL SQ     G   FSYCL P        S +  GN S       +S + + + 
Sbjct: 185 LGRSDLSLISQTSGIFGG-VFSYCL-PSTERKGSGSLILGGNSSVYRNSSPISYAKMIEN 242

Query: 214 DKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
            + Y  YF+ L GIS+G ++  +          ++    + +D+G   T LP   Y  L+
Sbjct: 243 PQLYNFYFINLTGISIGGVALQAP---------SVGPSRILVDSGTVITRLPPTIYKALK 293

Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST--FIPPP 328
            +        P          C+   +   +  P +  HF+G A++ +  T    F+   
Sbjct: 294 AEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD 353

Query: 329 VEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
              V C A+  ++   +V I GN+ Q +L + YD     V F    C+
Sbjct: 354 ASQV-CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 160/379 (42%), Gaps = 44/379 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  +IG+PP  ++  ++DTGS+L W+ C    +    +   +NP  SSSY    C
Sbjct: 56  NVTLTISLTIGSPPQ-NVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPC 110

Query: 81  QSEQC-----HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S  C      L    SC  + +LC+    YAD+S  +G LA E  TF  +       +F
Sbjct: 111 NSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAE--TFSLAGAAQPGTLF 168

Query: 135 GCGHNN--TGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           GC  +   T   NE+    GL+G+ R  LSL +Q++      KFSYC+    +       
Sbjct: 169 GCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMV----LPKFSYCI----SGEDAFGV 220

Query: 191 MYFGNGSEVSG-----GGVVSTSLVSKEDKTYYFVTLEGISVG-NLSNSSKLIPYYNSSG 244
           +  G+G            V +T+     D+  Y V LEGI V   L    K +   + +G
Sbjct: 221 LLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTPS 298
           A   G   +D+G   T L    YN L+++     K  LT  +DP         LCY  P+
Sbjct: 281 A---GQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA 337

Query: 299 MAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSD 354
                P +T  F G   +V        +    + V+CF     D    +  + G+  Q +
Sbjct: 338 SLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQN 397

Query: 355 LFIGYDFDSQMVSFKPTDC 373
           +++ +D     V F  T C
Sbjct: 398 VWMEFDLVKSRVGFTETTC 416


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/373 (25%), Positives = 161/373 (43%), Gaps = 43/373 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTP   +   IVD+GS + +V C  C QC     P + P  SS+Y  + C
Sbjct: 88  NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKC 146

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    + +  C Y   YA+ S + GVL  + ++FG  +       VFGC + 
Sbjct: 147 N------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 200

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + + ++ FS C             M  G G+
Sbjct: 201 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDVGGGT 250

Query: 198 EVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMF 252
            V GG      +V          YY + L+ I V     + +L P  +N     SK    
Sbjct: 251 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN-----SKHGTV 303

Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
           +D+G     LP+  +   ++ V    N++K     DP     +C+         ++ + P
Sbjct: 304 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGRNVSQLSEVFP 362

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
            +   F  G K+ L      F    VEG +C  + Q       + G     +  + YD  
Sbjct: 363 DVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRH 422

Query: 363 SQMVSFKPTDCTK 375
           ++ + F  T+C++
Sbjct: 423 NEKIGFWKTNCSE 435


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 111/410 (27%), Positives = 174/410 (42%), Gaps = 70/410 (17%)

Query: 23  EYVMKFSIGTPPLLD-IYGIVDTGSDLMWVQCLP--CVQCYKQ--VKPIYN--------- 68
           +Y + F++G       I   +DTGSDL+W  C P  C+ C  +    P  N         
Sbjct: 47  DYTLSFNLGPRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNASPPVNTTRSVAVSC 106

Query: 69  --PASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
             PA S+++   S    C + +C L  ++T  C++ +   + Y Y D SL   +    R 
Sbjct: 107 KSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL---IARLYRD 163

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKFSYC 177
           T   S+ F  N  FGC +           G+ G GR  LSL +Q+ +   QLG N+FSYC
Sbjct: 164 TLSLSSLFLRNFTFGCAYTTLA----EPTGVAGFGRGLLSLPAQLATLSPQLG-NRFSYC 218

Query: 178 LVPFHTDSSITSK---MYFGNGSEVS-----GGGV---VSTSLVSKEDKTYYF-VTLEGI 225
           LV    DS    K   +  G   E       GGGV   V T ++      Y++ V L GI
Sbjct: 219 LVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGI 278

Query: 226 SVG-NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
           SVG  +  + +++   N+ G    G + +D+G   T+LP  FYN + ++    +     +
Sbjct: 279 SVGKRIVPAPEMLRRVNNRG---DGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNER 335

Query: 285 ----DPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------- 331
               + + G   CY   S+A + P+LT  F GG    ++    +    ++G         
Sbjct: 336 ARKIEEKTGLAPCYYLNSVAEV-PVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394

Query: 332 VFCFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           V C  +        + G  G   GN+ Q    + YD + + V F    C 
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 444


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 133/348 (38%), Gaps = 30/348 (8%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CS-S 95
           ++DT SD+ WVQC PC    C+ Q   +Y+P+ SSS     C S  C  L   +  C+ +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218

Query: 96  QQLCNYTYGYADSSLTKGVLATERITFGNSN--NFFDNVVFGCGHN--NTGVFNENEMGL 151
              C Y   Y D S + G   ++ +T   +   +      FGC H     G F+    G+
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSGI 278

Query: 152 VGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVS 211
           + LGR   SL +Q  +  G + FSYCL P    S        G     +    V+  L S
Sbjct: 279 MALGRGAQSLPTQTKATYG-DVFSYCLPPTPVHSGF---FILGVPRVAASRYAVTPMLRS 334

Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
           K     Y V L  I V        + P   ++GA+      +      T LP   Y  L 
Sbjct: 335 KAAPMLYLVRLIAIEVAG--KRLPVPPAVFAAGAVMDSRTIV------TRLPPTAYMALR 386

Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFI 325
                 ++      P+     CY     A         P +T  FDG      +  S  +
Sbjct: 387 AAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVL 446

Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              ++G   FA    D   GI GN  Q  L + Y+ D   V F+   C
Sbjct: 447 ---LDGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 171/381 (44%), Gaps = 42/381 (11%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  IGTPP    +  VDTGSD++WV C+ C +C ++        +Y+P  SS
Sbjct: 78  TDTGLYYTEIEIGTPPK-QYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSS 136

Query: 74  SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           S   +SC  + C          C+    C Y+  Y D S T G   ++ + +      G 
Sbjct: 137 SGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQ 196

Query: 125 SNNFFDNVVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           + +   +V+FGCG     + G  N+   G++G G++  S+ SQ+ +     K FS+CL  
Sbjct: 197 TRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL-- 254

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                +I     F  G +V    V ST LV   D  +Y V LE I+VG    ++  +P +
Sbjct: 255 ----DTIKGGGIFAIG-DVVQPKVKSTPLV--PDMPHYNVNLESINVG---GTTLQLPSH 304

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSM 299
                  KG + ID+G   T LP+  Y  +   V      T +   +    LC +   S+
Sbjct: 305 MFETGEKKGTI-IDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQ--DFLCIQYFQSV 361

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
               P +T HF+    + +     F     + ++CF      +Q  DG D+ + G+   S
Sbjct: 362 DDGFPKITFHFEDDLGLNVYPHDYFFQNG-DNLYCFGFQNGGLQSKDGKDMVLLGDLVLS 420

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + YD ++Q+V +   +C+
Sbjct: 421 NKVVVYDLENQVVGWTDYNCS 441


>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
          Length = 392

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 66/209 (31%), Positives = 107/209 (51%), Gaps = 20/209 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV  F+IGTPP      ++D   +L+W QC  C +C++Q  P+++P +S++Y+   C + 
Sbjct: 51  YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 84  QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
            C  +  D+ +CS   +C Y     ++  T G + T+    G +     ++ FGC   + 
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
                   G+VGLGRT  SL    ++Q G   FSYCL P   D+   S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGRNSALFLGSSAKLAG 218

Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEG 224
           GG   ST  V+      +   YY V LEG
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEG 247


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 169/371 (45%), Gaps = 54/371 (14%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPASSSSYKELS 79
           IGTP +  +  + D GSDL+W+ C  C+QC            +    Y+P+ SS+ K LS
Sbjct: 106 IGTPNISFLVAL-DAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163

Query: 80  CQSEQCHLLDTVSCSS-QQLCNYTYGY-ADSSLTKGVLA------TERITFGNSNNFFDN 131
           C  + C    + +C S +QLC YT  Y ++++ + G+L       T  I   ++++    
Sbjct: 164 CSHQLCE--SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAP 221

Query: 132 VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSI 187
           V+ GCG   TG + +     GL+GLG   +S+ S  LS+ G   N FS C   F+ D S 
Sbjct: 222 VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPS-FLSKAGLVKNSFSLC---FNDDDS- 276

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +++FG+     G     T+L    D  Y  Y V +E   +G+            S   
Sbjct: 277 -GRIFFGD----QGLATQQTTLFLPSDGKYETYIVGVEACCIGS------------SCIK 319

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
            +     +D+GA  T LP + Y  + ++    +  T +       + CYK+ S   +  P
Sbjct: 320 QTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNP 379

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            +   F       ++H   F+    +GV  FC A+QP DGD+GI G    +   + +D +
Sbjct: 380 SVILKFALNNSF-VVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRE 438

Query: 363 SQMVSFKPTDC 373
           +  + +  ++C
Sbjct: 439 NLKLGWSRSNC 449


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 170/371 (45%), Gaps = 54/371 (14%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPASSSSYKELS 79
           IGTP +  +  + D GSDL+W+ C  C+QC            +    Y+P+ SS+ K LS
Sbjct: 87  IGTPNISFLVAL-DAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144

Query: 80  CQSEQCHLLDTVSCSS-QQLCNYTYGY-ADSSLTKGVLA------TERITFGNSNNFFDN 131
           C  + C    + +C S +QLC YT  Y ++++ + G+L       T  I   ++++    
Sbjct: 145 CSHQLCE--SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAP 202

Query: 132 VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSI 187
           V+ GCG   TG + +     GL+GLG   +S+ S  LS+ G   N FS C   F+ D S 
Sbjct: 203 VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPS-FLSKAGLVKNSFSLC---FNDDDS- 257

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGA 245
             +++FG+     G     T+L    D  Y  Y V +E   +G  S+  K   +      
Sbjct: 258 -GRIFFGD----QGLATQQTTLFLPSDGKYETYIVGVEACCIG--SSCIKQTSF------ 304

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
                  +D+GA  T LP + Y  + ++    +  T +       + CYK+ S   +  P
Sbjct: 305 ----RALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNP 360

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
            +   F       ++H   F+    +GV  FC A+QP DGD+GI G    +   + +D +
Sbjct: 361 SVILKFALNNSF-VVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRE 419

Query: 363 SQMVSFKPTDC 373
           +  + +  ++C
Sbjct: 420 NLKLGWSRSNC 430


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 102/394 (25%), Positives = 164/394 (41%), Gaps = 81/394 (20%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
           ++   IGTPP      ++DTGS L W+QC    LP      + K  ++P+ SSS+  L C
Sbjct: 73  IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 126

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
               C           SC S +LC+Y+Y YAD +  +G L  E+ITF N+      ++ G
Sbjct: 127 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNT-EITPPLILG 185

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI--TSKMYF 193
           C   ++     ++ G++G+ R RLS     +SQ   +KFSYC+ P         T   Y 
Sbjct: 186 CATESS-----DDRGILGMNRGRLSF----VSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236

Query: 194 GNGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           G+     G   VS      +  +   D   Y V + GI  G        +   N SG++ 
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFG--------LKKLNISGSVF 288

Query: 248 K------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           +      G   +D+G+  T L    Y+++  ++            R+G +L  K     G
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMT----------RVGRRL-KKGYVYGG 337

Query: 302 IAPILTAHFDGG-AKVPLI----------HTSTFIPPPV------EGVFCFAM---QPID 341
            A +    FDG  A +P +              F+P          G+ C  +     + 
Sbjct: 338 TADMC---FDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLG 394

Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
               I GN  Q +L++ +D  ++ V F   DC++
Sbjct: 395 AASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSR 428


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 99/373 (26%), Positives = 160/373 (42%), Gaps = 45/373 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G Y  +  IGTPP      IVDTGS L +V C  C QC K   P + P  SS+Y+ L C 
Sbjct: 90  GYYTTRIWIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCS 148

Query: 82  SEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
            E        +C S+ + C Y   YA+ S + GVL  + ++FG  +       VFGC + 
Sbjct: 149 ME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENV 201

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLGR  LS+  Q++ + +  N FS C             M  G G+
Sbjct: 202 ETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY----------GGMDVGGGA 251

Query: 198 EVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
            V GG     G+V T         YY + L+ I +     + K +P  N      K    
Sbjct: 252 MVLGGISPPAGMVFTH-SDPARSAYYNIDLKEIHI-----AGKQLP-INPMVFDGKYGTI 304

Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAP 304
           +D+G     LP+  +   ++ +    N++KL    D R  + +C+         ++   P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFP 363

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
            +   F  G ++ L      F      G +C  + Q  +    + G     +  + YD +
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423

Query: 363 SQMVSFKPTDCTK 375
              + F  T+C++
Sbjct: 424 HLKIGFWKTNCSE 436


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 99/373 (26%), Positives = 160/373 (42%), Gaps = 45/373 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G Y  +  IGTPP      IVDTGS L +V C  C QC K   P + P  SS+Y+ L C 
Sbjct: 90  GYYTTRIWIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCS 148

Query: 82  SEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
            E        +C S+ + C Y   YA+ S + GVL  + ++FG  +       VFGC + 
Sbjct: 149 ME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENV 201

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLGR  LS+  Q++ + +  N FS C             M  G G+
Sbjct: 202 ETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY----------GGMDVGGGA 251

Query: 198 EVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
            V GG     G+V T         YY + L+ I +     + K +P  N      K    
Sbjct: 252 MVLGGISPPAGMVFTH-SDPARSAYYNIDLKEIHI-----AGKQLP-INPMVFDGKYGTI 304

Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAP 304
           +D+G     LP+  +   ++ +    N++KL    D R  + +C+         ++   P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFP 363

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
            +   F  G ++ L      F      G +C  + Q  +    + G     +  + YD +
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423

Query: 363 SQMVSFKPTDCTK 375
              + F  T+C++
Sbjct: 424 HLKIGFWKTNCSE 436


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 160/401 (39%), Gaps = 61/401 (15%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-KPIYNPAS 71
           + S   T  G+Y ++F +GTP    +  + DTGSDL WV+C           + ++  A+
Sbjct: 101 LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCSGAGDGTGDAPRRVFRAAA 159

Query: 72  SSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITF----- 122
           S S+  ++C S+ C         +CSS    C Y Y Y D S  +GV+ T+  T      
Sbjct: 160 SRSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGS 219

Query: 123 -----GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
                G        VV GC  +  G   ++  G++ LG + +S AS+  ++ G  +FSYC
Sbjct: 220 ESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GRFSYC 278

Query: 178 LVPFHTDSSITSKMYFG-NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
           LV      + TS + FG  G E  GG   S+S  S   +T   +              ++
Sbjct: 279 LVDHLAPRNATSYLTFGPPGPE--GGAAASSSSSSAAARTPLLL------------DRRM 324

Query: 237 IPYYNSSGAISK------------------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
            P+Y  +                       G   +D+G   T+L    Y  +   +   +
Sbjct: 325 SPFYAVAVDAVHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERL 384

Query: 279 KLTPY--QDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
              P    DP    + CY   + A   P L   F G A++      +++     GV C  
Sbjct: 385 AGLPRVSMDP---FEYCYNWTAAALEIPGLEVRFAGSARL-QPPAKSYVVDAAPGVKCIG 440

Query: 337 MQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           +Q  +G    V + GN  Q D    +D   + + FK T C 
Sbjct: 441 VQ--EGAWPGVSVIGNILQQDHLWEFDLRDRWLRFKHTRCA 479


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 159/390 (40%), Gaps = 57/390 (14%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
           S++ G    +F +   P  +I  +VDTGS++ W     C             + S +   
Sbjct: 49  SSSGGGCHYRFELTHRPKDNISAVVDTGSNIFWTTEKEC-------------SRSKTRSM 95

Query: 78  LSCQSEQCHLLDTVSCSSQQL---------CNYT--YGYADSSLTKGVLATERITFGN-- 124
           L C S +C    +  C   +L         C Y   YG   +  T GVL  +++T     
Sbjct: 96  LPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVA 155

Query: 125 -----SNNFFDNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCL 178
                 +  F+ V  GC  + T  F +  + G+ GLGR+    A+ +  QL  +KFSYCL
Sbjct: 156 SKAVPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRS----ATSLPRQLNFSKFSYCL 211

Query: 179 VPFHTDSS-----ITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSN 232
             +          +T+      G+      V +T+L    D KT YFV L+GIS+G    
Sbjct: 212 SSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG--- 268

Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLG 289
            ++L      SG    GNMF+DTG   T L    + +L  ++   +K   Y   Q  R  
Sbjct: 269 -TRLPAVSTKSG----GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNN 323

Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
            Q+CY  PS A       P +  HF   A + L   S       +         I G + 
Sbjct: 324 GQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGIS 383

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           + GNF   +  +  D  ++ +SF   DC+K
Sbjct: 384 VLGNFQMQNTHMLLDTGNEKLSFVRADCSK 413


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 52/130 (40%), Positives = 72/130 (55%), Gaps = 18/130 (13%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
           IVDTGSDL WVQC PC  CY Q  P+++P+ S+SY  + C +  C   L        SC+
Sbjct: 179 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238

Query: 95  S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
           +          + C Y+  Y D S ++GVLAT+ +  G ++   D  VFGCG +N G+F 
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 296

Query: 146 ENEMGLVGLG 155
               GL+GLG
Sbjct: 297 -GTAGLMGLG 305


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 161/365 (44%), Gaps = 30/365 (8%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNPASSSSYKELSCQ 81
           V+  ++GTP    + G+VD  S  +W QC PC      + P    + P  S+++  L C 
Sbjct: 89  VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148

Query: 82  SEQCHLLDTVSCSSQQL---------CN-YTYGYADSSL-TKGVLATERITFGNSNNFFD 130
           S+ C  +   +C              C+ Y+  Y  S+  T G LAT+  TFG +     
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA--VP 206

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY-CLVPFHTDS-SIT 188
            VVFGC   + G F     G++G+GR  LSL SQ+  Q G  KFSY  L P  TD  S  
Sbjct: 207 GVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQL--QFG--KFSYQLLAPEATDDGSAD 261

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
           S + FG+ +        ST L+S      +Y+V L G+ V    N    IP       A 
Sbjct: 262 SVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDG--NRLDAIPAGTFDLRAN 319

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGI-AP 304
             G + + +  P T L +  Y+ +   V + I L        L   LCY   SMA +  P
Sbjct: 320 GTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVP 379

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            LT  FDGGA + L   + F      G+ C  M P  G   + G   Q+   + YD D+ 
Sbjct: 380 KLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAG 438

Query: 365 MVSFK 369
            ++F+
Sbjct: 439 RLTFE 443


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 172/383 (44%), Gaps = 49/383 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVKPIY-NPASSSSYK 76
           G Y  K  +GTPP  ++Y  +DTGSD++WV C  C  C +    Q++  Y +P SSS+  
Sbjct: 75  GLYYTKVKLGTPPR-ELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSS 133

Query: 77  ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN------SN 126
            +SC   +C         SCS +   C YT+ Y D S T G   ++ + F +      + 
Sbjct: 134 LISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTT 193

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N   +VVFGC    TG   ++E    G+ G G+  +S+ SQ+ SQ +    FS+CL    
Sbjct: 194 NSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL---K 250

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
            D+S    +  G   E+    +V + LV  +   +Y + L+ ISV    +  +  +    
Sbjct: 251 GDNSGGGVLVLG---EIVEPNIVYSPLVPSQ--PHYNLNLQSISVNGQIVRIAPSVFATS 305

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--KTPS 298
           N+ G I      +D+G     L ++ YN     +   I  +       G+Q CY   T S
Sbjct: 306 NNRGTI------VDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ-CYLITTSS 358

Query: 299 MAGIAPILTAHFDGGAKVPL-----IHTSTFIPPPVEG-VFCFAMQPIDGD-VGIFGNFA 351
              I P ++ +F GGA + L     +    FI    EG V+C   Q I G  + I G+  
Sbjct: 359 NVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIG---EGSVWCIGFQKISGQSITILGDLV 415

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
             D    YD   Q + +   DC+
Sbjct: 416 LKDKIFVYDLAGQRIGWANYDCS 438


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 139/350 (39%), Gaps = 46/350 (13%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
           ++DT  D+ W++C+PC   QC       Y+P  SS+Y    C S  C  L   +  C + 
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220

Query: 97  QLCNYTYGYA-DSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLG 155
             C Y    A DS  T G  +++ +T  NS +  +   FGC  N  G F     G++ LG
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTI-NSGDRVEGFRFGCSQNEQGSFENQADGIMALG 279

Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED- 214
           R   SL +Q  S  G + FSYCL P       T+K +F  G  +       T+ + KE  
Sbjct: 280 RGVQSLMAQTSSTYG-DAFSYCLPPTE-----TTKGFFQIGVPIGASYRFVTTPMLKERG 333

Query: 215 ------KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
                  T Y   L  I+V    +  +L    N    +      +D+    T LP   Y 
Sbjct: 334 GASAAAATLYRALLLAITV----DGKEL----NVPAEVFAAGTVMDSRTIITRLPVTAYG 385

Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGAKVPLIHTST 323
            L    RN ++      P+     CY     + P +  IA +    FDG A V +  +  
Sbjct: 386 ALRAAFRNRMRYR-VAPPQEELDTCYDLTGVRYPRLPRIALV----FDGNAVVEMDRSGI 440

Query: 324 FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            +     G   FA    D    I GN  Q  + + +D     + F+   C
Sbjct: 441 LL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 159/376 (42%), Gaps = 44/376 (11%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
           N+    G Y     IGTP +   Y  +DTGS   WV  + C QC  +   +     Y+P 
Sbjct: 51  NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 109

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           SS S KE+ C    C       C+    C Y  GYAD  LT G+L T+ + +      G 
Sbjct: 110 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +     +V FGCG   +G  N + +   G++G G +  +  SQ+ +     K FS+CL  
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                S      F  G  V     V T+ + K ++ Y+ V L+ I   N++ ++  +P  
Sbjct: 226 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 275

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           N  G       FID+G+    LP+  Y+ L   V        + D  +G+   ++     
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 330

Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
           G      P +T HF+    +  ++   ++       +CF  Q   I G  D+ I G+   
Sbjct: 331 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 389

Query: 353 SDLFIGYDFDSQMVSF 368
           S+  + YD + Q + +
Sbjct: 390 SNKVVVYDMEKQAIGW 405


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/387 (22%), Positives = 170/387 (43%), Gaps = 65/387 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + +  VDTGSD++WV C PC +C  +        +++  +SS+ K
Sbjct: 72  GLYFTKIKLGSPPK-EYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSK 130

Query: 77  ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
           ++ C  + C  +  + SC     C+Y   YAD S ++G    +++T         +    
Sbjct: 131 KVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLG 190

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
             VVFGCG + +G   +++    G++G G++  S+ SQ+ +   A + FS+CL       
Sbjct: 191 QEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
                        V GGG+ +  +V            ++ +Y V L G+ V     +  L
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDG--TALDL 290

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
            P       +  G   +D+G      PK  Y+ L E +  R  +KL   +D    +  C+
Sbjct: 291 PP-----SIMRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVED----TFQCF 341

Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
                  +A P ++  F+   K+  ++   ++    + ++CF  Q          +V + 
Sbjct: 342 SFSENVDVAFPPVSFEFEDSVKLT-VYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILL 400

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD +++++ +   +C+
Sbjct: 401 GDLVLSNKLVVYDLENEVIGWADHNCS 427


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 162/384 (42%), Gaps = 44/384 (11%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
           N+    G Y     IGTP +   Y  +DTGS   WV  + C QC  +   +     Y+P 
Sbjct: 51  NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 109

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           SS S KE+ C    C       C+    C Y  GYAD  LT G+L T+ + +      G 
Sbjct: 110 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +     +V FGCG   +G  N + +   G++G G +  +  SQ+ +     K FS+CL  
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                S      F  G  V     V T+ + K ++ Y+ V L+ I   N++ ++  +P  
Sbjct: 226 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 275

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           N  G       FID+G+    LP+  Y+ L   V        + D  +G+   ++     
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 330

Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
           G      P +T HF+    +  ++   ++       +CF  Q   I G  D+ I G+   
Sbjct: 331 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 389

Query: 353 SDLFIGYDFDSQMVSFKPTDCTKQ 376
           S+  + YD + Q + +   +  ++
Sbjct: 390 SNKVVVYDMEKQAIGWTEHNSVEE 413


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 159/376 (42%), Gaps = 44/376 (11%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
           N+    G Y     IGTP +   Y  +DTGS   WV  + C QC  +   +     Y+P 
Sbjct: 75  NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           SS S KE+ C    C       C+    C Y  GYAD  LT G+L T+ + +      G 
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +     +V FGCG   +G  N + +   G++G G +  +  SQ+ +     K FS+CL  
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 249

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
                S      F  G  V     V T+ + K ++ Y+ V L+ I   N++ ++  +P  
Sbjct: 250 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 299

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           N  G       FID+G+    LP+  Y+ L   V        + D  +G+   ++     
Sbjct: 300 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 354

Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
           G      P +T HF+    +  ++   ++       +CF  Q   I G  D+ I G+   
Sbjct: 355 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 413

Query: 353 SDLFIGYDFDSQMVSF 368
           S+  + YD + Q + +
Sbjct: 414 SNKVVVYDMEKQAIGW 429


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 161/365 (44%), Gaps = 30/365 (8%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNPASSSSYKELSCQ 81
           V+  ++GTP    + G+VD  S  +W QC PC      + P    + P  S+++  L C 
Sbjct: 89  VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148

Query: 82  SEQCHLLDTVSCSSQQL---------CN-YTYGYADSSL-TKGVLATERITFGNSNNFFD 130
           S+ C  +   +C              C+ Y+  Y  S+  T G LAT+  TFG +     
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA--VP 206

Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY-CLVPFHTDS-SIT 188
            VVFGC   + G F     G++G+GR  LSL SQ+  Q G  KFSY  L P  TD  S  
Sbjct: 207 GVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQL--QFG--KFSYQLLAPEATDDGSAD 261

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
           S + FG+ +        ST L+S      +Y+V L G+ V    N    IP       A 
Sbjct: 262 SVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDG--NRLDAIPAGTFDLRAN 319

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGI-AP 304
             G + + +  P T L +  Y+ +   V + I L        L   LCY   SMA +  P
Sbjct: 320 GTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVP 379

Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
            LT  FDGGA + L   + F      G+ C  M P  G   + G   Q+   + YD D+ 
Sbjct: 380 KLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAG 438

Query: 365 MVSFK 369
            ++F+
Sbjct: 439 RLTFE 443


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 160/382 (41%), Gaps = 39/382 (10%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNP 69
           + S      G+Y +K  +GTP   +   + DTGS+L WV+C           P   ++ P
Sbjct: 80  MSSGAYAGTGQYFVKVLVGTP-AQEFTLVADTGSELTWVKCA------GGASPPGLVFRP 132

Query: 70  ASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTK-GVLATERITF-- 122
            +S S+  + C S+ C L       +CSS    C+Y Y Y + S    GV+ T+  T   
Sbjct: 133 EASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIAL 192

Query: 123 -GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
            G       +VV GC   + G   ++  G++ LG  ++S AS+  ++ G + FSYCLV  
Sbjct: 193 PGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGS-FSYCLVDH 251

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
               + T  + FG G +V       T L       +Y V ++ + V   +       +  
Sbjct: 252 LAPRNATGYLAFGPG-QVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYK----T 296
            SG +      +D+G   T+L    Y  +   +   +   P  D P    + CY      
Sbjct: 311 KSGGV-----ILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFPPF--EHCYNWTAPR 363

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQS 353
           P    I P L   F G A++     S ++     GV C  +Q  +G+   V + GN  Q 
Sbjct: 364 PGAPEI-PKLAVQFTGCARLEPPAKS-YVIDVKPGVKCIGLQ--EGEWPGVSVIGNIMQQ 419

Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
           +    +D  +  V F P+ CT+
Sbjct: 420 EHLWEFDLKNMEVRFMPSTCTR 441


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 153/363 (42%), Gaps = 30/363 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            Y+ +  +GTP    +  I D  +D  WV C  C  C     P ++P  SS+Y+ + C S
Sbjct: 82  NYIARAGLGTPAQTLLVAI-DPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTVPCGS 139

Query: 83  EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
            QC  + + SC +     C +   YA S+  + VL  + +     NN   +  FGC    
Sbjct: 140 PQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLAL--ENNVVVSYTFGCLRVV 196

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G  +    GL+G GR  LS  SQ     G + FSYCL P +  S+ +  +  G   +  
Sbjct: 197 SG-NSVPPQGLIGFGRGPLSFLSQTKDTYG-SVFSYCL-PNYRSSNFSGTLKLGPIGQPK 253

Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA---ISKGNMFIDTG 256
              + +T L+    + + Y+V + GI VG     SK++    S+ A   ++     ID G
Sbjct: 254 --RIKTTPLLYNPHRPSLYYVNMIGIRVG-----SKVVQVPQSALAFNPVTGSGTIIDAG 306

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
              T L    Y  + +  R  ++ TP   P  G   CY    +    P +T  F G   V
Sbjct: 307 TMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYN---VTVSVPTVTFMFAGAVAV 362

Query: 317 PLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            L   +  I     GV C AM   P DG    + +  +  Q +  + +D  +  V F   
Sbjct: 363 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 422

Query: 372 DCT 374
            CT
Sbjct: 423 LCT 425


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 153/363 (42%), Gaps = 30/363 (8%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            Y+ +  +GTP    +  I D  +D  WV C  C  C     P ++P  SS+Y+ + C S
Sbjct: 101 NYIARAGLGTPAQTLLVAI-DPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTVPCGS 158

Query: 83  EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
            QC  + + SC +     C +   YA S+  + VL  + +     NN   +  FGC    
Sbjct: 159 PQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLAL--ENNVVVSYTFGCLRVV 215

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           +G  +    GL+G GR  LS  SQ     G + FSYCL P +  S+ +  +  G   +  
Sbjct: 216 SG-NSVPPQGLIGFGRGPLSFLSQTKDTYG-SVFSYCL-PNYRSSNFSGTLKLGPIGQPK 272

Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA---ISKGNMFIDTG 256
              + +T L+    + + Y+V + GI VG     SK++    S+ A   ++     ID G
Sbjct: 273 --RIKTTPLLYNPHRPSLYYVNMIGIRVG-----SKVVQVPQSALAFNPVTGSGTIIDAG 325

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
              T L    Y  + +  R  ++ TP   P  G   CY    +    P +T  F G   V
Sbjct: 326 TMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYN---VTVSVPTVTFMFAGAVAV 381

Query: 317 PLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            L   +  I     GV C AM   P DG    + +  +  Q +  + +D  +  V F   
Sbjct: 382 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 441

Query: 372 DCT 374
            CT
Sbjct: 442 LCT 444


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/335 (27%), Positives = 147/335 (43%), Gaps = 44/335 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SSSY  + C
Sbjct: 86  NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC 144

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-FDNVVFGCGHN 139
                  +D    S ++ C Y   YA+ S + GVL  + ++FG  +       VFGC ++
Sbjct: 145 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAVFGCENS 198

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ +   N  FS C             M  G G+
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCY----------GGMDIGGGA 248

Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
            V GG    + +V S+ D     YY + L+ I V    L   S++          SK   
Sbjct: 249 MVLGGVPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFD--------SKHGT 300

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
            +D+G     LP+  +   ++ V    +++K     DP     +C+         +  + 
Sbjct: 301 VLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSY-KDICFAGARRNVSKLHEVF 359

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM 337
           P +   F  G K+ L      F    V+G +C  +
Sbjct: 360 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 394


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 178/384 (46%), Gaps = 51/384 (13%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
           A G Y  K  IGTP   D Y  VDTGSD+MWV C+ C +C K+        +Y+   S +
Sbjct: 94  AVGLYYAKIGIGTPAR-DYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLT 152

Query: 75  YKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
            K +SC  + C+ ++      C +   C+YT  YAD S + G    + + +   +   + 
Sbjct: 153 GKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLET 212

Query: 131 -----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
                +V+FGC    +G  +  E   G++G G++  S+ SQ+ S     K F++CL    
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
               +     F  G  V    V +T LV   ++T+Y V ++ + VG   L+  + +    
Sbjct: 269 --DGLNGGGIFAIGHIVQ-PKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFDVG 323

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-T 296
           +  G I      ID+G     LP+  Y++L  ++   ++ +K+    D       C++ +
Sbjct: 324 DKKGTI------IDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF----TCFQYS 373

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPID-GDVGIFGNF 350
            S+    P +T HF+    +  +H   ++    +G++C       MQ  D  ++ + G+ 
Sbjct: 374 ESLDDGFPAVTFHFENSLYLK-VHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
           A S+  + YD ++Q++ +   +C+
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCS 455


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 156/374 (41%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SS+Y  + C
Sbjct: 85  NGYYTTRLHIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 143

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    S +  C Y   YA+ S + GVL  + ++FG  +       VFGC ++
Sbjct: 144 N------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 197

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + +  + FS C             M  G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 247

Query: 198 EVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            V G       ++          YY + L+ + V   +       +    G +      +
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTV------L 301

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----------IA 303
           D+G     LP+  +   ++ V + +   P +  R G    YK    AG          + 
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVH--PLKKIR-GPDSNYKDICFAGAGRNVSQLSEVF 358

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    VEG +C  + Q       + G     +  + YD 
Sbjct: 359 PKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 418

Query: 362 DSQMVSFKPTDCTK 375
            ++ + F  T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 164/379 (43%), Gaps = 42/379 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
           G Y  +  +G PP  D Y  +DTGSD++WV C  C  C      +     ++P SS++  
Sbjct: 81  GLYYTRVQLGNPPK-DFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTAS 139

Query: 77  ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSN 126
            +SC  + C L     D+        C Y + Y D S T G    + I          ++
Sbjct: 140 LVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTS 199

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
           N   +VVFGC  + TG   +++    G+ G G+  LS+ SQ+ S+  A K FS+CL    
Sbjct: 200 NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCL---K 256

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
            D S    +  G   E+    VV T LV  +   +Y + L+ ISV    L  S  +    
Sbjct: 257 GDDSGGGILVLG---EIVEPNVVYTPLVPSQ--PHYNLNLQSISVNGQVLPISPAVFATS 311

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSM 299
           +S G I      ID+G     L ++ YN     V N +  +  Q   L    CY T  S+
Sbjct: 312 SSQGTI------IDSGTTLAYLAEEAYNAFVVAVTNIVSQST-QSVVLKGNRCYVTSSSV 364

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDL 355
           + I P ++ +F GGA + L      I     G   V+C   Q I G  + I G+    D 
Sbjct: 365 SDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDK 424

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              YD  +Q + +   DC+
Sbjct: 425 IFIYDLANQRIGWTNYDCS 443


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/180 (31%), Positives = 91/180 (50%), Gaps = 14/180 (7%)

Query: 9   PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVK--- 64
           P++ V  + S    ++ M  S+GTP + ++  I DTGS + WVQC  C V CY Q +   
Sbjct: 8   PDSAVIGDDSIRKNQFFMGISLGTPAVFNLVTI-DTGSTISWVQCQYCIVHCYTQDQRAG 66

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATE 118
           P +N +SSS+Y+ + C ++ CH +             +  C Y+  YA    + G L+ +
Sbjct: 67  PTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQD 126

Query: 119 RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
           R+T  NS +     +FGCG +N   +N +  G++G G    S  +QI      + FSYC 
Sbjct: 127 RLTLANSYS-IQKFIFGCGSDNR--YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCF 183


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 156/374 (41%), Gaps = 45/374 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP  +   IVD+GS + +V C  C QC     P + P  SS+Y  + C
Sbjct: 85  NGYYTTRLHIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 143

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    S +  C Y   YA+ S + GVL  + ++FG  +       VFGC ++
Sbjct: 144 N------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 197

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG +F+++  G++GLGR +LS+  Q++ + +  + FS C             M  G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 247

Query: 198 EVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            V G       ++          YY + L+ + V   +       +    G +      +
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTV------L 301

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----------IA 303
           D+G     LP+  +   ++ V + +   P +  R G    YK    AG          + 
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVH--PLKKIR-GPDPNYKDICFAGAGRNVSQLSEVF 358

Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
           P +   F  G K+ L      F    VEG +C  + Q       + G     +  + YD 
Sbjct: 359 PKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 418

Query: 362 DSQMVSFKPTDCTK 375
            ++ + F  T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 108/440 (24%), Positives = 165/440 (37%), Gaps = 99/440 (22%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQC----------- 59
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C                 
Sbjct: 44  LSSGAYTGTGQYFVRFRVGTPARPFLLV---ADTGSDLTWVKCRRHAAPAPAPAPAPGYN 100

Query: 60  YKQVKP-----------------IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QL 98
           Y    P                 ++ P  S ++  + C S+ C         +C +    
Sbjct: 101 YGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSP 160

Query: 99  CNYTYGYADSSLTKGVLATERITFGNSNNF---------FDNVVFGCGHNNTGVFNENEM 149
           C Y Y Y D S  +G + T+  T   S               VV GC  + TG       
Sbjct: 161 CAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASD 220

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--------------- 194
           G++ LG + +S AS+  ++ G  +FSYCLV      + TS + FG               
Sbjct: 221 GVLSLGYSNVSFASRAAARFG-GRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTAC 279

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFI 253
            GS  + G   +  L+    + +Y V + G+SV G L    +L+           G   +
Sbjct: 280 AGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKG-----GGAIL 334

Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQLCYKTPS------MAGIAPI 305
           D+G   T+L    Y  +   +   +   P    DP      CY   S      +A   P 
Sbjct: 335 DSGTSLTVLVSPAYRAVVAALGKKLVGLPRVAMDP---FDYCYNWTSPLTGEDLAVAVPA 391

Query: 306 LTAHFDGGAKVPLIHTSTFIPPP-------VEGVFCFAMQPIDGD---VGIFGNFAQSDL 355
           L  HF G A++         PPP         GV C  +Q  +GD   V + GN  Q + 
Sbjct: 392 LAVHFAGSARLQ--------PPPKSYVIDAAPGVKCIGLQ--EGDWPGVSVIGNILQQEH 441

Query: 356 FIGYDFDSQMVSFKPTDCTK 375
              +D  ++ + FK + C +
Sbjct: 442 LWEFDLKNRRLRFKRSRCMQ 461


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 164/380 (43%), Gaps = 46/380 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP  ++  ++DTGS+L W+ C    +    +   +NP  SSSY    C
Sbjct: 57  NVTLTVSLTVGSPPQ-NVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPC 111

Query: 81  QSEQC-----HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S  C      L    SC  + +LC+    YAD+S  +G LA E  TF  +       +F
Sbjct: 112 NSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAE--TFSLAGAAQPGTLF 169

Query: 135 GCGHNN--TGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
           GC  +   T   NE+    GL+G+ R  LSL    ++Q+   KFSYC+    +       
Sbjct: 170 GCMDSAGYTSDINEDSKTTGLMGMNRGSLSL----VTQMSLPKFSYCI----SGEDALGV 221

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYF------VTLEGISVG-NLSNSSKLIPYYNSS 243
           +  G+G++ +   +  T LV+    + YF      V LEGI V   L    K +   + +
Sbjct: 222 LLLGDGTD-APSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHT 280

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTP 297
           GA   G   +D+G   T L    Y+ L+++     K  LT  +DP         LCY  P
Sbjct: 281 GA---GQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAP 337

Query: 298 SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQS 353
           +     P +T  F G   +V        +    + V+CF     D    +  + G+  Q 
Sbjct: 338 ASFAAVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQ 397

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
           ++++ +D     V F  T C
Sbjct: 398 NVWMEFDLLKSRVGFTQTTC 417


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 165/357 (46%), Gaps = 46/357 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G YV++  +GTP  L ++ ++DT  D  WV C  C  C     P ++P +SS+Y  L C 
Sbjct: 97  GNYVVRVKLGTPGQL-MFMVLDTSRDAAWVPCADCAGCSS---PTFSPNTSSTYASLQCS 152

Query: 82  SEQCHLLDTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             QC  +  +SC    ++    N TYG  DSS +  +    + + G + +   +  FGC 
Sbjct: 153 VPQCTQVRGLSCPTTGTAACFFNQTYG-GDSSFSAML---SQDSLGLAVDTLPSYSFGCV 208

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           +  +G       GL+GLGR  +SL SQ  S L +  FSYC   F        K Y+ +GS
Sbjct: 209 NAVSGS-TLPPQGLLGLGRGPMSLLSQSGS-LYSGVFSYCFPSF--------KSYYFSGS 258

Query: 198 EVSG-----GGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
              G       + +T L+    + T Y+V L G+SVG   +  + +L+ +  ++GA    
Sbjct: 259 LRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGA---- 314

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTA 308
              ID+G   T   +  Y  + ++ R  +K  P+    +G+   C+   +   IAP +T 
Sbjct: 315 GTIIDSGTVITRFVEPVYAAIRDEFRKQVK-GPFAT--IGAFDTCFAA-TNEDIAPPVTF 370

Query: 309 HFDG-GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV----GIFGNFAQSDLFIGYD 360
           HF G   K+PL   +T I      + C AM     +V     +  N  Q +L I +D
Sbjct: 371 HFTGMDLKLPL--ENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFD 425


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 176/391 (45%), Gaps = 46/391 (11%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  +GTPP    Y  VDTGSD++WV C+ C +C ++         Y+P +SS
Sbjct: 82  TDTGLYFTEIKLGTPPK-RYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASS 140

Query: 74  SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           S   +SC    C          C++   C Y+  Y D S T G   T+ + F      G 
Sbjct: 141 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQ 200

Query: 125 SNNFFDNVVFGCGHNNTG-VFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           +      + FGCG    G + N N+   G++G G+   S+ SQ+ +   A K F++CL  
Sbjct: 201 TQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDT 260

Query: 181 ------FHTDSSITSKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNL 230
                 F   + +  K YF         G+++  L    +    + +Y V L+ I VG  
Sbjct: 261 IKGGGIFAIGNVVQPKCYF---VFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVG-- 315

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
             ++  +P +       KG + ID+G   T LP+  + ++ + V +  +   + +  L  
Sbjct: 316 -GTTLQLPAHVFETGEKKGTI-IDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHN--LQD 371

Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-D 343
            LC++ + S+    P +T HF+    + +     F P   + ++C      A+Q  DG D
Sbjct: 372 FLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGND-IYCVGFQNGALQSKDGKD 430

Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + + G+   S+  + YD ++Q++ +   +C+
Sbjct: 431 IVLMGDLVLSNKLVVYDLENQVIGWTDYNCS 461


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 161/373 (43%), Gaps = 39/373 (10%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
           ++   IGTP       ++DTGS L W+QC P         P   ++P+ SSS+ +L C  
Sbjct: 81  ILSLPIGTPSQSQEL-VLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSH 139

Query: 83  EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C           SC S +LC+Y+Y YAD +  +G L  E+ TF NS      ++ GC 
Sbjct: 140 PLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT-TPPLILGCA 198

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFG 194
             +T     +E G++G+   RLS     +SQ   +KFSYC +P  ++    + T   Y G
Sbjct: 199 KEST-----DEKGILGMNLGRLSF----ISQAKISKFSYC-IPTRSNRPGLASTGSFYLG 248

Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           +     G   VS      +  +   D   Y V L+GI +G    +     +   +G    
Sbjct: 249 DNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGG--S 306

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPI- 305
           G   +D+G+  T L    Y++++E++   +     +    GS   +C+       I  + 
Sbjct: 307 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 366

Query: 306 --LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
             L   F  G ++ L+   + +     G+ C  +     +     I GN  Q +L++ +D
Sbjct: 367 GDLVFEFGRGVEI-LVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 425

Query: 361 FDSQMVSFKPTDC 373
             ++ V F   +C
Sbjct: 426 VTNRRVGFSKAEC 438


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 51/383 (13%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
           A G Y  K  IGTP   D Y  VDTGSD+MWV C+ C +C K+        +Y+   S +
Sbjct: 94  AVGLYYAKIGIGTPAR-DYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLT 152

Query: 75  YKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
            K +SC  + C+ ++      C +   C+YT  YAD S + G    + + +   +   + 
Sbjct: 153 GKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLET 212

Query: 131 -----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
                +V+FGC    +G  +  E   G++G G++  S+ SQ+ S     K F++CL    
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
               +     F  G  V    V +T LV   ++T+Y V ++ + VG   L+  + +    
Sbjct: 269 --DGLNGGGIFAIGHIVQ-PKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFDVG 323

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-T 296
           +  G I      ID+G     LP+  Y++L  ++   ++ +K+    D       C++ +
Sbjct: 324 DKKGTI------IDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF----TCFQYS 373

Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPID-GDVGIFGNF 350
            S+    P +T HF+    +  +H   ++    +G++C       MQ  D  ++ + G+ 
Sbjct: 374 ESLDDGFPAVTFHFENSLYLK-VHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431

Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
           A S+  + YD ++Q++ +   +C
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNC 454


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 100/344 (29%), Positives = 143/344 (41%), Gaps = 43/344 (12%)

Query: 48  LMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYAD 107
           + W QC PCV+C K     ++P++S +Y   SC      +  TV  +      Y   Y D
Sbjct: 98  ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSC------IPSTVGNT------YNMTYGD 145

Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
            S + G    + +T    ++ F    FGCG NN G F     G++GLG+ +LS  SQ  S
Sbjct: 146 KSTSVGNYGCDTMTL-EPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTAS 204

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK------EDKTYYFVT 221
           +     FSYCL     + SI S + FG     S   +  TSLV+       E+  YYFV 
Sbjct: 205 KF-KKVFSYCL---PEEDSIGS-LLFGE-KATSQSSLKFTSLVNGPGTSGLEESGYYFVK 258

Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
           L  ISVGN   +   +P    S   +     ID+G   T LP+  Y+ L    + A+   
Sbjct: 259 LLDISVGNKRLN---VP----SSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKY 311

Query: 282 PYQDPRLGS----QLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
           P  + R         CY       +  P +  HF  GA V L +    I        C A
Sbjct: 312 PLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRL-NGKRVIWGNDASRLCLA 370

Query: 337 M-----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
                   ++ ++ I GN  Q  L + YD     + F    C+K
Sbjct: 371 FAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 147/320 (45%), Gaps = 40/320 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP  D Y  VDTGSD++WV C  C  C +    Q++   ++P SS +  
Sbjct: 79  GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137

Query: 77  ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
            +SC  ++C      S   CS Q  LC YT+ Y D S T G   ++ + F    G+S   
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    VVFGC  + TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+CL    
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
                  K   G G  +  G +V  ++V       + +Y V L  ISV     + + +P 
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301

Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             S  + S G    IDTG     L +  Y    E + NA+  +       G+Q    T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361

Query: 299 MAGIAPILTAHFDGGAKVPL 318
           +  I P ++ +F GGA + L
Sbjct: 362 VGDIFPPVSLNFAGGASMFL 381


>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 165

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/148 (37%), Positives = 76/148 (51%), Gaps = 17/148 (11%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
           ++S  S  +GEY +   I TPP   I  I+DTGSDL WVQC PC+ CY Q   ++NP SS
Sbjct: 1   MESGASLGSGEYFIDIFIDTPPR-HILVIIDTGSDLTWVQCTPCLHCYLQKGLVFNPHSS 59

Query: 73  SSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITF----- 122
            SY  ++C   +   ++     +   +  Q C+Y Y Y DSS T    ATE  T      
Sbjct: 60  ESYDPVACGEPKRAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIK 119

Query: 123 -----GNSNNF-FDNVVFGCGHNNTGVF 144
                G  +      ++FGCGHNN G+F
Sbjct: 120 NDEGGGEDDTLQISKIMFGCGHNNQGLF 147


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 157/364 (43%), Gaps = 65/364 (17%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           Y+   +IGTPP      I+    + +W QC PC +C+KQ  P++N      Y+       
Sbjct: 28  YMANLTIGTPPQ-PASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN-----RYE------- 74

Query: 84  QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGV 143
               ++T+             + D+S   G+  T+    G +     ++ FGC  ++   
Sbjct: 75  ----VETM-------------FGDTS---GIGGTDTFAIGTATA---SLAFGCAMDSNIK 111

Query: 144 FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG- 202
                 G+VGLGRT  SL    + Q+ A  FSYCL P H  +   S +  G  ++++GG 
Sbjct: 112 QLLGASGVVGLGRTPWSL----VGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLAGGK 166

Query: 203 GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
              +T LV + +D + Y + LEGI  G++     + P  N S       + +DT    + 
Sbjct: 167 SAATTPLVNTSDDSSDYMIHLEGIKFGDV----IIEPPPNGS------VVLVDTIFGVSF 216

Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGGAK 315
           L    ++ +++ V  A+   P   P     LC+   + A  A      P +   F G A 
Sbjct: 217 LVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAA 276

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
           +  +  S ++     G  C AM       +  ++ I G   Q ++   +D D + +SF+P
Sbjct: 277 L-TVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 335

Query: 371 TDCT 374
            DC+
Sbjct: 336 ADCS 339


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 111/412 (26%), Positives = 176/412 (42%), Gaps = 83/412 (20%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQCYK-------QVKPIY 67
           S + G Y +  S GTPP    + I+DTGSD++W  C     C  C         +++P +
Sbjct: 61  SHSYGGYSVSLSFGTPPQTLSF-IMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQP-F 118

Query: 68  NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN-----------YTYGYADSSLTKGVLA 116
            P  SSS K L C++ +C  +   + +  Q C+           Y   Y  S  T GV  
Sbjct: 119 IPKESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYG-SGTTGGVAL 177

Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFS 175
           +E +   + +    N + GC      VF+ ++  G+ G GR   SL SQ    LG  KFS
Sbjct: 178 SETLHLHSLSK--PNFLVGCS-----VFSSHQPAGIAGFGRGLSSLPSQ----LGLGKFS 226

Query: 176 YCLVP--FHTDSSITSKMYFGN---GSEVSGGGVVSTSLVSK---EDKT----YYFVTLE 223
           YCL+   F  D+  +S +        S+     +V T  V     ++K+    YY++ L 
Sbjct: 227 YCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLR 286

Query: 224 GISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAI 278
            I+VG        +PY Y S G    G + ID+G   T + ++ +  L +    Q+++  
Sbjct: 287 RITVG---GHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYR 343

Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
           ++   +D  +G + C+       ++ P L  +F GGA V L         PVE  F F  
Sbjct: 344 RVKEIEDA-IGLRPCFNVSDAKTVSFPELRLYFKGGADVAL---------PVENYFAFVG 393

Query: 338 QPI-------DGDVG---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             +       DG  G         I GNF   + ++ YD  ++ + FK   C
Sbjct: 394 GEVACLTVVTDGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 166/367 (45%), Gaps = 35/367 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G YV++  +GTPP L ++ ++DT +D +W+ C  C  C       +N  SSS+Y  +SC 
Sbjct: 102 GNYVVRAKLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 159

Query: 82  SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           + QC     ++C S      +C++   Y   S     L  + +T   + +   N  FGC 
Sbjct: 160 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL--APDVIPNFSFGCI 217

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           ++ +G  +    GL+GLGR  +SL SQ  S L +  FSYCL  F +        YF    
Sbjct: 218 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 268

Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           ++   G    +  T L+    + + Y+V L G+SVG++     + P Y +  A S     
Sbjct: 269 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDANSGAGTI 326

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
           ID+G   T   +  Y  + ++ R  + ++ +    LG+   C+   +   +AP +T H  
Sbjct: 327 IDSGTVITRFAQPVYEAIRDEFRKQVNVSSFST--LGAFDTCFSADN-ENVAPKITLHMT 383

Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
               K+P+   +T I      + C +M    Q  +  + +  N  Q +L I +D  +  +
Sbjct: 384 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441

Query: 367 SFKPTDC 373
              P  C
Sbjct: 442 GIAPEPC 448


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 92/381 (24%), Positives = 160/381 (41%), Gaps = 57/381 (14%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
           V+   IGTPP      ++DTGS L W+QC      + +  P   ++P+ SSS+  L C  
Sbjct: 89  VVTLPIGTPPQPQQM-VLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLPCTH 141

Query: 83  EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C           +C   +LC+Y+Y YAD +  +G L  E++ F  S      ++ GC 
Sbjct: 142 PLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQT-TPPLILGCS 200

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFHTDSSITSKMYFG 194
                  + +  G++G+   RLS   Q        KFSYC+    P + ++  T   Y G
Sbjct: 201 SE-----SRDARGILGMNLGRLSFPFQA----KVTKFSYCVPTRQPANNNNFPTGSFYLG 251

Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
           N    +    VS      +  +   D   Y V ++GI +G       + P      A   
Sbjct: 252 NNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGG--RKLNIPPSVFRPNAGGS 309

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSMA 300
           G   +D+G+  T L    Y+R+ E++   +       PR+         + +C+   +M 
Sbjct: 310 GQTMVDSGSEFTFLVDVAYDRVREEIIRVL------GPRVKKGYVYGGVADMCFDGNAME 363

Query: 301 GIAPIL---TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSD 354
            I  +L      F+ G ++ ++     +     GV C  +   + +     I GNF Q +
Sbjct: 364 -IGRLLGDVAFEFEKGVEI-VVPKERVLADVGGGVHCVGIGRSERLGAASNIIGNFHQQN 421

Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
           L++ +D  ++ + F   DC++
Sbjct: 422 LWVEFDLANRRIGFGVADCSR 442


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 170/387 (43%), Gaps = 65/387 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + +  VDTGSD++W+ C PC +C  +        +++  +SS+ K
Sbjct: 72  GLYFTKIKLGSPPK-EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSK 130

Query: 77  ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKG-----VLATERITFG-NSNNFF 129
           ++ C  + C  +  + SC     C+Y   YAD S + G     +L  E++T    +    
Sbjct: 131 KVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLG 190

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
             VVFGCG + +G     +    G++G G++  S+ SQ+ +   A + FS+CL       
Sbjct: 191 QEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
                        V GGG+ +  +V            ++ +Y V L G+ V     +S  
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV---DGTSLD 289

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
           +P       +  G   +D+G      PK  Y+ L E +  R  +KL   ++    +  C+
Sbjct: 290 LP----RSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE----TFQCF 341

Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
              +    A P ++  F+   K+  ++   ++    E ++CF  Q          +V + 
Sbjct: 342 SFSTNVDEAFPPVSFEFEDSVKLT-VYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILL 400

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD D++++ +   +C+
Sbjct: 401 GDLVLSNKLVVYDLDNEVIGWADHNCS 427


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 174/390 (44%), Gaps = 52/390 (13%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNP 69
           S ++T  G Y  +  IGTP     Y  VDTGSD++WV C+ C  C ++        +Y+P
Sbjct: 81  SGLATETGLYFTRIGIGTPAKR-YYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDP 139

Query: 70  ASSSSYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
             S S + ++C  + C   +     SC+S   C Y+  Y D S T G   T+ + +    
Sbjct: 140 RGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199

Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSY 176
             G +     +V FGCG    G    + +   G++G G++  S+ SQ+ +     K F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259

Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSS 234
           CL       ++     F  G+ V    V +T LVS  D  +Y V L+GI VG   L   +
Sbjct: 260 CL------DTVNGGGIFAIGNVVQ-PKVKTTPLVS--DMPHYNVILKGIDVGGTALGLPT 310

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQ 291
            +    NS G I      ID+G     +P+  Y  L   V      I +   QD      
Sbjct: 311 NIFDSGNSKGTI------IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---- 360

Query: 292 LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DV 344
            C++ + S+    P +T HF+G   + ++    ++    + ++C       +Q  DG D+
Sbjct: 361 -CFQYSGSVDDGFPEVTFHFEGDVSL-IVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDM 418

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + G+   S+  + YD ++Q + +   +C+
Sbjct: 419 VLLGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 159/375 (42%), Gaps = 47/375 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IG+PP  +   IVDTGS + +V C  CVQC     P + P  SS+Y+ + C
Sbjct: 86  NGYYTTRLWIGSPPQ-EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC 144

Query: 81  QSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
            ++        +C    + C Y   YA+ S + GVLA + ++FG  +       VFGC  
Sbjct: 145 NAD-------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCET 197

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
             +G ++ +   G++GLGR  LS+  Q++ + + +N FS C             M  G G
Sbjct: 198 MESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY----------GGMDVGGG 247

Query: 197 SEVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIP--YYNSSGAISKGN 250
           + V GG      +V S  D +   YY + L+ I V       KL P  +    GAI    
Sbjct: 248 AMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVA--GKPLKLNPRTFDGKYGAI---- 301

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-----TPSMAGI 302
             +D+G      P+  Y   ++ +   I   K     DP     +C+         +  +
Sbjct: 302 --LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF-KDICFSGAGRDVTELPKV 358

Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
            P +   F  G K+ L      F    V G +C  +     D   + G     +  + Y+
Sbjct: 359 FPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYN 418

Query: 361 FDSQMVSFKPTDCTK 375
            ++  + F  T+C++
Sbjct: 419 RENSTIGFWKTNCSE 433


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 40/375 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYKEL 78
           Y  +  +G+PP  D Y  +DTGSD++WV C  C  C        P+  ++P SS +   +
Sbjct: 90  YYTRLQLGSPPR-DFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148

Query: 79  SCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS--NNF 128
           SC  ++C L     D+V  +    C YT+ Y D S T G   ++ + F    G S   N 
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208

Query: 129 FDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTD 184
              +VFGC    TG   + +    G+ G G+  +S+ SQ+ SQ +    FS+CL     D
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL---KGD 265

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
            S    +  G   E+    +V T LV  +   +Y + L+ I V    N   L    +   
Sbjct: 266 DSGGGILVLG---EIVEPNIVYTPLVPSQ--PHYNLNLQSIYV----NGQTLAIDPSVFA 316

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMAGI 302
             S     ID+G     L +  Y+     + + +  +P   P L  G+Q    + S+  +
Sbjct: 317 TSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTV--SPSVSPYLSKGNQCYLTSSSINDV 374

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSDLFIG 358
            P ++ +F GG  + LI     I         ++C   Q I G ++ I G+    D    
Sbjct: 375 FPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFV 434

Query: 359 YDFDSQMVSFKPTDC 373
           YD   Q + +   DC
Sbjct: 435 YDIAGQRIGWANYDC 449


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 162/389 (41%), Gaps = 59/389 (15%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N   ++  ++GTPP  ++  ++DTGS+L W+ C   +         ++P  S+SY+ + C
Sbjct: 28  NVSLIVSLTVGTPPQ-NVSMVIDTGSELSWLHCNKTLS----YPTTFDPTRSTSYQTIPC 82

Query: 81  QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C           SC S  LC+ T  YAD+S + G LA++    G+S+     +VFG
Sbjct: 83  SSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD--ISGLVFG 140

Query: 136 CGHNNTGVFNEN------EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           C  +   VF+ N        GL+G+ R  LS     +SQLG  KFSYC+    + +  + 
Sbjct: 141 CMDS---VFSSNSDEDSKSTGLMGMNRGSLSF----VSQLGFPKFSYCI----SGTDFSG 189

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
            +  G  +      +  T L+         D+  Y V LEGI V +     KL+P   S+
Sbjct: 190 LLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLD-----KLLPIPKST 244

Query: 244 ---GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY 294
                   G   +D+G   T L    YN L     N     L   +DP    Q    LCY
Sbjct: 245 FEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCY 304

Query: 295 KTPSMAGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDV 344
             P    + P+L   T  F G    V        +P  + G   V C +    D    + 
Sbjct: 305 LVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEA 364

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + G+  Q ++++ +D +   +      C
Sbjct: 365 YVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 152/374 (40%), Gaps = 57/374 (15%)

Query: 34  PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSC 93
           P  +I  +VDTGS++ W     C             + S +   L C S +C    +  C
Sbjct: 42  PKDNISAVVDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGC 88

Query: 94  SSQQL---------CNYT--YGYADSSLTKGVLATERITFGN-------SNNFFDNVVFG 135
              +L         C Y   YG   +  T GVL  +++T           +  F+ V  G
Sbjct: 89  RRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIG 148

Query: 136 CGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-----ITS 189
           C  + T  F +  + G+ GLGR+    A+ +  QL  +KFSYCL  +          +T+
Sbjct: 149 CSTSATLKFKDPSIKGVFGLGRS----ATSLPRQLNFSKFSYCLSSYQKPDLPSYLLLTA 204

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
                 G+      V +T+L    D KT YFV L+GIS+G     ++L      SG    
Sbjct: 205 APDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG----TRLPAVSTKSG---- 256

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCYKTPSMAGIA-- 303
           GNMF+DTG   T L    + +L  ++   +K   Y   Q  R   Q+CY  PS A     
Sbjct: 257 GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAADESS 316

Query: 304 --PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
             P +  HF   A + L   S       +         I G + + GNF   +  +  D 
Sbjct: 317 KLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTHMLLDT 376

Query: 362 DSQMVSFKPTDCTK 375
            ++ +SF   DC+K
Sbjct: 377 GNEKLSFVRADCSK 390


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 166/382 (43%), Gaps = 51/382 (13%)

Query: 25  VMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS---- 79
           V+   IGTPP   D+  ++DTGS L W+QC    +  K++ P+  P ++S    LS    
Sbjct: 67  VVSLPIGTPPQPTDL--VLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFS 123

Query: 80  --------CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
                   C+          SC   +LC+Y+Y YAD +L +G L  E+ TF  S +    
Sbjct: 124 LLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS-TPP 182

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V+ GC   +T    EN  G++G+ R RLS     +SQ   +KFSYC VP  T S+ T   
Sbjct: 183 VILGCAQAST----ENR-GILGMNRGRLSF----ISQAKISKFSYC-VPSRTGSNPTGLF 232

Query: 192 YFGNGSEVSGGGVVSTSLVSKE-------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           Y G+    S    V T L   E       D   Y + ++ I +        + P      
Sbjct: 233 YLGDNPNSSKFKYV-TMLTFPESQSSPNLDPLAYTLPMKAIKIAG--KRLNVPPAAFKPD 289

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
           A   G   ID+G+  T L  + Y +++E+V   +     K   Y D    + +C+     
Sbjct: 290 AGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADV---ADMCFDAGVT 346

Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQS 353
           A +      ++  FD G ++ +      +    +GV C  +   + +     I G   Q 
Sbjct: 347 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQ 406

Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
           ++++ YD  ++ V F   +C++
Sbjct: 407 NMWVEYDLANKRVGFGGAECSR 428


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 114/402 (28%), Positives = 164/402 (40%), Gaps = 73/402 (18%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVKP---IYNPASSSS 74
           G Y +  S GTPP   +  I+DTGSDL+W  C     C  C +    P   I+ P SSSS
Sbjct: 88  GAYSIPLSFGTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSS 146

Query: 75  YKELSCQSEQCHLLDTVSCSSQ------------QLCNYTYGYADSSLTKGVLATERITF 122
            K L C + +C  +      S+            Q+C     +  S +T G++ +E +  
Sbjct: 147 SKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDL 206

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
                   N + GC   +T        G+ G GR   SL SQ    LG  KFSYCL+   
Sbjct: 207 PGKG--VPNFIVGCSVLST----SQPAGISGFGRGPPSLPSQ----LGLKKFSYCLLSRR 256

Query: 183 TDSSITSKMYFGNGSEVSG---GGVVSTSLVSKED-------KTYYFVTLEGISVGNLSN 232
            D +  S     +G   SG    G+  T  V             YY++ L  I+VG    
Sbjct: 257 YDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVG---G 313

Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLTPYQDPR 287
               IPY Y   GA   G   ID+G   T +  + +       E+QV++  K     +  
Sbjct: 314 KHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS--KRATEVEGI 371

Query: 288 LGSQLCY-----KTPSMAGIAPILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAMQPI 340
            G + C+      TPS     P LT  F GGA  ++PL +   F+    + V C  +   
Sbjct: 372 TGLRPCFNISGLNTPSF----PELTLKFRGGAEMELPLANYVAFL--GGDDVVCLTIV-T 424

Query: 341 DGDVG---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           DG  G         I GNF Q + ++ YD  ++ + F+   C
Sbjct: 425 DGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 166/367 (45%), Gaps = 35/367 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G YV++  +GTPP L ++ ++DT +D +W+ C  C  C       +N  SSS+Y  +SC 
Sbjct: 28  GNYVVRAKLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 85

Query: 82  SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           + QC     ++C S      +C++   Y   S     L  + +T   + +   N  FGC 
Sbjct: 86  TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL--APDVIPNFSFGCI 143

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           ++ +G  +    GL+GLGR  +SL SQ  S L +  FSYCL  F +        YF    
Sbjct: 144 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 194

Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           ++   G    +  T L+    + + Y+V L G+SVG++     + P Y +  A S     
Sbjct: 195 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDANSGAGTI 252

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
           ID+G   T   +  Y  + ++ R  + ++ +    LG+   C+   +   +AP +T H  
Sbjct: 253 IDSGTVITRFAQPVYEAIRDEFRKQVNVSSFST--LGAFDTCFSADN-ENVAPKITLHMT 309

Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
               K+P+   +T I      + C +M    Q  +  + +  N  Q +L I +D  +  +
Sbjct: 310 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 367

Query: 367 SFKPTDC 373
              P  C
Sbjct: 368 GIAPEPC 374


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 80/257 (31%), Positives = 124/257 (48%), Gaps = 30/257 (11%)

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
           FGCG  + G       GL+GL    +SL    +SQL   +FSYCL PF      TS M F
Sbjct: 96  FGCGALSAGSL-VGASGLMGLSPGTMSL----ISQLSVPRFSYCLTPFAERK--TSPMLF 148

Query: 194 GNGSEV----SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
           G  +++    + G + +T+++     D  YY+V L G+S+G     +K +    +S AI+
Sbjct: 149 GAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLG-----TKRLRVPAASLAIN 203

Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
               G   +D+G+    L    ++ +++ V  A+KL  +       +LC+  PS   +A 
Sbjct: 204 PDGTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAA 263

Query: 304 ---PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG----IFGNFAQSDLF 356
              P L  HFDGGA + L   + F  P   G+ C A+     D+G    I GN  Q ++ 
Sbjct: 264 VKTPPLVLHFDGGAAMALPRDNYFQEP-RAGLMCLAVARSPEDLGAPISIIGNVQQQNMH 322

Query: 357 IGYDFDSQMVSFKPTDC 373
           + +D  +Q  SF PT C
Sbjct: 323 VLFDVHNQKFSFAPTKC 339


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 146/349 (41%), Gaps = 40/349 (11%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD------TVS 92
           ++DT SD+ WVQC PC   QCY Q   +Y+P+ S S +  +C S  C  L       + S
Sbjct: 185 LLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSSS 244

Query: 93  CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENE-MGL 151
            +S   C Y   Y D S T G L  ++++   ++       FGC H   G F+ ++  G+
Sbjct: 245 SNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ-VPKFEFGCSHAARGSFSRSKTAGI 303

Query: 152 VGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVS 211
           + LGR   SL SQ  ++ G   FSYC  P     + + K +F  G         + + + 
Sbjct: 304 MALGRGVQSLVSQTSTKYG-QVFSYCFPP-----TASHKGFFVLGVPRRSSSRYAVTPML 357

Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
           K     Y V LE I+V        + P   ++GA       +D+    T LP   Y  L 
Sbjct: 358 KT-PMLYQVRLEAIAVAG--QRLDVPPTVFAAGAA------LDSRTVITRLPPTAYQALR 408

Query: 272 EQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDG-GAKVPLIHTSTFIPP 327
              R+  K++ Y+      QL  CY    ++ I  P ++  FD  GA V L       P 
Sbjct: 409 SAFRD--KMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQL------DPS 460

Query: 328 PVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            V    C A     GD    GI G      + + Y+     V F+   C
Sbjct: 461 GVLFGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 169/379 (44%), Gaps = 41/379 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVKPIY-NPASSSSYK 76
           G Y  K  +GTPP  + Y  +DTGSD++WV C  C  C +    Q++  Y +P SSS+  
Sbjct: 75  GLYYTKVKLGTPPR-EFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSS 133

Query: 77  ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN------SN 126
            +SC   +C         SCSSQ   C YT+ Y D S T G   ++ + F        + 
Sbjct: 134 LISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTT 193

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFH 182
           N   +VVFGC    TG   ++E    G+ G G+  +S+ SQ+ L  +    FS+CL    
Sbjct: 194 NSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL---K 250

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
            D+S    +  G   E+    +V + LV  + + +Y + L+ ISV     + +++P   +
Sbjct: 251 GDNSGGGVLVLG---EIVEPNIVYSPLV--QSQPHYNLNLQSISV-----NGQIVPIAPA 300

Query: 243 SGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--KTPSM 299
             A S      +D+G     L ++ YN     +   +  +       G+Q CY   T S 
Sbjct: 301 VFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ-CYLITTSSN 359

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDL 355
             I P ++ +F GGA + L      +     G   V+C   Q I G  + I G+    D 
Sbjct: 360 VDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDK 419

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              YD   Q + +   DC+
Sbjct: 420 IFVYDLAGQRIGWANYDCS 438


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 159/375 (42%), Gaps = 47/375 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IG+PP  +   IVDTGS + +V C  CVQC     P + P  SS+Y+ + C
Sbjct: 86  NGYYTTRLWIGSPPQ-EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC 144

Query: 81  QSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
            ++        +C    + C Y   YA+ S + GVLA + ++FG  +       VFGC  
Sbjct: 145 NAD-------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCET 197

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
             +G ++ +   G++GLGR  LS+  Q++ + + +N FS C             M  G G
Sbjct: 198 MESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY----------GGMDVGGG 247

Query: 197 SEVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIP--YYNSSGAISKGN 250
           + V GG      +V S  D +   YY + L+ I V       KL P  +    GAI    
Sbjct: 248 AMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVA--GKPLKLNPRTFDGKYGAI---- 301

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-----TPSMAGI 302
             +D+G      P+  Y   ++ +   I   K     DP     +C+         +  +
Sbjct: 302 --LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF-KDICFSGAGRDVTELPKV 358

Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
            P +   F  G K+ L      F    V G +C  +     D   + G     +  + Y+
Sbjct: 359 FPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYN 418

Query: 361 FDSQMVSFKPTDCTK 375
            ++  + F  T+C++
Sbjct: 419 RENSTIGFWKTNCSE 433


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 96/392 (24%), Positives = 156/392 (39%), Gaps = 48/392 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQV----------------- 63
           G Y++   IGTP L   Y +V DT +DL W+ C    +  K                   
Sbjct: 123 GMYLVSVRIGTPAL--PYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAK 180

Query: 64  ---KPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
              K  Y PA SSS++ + C  ++C +L   +C   S  + C+Y     D ++T G+   
Sbjct: 181 EASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGK 240

Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
           E+ T   S+        ++ GC     G   +   G++ LG   +S A     + G  +F
Sbjct: 241 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFG-QRF 299

Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
           S+CL+  ++    +S + FG    V G G + T ++   D K  Y   + G+ VG     
Sbjct: 300 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGG---- 355

Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
            +L IP   +++   +  G + +DT    T L  + Y  +   +   +   P      G 
Sbjct: 356 ERLDIPDEVWDAERFVG-GGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGF 414

Query: 291 QLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-ID 341
           + CYK         P+     P  T    GGA++     S  +P    GV C A +  + 
Sbjct: 415 EYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLR 474

Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           G  GI GN    +     D     + F+   C
Sbjct: 475 GGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 163/384 (42%), Gaps = 50/384 (13%)

Query: 7   FYPNNVVQSNVSTANGE----YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
           FY + VV   +S+ + +    ++ +   G+P       + DTGS L W QC PC  CY Q
Sbjct: 37  FYDSKVVSLPLSSPHSQRGLAFMAEIHFGSPQKKQFLHM-DTGSSLTWTQCFPCSDCYAQ 95

Query: 63  -VKPIYNPASSSSYKELSCQ-----SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
            + P Y PA+S +Y++  C+     S      D ++    ++C Y   Y D +  KG LA
Sbjct: 96  KIYPKYRPAASITYRDAMCEDSHPKSNPHFAFDPLT----RICTYQQHYLDETNIKGTLA 151

Query: 117 TERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
            E IT    +  F     V FGC   + G +     G++GLG  + S    I+ + G+ K
Sbjct: 152 QEMITVDTHDGGFKRVHGVYFGCNTLSDGSYFTG-TGILGLGVGKYS----IIGEFGS-K 205

Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
           FS+CL    ++   +  +  G+G+ V G   V   +   E  T +   LE I VG     
Sbjct: 206 FSFCLGEI-SEPKASHNLILGDGANVQGHPTV---INITEGHTIF--QLESIIVGEEITL 259

Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQL 292
              +             +F+DTG+  + L  + Y +  +   + I   P   +P     L
Sbjct: 260 DDPV------------QVFVDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEP----TL 303

Query: 293 CYKTPSMAGIAPI-LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--IFGN 349
           CYK  ++  +  + +   FD GA++ +   + FI      + C A+Q         I G 
Sbjct: 304 CYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGV 363

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
            A     +GYD  ++       DC
Sbjct: 364 IAMQGYNVGYDLSAKTAYINKQDC 387


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 95/372 (25%), Positives = 153/372 (41%), Gaps = 36/372 (9%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE---LSCQ 81
           V+   IGTPP L    ++DTGS L W+QC       K+  P  +    S       L C 
Sbjct: 83  VVTLPIGTPPQLQQM-VLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCN 141

Query: 82  SEQC--HLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
              C   + D      C +  LC+Y+Y YAD +  +G L  E+I F  S      ++ GC
Sbjct: 142 HPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQT-TPPIILGC 200

Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
                   +++  G++G+   RL   SQ        KFSYC VP       +   Y GN 
Sbjct: 201 ATQ-----SDDARGILGMNLGRLGFPSQA----KITKFSYC-VPTKQAQPASGSFYLGNN 250

Query: 197 SEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
              S    V+      +  +   D   Y + L+GIS+G       + P      A   G 
Sbjct: 251 PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIG--GKKLNIPPSVFKPNAGGSGQ 308

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS--MAGIAPIL 306
             ID+G+  T L  + YN + E++   +     +    G  + +C+   +  +  +   +
Sbjct: 309 TMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDM 368

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDS 363
              F+ G ++ +I     +     GV C  M   + +     I GNF Q +L++ +D  +
Sbjct: 369 VFEFEKGVQI-VIPKERVLATVDGGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLAN 427

Query: 364 QMVSFKPTDCTK 375
           + V F   DC+K
Sbjct: 428 RRVGFGEADCSK 439


>gi|357128280|ref|XP_003565802.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 530

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 101/421 (23%), Positives = 166/421 (39%), Gaps = 66/421 (15%)

Query: 13  VQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL----------------- 54
           VQS +   N G Y++   IGTPP+     ++DT +DL W+ C                  
Sbjct: 95  VQSGMGVVNVGMYLVTVRIGTPPVA-FSMVLDTANDLTWLNCRLRRRKGKHHGRPSSTAT 153

Query: 55  ---------PCVQCYKQVKPIYNPASSSSYKELSC-QSEQCHLLDTVSCSS---QQLCNY 101
                    P +      K  Y P+ SSS++   C Q + C      +C S    + C+Y
Sbjct: 154 TTTMSAAMEPEMDAPVVKKTWYRPSLSSSWRRYRCSQKDACGSFPHNTCRSPNHNESCSY 213

Query: 102 TYGYADSSLTKGVLATERITF---------GNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
              Y D ++T+G+   E  T          G +      +V GC     G   +   G++
Sbjct: 214 EQMYEDGTVTRGIYGRETATVPVSVSGAGEGQTAVLLPGLVLGCSTFEAGATVDAHDGVL 273

Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK 212
            LG   +S  +   ++ G  +FS+CL+   +     S + FG    ++GG +  T+LV  
Sbjct: 274 TLGNHAVSFGTVAAARFGG-RFSFCLLHTMSGRDTFSYLTFGPNPALNGGAMEETNLVYS 332

Query: 213 EDKTYYFVTLEGISVGNLSNSSKL--IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
            D    F    G++ G   +  +L  IP      A+  G + +DTG   T L +  +   
Sbjct: 333 PDGEPAFGA--GVT-GVFVDGERLAGIPPEVWDPAVLGGALNLDTGTSLTGLVEPAF--- 386

Query: 271 EEQVRNAI--KLTPYQDPRL-GSQLCYKTPSMAGIA------------PILTAHFDGGAK 315
            E VR A+  +L   Q   + G  +CYK    AG              P +   F+GGA+
Sbjct: 387 -EAVRAAVDRRLGHLQKEDVAGFDICYKWAFGAGAGDEGVDPAHNVTVPKVAFEFEGGAR 445

Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           +  +     +P  V GV C   +  +    + GN    +    +D  +  + F+   CT 
Sbjct: 446 LEPVARGIVLPEVVPGVACLGFRRREVGPSVLGNVHMQEHVWEFDHMAGKLRFRKDKCTN 505

Query: 376 Q 376
            
Sbjct: 506 H 506


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 153/370 (41%), Gaps = 42/370 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
            G Y  +  IGTPP  +   IVDTGS + +V C  C  C     P ++PA SSSYK L C
Sbjct: 32  KGYYTSRVKIGTPPH-EFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLEC 90

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-FDNVVFGCGHN 139
            SE      T  C   +   Y   YA+ S + GVL  + I F NS++     +VFGC   
Sbjct: 91  GSE----CSTGFCDGSR--KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETA 144

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLGR  LS+  Q++ +    + FS C             M  G G+
Sbjct: 145 ETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCY----------GGMDEGGGA 194

Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
            + GG      +V          YY + L+GI VG      K   +    G +      +
Sbjct: 195 MILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTV------L 248

Query: 254 DTGAPPTLLP----KDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
           D+G      P    + F + ++EQV  ++K  P  D +    +CY        +++   P
Sbjct: 249 DSGTTYAYFPGAAFQAFKSAVKEQV-GSLKEVPGPDEKF-KDICYAGAGTNVSNLSQFFP 306

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
            +   F  G  V L      F    + G +C  +        + G     ++ + Y+   
Sbjct: 307 SVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGK 366

Query: 364 QMVSFKPTDC 373
             + F  T C
Sbjct: 367 ASIGFLKTKC 376


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 105/215 (48%), Gaps = 24/215 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C QC +   P + P  SS+Y+ +SC
Sbjct: 87  NGYYTTRIWIGTPPQT-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC 145

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D    + ++ C Y   YA+ S + GVL  + I+FGN +       +FGC + 
Sbjct: 146 N------IDCTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQ 199

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLGR  LS+  Q++ + + ++ FS C             M  G G+
Sbjct: 200 ETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCY----------GGMDIGGGA 249

Query: 198 EVSGGGVVSTSLVSKED----KTYYFVTLEGISVG 228
            + GG    + +V  E       YY + L+ I V 
Sbjct: 250 MILGGISPPSGMVFAESDPVRSQYYNIDLKAIHVA 284


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 167/378 (44%), Gaps = 44/378 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKEL 78
           Y  K  IGTPP    +  VDTGSD++WV C+ C +C  +        +Y+P  SSS   +
Sbjct: 87  YYTKIEIGTPPK-PFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAV 145

Query: 79  SCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
           SC ++ C            C++ + C Y   Y D S T G   ++ + +    GN+    
Sbjct: 146 SCDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205

Query: 130 --DNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
              NV+FGCG    G     N+   G++G G++  S  SQ+ S     K FS+CL     
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL----- 260

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
             +I     F  G EV    V ST L+   + ++Y V L+ I V    N+ +L P+   +
Sbjct: 261 -DTIKGGGIFAIG-EVVQPKVKSTPLL--PNMSHYNVNLQSIDVAG--NALQLPPHIFET 314

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGI 302
               K    ID+G   T LP+  Y  +   V    +   ++   +   LC++ + S+   
Sbjct: 315 S--EKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFR--TIQGFLCFEYSESVDDG 370

Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQSDLF 356
            P +T HF+    + +     F     + ++C        QP D  D+ + G+   S+  
Sbjct: 371 FPKITFHFEDDLGLNVYPHDYFFQNG-DNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKV 429

Query: 357 IGYDFDSQMVSFKPTDCT 374
           + YD + Q++ +   +C+
Sbjct: 430 VVYDLEKQVIGWTDYNCS 447


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 96/373 (25%), Positives = 160/373 (42%), Gaps = 43/373 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C  C +   P + P  S +Y+ + C
Sbjct: 86  NGYYTTRLWIGTPPQ-RFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC 144

Query: 81  QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
             +        +C      C Y   YA+ S + GVL  + ++FGN +       VFGC +
Sbjct: 145 TPD-------CNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCEN 197

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           + TG ++++   G++GLGR  LS+  Q++  ++ ++ FS C             M  G G
Sbjct: 198 DETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY----------GGMDVGGG 247

Query: 197 SEVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           + + GG      +V   S  D++ YY + L+ + V       KL    N      K    
Sbjct: 248 AMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVA----GKKL--QLNPKVFDGKHGTV 301

Query: 253 IDTGAPPTLLPKD---FYNRLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
           +D+G     LP+     + R   + RN++K     DP     +C+         +A   P
Sbjct: 302 LDSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNY-KDICFTGAGIDVSQLAKSFP 360

Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
           ++   F+ G K+ L      F    V G +C  +     D   + G     +  + YD +
Sbjct: 361 VVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRE 420

Query: 363 SQMVSFKPTDCTK 375
           +  + F  T+C++
Sbjct: 421 NSKIGFWKTNCSE 433


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/277 (28%), Positives = 124/277 (44%), Gaps = 22/277 (7%)

Query: 91  VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
           V  S+  +CNY   Y D S T+G L  E++ FG       + +FGCG NN G+F     G
Sbjct: 68  VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTI--LVKDFIFGCGRNNKGLFG-GVSG 124

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV 210
           L+GLGR+ LSL SQ     G   FSYCL P        S +  GN S       +S + +
Sbjct: 125 LMGLGRSDLSLISQTSGIFGG-VFSYCL-PSTERKGSGSLILGGNSSVYRNSSPISYAKM 182

Query: 211 SKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
            +  + Y  YF+ L GIS+G ++  +          ++    + +D+G   T LP   Y 
Sbjct: 183 IENPQLYNFYFINLTGISIGGVALQAP---------SVGPSRILVDSGTVITRLPPTIYK 233

Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST--FI 325
            L+ +        P          C+   +   +  P +  HF+G A++ +  T    F+
Sbjct: 234 ALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFV 293

Query: 326 PPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYD 360
                 V C A+  ++   +V I GN+ Q +L + YD
Sbjct: 294 KSDASQV-CLALASLEYQDEVAILGNYQQKNLRVIYD 329


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 28/360 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV+   +GTP    I  I DTGS   WV C  C  C+   +      S++  K +SC + 
Sbjct: 82  YVISVGLGTPAKTQIVEI-DTGSSTSWVFC-ECDGCHTNPRTFLQSRSTTCAK-VSCGTS 138

Query: 84  QCHLLDTV-SCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            C L  +   C   +    C +   Y D S + G+L  + +TF +      +  FGC  +
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-IPSFTFGCNLD 197

Query: 140 NTGVFNE--NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM--YFGN 195
           + G  NE  N  GL+G+G   +S+  Q  S    + FSYCL    ++    SK   YF  
Sbjct: 198 SFGA-NEFGNVDGLLGMGAGPMSVLKQ--SSPRFDGFSYCLPLQKSERGFFSKTTGYFSL 254

Query: 196 GSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
           G   +   V  T +V++   T  +FV L  ISV    +  +L     S    S+  +  D
Sbjct: 255 GKVATRTDVRYTKMVARRKNTELFFVDLAAISV----DGERL---GLSPSIFSRKGVVFD 307

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
           +G+  + +P    + L +++R  + L          + CY   S+  G  P ++ HFD G
Sbjct: 308 SGSELSYIPDRALSVLSQRIRELL-LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366

Query: 314 AKVPLIHTSTFIPPPV--EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           A+  L     F+   V  + V+C A  P +  V I G+  Q+   + YD   Q++   P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/209 (32%), Positives = 104/209 (49%), Gaps = 23/209 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP +    IVD+GS + +V C  C QC K   P + P  SS+Y+ + C
Sbjct: 90  NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC 148

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
             + C+  D      ++ C Y   YA+ S +KGVL  + I+FGN +       VFGC   
Sbjct: 149 NMD-CNCDD-----DREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++++   G++GLG+  LSL  Q++ + L +N F  C             M  G GS
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 252

Query: 198 EVSGGGVVSTSLV---SKEDKTYYFVTLE 223
            + GG    + +V   S  D+++   T+ 
Sbjct: 253 MILGGFDYPSDMVFTDSDPDRSFGMATVH 281


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 103/397 (25%), Positives = 170/397 (42%), Gaps = 79/397 (19%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP     Y  VDTGSD+MWV C+ C QC ++        +YN   S S K
Sbjct: 78  GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGK 136

Query: 77  ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
            +SC  + C+ +       C +   C Y   Y D S T G    + + + +      +  
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196

Query: 128 FFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
              +V+FGCG   +G     NE  + G++G G+   S+ SQ+ S     K F++CL    
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL---- 252

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
                       +G   +GGG+ +   V +          ++ +Y V +  + VG   L+
Sbjct: 253 ------------DGR--NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLN 298

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
             + L    +  GAI      ID+G     LP+  Y  L +++ +       Q+P L   
Sbjct: 299 IPADLFQPGDRKGAI------IDSGTTLAYLPEIIYEPLVKKITS-------QEPALKVH 345

Query: 292 LC---YKTPSMAGIA----PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCF-----AMQ 338
           +    YK    +G      P +T HF+    + +  H   F   P EG++C      AMQ
Sbjct: 346 IVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLF---PYEGMWCIGWQNSAMQ 402

Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             D  ++ + G+   S+  + YD ++Q++ +   +C+
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439


>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
          Length = 299

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/140 (38%), Positives = 74/140 (52%), Gaps = 27/140 (19%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V++ V   NGE++M  +IGTP   + Y  I+DTGSDL+W QC PC  C+ Q  PI++P  
Sbjct: 86  VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +L C S+  H                      S T+GVLATE  TFG+++     
Sbjct: 144 SSSFSKLPCSSDLYH----------------------SSTQGVLATETFTFGDAS--VSK 179

Query: 132 VVFGCGHNNTGVFNENEMGL 151
           + FGCG +N G       GL
Sbjct: 180 IGFGCGEDNRGRAYSQGAGL 199


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 164/381 (43%), Gaps = 68/381 (17%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
           IGTP +  +  + D GSDL+WV C  C+QC       Y  +      Y+P+ SS+ K LS
Sbjct: 119 IGTPHVSFLVAL-DAGSDLLWVPC-DCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLS 176

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-------- 131
           C  + C L    + S +Q C Y+  Y   + +   L  E I    SN   DN        
Sbjct: 177 CSHQLCELGPNCN-SPKQPCPYSMDYYTENTSSSGLLVEDILHLASNG--DNALSYSVRA 233

Query: 132 -VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSS 186
            VV GCG   +G + +     GL+GLG   +S+ S  L++ G   N FS C      D  
Sbjct: 234 PVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPS-FLAKAGLIRNSFSMCF-----DED 287

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
            + +++FG+    +     ST  ++ + + T Y V +EG  VG+            S   
Sbjct: 288 DSGRIFFGDQGPTTQQ---STPFLTLDGNYTTYVVGVEGFCVGS------------SCLK 332

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT--------P 297
            +     +DTG   T LP   Y R+ E+    +  T         + CYK+        P
Sbjct: 333 QTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVP 392

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDL 355
           S+  I P+  +         +IH   F+   ++G+  FC A+QP +GD+G  G    +  
Sbjct: 393 SVKLIFPLNNSF--------VIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGY 444

Query: 356 FIGYDFDSQMVSFKPTDCTKQ 376
            + +D ++  + +  + C  +
Sbjct: 445 RVVFDRENMKLGWSHSSCEDR 465


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 168/397 (42%), Gaps = 79/397 (19%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP     Y  VDTGSD+MWV C+ C QC ++        +YN   S S K
Sbjct: 78  GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGK 136

Query: 77  ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
            +SC  + C+ +       C +   C Y   Y D S T G    + + + +      +  
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196

Query: 128 FFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
              +V+FGCG   +G     NE  + G++G G+   S+ SQ+ S     K F++CL    
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL---- 252

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
                            +GGG+ +   V +          ++ +Y V +  + VG   L+
Sbjct: 253 --------------DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
             + L    +  GAI      ID+G     LP+  Y  L +++ +       Q+P L   
Sbjct: 299 IPADLFQPGDRKGAI------IDSGTTLAYLPEIIYEPLVKKITS-------QEPALKVH 345

Query: 292 LC---YKTPSMAGIA----PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCF-----AMQ 338
           +    YK    +G      P +T HF+    + +  H   F   P EG++C      AMQ
Sbjct: 346 IVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLF---PHEGMWCIGWQNSAMQ 402

Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             D  ++ + G+   S+  + YD ++Q++ +   +C+
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 160/372 (43%), Gaps = 38/372 (10%)

Query: 26  MKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
           M  S+GTPP  L+    VD+G    WV C            ++ P  S+S+ +L C S  
Sbjct: 1   MDLSLGTPPQPLNFTLAVDSG--FSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPS 58

Query: 85  CHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCGHNN 140
           C     V  SC     C+Y   Y  +  + G L ++  T  +  N     N+  GCG ++
Sbjct: 59  CSAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDS 118

Query: 141 TGVFN-ENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGN--- 195
            G+    +  G VG  +  +S   Q LS LG  +KF YCL P  T      K+  GN   
Sbjct: 119 GGLLELLDTSGFVGFDKGNVSFMGQ-LSALGYRSKFIYCL-PSDT---FRGKLVIGNYKL 173

Query: 196 -GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFI 253
             + +S     +  + + +    YF+ L  IS+    N  ++ I  + S+G    G   I
Sbjct: 174 RNASISSSMAYTPMITNPQAAELYFINLSTISIDK--NKFQVPIQGFLSNGT---GGTVI 228

Query: 254 DTGAPPTLLPKDFYNRLEEQVR----NAIKLTPYQDPRLGSQLCYKTPSMAGIAP--ILT 307
           DT    + L  DFY +L + ++    N ++++      LG +LCY   + +   P   LT
Sbjct: 229 DTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLT 288

Query: 308 AHFDGGAKVPLIHTSTFI---PPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDF 361
            HF GGA V +  ++ F+      V    C A+   + +  ++ + G + Q DL + YD 
Sbjct: 289 YHFLGGAGVEV--STWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDL 346

Query: 362 DSQMVSFKPTDC 373
           +     F    C
Sbjct: 347 EQMRYGFGAQGC 358


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/396 (24%), Positives = 156/396 (39%), Gaps = 52/396 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQV----------------- 63
           G Y++   IGTP L   Y +V DT +DL W+ C    +  K                   
Sbjct: 122 GMYLVSVRIGTPAL--PYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGAT 179

Query: 64  -------KPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKG 113
                  K  Y PA SSS++ + C  ++C +L   +C   S  + C+Y     D ++T G
Sbjct: 180 AAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIG 239

Query: 114 VLATERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
           +   E+ T   S+        ++ GC     G   +   G++ LG   +S A     + G
Sbjct: 240 IYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFG 299

Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN 229
             +FS+CL+  ++    +S + FG    V G G + T ++   D K  Y   + G+ VG 
Sbjct: 300 -QRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGG 358

Query: 230 LSNSSKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
                +L IP   +++   +  G + +DT    T L  + Y  +   +   +   P    
Sbjct: 359 ----ERLDIPDEVWDAERFVG-GGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYE 413

Query: 287 RLGSQLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
             G + CYK         P+     P  T    GGA++     S  +P    GV C A +
Sbjct: 414 LEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFR 473

Query: 339 P-IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
             + G  GI GN    +     D     + F+   C
Sbjct: 474 KLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 509


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 168/391 (42%), Gaps = 65/391 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP  +    +DTGSD++WV C  C  C K    Q++   ++P  SSS  
Sbjct: 82  GLYYTKVKLGTPPR-EFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140

Query: 77  ELSCQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG---------NS 125
            +SC   +C+        CS   LC+Y++ Y D S T G   ++ ++F          NS
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200

Query: 126 NNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
           +  F   VFGC +  TG          G+ GLG+  LS+ SQ+  Q L    FS+CL   
Sbjct: 201 SAPF---VFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL--- 254

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISVGNLSN 232
                           + SGGG++    + + D  Y         Y V L+ I+V     
Sbjct: 255 --------------KGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAV----- 295

Query: 233 SSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP-RLGS 290
           + +++P   S   I+ G+   IDTG     LP + Y+   + + NA+  + Y  P    S
Sbjct: 296 NGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAV--SQYGRPITYES 353

Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKV---PLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
             C++ T     + P ++  F GGA +   P  +   F       ++C   Q +    + 
Sbjct: 354 YQCFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIF-SSSGSSIWCIGFQRMSHRRIT 412

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           I G+    D  + YD   Q + +   DC+ +
Sbjct: 413 ILGDLVLKDKVVVYDLVRQRIGWAEYDCSLE 443


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 28/360 (7%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV+   +GTP    I  I DTGS   WV C  C  C+   +      S++  K +SC + 
Sbjct: 82  YVISVGLGTPAKTQIVEI-DTGSSTSWVFC-ECDGCHTNPRTFLQSRSTTCAK-VSCGTS 138

Query: 84  QCHLLDTV-SCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
            C L  +   C   +    C +   Y D S + G+L  + +TF +         FGC  +
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-IPGFSFGCNMD 197

Query: 140 NTGVFNE--NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM--YFGN 195
           + G  NE  N  GL+G+G   +S+  Q  S    + FSYCL    ++    SK   YF  
Sbjct: 198 SFGA-NEFGNVDGLLGMGAGPMSVLKQ--SSPTFDCFSYCLPLQKSERGFFSKTTGYFSL 254

Query: 196 GSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
           G   +   V  T +V+++  T  +FV L  ISV    +  +L     S    S+  +  D
Sbjct: 255 GKVATRTDVRYTKMVARKKNTELFFVDLTAISV----DGERL---GLSPSVFSRKGVVFD 307

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
           +G+  + +P    + L +++R  + L          + CY   S+  G  P ++ HFD G
Sbjct: 308 SGSELSYIPDRALSVLSQRIRELL-LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366

Query: 314 AKVPLIHTSTFIPPPV--EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
           A+  L     F+   V  + V+C A  P +  V I G+  Q+   + YD   Q++   P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 173/390 (44%), Gaps = 52/390 (13%)

Query: 15  SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNP 69
           S ++T  G Y  +  IGTP     Y  VDTGSD++WV C+ C  C ++        +Y+P
Sbjct: 81  SGLATETGLYFTRIGIGTPAKR-YYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDP 139

Query: 70  ASSSSYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
             S S + ++C  + C   +     SC+S   C Y+  Y D S T G   T+ + +    
Sbjct: 140 RGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199

Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSY 176
             G +     +V FGCG    G    + +   G++G G++  S+ SQ+ +     K F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259

Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSS 234
           CL       ++     F  G+ V    V +T LV   D  +Y V L+GI VG   L   +
Sbjct: 260 CL------DTVNGGGIFAIGNVVQ-PKVKTTPLV--PDMPHYNVILKGIDVGGTALGLPT 310

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQ 291
            +    NS G I      ID+G     +P+  Y  L   V      I +   QD      
Sbjct: 311 NIFDSGNSKGTI------IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---- 360

Query: 292 LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DV 344
            C++ + S+    P +T HF+G   + ++    ++    + ++C       +Q  DG D+
Sbjct: 361 -CFQYSGSVDDGFPEVTFHFEGDVSL-IVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDM 418

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + G+   S+  + YD ++Q + +   +C+
Sbjct: 419 VLLGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 324

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 112/247 (45%), Gaps = 33/247 (13%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
           ++   IGTPP      ++DTGS L W+QC    LP      + K  ++P+ SSS+  L C
Sbjct: 75  IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 128

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
               C           SC S +LC+Y+Y YAD +  +G L  E+ITF N+      ++ G
Sbjct: 129 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNT-EITPPLILG 187

Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
           C   ++     ++ G++G+ R RLS     +SQ    KFSYC+ P       T    F  
Sbjct: 188 CATESS-----DDRGILGMNRGRLSF----VSQAKITKFSYCIPPKSNRPGFTPTGSFYL 238

Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLE--------GISVGNLSNSSKLIPYYNSSGAIS 247
           G   +  G    SL++  ++    V  E        GI    +  SS L    N  G + 
Sbjct: 239 GDNPNSKGFKYVSLLTFPERVEILVPKERVLVNVGDGIHCVGIGRSSMLGAASNIIGNVH 298

Query: 248 KGNMFID 254
           + N++++
Sbjct: 299 QQNLWVE 305


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 159/375 (42%), Gaps = 39/375 (10%)

Query: 25  VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
           ++   IGTP       ++DTGS L W+QC P         P   ++P+ SSS+ +L C  
Sbjct: 82  ILSLPIGTPSQSQEL-VLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSH 140

Query: 83  EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
             C           SC S +LC+Y+Y YAD +  +G L  E+ TF NS      ++ GC 
Sbjct: 141 PLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT-TPPLILGCA 199

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFG 194
             +T V      G++G+   RLS     +SQ   +KFSYC +P  ++    + T   Y G
Sbjct: 200 KESTDV-----KGILGMNLGRLSF----ISQAKISKFSYC-IPTRSNRPGLASTGSFYLG 249

Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
                 G   VS      +  +   D   Y V L GI +G    +     +   +G    
Sbjct: 250 ENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG--S 307

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPI- 305
           G   +D+G+  T L    Y++++E++   +     +    GS   +C+       I  + 
Sbjct: 308 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLI 367

Query: 306 --LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
             L   F  G ++ L+     +     G+ C  +     +     I GN  Q +L++ +D
Sbjct: 368 GDLVFEFGRGVEI-LVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 426

Query: 361 FDSQMVSFKPTDCTK 375
             ++ V F   +C++
Sbjct: 427 VANRRVGFSKAECSR 441


>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
          Length = 278

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 72/131 (54%), Gaps = 27/131 (20%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
           V++ V   NGE++MK +IGTP   + Y  I+DTGSDL+W QC PC  C+ Q  PI++P  
Sbjct: 79  VEAPVHAGNGEFLMKLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKK 136

Query: 72  SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
           SSS+ +L C S+  +                      S T+GVLATE   FG+++     
Sbjct: 137 SSSFSKLPCSSDLYY----------------------SSTQGVLATETFAFGDAS--VSK 172

Query: 132 VVFGCGHNNTG 142
           + FGCG +N G
Sbjct: 173 IGFGCGEDNDG 183


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 157/368 (42%), Gaps = 33/368 (8%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C  C     P + P +S +Y+ + C
Sbjct: 90  NGYYTTRLWIGTPPQ-RFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC 148

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
            + QC+  D      ++ C Y   YA+ S + GVL  + ++FGN +       +FGC ++
Sbjct: 149 -TWQCNCDD-----DRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCEND 202

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILS-QLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
            TG ++N+   G++GLGR  LS+  Q++  ++ ++ FS C   +         M  G  S
Sbjct: 203 ETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC---YGGMGVGGGAMVLGGIS 259

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
             +      +  V      YY + L+ I V       +L  + N      K    +D+G 
Sbjct: 260 PPADMVFTHSDPVR---SPYYNIDLKEIHVAG----KRL--HLNPKVFDGKHGTVLDSGT 310

Query: 258 PPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAH 309
               LP+  +   +  +    +++K     DP   + +C+         ++   P++   
Sbjct: 311 TYAYLPESAFLAFKHAIMKETHSLKRISGPDPHY-NDICFSGAEINVSQLSKSFPVVEMV 369

Query: 310 FDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVS 367
           F  G K+ L      F    V G +C  +     D   + G     +  + YD +   + 
Sbjct: 370 FGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIG 429

Query: 368 FKPTDCTK 375
           F  T+C++
Sbjct: 430 FWKTNCSE 437


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/443 (23%), Positives = 167/443 (37%), Gaps = 99/443 (22%)

Query: 13  VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCL-----PCVQCYKQVKP 65
           + S   T  G+Y ++F +GTP  P L +    DTGSDL WV+C           Y    P
Sbjct: 96  LSSGAYTGTGQYFVRFRVGTPARPFLLV---ADTGSDLTWVKCHRHDHDAPAPGYGYAAP 152

Query: 66  ----------------------IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLC 99
                                 ++ P  S ++  + C S+ C         +C +    C
Sbjct: 153 ASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPC 212

Query: 100 NYTYGYADSSLTKGVLATERITFGNSNN---------FFDNVVFGCGHNNTGVFNENEMG 150
            Y Y Y D S  +G + T+  T   S               VV GC  + TG       G
Sbjct: 213 AYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDG 272

Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-- 208
           ++ LG + +S AS+  ++ G  +FSYCLV      + TS + FG    VS      T+  
Sbjct: 273 VLSLGYSNISFASRAAARFG-GRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACA 331

Query: 209 --------------------LVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAIS 247
                               L+    + +Y VT+ GISV G L    +L+ +  + G   
Sbjct: 332 GGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLV-WDVAKG--- 387

Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP------------YQDPRLGSQLCYK 295
            G   +D+G   T+L    Y  +   +   +   P            +  P  G  L   
Sbjct: 388 -GGAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRVTMDPFDYCYNWTSPSTGEDLTVA 446

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQ 352
            P +A        HF G A++      +++     GV C  +Q  +G+   V + GN  Q
Sbjct: 447 MPELA-------VHFAGSARL-QPPAKSYVIDAAPGVKCIGLQ--EGEWPGVSVIGNILQ 496

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +    +D  ++ + FK + CT+
Sbjct: 497 QEHLWEFDLKNRRLRFKRSRCTQ 519


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 117/410 (28%), Positives = 169/410 (41%), Gaps = 74/410 (18%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC------YKQVKPIYN 68
           S + G Y M  S+GTP    +  I+DTGS L+W  C     C  C        ++ P + 
Sbjct: 78  SRSYGGYSMSLSLGTPSQ-TVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKI-PKFM 135

Query: 69  PASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYADSSLTKGVL 115
           P  SSS K + C++ +C  +   S  S+   CN            Y   Y   S T G+L
Sbjct: 136 PRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLL 194

Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
            +E I F N      + + GC   +T        G+ G GR++ SL  Q    LG  KFS
Sbjct: 195 LSETINFPNKT--ISDFLAGCSLLST----RQPEGIAGFGRSQESLPLQ----LGLKKFS 244

Query: 176 YCLVPFH-TDSSITSKMYFGNGSEVSGGGVVSTS-------LVSKED---KTYYFVTLEG 224
           YCLV     DS ++S +    G   S       S       L S+ +   + YY+V L  
Sbjct: 245 YCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRK 304

Query: 225 ISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRL----EEQVRNAIK 279
           I VG    +   +PY +   G+   G   +D+G+  T +    +  L    E+Q+ N   
Sbjct: 305 IIVGK---THVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTV 361

Query: 280 LTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFA 336
            T  Q    G + C+       +  P LT  F GGAK  +PL +   F+     GV C  
Sbjct: 362 ATNVQK-LTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVD---MGVVCLT 417

Query: 337 M-----QPIDGDVG--------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
           +       + GD G        I GNF Q + +I YD ++    FK   C
Sbjct: 418 IVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 165/382 (43%), Gaps = 51/382 (13%)

Query: 25  VMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS---- 79
           V+   IGTPP   D+  ++DTGS L W+QC    +  K++ P+  P ++S    LS    
Sbjct: 67  VVSLPIGTPPQPTDL--VLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFS 123

Query: 80  --------CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
                   C+          SC   +LC+Y+Y YAD +L +G L  E+ TF  S +    
Sbjct: 124 LLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS-TPP 182

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           V+ GC   +T    EN  G++G+   RLS     +SQ   +KFSYC VP  T S+ T   
Sbjct: 183 VILGCAQAST----ENR-GILGMNHGRLSF----ISQAKISKFSYC-VPSRTGSNPTGLF 232

Query: 192 YFGNGSEVSGGGVVSTSLVSKE-------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
           Y G+    S    V T L   E       D   Y + ++ I +        + P      
Sbjct: 233 YLGDNPNSSKFKYV-TMLTFPESQSSPNLDPLAYTLPMKAIKIAG--KRLNIPPAAFKPD 289

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
           A   G   ID+G+  T L  + Y +++E+V   +     K   Y D    + +C+     
Sbjct: 290 AGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADV---ADMCFDAGVT 346

Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQS 353
           A +      ++  FD G ++ +      +    +GV C  +   + +     I G   Q 
Sbjct: 347 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQ 406

Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
           ++++ YD  ++ V F   +C++
Sbjct: 407 NMWVEYDLANKRVGFGGAECSR 428


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 174/386 (45%), Gaps = 60/386 (15%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G PP  +I  ++DTGS+L W+ C    +    +  ++NP SSS+Y  + C
Sbjct: 62  NVTLTVTLAVGDPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 116

Query: 81  QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S  C      L    SC  +  LC+    YAD++  +G LA E    G+        +F
Sbjct: 117 SSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR--PGTLF 174

Query: 135 GC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           GC   G ++    +    GL+G+ R  LS     ++QLG +KFSYC+    +DSS+   +
Sbjct: 175 GCMDSGLSSNSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCIS--GSDSSVF--L 226

Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSG 244
             G+ S    G +  T LV +       D+  Y V LEGI VG+ + +  K +   + +G
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQ----LCYKT 296
           A   G   +D+G   T L    Y  L+     Q ++ ++L    DP    Q    LCYK 
Sbjct: 287 A---GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV--DDPDFVFQGTMDLCYKV 341

Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
                P+ +G+ P+++  F G      G K+ L   +       E V+CF     D    
Sbjct: 342 GSTTRPNFSGL-PMVSLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 399

Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +  + G+  Q ++++ +D     V F
Sbjct: 400 EAFVIGHHHQQNVWMEFDLAKSRVGF 425


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 89/347 (25%), Positives = 127/347 (36%), Gaps = 80/347 (23%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
           EYV+   +G+P +     ++DTGSD+ WVQC PC     C+     +++PA+SS+Y   +
Sbjct: 105 EYVISVGLGSPAVTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 163

Query: 80  CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           C +  C  L    +   C ++  C Y   Y D S T G                    FG
Sbjct: 164 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT----------------GFQFG 207

Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C H   G   ++   GL+GLG    SL SQ                              
Sbjct: 208 CSHAELGAGMDDKTDGLIGLGGDAQSLVSQ------------------------------ 237

Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
                       T+  SK+  TYYF  LE I+VG       L P   ++G++      +D
Sbjct: 238 ------------TAARSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------VD 277

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGG 313
           +G   T LP   Y  L    R  +      +P      C+    +  ++ P +   F GG
Sbjct: 278 SGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGG 337

Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
           A V L            G   FA    D   G  GN  Q    + YD
Sbjct: 338 AVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 168/391 (42%), Gaps = 65/391 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
           G Y  K  +GTPP  +    +DTGSD++WV C  C  C K    Q++   ++P  SSS  
Sbjct: 82  GLYYTKVKLGTPPR-EFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140

Query: 77  ELSCQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG---------NS 125
            +SC   +C+        CS   LC+Y++ Y D S T G   ++ ++F          NS
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200

Query: 126 NNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
           +  F   VFGC +  +G          G+ GLG+  LS+ SQ+  Q L    FS+CL   
Sbjct: 201 SAPF---VFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL--- 254

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISVGNLSN 232
                           + SGGG++    + + D  Y         Y V L+ I+V     
Sbjct: 255 --------------KGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAV----- 295

Query: 233 SSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP-RLGS 290
           + +++P   S   I+ G+   IDTG     LP + Y+   + V NA+  + Y  P    S
Sbjct: 296 NGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAV--SQYGRPITYES 353

Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKV---PLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
             C++ T     + P ++  F GGA +   P  +   F       ++C   Q +    + 
Sbjct: 354 YQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWCIGFQRMSHRRIT 412

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
           I G+    D  + YD   Q + +   DC+ +
Sbjct: 413 ILGDLVLKDKVVVYDLVRQRIGWAEYDCSLE 443


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 59/389 (15%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP  ++  ++DTGS+L W+ C    +  + +  ++NP SS +Y ++ C
Sbjct: 66  NVSLTVSLTVGSPPQ-NVTMVLDTGSELSWLHC----KKTQFLNSVFNPLSSKTYSKVPC 120

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C      L   VSC + +LC+    YAD++  +G LA E    G+        +FG
Sbjct: 121 LSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--PATIFG 178

Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           C   G ++    +    GL+G+ R  LS     ++Q+G  KFSYC+  F  DS+    + 
Sbjct: 179 CMDSGFSSNSEEDSKTTGLIGMNRGSLSF----VNQMGYPKFSYCISGF--DSA--GVLL 230

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNS---SKLIPYYNSS 243
            GN S      +  T LV         D+  Y V LEGI V N   S   S  +P  + +
Sbjct: 231 LGNASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVP--DHT 288

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKLTPYQDPRLGSQ----LCY- 294
           GA   G   +D+G   T L    Y  L+     Q R  +K+    D     Q    LCY 
Sbjct: 289 GA---GQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKV--LNDDNFVFQGAMDLCYL 343

Query: 295 ---KTPSMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDV 344
                P++  + P+++  F G    V        +P  V G   V+CF     D    + 
Sbjct: 344 LDSSRPNLQNL-PVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEA 402

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + G+  Q ++++ +D +   +      C
Sbjct: 403 FVIGHHHQQNVWMEFDLEKSRIGLADVRC 431


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 104/399 (26%), Positives = 167/399 (41%), Gaps = 81/399 (20%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTG+D+MWV C+ C +C  +        +YN   SSS K
Sbjct: 71  GLYYAKIGIGTPSK-DYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGK 129

Query: 77  ELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
            + C  E C      LL   +  +   C Y   Y D S T G    + + F   +     
Sbjct: 130 LVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKT 189

Query: 131 -----NVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
                +V+FGCG   +G     NE  + G++G G+   S+ SQ+ S     K F++CL  
Sbjct: 190 ASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL-- 247

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
                         NG  V+GGG+ +   V +          D+ +Y V +  I VG+  
Sbjct: 248 --------------NG--VNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTF 291

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           L+ S+      +S G I      ID+G     LP   Y  L  ++ +       Q P L 
Sbjct: 292 LNLSTDASEQRDSKGTI------IDSGTTLAYLPDGIYQPLVYKILS-------QQPNLK 338

Query: 290 SQ------LCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPID 341
            Q       C++ + S+    P +T +F+ G  + +  H   F+    E ++C   Q   
Sbjct: 339 VQTLHDEYTCFQYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLS---ENLWCIGWQNSG 395

Query: 342 G------DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                  ++ + G+   S+  + YD ++Q++ +   +C+
Sbjct: 396 AQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 434


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 171/383 (44%), Gaps = 51/383 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ----VK-PIYNPASSSSYK 76
           G Y  K  +G PP  D Y  VDTGSD++WV C  C +C  +    VK  +Y+P SS+S  
Sbjct: 80  GLYFAKIGLGNPPK-DYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138

Query: 77  ELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
            + C  + C          C+    C Y+  Y D S T G    + + F    GN     
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198

Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
            N  V+FGCG   +G     +E   G++G G+   S+ SQ+ +     + F++CL     
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL----- 253

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYN 241
             ++     F  G EV    V +T +V   ++ +Y V ++ I VG   L   + +    +
Sbjct: 254 -DNVKGGGIFAIG-EVVSPKVNTTPMVP--NQPHYNVVMKEIEVGGNVLELPTDIFDTGD 309

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-TP 297
             G I      ID+G     LP+  Y  +  ++   +  +KL   ++       C++ T 
Sbjct: 310 RRGTI------IDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF----TCFQYTG 359

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFA 351
           ++    P++  HF+G   +  ++   ++    E V+CF      MQ  DG D+ + G+  
Sbjct: 360 NVNEGFPVVKFHFNGSLSLT-VNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLV 418

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
            S+  + YD ++Q + +   +C+
Sbjct: 419 LSNKLVLYDLENQAIGWTDYNCS 441


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 164/381 (43%), Gaps = 65/381 (17%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G+PP  + +  VDTGSD++W+ C PC +C  +        +++  +SS+ K
Sbjct: 72  GLYFTKIKLGSPPK-EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSK 130

Query: 77  ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
           ++ C  + C  +  + SC     C+Y   YAD S + G    + +T         +    
Sbjct: 131 KVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLG 190

Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
             VVFGCG + +G     +    G++G G++  S+ SQ+ +   A + FS+CL       
Sbjct: 191 QEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
                        V GGG+ +  +V            ++ +Y V L G+ V     +S  
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV---DGTSLD 289

Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
           +P       +  G   +D+G      PK  Y+ L E +  R  +KL   ++    +  C+
Sbjct: 290 LP----RSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE----TFQCF 341

Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
              +    A P ++  F+   K+  ++   ++    E ++CF  Q          +V + 
Sbjct: 342 SFSTNVDEAFPPVSFEFEDSVKLT-VYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILL 400

Query: 348 GNFAQSDLFIGYDFDSQMVSF 368
           G+   S+  + YD D++++ +
Sbjct: 401 GDLVLSNKLVVYDLDNEVIGW 421


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/396 (24%), Positives = 163/396 (41%), Gaps = 71/396 (17%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           ++ G Y  K  +G+P   + Y  VDTGSD++WV C  C  C K+        +Y+P  S 
Sbjct: 67  SSTGLYYTKVGLGSPAK-EFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSK 125

Query: 74  SYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
           +   + C    C   DT S     C     C Y+  Y D S T G    + +TF    GN
Sbjct: 126 TSNAVPCGDGFC--TDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGN 183

Query: 125 SNNFFDN--VVFGCGHNNTGVFNENE----MGLVGLGRTRLSLASQILSQLGANK-FSYC 177
            +   DN  V+FGCG   +G  + N      G++G G+   S+ SQ+ +     + FS+C
Sbjct: 184 LHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHC 243

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED---------KTYYFVTLEGISVG 228
           L   H                  GGG+ S   V +             +Y V L+ + V 
Sbjct: 244 LDSHH------------------GGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDV- 284

Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQD 285
                  L+P Y       +G + ID+G     LP   YN+L  +V   +  +KL   +D
Sbjct: 285 --DGEPILLPLYLFDSGSGRGTI-IDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVED 341

Query: 286 PRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--- 341
                  C+  +  +    P++  HF+G +    +H   ++    E ++C   Q      
Sbjct: 342 ----QFTCFHYSDKLDEGFPVVKFHFEGLSLT--VHPHDYLFLYKEDIYCIGWQKSSTQT 395

Query: 342 ---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
               D+ + G+   S+  + YD ++ ++ +   +C+
Sbjct: 396 KEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 431


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 113/235 (48%), Gaps = 17/235 (7%)

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
           GL+GLGR RLSL    +SQ GA KFSYCL P+  ++  T  ++ G  + + G G V T+ 
Sbjct: 153 GLMGLGRGRLSL----VSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQ 208

Query: 210 VSKEDK--TYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKD 265
             K  K   +Y++ L G++VG   L   + +      +  +  G + ID+G+P T L  D
Sbjct: 209 FVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHD 268

Query: 266 FYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTS 322
            Y+ L  ++    N   + P  D   G+ LC     +  + P +  HF GGA + +   S
Sbjct: 269 AYDALASELAARLNGSLVAPPPDADDGA-LCVARRDVGRVVPAVVFHFRGGADMAVPAES 327

Query: 323 TFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            +   PV+           G      + GN+ Q ++ + YD  +   SF+P DC+
Sbjct: 328 YWA--PVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCS 380



 Score = 41.6 bits (96), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 17  VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKP 65
           V  A  +YV ++ IG PP      ++DTGSDL+W QC  C+ Q + Q  P
Sbjct: 83  VRWATLQYVAEYLIGDPPQ-RAEALIDTGSDLVWTQCSTCLRQGFSQAGP 131


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 162/367 (44%), Gaps = 48/367 (13%)

Query: 29  SIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKEL 78
           ++GTP   D + + +DTGSDL W+ C  C  C +++K          IY+P +SS+  ++
Sbjct: 109 TVGTPS--DWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 165

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVV 133
            C S  C   D  + S +  C Y   Y ++ + + GVL  + +       +S      V 
Sbjct: 166 PCNSTLCTRGDRCA-SPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVT 224

Query: 134 FGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
           FGCG   TGVF++     GL GLG   +S+ S +  + + AN FS C   F  D +   +
Sbjct: 225 FGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GR 279

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG+   V       T L  ++    Y +T+  ISVG             ++G +    
Sbjct: 280 ISFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVGG------------NTGDLEFDA 324

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAGIA-PIL 306
           +F D+G   T L    Y  + E   +      YQ  D  L  + CY  +P+      P +
Sbjct: 325 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
                GG+  P+ H    IP     V+C A+  I+ D+ I G    +   + +D +  ++
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLIL 442

Query: 367 SFKPTDC 373
            +K +DC
Sbjct: 443 GWKESDC 449


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 99/412 (24%), Positives = 164/412 (39%), Gaps = 57/412 (13%)

Query: 13  VQSNVSTAN-GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQC----------------- 53
           ++S ++TA+ G Y++    GTP L   Y +V DT +DL W+ C                 
Sbjct: 128 MRSALNTAHVGMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKT 185

Query: 54  ---------LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQ---QLCNY 101
                    +  +   +  K  Y PA SSS++ + C  +QC  L   +C S    + C+Y
Sbjct: 186 MSVGGDDDVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSY 245

Query: 102 TYGYADSSLTKGVLATERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTR 158
                D ++T G+   E+ T   S+        +V GC     G   +   G++ LG   
Sbjct: 246 YQKTQDGTVTIGIYGNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGH 305

Query: 159 LSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTY 217
           +S A   + + G  +FS+CL+  ++    +S + FG    V G G + T ++   D K  
Sbjct: 306 MSFAIHAVLRFGG-RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAA 364

Query: 218 YFVTLEGISVGNLSNSSKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
           Y   +  + VG      +L IP   +N    +  G + +DT    T L  + Y  L   +
Sbjct: 365 YGPRVTAVLVGG----ERLDIPDDVWNIDKGLGSG-VILDTSTSVTSLVPEAYEPLVAAL 419

Query: 275 RNAIKLTPYQDPRLGSQLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
              +   P ++   G + CY+         P+     P +T    GGA++     S  +P
Sbjct: 420 DRHLAHLP-RESFAGFEYCYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMP 478

Query: 327 PPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
               GV C A +  P  G   I GN    +     D       F+   C  +
Sbjct: 479 EVGHGVACLAFRKLPWGGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKCNTR 530


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 157/370 (42%), Gaps = 37/370 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C QC +   P + P  SS+Y+ + C
Sbjct: 10  NGYYTTRLWIGTPPQ-RFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKC 68

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
                  +D      +Q C Y   YA+ S + GVL  + I+FGN +       VFGC + 
Sbjct: 69  N------IDCNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENM 122

Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
            TG +++++  G++G+GR  LS+   ++ +   N  FS C   +         M  G   
Sbjct: 123 ETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLC---YGGMGIGGGAMVLG--- 176

Query: 198 EVSGGGVVSTSLVSKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
              G    S  + S+ D     YY + L+ I V     + K +P  N +    K    +D
Sbjct: 177 ---GISPPSNMVFSQSDPVRSPYYNIDLKEIHV-----AGKPLP-LNPTVFDGKHGTILD 227

Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLG-SQLCY-----KTPSMAGIAPILT 307
           +G     LP+  +   ++ +   +  L P + P    + +C+         ++   P + 
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVE 287

Query: 308 AHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
             F  G K+ L      F    V G +C  + Q       + G     +  + YD ++  
Sbjct: 288 MVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSK 347

Query: 366 VSFKPTDCTK 375
           + F  T+C++
Sbjct: 348 IGFWKTNCSE 357


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 114/419 (27%), Positives = 177/419 (42%), Gaps = 71/419 (16%)

Query: 17  VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQC------YKQVK 64
           V+T    Y++  ++G PP +  +Y  +DTGSDL WV C       C++C       K + 
Sbjct: 18  VTTYTDGYLLSLNLGMPPQVFQVY--LDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIP 75

Query: 65  PIYNPASSSSYKELSCQSEQC---HLLD-------TVSCS----SQQLCN-----YTYGY 105
                 SSS+ KEL C S  C   H  D        V C+       LC      ++Y Y
Sbjct: 76  SFSPSQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTY 134

Query: 106 ADSSLTKGVLATERITFGNS----NNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
              +L  G LA + +T   S        D     FGC     G      +G+ G G+  L
Sbjct: 135 GGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGC----VGSSIREPIGIAGFGKGIL 190

Query: 160 SLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKT 216
           SL SQ+        FS+C + F    + + TS +  G+ +  +    + T ++ S  +  
Sbjct: 191 SLPSQL--GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPN 248

Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
           +Y++ LEG+S+G+   +    P  +S  +   G M +DTG   T LP  FY  +   + +
Sbjct: 249 FYYIGLEGVSIGD-GAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLAS 307

Query: 277 AIKLTPYQD--PRLGSQLCYK-----TPSMAGIAPILTAHFDGGAKVPLIHTSTF--IPP 327
            I      D   R G  LC+K     TP      P++  HF G  K+ L   S +  +  
Sbjct: 308 VILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTA 367

Query: 328 PVEGVF--CFAMQPIDG--DVG--------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           P   V   C   Q +D   DVG        + G+F   ++ + YD ++  + F+P DC 
Sbjct: 368 PKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 426


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 162/382 (42%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +  
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   TSL    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 285 GYMILGRYDRAAMDGGY-TSLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 97/394 (24%), Positives = 167/394 (42%), Gaps = 50/394 (12%)

Query: 11  NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----P 65
           N+  + + T  G Y  K  +G+PP  D Y  VDTGSD++WV C+ C +C ++        
Sbjct: 57  NLGGNGLPTETGLYFTKLGLGSPPR-DYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLT 115

Query: 66  IYNPASSSSYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           +Y+P  S +   +SC  + C          C S+  C Y+  Y D S T G    + +T+
Sbjct: 116 LYDPKGSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTY 175

Query: 123 GNSNNFF------DNVVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGAN 172
              N          +++FGCG   +G       E   G++G G+   S+ SQ+ +     
Sbjct: 176 NRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVK 235

Query: 173 K-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN-- 229
           K FS+CL       ++     F  G EV    V +T LV +    +Y V L+ I V    
Sbjct: 236 KIFSHCL------DNVRGGGIFAIG-EVVEPKVSTTPLVPR--MAHYNVVLKSIEVDTDI 286

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           L   S +    N  G +      ID+G     LP   Y+ L ++V   +   P     L 
Sbjct: 287 LQLPSDIFDSVNGKGTV------IDSGTTLAYLPDIVYDELIQKV---LARQPGLKLYLV 337

Query: 290 SQ--LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---- 342
            Q   C+  T ++    P++  HF     +  ++   ++    +G++C   Q        
Sbjct: 338 EQQFRCFLYTGNVDRGFPVVKLHFKDSLSLT-VYPHDYLFQFKDGIWCIGWQRSVAQTKN 396

Query: 343 --DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             D+ + G+   S+  + YD ++ ++ +   +C+
Sbjct: 397 GKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNCS 430


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 99/396 (25%), Positives = 160/396 (40%), Gaps = 57/396 (14%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS--------SS 73
           G Y +  + GTPP  ++  I DTGS L+W  C    +C +   P  +PA+        SS
Sbjct: 130 GAYSVSLAFGTPPQ-NLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSS 188

Query: 74  SYKELSCQSEQCHLL---------DTVSCSSQQLCNYTYGYA---DSSLTKGVLATERIT 121
           S K + C++ +C  +            +  S++  +   GY     S  T G+L +E + 
Sbjct: 189 SVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLD 248

Query: 122 FGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
             N      + + GC   +         G+ G GR   SL     SQ+   +FS+CLV  
Sbjct: 249 LENKR--VPDFLVGCSVMSV----HQPAGIAGFGRGPESLP----SQMRLKRFSHCLVSR 298

Query: 182 -HTDSSITSKMYFGNGSEVSGGGVVS---------TSLVSKEDKTYYFVTLEGISVGNLS 231
              DS ++S +   +GSE       S          S+ +   + YY+++L  I +G   
Sbjct: 299 GFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIG--- 355

Query: 232 NSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPR 287
                 PY Y    +   G   ID+G+  T L K  +  + +++   +   P     + +
Sbjct: 356 GKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQ 415

Query: 288 LGSQLCYKTPSMAGIA--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
            G + C+  P     A  P +   F GG K+ L   +       EGV C  M   +  VG
Sbjct: 416 SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVG 475

Query: 346 -------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                  I G F Q ++ + YD   Q + F+   CT
Sbjct: 476 GGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 102/415 (24%), Positives = 166/415 (40%), Gaps = 74/415 (17%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYN----PASSSSYK 76
           +Y + F++ + P   +   +DTGSDL+W  C P  C+ C  + +        P  SS+ +
Sbjct: 81  DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTAR 140

Query: 77  ELSCQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
            + C+S  C                      ++T  C S    ++ Y Y D SL   +  
Sbjct: 141 SVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYH 200

Query: 117 TE-RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGAN 172
              ++     +    N  FGC H          +G+ G GR  LSL +Q+ S   QLG N
Sbjct: 201 DSIKLPLATPSLSLHNFTFGCAHTALA----EPVGVAGFGRGVLSLPAQLASFAPQLG-N 255

Query: 173 KFSYCLV--PFHTDS-SITSKMYFGNGSE----VSGGGV--VSTSLVSKEDKTYYF-VTL 222
           +FSYCLV   F++D   + S +  G+  +    V+   V  V TS++      Y++ V L
Sbjct: 256 RFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGL 315

Query: 223 EGISVGNLSNSSKLIP---YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI- 278
           EGIS+G      K IP   +         G + +D+G   T+LP   YN +  +  N + 
Sbjct: 316 EGISIGK-----KKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVG 370

Query: 279 ---KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---- 331
              +     + + G   CY   ++  I P L  HF G     ++    +    ++G    
Sbjct: 371 RVYERAKEVEDKTGLGPCYYYDTVVNI-PSLVLHFVGNESSVVLPKKNYFYDFLDGGDGV 429

Query: 332 -----VFCFAM-------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                V C  +       +   G     GN+ Q    + YD + + V F    C 
Sbjct: 430 RRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 172/386 (44%), Gaps = 60/386 (15%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G PP  +I  ++DTGS+L W+ C    +    +  ++NP SSS+Y  + C
Sbjct: 62  NVTLTVTLAVGDPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 116

Query: 81  QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S  C      L    SC  +  LC+    YAD++  +G LA E    G+        +F
Sbjct: 117 SSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR--PGTLF 174

Query: 135 GC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           GC   G ++    +    GL+G+ R  LS     ++QLG +KFSYC+    + S  +  +
Sbjct: 175 GCMDSGLSSNSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCI----SGSDSSGFL 226

Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSG 244
             G+ S    G +  T LV +       D+  Y V LEGI VG+ + +  K +   + +G
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286

Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQ----LCYKT 296
           A   G   +D+G   T L    Y  L+     Q ++ ++L    DP    Q    LCYK 
Sbjct: 287 A---GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV--DDPDFVFQGTMDLCYKV 341

Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
                P+ +G+ P+++  F G      G K+ L   +       E V+CF     D    
Sbjct: 342 GSTTRPNFSGL-PMVSLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 399

Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +  + G+  Q ++++ +D     V F
Sbjct: 400 EAFVIGHHHQQNVWMEFDLAKSRVGF 425


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 158/384 (41%), Gaps = 57/384 (14%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G P   +    +DTGSD++WV C PC  C           +++   SSS +
Sbjct: 82  GLYFTKVKLGNPAR-EFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140

Query: 77  ELSCQSEQCHLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
            L C    C  + T +  C +Q   C+Y++ Y D S T G   T+ + F    G S   N
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
               +VFGC     G          G+ G G+   S+ SQ+ S+ +    FS+CL     
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL----- 255

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKE--------DKTYYFVTLEGISV-GNLSNSS 234
                       G E  GG +V   ++            + +Y + L+ I++ G L  + 
Sbjct: 256 -----------KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNP 304

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
            + P  N+      G   ID+G     L ++ Y+ +   + +A+  +       GSQ   
Sbjct: 305 TMFPISNA------GETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFR 358

Query: 295 KTPSMAGIAPILTAHFDGGAKVPL-----IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGN 349
            + S+A I P+L  +F+G A + +     +   + +  P   ++C   Q  +  + I G+
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREP--ALWCIGFQKAEDGLNILGD 416

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
               D  I YD   Q + +   DC
Sbjct: 417 LVLKDKIIVYDLARQRIGWANYDC 440


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 138/309 (44%), Gaps = 40/309 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP   +  ++DTGS+L W+ C    +  + +  ++NP  SSSY  + C
Sbjct: 67  NVTLTVSLTVGTPPQ-SVTMVLDTGSELSWLHC----KKQQNINSVFNPHLSSSYTPIPC 121

Query: 81  QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C       L  VSC S  LC+ T  YAD +  +G LA++  TF  S +    ++FG
Sbjct: 122 MSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASD--TFAISGSGQPGIIFG 179

Query: 136 ---CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
               G ++    +    GL+G+ R  LS     ++Q+G  KFSYC+    +    +  + 
Sbjct: 180 SMDSGFSSNANEDSKTTGLMGMNRGSLSF----VTQMGFPKFSYCI----SGKDASGVLL 231

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGA 245
           FG+ +    G +  T LV         D+  Y V L GI VG+      K I   + +GA
Sbjct: 232 FGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGA 291

Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTPSM 299
              G   +D+G   T L    Y  L  +     +  LT  +DP         LC++    
Sbjct: 292 ---GQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRV-RR 347

Query: 300 AGIAPILTA 308
            G+ P + A
Sbjct: 348 GGVVPAVPA 356


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 170/397 (42%), Gaps = 64/397 (16%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK----PIYNPASSS 73
           G Y +  + GTPP    + ++DTGS L+W  C     C +C +  ++    P + P  SS
Sbjct: 90  GGYSISLNFGTPPQTTKF-VMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSS 148

Query: 74  SYKELSCQSEQCHLL------------DTVSCSSQQLCN-YTYGYADSSLTKGVLATERI 120
           S   + C++ +C  L            D  + +  Q C  Y   Y   S T G+L +E +
Sbjct: 149 SSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETL 207

Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFN-ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
            F +        + GC      +F+     G+ G GR+  SL SQ    LG  KFSYCLV
Sbjct: 208 DFPHKKTI-PGFLVGCS-----LFSIRQPEGIAGFGRSPESLPSQ----LGLKKFSYCLV 257

Query: 180 PFHTDSSITSK---MYFGNGSEVSGGGVVSTSLVSKED----KTYYFVTLEGISVGNLSN 232
               D +  S    +  G+GS+ +    +S +   K      + YY+V L  I +G   +
Sbjct: 258 SHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIG---D 314

Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPR 287
           +   +PY +   G+   G   +D+G   T + K  Y       E+QV +    T  Q+ +
Sbjct: 315 THVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQN-Q 373

Query: 288 LGSQLCYKTPSMAGIA-PILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFAMQPID--- 341
            G + C+       ++ P    HF GGAK  +PL +  +F+     GV C  +   +   
Sbjct: 374 TGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVD---SGVICLTIVSDNMSG 430

Query: 342 -----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
                G   I GN+ Q +  + +D  ++   FK  +C
Sbjct: 431 SGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
          Length = 477

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 157/368 (42%), Gaps = 43/368 (11%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSS 74
           +T  G Y++   +GTPP   +YG  D  S  +WV C  CV    C      +Y       
Sbjct: 81  ATTGGTYLITVGVGTPPQY-VYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL 139

Query: 75  YKELSCQSEQCH-LLDTVSCSS--QQLCNYTYGYADSSLTKGV--LATERITFGNSNNFF 129
           Y   SC  ++C  ++    C +     C YT  Y  +  T+    L  +  T G+ N   
Sbjct: 140 Y---SCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGTETEGHLGLQPFTLGD-NTMP 195

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-- 187
            N++FGCG        E   G++GL R RLSL SQ+  QLG  +FSY   P + D++   
Sbjct: 196 VNMIFGCGLE-----PETNFGVIGLNRGRLSLISQL--QLG--RFSYYFAPEYDDTAAGN 246

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---YFVTLEGISVGNLSNSSKLIPYYNSSG 244
            S + FG  +         T   S E+  Y   Y V L G+ VG  SN+  ++      G
Sbjct: 247 ASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVG--SNNLNML------G 298

Query: 245 AISKGN----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           A S G      ++ T  P T L K+ Y+ L  ++ + +         LG  LCY +  +A
Sbjct: 299 AGSGGRDPLVAYLSTSVPVTFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLA 358

Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
               P +   F  GA + L   +        G+ C  + P  + G + + G+  Q+   +
Sbjct: 359 KAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHM 418

Query: 358 GYDFDSQM 365
            Y +D Q+
Sbjct: 419 MY-YDIQI 425


>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
 gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
 gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
          Length = 477

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 157/368 (42%), Gaps = 43/368 (11%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSS 74
           +T  G Y++   +GTPP   +YG  D  S  +WV C  CV    C      +Y       
Sbjct: 81  ATTGGTYLITVGVGTPPQY-VYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL 139

Query: 75  YKELSCQSEQCH-LLDTVSCSS--QQLCNYTYGYADSSLTKGV--LATERITFGNSNNFF 129
           Y   SC  ++C  ++    C +     C YT  Y  +  T+    L  +  T G+ N   
Sbjct: 140 Y---SCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGTETEGHLGLQPFTLGD-NTMP 195

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-- 187
            N++FGCG        E   G++GL R RLSL SQ+  QLG  +FSY   P + D++   
Sbjct: 196 VNMIFGCGLE-----PETNFGVIGLNRGRLSLISQL--QLG--RFSYYFAPEYDDTAAGN 246

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---YFVTLEGISVGNLSNSSKLIPYYNSSG 244
            S + FG  +         T   S E+  Y   Y V L G+ VG  SN+  ++      G
Sbjct: 247 ASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVG--SNNLNML------G 298

Query: 245 AISKGN----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           A S G      ++ T  P T L K+ Y+ L  ++ + +         LG  LCY +  +A
Sbjct: 299 AGSGGRDPLVAYLSTSVPITFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLA 358

Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
               P +   F  GA + L   +        G+ C  + P  + G + + G+  Q+   +
Sbjct: 359 KAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHM 418

Query: 358 GYDFDSQM 365
            Y +D Q+
Sbjct: 419 MY-YDIQI 425


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 90/369 (24%), Positives = 156/369 (42%), Gaps = 35/369 (9%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           NG Y  +  IGTPP      IVDTGS + +V C  C  C     P + P  S +Y+ + C
Sbjct: 90  NGYYTARLWIGTPPQ-RFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC 148

Query: 81  QSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
            + QC      +C + ++ C Y   YA+ S + G L  + ++FGN         +FGC +
Sbjct: 149 -TWQC------NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCEN 201

Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILS-QLGANKFSYCLVPFHTDSSITSKMYFGNG 196
           + TG ++N+   G++GLGR  LS+  Q++  ++ ++ FS C   +         M  G  
Sbjct: 202 DETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLC---YGGMGVGGGAMVLGGI 258

Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
           S  +      +  V      YY + L+ I V       +L  + N      K    +D+G
Sbjct: 259 SPPADMVFTRSDPVR---SPYYNIDLKEIHVAG----KRL--HLNPKVFDGKHGTVLDSG 309

Query: 257 APPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAPILTA 308
                LP+  +   +  +    +++K     DPR  + +C+         ++   P++  
Sbjct: 310 TTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRY-NDICFSGAEIDVSQISKSFPVVEM 368

Query: 309 HFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMV 366
            F  G K+ L      F    V G +C  +     D   + G     +  + YD +   +
Sbjct: 369 VFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKI 428

Query: 367 SFKPTDCTK 375
            F  T+C++
Sbjct: 429 GFWKTNCSE 437


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
           G Y  +  +G P   + +  +DTGSD++WV C PC  C        Q++  +NP SSS+ 
Sbjct: 89  GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 146

Query: 76  KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
             ++C  ++C         +   S S    C YT+ Y D S T G   ++ + F    GN
Sbjct: 147 SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 206

Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
               N   ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+C
Sbjct: 207 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 265

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
           L        I   +  G   E+   G+V T LV  +   +Y + LE I+V    L   S 
Sbjct: 266 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 317

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
           L    N+ G I      +D+G     L    Y+     +  A+  +       GSQ    
Sbjct: 318 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 371

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
           + S+    P +T +F GG  + +   +  +    V+   ++C   Q   G ++ I G+  
Sbjct: 372 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 431

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
             D    YD  +  + +   DC+
Sbjct: 432 LKDKIFVYDLANMRMGWADYDCS 454


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
           G Y  +  +G P   + +  +DTGSD++WV C PC  C        Q++  +NP SSS+ 
Sbjct: 87  GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 144

Query: 76  KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
             ++C  ++C         +   S S    C YT+ Y D S T G   ++ + F    GN
Sbjct: 145 SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 204

Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
               N   ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 263

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
           L        I   +  G   E+   G+V T LV  +   +Y + LE I+V    L   S 
Sbjct: 264 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 315

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
           L    N+ G I      +D+G     L    Y+     +  A+  +       GSQ    
Sbjct: 316 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 369

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
           + S+    P +T +F GG  + +   +  +    V+   ++C   Q   G ++ I G+  
Sbjct: 370 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 429

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
             D    YD  +  + +   DC+
Sbjct: 430 LKDKIFVYDLANMRMGWADYDCS 452


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 113/408 (27%), Positives = 168/408 (41%), Gaps = 59/408 (14%)

Query: 10  NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK 64
           N+V +S +S  + G Y    S GTP    ++ I DTGS L+W  C     C +C + ++ 
Sbjct: 66  NSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKID 124

Query: 65  PI----YNPASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYAD 107
           P     + P  SSS K + CQ+ +C  +      SQ + CN            Y   Y  
Sbjct: 125 PTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS 184

Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
            S T G+L +E + F +      N V GC   +         G+ G GR   SL SQ   
Sbjct: 185 GS-TAGLLLSETLDFPDKK--IPNFVVGCSFLSI----HQPSGIAGFGRGSESLPSQ--- 234

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVST------SLVSKEDKTYYFVT 221
            +G  KF+YCL     D S  S     + + V   G+  T      S+ +   K YY++ 
Sbjct: 235 -MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLN 293

Query: 222 LEGISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN 276
           +  I VG   N +  +PY +   G    G   ID+G+  T + K          E+Q+ N
Sbjct: 294 IRKIIVG---NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 350

Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
             + T  +    G + C+       +  P L   F GGAK  L   + F      GV C 
Sbjct: 351 WTRATDVET-LTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACL 409

Query: 336 AM---QPIDGDVG------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            +   Q  DG  G      I G F Q + ++ YD  +Q + F+   C+
Sbjct: 410 TVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
           G Y  +  +G P   + +  +DTGSD++WV C PC  C        Q++  +NP SSS+ 
Sbjct: 3   GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 60

Query: 76  KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
             ++C  ++C         +   S S    C YT+ Y D S T G   ++ + F    GN
Sbjct: 61  SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 120

Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
               N   ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+C
Sbjct: 121 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 179

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
           L        I   +  G   E+   G+V T LV  +   +Y + LE I+V    L   S 
Sbjct: 180 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 231

Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
           L    N+ G I      +D+G     L    Y+     +  A+  +       GSQ    
Sbjct: 232 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 285

Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
           + S+    P +T +F GG  + +   +  +    V+   ++C   Q   G ++ I G+  
Sbjct: 286 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 345

Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
             D    YD  +  + +   DC+
Sbjct: 346 LKDKIFVYDLANMRMGWADYDCS 368


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 164/381 (43%), Gaps = 47/381 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  +  IG+PP  D +  VDTGSD++WV C+ C  C K+        +YNP SSS+  
Sbjct: 71  GLYYARIGIGSPPN-DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 77  ELSCQSEQCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
            ++C    C          C    LC Y   Y D S T G    + I      GN     
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189

Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
            N  +VFGCG   +G     +E   G++G G+   S+ SQ+ +     K F++CL     
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL----- 244

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
             SI+    F  G EV    + +T +V   ++ +Y V L G+ VG+ +    L  +  S 
Sbjct: 245 -DSISGGGIFAIG-EVVEPKLXNTPVVP--NQAHYNVVLNGVKVGDTALDLPLGLFETS- 299

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK-TPSM 299
               K    ID+G     LP+  Y  L E++  A   +KL    D       C+    ++
Sbjct: 300 ---YKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDD----QFTCFVFDKNV 352

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG-DVGIFGNFAQS 353
               P +T  F+  + +  I+   ++    + V+C        Q  DG +V + G+    
Sbjct: 353 DDGFPTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + Y+ ++Q + +   +C+
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 101/385 (26%), Positives = 153/385 (39%), Gaps = 52/385 (13%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     IG PP    LD    VDTGSDL W+QC  PC  C K   P+Y PA      
Sbjct: 184 DGQYYTSIFIGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 239

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            ++L CQ  Q    +   C + + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDF 296

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +S  SQ+ S  + AN F +C+     +    
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT---REQGGG 353

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   GV  TS+ S  D  Y+      +  G+               A S 
Sbjct: 354 GYMFLGD-DYVPRWGVTWTSIRSGPDNLYH-TQAHHVKYGDQQ-------LRRPEQAGST 404

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
             +  D+G+  T LP + Y  L   ++ A              LC+K          +  
Sbjct: 405 VQVIFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQ 464

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
               L  HF    K  L  + TF   P +       G  C  +    +   G   I G+ 
Sbjct: 465 FFEPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
           +     + YD   + + +  +DCTK
Sbjct: 522 SLRGKLVVYDNQRKQIGWADSDCTK 546


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 161/381 (42%), Gaps = 41/381 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
           G Y  +  +G+PP  D Y  +DTGSD++WV C  C  C      +     ++P SS++  
Sbjct: 82  GLYFTRVQLGSPPK-DFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAA 140

Query: 77  ELSCQSEQC----HLLDTVSCSSQQLCNYTYGYADSSLTKGV-------LATERITFGN- 124
            +SC  ++C       D++  S    C YT+ Y D S T G        L T  ++ G  
Sbjct: 141 LVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGEL 200

Query: 125 ---SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYC 177
                 +  +V F C    TG   +++    G+ G G+  +S+ SQ+ SQ +    FS+C
Sbjct: 201 SQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHC 260

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L     D S    +  G   E+    +V T LV  +   +Y + L+ ISV     +  + 
Sbjct: 261 L---KGDDSGGGVLVLG---EIVEPNIVYTPLVPSQP--HYNLYLQSISVAG--QTLAID 310

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
           P  +  GA S     +D+G     L +  Y+     + + + L        G+Q    T 
Sbjct: 311 P--SVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQCYLVTS 368

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDG-DVGIFGNFAQS 353
           S+  + P ++ +F GGA + L      +     G   V+C   Q   G  + I G+    
Sbjct: 369 SVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLK 428

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           D    YD  +Q V +   DC+
Sbjct: 429 DKIFVYDIANQRVGWTNYDCS 449


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 113/408 (27%), Positives = 168/408 (41%), Gaps = 59/408 (14%)

Query: 10  NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK 64
           N+V +S +S  + G Y    S GTP    ++ I DTGS L+W  C     C +C + ++ 
Sbjct: 66  NSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKID 124

Query: 65  PI----YNPASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYAD 107
           P     + P  SSS K + CQ+ +C  +      SQ + CN            Y   Y  
Sbjct: 125 PTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS 184

Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
            S T G+L +E + F   +    N V GC   +         G+ G GR   SL SQ   
Sbjct: 185 GS-TAGLLLSETLDF--PDKXIPNFVVGCSFLSI----HQPSGIAGFGRGSESLPSQ--- 234

Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVST------SLVSKEDKTYYFVT 221
            +G  KF+YCL     D S  S     + + V   G+  T      S+ +   K YY++ 
Sbjct: 235 -MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLN 293

Query: 222 LEGISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN 276
           +  I VG   N +  +PY +   G    G   ID+G+  T + K          E+Q+ N
Sbjct: 294 IRKIIVG---NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 350

Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
             + T  +    G + C+       +  P L   F GGAK  L   + F      GV C 
Sbjct: 351 WTRATDVET-LTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACL 409

Query: 336 AM---QPIDGDVG------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            +   Q  DG  G      I G F Q + ++ YD  +Q + F+   C+
Sbjct: 410 TVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 162/383 (42%), Gaps = 48/383 (12%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     +G PP    LD    VDTGSDL W+QC  PC  C K   P+Y PA      
Sbjct: 200 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 255

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            K+L CQ  Q +      C + + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 256 PKDLLCQELQGN---QNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDF 312

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +SL SQ+ +Q + +N F +C+     D +  
Sbjct: 313 VFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT---RDPNGG 369

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+ ST + S  D  ++    + +  G+   S +      +SG  + 
Sbjct: 370 GYMFLGD-DYVPRWGMTSTPIRSAPDNLFH-TEAQKVYYGDQQLSMR-----GASG--NS 420

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
             +  D+G+  T LP + Y  L   ++ A              LC  T         +  
Sbjct: 421 VQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQ 480

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPV-----EGVFCFAM---QPID-GDVGIFGNFAQ 352
           +   L  HF G     +  T T +P        +G  C      + ID G   I G+ A 
Sbjct: 481 LFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNAL 539

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
               + YD   + + +  +DCTK
Sbjct: 540 RGKLVVYDNQQRQIGWTNSDCTK 562


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 162/383 (42%), Gaps = 48/383 (12%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     +G PP    LD    VDTGSDL W+QC  PC  C K   P+Y PA      
Sbjct: 201 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 256

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            K+L CQ  Q +      C + + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 257 PKDLLCQELQGN---QNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDF 313

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +SL SQ+ +Q + +N F +C+     D +  
Sbjct: 314 VFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT---RDPNGG 370

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+ ST + S  D  ++    + +  G+   S +      +SG  + 
Sbjct: 371 GYMFLGD-DYVPRWGMTSTPIRSAPDNLFH-TEAQKVYYGDQQLSMR-----GASG--NS 421

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
             +  D+G+  T LP + Y  L   ++ A              LC  T         +  
Sbjct: 422 VQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQ 481

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPV-----EGVFCFAM---QPID-GDVGIFGNFAQ 352
           +   L  HF G     +  T T +P        +G  C      + ID G   I G+ A 
Sbjct: 482 LFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNAL 540

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
               + YD   + + +  +DCTK
Sbjct: 541 RGKLVVYDNQQRQIGWTNSDCTK 563


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/387 (25%), Positives = 169/387 (43%), Gaps = 53/387 (13%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
           NG Y     +G+PP    LD+    DTGSDL W+QC  PC  C K   P+Y P   +   
Sbjct: 98  NGLYFTHIFVGSPPRRYFLDM----DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVP 153

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN--V 132
            K+  C   Q +L  T  C + + C+Y   YAD S + GVLA++ +    +N       +
Sbjct: 154 LKDSLCVEVQRNL-KTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGI 212

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           +FGC ++  G+   +     G++GL + ++SL SQ+ SQ +  N   +CL    +D++  
Sbjct: 213 MFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLT---SDATGG 269

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+    +++     Y+   ++      +S+ S+ +      G   +
Sbjct: 270 GYMFLGD-DFVPYWGMAWVPMLNSHSPNYHSQIMK------ISHGSRQLSLGRQDGRTER 322

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTP----SMAG 301
             +  DTG+  T  PK+ Y  L   +++       Q   DP L   +C++      S+  
Sbjct: 323 --VVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTL--PVCWRAKFPIRSVID 378

Query: 302 IAPI---LTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIFG 348
           +      LT  F   +K  ++ T   IPP        +G  C  +       DG   I G
Sbjct: 379 VKQFFQPLTLQFR--SKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILG 436

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           + +     + YD  +Q + +  + C K
Sbjct: 437 DISLRGKLVVYDNVNQKIGWAQSTCVK 463


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/271 (29%), Positives = 120/271 (44%), Gaps = 32/271 (11%)

Query: 16  NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
           N+    G Y     IGTP +   Y  +DTGS   WV  + C QC  +   +     Y+P 
Sbjct: 75  NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
           SS S KE+ C    C       C+    C Y  GYAD  LT G+L T+ + +      G 
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLV 179
           +     +V FGCG   +G  N + +   G++G G +  +  SQ L+  G  K  FS+CL 
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQ-LAAAGKTKKIFSHCL- 249

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
                 S      F  G  V     V T+ + K ++ Y+ V L+ I   N++ ++  +P 
Sbjct: 250 -----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP- 298

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
            N  G       FID+G+    LP+  Y+ L
Sbjct: 299 ANIFGTTKTKGTFIDSGSTLVYLPEIIYSEL 329


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 136/370 (36%), Gaps = 73/370 (19%)

Query: 54  LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCN-YTYGYADSSL 110
           +PC        P+ + A +S+     C   +C L D  T SC +   C    Y Y D SL
Sbjct: 24  IPCAS------PLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSL 77

Query: 111 TKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI 165
               L   R+  G           DN  F C H   G      +G+ G GR  LSL  Q+
Sbjct: 78  VAH-LRRGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQL 132

Query: 166 LSQLGANKFSYCLVP--FHTDSSIT-SKMYFGNG-----SEVSGGGVVSTSLVSKEDKTY 217
             QL + +FSYCLV   F  D  I  S +  G       +     G V T L+      Y
Sbjct: 133 SPQL-SGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPY 191

Query: 218 YF-VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----- 271
           ++ V LE +SVG     ++  P          G M +D+G   T+LP + Y R+      
Sbjct: 192 FYSVALEAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFAR 249

Query: 272 ----------EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHT 321
                     E+      LTP          CY+  +     P L  HF G A V L   
Sbjct: 250 AMAAAGFARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRR 299

Query: 322 STFIPPPVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
           + F+    E          V C  +        +  DG  G  GNF Q    + YD D+ 
Sbjct: 300 NYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAG 359

Query: 365 MVSFKPTDCT 374
            V F    CT
Sbjct: 360 RVGFARRRCT 369


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/289 (29%), Positives = 125/289 (43%), Gaps = 19/289 (6%)

Query: 90  TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
           T  CS    C Y   Y D S T G  A + +T  +S++      FGCG  N G+F E   
Sbjct: 13  TRGCSGGH-CLYGVQYGDGSYTIGFFAMDTLTL-SSHDAIKGFRFGCGERNEGLFGE-AA 69

Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS- 208
           GL+GLGR + SL  Q   + G   F++C   F   SS T  + FG GS  +    +ST+ 
Sbjct: 70  GLLGLGRGKTSLPVQTYDKYG-GVFAHC---FPARSSGTGYLEFGPGSSPAVSAKLSTTP 125

Query: 209 LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
           ++     T+Y+V + GI VG      KL+P   S    +     +D+G   T LP   Y+
Sbjct: 126 MLIDTGPTFYYVGMTGIRVGG-----KLLPIPQS--VFAAAGTIVDSGTVITRLPPAAYS 178

Query: 269 RLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFI 325
            L      ++    Y+     S L  CY     + +A P ++  F GG  + +  +    
Sbjct: 179 SLRSAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY 238

Query: 326 PPPV-EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              V +    FA      DV I GN       + YD  S++V F P  C
Sbjct: 239 AASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 441

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 117/436 (26%), Positives = 174/436 (39%), Gaps = 95/436 (21%)

Query: 17  VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWV--------QCLPCVQCYKQVKPIY 67
           ++T    Y++  ++GTPP +  +Y  +DTGSDL WV        QCL C   +   KP  
Sbjct: 18  IATYTDGYLLSLNLGTPPQVFQVY--LDTGSDLTWVPCGTNTSYQCLECGNEHSISKP-- 73

Query: 68  NPA-----SSSSYKEL-----------------SCQSEQCHLLDTVSCSSQQLCN-YTYG 104
            PA     S SS ++L                 +C +  C +   +S    +LC  + Y 
Sbjct: 74  TPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPPFAYT 133

Query: 105 YADSSLTKGVLATERITFGNS------NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTR 158
           Y   +L  G LA + I    S         F    FGC     G      +G+ G G+ +
Sbjct: 134 YGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGC----VGSSIREPIGIAGFGKGK 189

Query: 159 LSLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGN-GSEVSGGGVVSTSLVSKEDK 215
           LSL SQ+        FS+C + F    + +ITS M  G+    V  G + +  L S    
Sbjct: 190 LSLPSQL--GFLDKGFSHCFLGFWFARNPNITSPMVIGDLALSVKDGFLFTPMLKSLTYP 247

Query: 216 TYYFVTLEGISVGNLSNSSKLIPYYNS-SGAISKGN--MFIDTGAPPTLLPKDFYNRLEE 272
            +Y++ LEG+++G+    +  IP   S SG  S+GN  + +DTG   T L   FY     
Sbjct: 248 NFYYIGLEGVTIGD----NAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFY---AS 300

Query: 273 QVRNAIKLTPYQ-----DPRLGSQLCYKTPSMAGIA-----PILTAHFDGGAKVPLIHTS 322
            + +     PY      + R G  LC K P M         P +T H  G   + L   S
Sbjct: 301 VLSSLSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKES 360

Query: 323 TF--IPPPVEGVF--CFAMQPIDGD--------------------VGIFGNFAQSDLFIG 358
            +  +  P   V   C   Q  D D                      + G+F   ++ + 
Sbjct: 361 CYYAVTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVV 420

Query: 359 YDFDSQMVSFKPTDCT 374
           YD +S  V F+P DC 
Sbjct: 421 YDLESGRVGFQPRDCA 436


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 162/383 (42%), Gaps = 53/383 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPA 70
           NG Y  +  IGTP   +   IVD+GS + +V C  C QC           +   P + P 
Sbjct: 89  NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPD 147

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF- 129
            SS+Y  + C       +D    + +  C Y   YA+ S + GVL  + ++FG  +    
Sbjct: 148 LSSTYSPVKCN------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 201

Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
              VFGC +  TG +F+++  G++GLGR +LS+  Q++ + + ++ FS C          
Sbjct: 202 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY--------- 252

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNS 242
              M  G G+ V GG      +V          YY + L+ I V     + +L P  +N 
Sbjct: 253 -GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN- 308

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY----- 294
               SK    +D+G     LP+  +   ++ V    N++K     DP     +C+     
Sbjct: 309 ----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGR 363

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQ 352
               ++ + P +   F  G K+ L      F    VEG +C  + Q       + G    
Sbjct: 364 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVV 423

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +  + YD  ++ + F  T+C++
Sbjct: 424 RNTLVTYDRHNEKIGFWKTNCSE 446


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/261 (30%), Positives = 129/261 (49%), Gaps = 22/261 (8%)

Query: 24  YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           Y+++ +IGTP  P+L     +DT +D  WV C  CV C   V  +++P+ SSS + L C 
Sbjct: 91  YIVRANIGTPAQPMLVA---LDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCD 145

Query: 82  SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
           + QC      +C++ + C +   Y  S++ +  L  + +T   +N+   +  FGC    T
Sbjct: 146 APQCKQAPNPTCTAGKSCGFNMTYGGSTI-EASLTQDTLTL--ANDVIKSYTFGCISKAT 202

Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
           G  +    GL+GLGR  LSL SQ    L  + FSYCL P    S+ +  +    G +   
Sbjct: 203 GT-SLPAQGLMGLGRGPLSLISQT-QNLYMSTFSYCL-PNSKSSNFSGSLRL--GPKYQP 257

Query: 202 GGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
             + +T L+    + + Y+V L GI VGN  +   +  + +  S+GA   G +F D+G  
Sbjct: 258 VRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGA---GTIF-DSGTV 313

Query: 259 PTLLPKDFYNRLEEQVRNAIK 279
            T L +  Y  +  + R  IK
Sbjct: 314 FTRLVEPAYVAVRNEFRRRIK 334


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 170/396 (42%), Gaps = 53/396 (13%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVKPI----YNPASSS 73
           G Y +  ++GTPP    + ++DTGS L+W  C     C  C +  + P     + P +SS
Sbjct: 86  GGYSIDLNLGTPPQTSPF-VLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSS 144

Query: 74  SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLT-------KGVLATERITFGNSN 126
           + K L C++ +C  L      S+       G  + SLT        G+ AT      ++ 
Sbjct: 145 TAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNL 204

Query: 127 NFFDNVV--FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
           NF    V  F  G +   +      G+ G GR + SL SQ    +   +FSYCLV    D
Sbjct: 205 NFPGKTVPQFLVGCSILSI--RQPSGIAGFGRGQESLPSQ----MNLKRFSYCLVSHRFD 258

Query: 185 SSITSK---MYFGNGSEVSGGGVVSTSLVSKED-----KTYYFVTLEGISVGNLSNSSKL 236
            +  S    +   +  +    G+  T   S        + YY+VTL  + VG +      
Sbjct: 259 DTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVK--- 315

Query: 237 IPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--- 292
           IPY +   G+   G   +D+G+  T + +  YN + ++    +     ++  + +Q    
Sbjct: 316 IPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS 375

Query: 293 -CYKTPSMAGIA-PILTAHFDGGAKV--PLIHTSTFIPPPVEGVFCF-------AMQP-I 340
            C+    +  I+ P  T  F GGAK+  PL++  +F+      V CF       A QP  
Sbjct: 376 PCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGD--AEVLCFTVVSDGGAGQPKT 433

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
            G   I GN+ Q + ++ YD +++   F P +C ++
Sbjct: 434 AGPAIILGNYQQQNFYVEYDLENERFGFGPRNCKRK 469


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 57/376 (15%)

Query: 29  SIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKEL 78
           ++GTP   D + + +DTGSDL W+ C  C  C +++K          IY+P +SS+  ++
Sbjct: 60  TVGTPS--DWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 116

Query: 79  SCQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVV 133
            C S  C   D  + S +  C Y   Y ++ + + GVL  + +       +S      V 
Sbjct: 117 PCNSTLCTRGDRCA-SPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVT 175

Query: 134 FGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
           FGCG   TGVF++     GL GLG   +S+ S +  + + AN FS C   F  D +   +
Sbjct: 176 FGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GR 230

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           + FG+   V       T L  ++    Y +T+  ISVG             ++G +    
Sbjct: 231 ISFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVGG------------NTGDLEFDA 275

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCY--KTPSMAGIA--- 303
           +F D+G   T L    Y  + E   +      YQ  D  L  + CY  + P  +G     
Sbjct: 276 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPN 334

Query: 304 ------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
                 P +     GG+  P+ H    IP     V+C A+  I+ D+ I G    +   +
Sbjct: 335 KDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRV 393

Query: 358 GYDFDSQMVSFKPTDC 373
            +D +  ++ +K +DC
Sbjct: 394 VFDREKLILGWKESDC 409


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 162/383 (42%), Gaps = 53/383 (13%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPA 70
           NG Y  +  IGTP   +   IVD+GS + +V C  C QC           +   P + P 
Sbjct: 88  NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPD 146

Query: 71  SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF- 129
            SS+Y  + C       +D    + +  C Y   YA+ S + GVL  + ++FG  +    
Sbjct: 147 LSSTYSPVKCN------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 200

Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
              VFGC +  TG +F+++  G++GLGR +LS+  Q++ + + ++ FS C          
Sbjct: 201 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY--------- 251

Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNS 242
              M  G G+ V GG      +V          YY + L+ I V     + +L P  +N 
Sbjct: 252 -GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN- 307

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY----- 294
               SK    +D+G     LP+  +   ++ V    N++K     DP     +C+     
Sbjct: 308 ----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGR 362

Query: 295 KTPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQ 352
               ++ + P +   F  G K+ L      F    VEG +C  + Q       + G    
Sbjct: 363 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVV 422

Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
            +  + YD  ++ + F  T+C++
Sbjct: 423 RNTLVTYDRHNEKIGFWKTNCSE 445


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/404 (23%), Positives = 170/404 (42%), Gaps = 45/404 (11%)

Query: 1   MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQ 58
           ++P   F+P+ V+   +      Y  +  +G P     Y + +DTGS+L W+QC  PC  
Sbjct: 10  LTPPLRFFPSVVMCIQMGML---YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTS 66

Query: 59  CYKQVKPIYNPASSSSYK--ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
           C K    +Y P   +  +  E  C   Q + L T  C +   C+Y   YAD S + GVL 
Sbjct: 67  CAKGANQLYKPRKDNLVRSSEAFCVEVQRNQL-TEHCENCHQCDYEIEYADHSYSMGVLT 125

Query: 117 TER--ITFGNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LG 170
            ++  +   N +    ++VFGCG++  G+     +   G++GL R ++SL SQ+ S+ + 
Sbjct: 126 KDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGII 185

Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
           +N   +CL      S +  + Y   GS+ V   G+    ++       Y + +  +S G 
Sbjct: 186 SNVVGHCLA-----SDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQ 240

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
                 ++     +G +  G +  DTG+  T  P   Y++L   ++    L   +D    
Sbjct: 241 -----GMLSLDGENGRV--GKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDE 293

Query: 290 SQ-LCYKTP------SMAGIAPILT-AHFDGGAKVPLIHTSTFIPPPV------EGVFCF 335
           +  +C++        S++ +           G+K  +I     I P        +G  C 
Sbjct: 294 TLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCL 353

Query: 336 AM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            +       DG   I G+ +     I YD   + + +  +DC +
Sbjct: 354 GILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 397


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 84/272 (30%), Positives = 128/272 (47%), Gaps = 35/272 (12%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPI----YNPASSSS 74
           A G Y  + S+GTPP    Y  VDTGS++ WV+C PC  C +    P+    ++P  S++
Sbjct: 37  AMGLYYTRISLGTPPQ-QFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTT 95

Query: 75  YKELSCQSEQCHLLD-TVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGN-------S 125
              +SC   +C +L+  + CS ++L C Y+  Y D S T G    +  TF         +
Sbjct: 96  KISISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTA 155

Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTD 184
            +    +VFGCG   TG ++ +  GL+G G T +SL +Q+  Q +  N F++CL     D
Sbjct: 156 KSGTARLVFGCGGTQTGSWSVD--GLLGFGPTTVSLPNQLAQQNISVNIFAHCL---QGD 210

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLE--GISVGNLSNSSKLIPYYNS 242
            S    +  G   E     +V T +V  ED  +Y V L   GIS  N++  +     Y  
Sbjct: 211 VSGRGSLVIGTIREPD---LVYTPMVFGED--HYNVQLLNIGISGRNVTTPASFDLEYT- 264

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
                 G + ID+G   T L +  Y+     V
Sbjct: 265 ------GGVIIDSGTTLTYLVQPAYDEFRRGV 290


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +  
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 163/381 (42%), Gaps = 47/381 (12%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  +  IG+PP  D +  VDTGSD++WV C+ C  C K+        +YNP SSS+  
Sbjct: 71  GLYYARIGIGSPPN-DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 77  ELSCQSEQCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
            ++C    C          C    LC Y   Y D S T G    + I      GN     
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189

Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
            N  +VFGCG   +G     +E   G++G G+   S+ SQ+ +     K F++CL     
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL----- 244

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
             SI+    F  G EV    + +T +V   ++ +Y V L G+ VG+ +    L  +  S 
Sbjct: 245 -DSISGGGIFAIG-EVVEPKLKTTPVVP--NQAHYNVVLNGVKVGDTALDLPLGLFETS- 299

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK-TPSM 299
               K    ID+G     LP   Y  L E++  A   +KL    D       C+    ++
Sbjct: 300 ---YKRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDD----QFTCFVFDKNV 352

Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG-DVGIFGNFAQS 353
               P +T  F+  + +  I+   ++    + V+C        Q  DG +V + G+    
Sbjct: 353 DDGFPTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411

Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
           +  + Y+ ++Q + +   +C+
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +  
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 166/385 (43%), Gaps = 49/385 (12%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
           NG Y     +G+PP    LD+    DTGSDL W+QC  PC  C K   P+Y P   +   
Sbjct: 311 NGLYFTHIFVGSPPRRYFLDM----DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVP 366

Query: 75  YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN--V 132
            K+  C   Q +L  T  C + + C+Y   YAD S + GVLA++ +    +N       +
Sbjct: 367 LKDSLCVEVQRNL-KTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGI 425

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           +FGC ++  G+   +     G++GL + ++SL SQ+ SQ +  N   +CL    +D++  
Sbjct: 426 MFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLT---SDATGG 482

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+    +++     Y+   ++      +S+ S+ +      G   +
Sbjct: 483 GYMFLGD-DFVPYWGMAWVPMLNSHSPNYHSQIMK------ISHGSRQLSLGRQDGRTER 535

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA-- 303
             +  DTG+  T  PK+ Y  L   +++       Q   DP L      K P  + I   
Sbjct: 536 --VVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVK 593

Query: 304 ---PILTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIFGNF 350
                LT  F   +K  ++ T   IPP        +G  C  +       DG   I G+ 
Sbjct: 594 QFFQPLTLQFR--SKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDI 651

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
           +     + YD  +Q + +  + C K
Sbjct: 652 SLRGKLVVYDNVNQKIGWAQSTCVK 676


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 144/330 (43%), Gaps = 29/330 (8%)

Query: 64  KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN-------YTYGYADSSLTKGVLA 116
           K ++ P  S S++ ++C S++C + D     S  LC        Y   YAD S  KG   
Sbjct: 188 KGVFCPHRSKSFQAVTCASQKCKI-DLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFG 246

Query: 117 TERITFGNSN---NFFDNVVFGCGHN-NTGV-FNENEMGLVGLGRTRLSLASQILSQLGA 171
           T+ IT    N      +N+  GC  +   GV FNE+  G++GLG  + S   +   + GA
Sbjct: 247 TDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGA 306

Query: 172 NKFSYCLVPFHTDSSITSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
            KFSYCLV   +  +++S +  G        G +  T L+      +Y V + GIS+G  
Sbjct: 307 -KFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFP--PFYGVNVVGISIG-- 361

Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPT-LLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
               K+ P        S+G   ID+G   T LL   +    E  +++  K+        G
Sbjct: 362 GQMLKIPPQVWDFN--SQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFG 419

Query: 290 S-QLCYKTPSM-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDGDVG- 345
           +   C+        + P L  HF GGA+  P + +      P+  V C  + PIDG  G 
Sbjct: 420 ALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL--VKCIGIVPIDGIGGA 477

Query: 346 -IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            + GN  Q +    +D  +  + F P+ CT
Sbjct: 478 SVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/398 (25%), Positives = 171/398 (42%), Gaps = 77/398 (19%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
           A G Y  K  IGTPP  + Y  VDTGSD+MWV C+ C +C  +        +Y+   SSS
Sbjct: 79  AVGLYYAKIGIGTPPK-NYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSS 137

Query: 75  YKELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NS 125
            K + C  E C  ++      C++   C Y   Y D S T G    + + +        +
Sbjct: 138 GKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKT 197

Query: 126 NNFFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           ++   ++VFGCG   +G     NE  + G++G G+   S+ SQ+ S     K F++CL  
Sbjct: 198 DSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL-- 255

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
                         NG  V+GGG+ +   V +          D+ +Y V +  + VG+  
Sbjct: 256 --------------NG--VNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTF 299

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
           LS S+      ++S    +    ID+G     LP+  Y  L  ++ +       Q P L 
Sbjct: 300 LSLST------DTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMIS-------QHPDLK 346

Query: 290 SQ------LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ---- 338
            Q       C++ + S+    P +T  F+ G  +  ++   ++ P V   +C   Q    
Sbjct: 347 VQTLHDEYTCFQYSESVDDGFPAVTFFFENGLSLK-VYPHDYLFPSVN-FWCIGWQNSGT 404

Query: 339 --PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                 ++ + G+   S+  + YD ++Q + +   +C+
Sbjct: 405 QSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYNCS 442


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +  
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 157/385 (40%), Gaps = 60/385 (15%)

Query: 32  TPPLLDIYGI----------------VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
           TPPL   YG+                +DT S L W++C  C+   +Q  P+++P+ SSSY
Sbjct: 67  TPPLEYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSY 126

Query: 76  KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
           + L   S  C   + V  +  +   +  G A      G + T+ I  GN      +V FG
Sbjct: 127 RPLHPTSPLCRAPNPVLPAGDKCSFHLPGEA-----HGYVGTDTIILGNPTLPIHSVAFG 181

Query: 136 CGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C  +  G   +    G +G+G+   SL  QI  ++G ++FSYCL+           + F 
Sbjct: 182 CAQSTEGFDTKGTFAGTLGMGKLPTSLIMQIKDRVG-SRFSYCLIGLGHSPGRNGFIRF- 239

Query: 195 NGSEVSGGGVVSTSLVSKEDK--------------TYYFVTLEGISVGN--LSNSSKLIP 238
            G+++       T LV    K              + Y+V L GIS+    +    + + 
Sbjct: 240 -GADIPD----PTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMF 294

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCYK 295
              S G+   G  F+D G   T L    Y  +EE V + ++   Y   +DP     LC++
Sbjct: 295 ERRSDGS---GGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNF--SLCFR 349

Query: 296 T-PSMAGIAPILTAHFDGGAKVPLIH-----TSTFIPPPVEGVFCFAM-QPIDGDVGIFG 348
             P +    P LT  F+G A   + H      + F+    + + CF + +   G   + G
Sbjct: 350 EHPGIWSHIPKLTLDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVG 409

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
              Q D    +D  +  ++F    C
Sbjct: 410 AMQQVDTRFIFDLHANTITFHRESC 434


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 145/355 (40%), Gaps = 52/355 (14%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
           IVDTGSDL WVQC PC  CY Q  P+++P+ S+SY  + C +  C   L        SC+
Sbjct: 125 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 184

Query: 95  S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
           +          + C Y+  Y D S ++GVLAT+ +  G ++   D  VFGCG +N G   
Sbjct: 185 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRG--- 239

Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
                   L R   + +S   S  G +  +   +    D+S      + N + VS     
Sbjct: 240 --------LRRPGSAASSPTASPPGTSGDAAGSLSLGGDTS-----SYRNATPVS----Y 282

Query: 206 STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKD 265
           +  +       +YF+ + G SVG  + ++           +   N+ +D+G   T L   
Sbjct: 283 TRMIADPAQPPFYFMNVTGASVGGAAVAAA---------GLGAANVLLDSGTVITRLAPS 333

Query: 266 FYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTS 322
            Y   R E   +   +  P   P      CY       +  P+LT   + GA + +    
Sbjct: 334 VYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAG 393

Query: 323 TFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                  +G   C AM  +  +    I GN+ Q +  + YD     + F   DC+
Sbjct: 394 MLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 159/355 (44%), Gaps = 46/355 (12%)

Query: 41  IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS-SQQLC 99
           I+DTGS + ++ C  C  C K     ++P  S++ K+L+C    C+   T SC+ +   C
Sbjct: 29  IIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPLCN-CGTPSCTCNNDRC 87

Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLGRTR 158
            Y+  YA+ S ++G +  +   F +S++    +VFGC +  TG ++ +   G++G+G   
Sbjct: 88  YYSRTYAERSSSEGWMIEDTFGFPDSDSPV-RLVFGCENGETGEIYRQMADGIMGMGNNH 146

Query: 159 LSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY 217
            +  SQ++ + +  + FS C    +    I   +  G+ +   G   V T L++     Y
Sbjct: 147 NAFQSQLVQRKVIEDVFSLCFG--YPKDGI---LLLGDVTLPEGANTVYTPLLTHLHLHY 201

Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--- 274
           Y V ++GI+V   + +     +    G +      +D+G   T LP D +  + + V   
Sbjct: 202 YNVKMDGITVNGQTLAFDASVFDRGYGTV------LDSGTTFTYLPTDAFKAMAKAVGDY 255

Query: 275 --RNAIKLTPYQDPRLGSQLCYKTP-----SMAGIAPILTAHFDGGAKVPLIHTSTFIPP 327
             +  ++ TP  DP+  + +C+K        +    P     F GGAK+ L        P
Sbjct: 256 VEKKGLQSTPGADPQY-NDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTL--------P 306

Query: 328 PVEGVFCFAMQPIDGDVGIF---------GNFAQSDLFIGYDFDSQMVSFKPTDC 373
           P+   + F  +P +  +GIF         G  +  D+ + YD  +  V F    C
Sbjct: 307 PLR--YLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359


>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
 gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
          Length = 453

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 83/294 (28%), Positives = 136/294 (46%), Gaps = 30/294 (10%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           +++   +GTP +  +   +DTGS L WVQC PC ++C+ Q   V PI++P++SS+++ + 
Sbjct: 53  FLIPVKLGTPAVQYLV-TMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVG 111

Query: 80  CQSEQCHLL------DTVSCSS-QQLCNYTYGYADS-SLTKGVLATERITFGNSNNF--- 128
           C +  C  L       + +C   + +C YT  Y    + + G   T+R+  G        
Sbjct: 112 CSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTT 171

Query: 129 --FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
               N VFGC   +T      E G+ GLG +  S   QI   L    FSYCL      S 
Sbjct: 172 LSLANFVFGCSM-DTQYSTHKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCL-----PSD 224

Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
              + Y   G + SGG  V TS+     +  Y + + G++V  ++   + +   + S   
Sbjct: 225 EAHQGYLSIGPDSSGG--VPTSMFPGTPRPVYSIGMTGLTV-TVNGEVRSLVSGSGSSPS 281

Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS 298
               M +D+GA  TLL    + +LE+ +  A++   Y        +QLC+ T S
Sbjct: 282 PSSLMVVDSGAKLTLLLASTFGQLEDAIIPAMESLGYSLNTAAGQNQLCFLTES 335


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/386 (25%), Positives = 154/386 (39%), Gaps = 35/386 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPA 70
           + S   T  G+Y ++F +GTP    +  + DTGSDL WV+C           P   +  +
Sbjct: 3   LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAAGPPASDPPAREFRAS 61

Query: 71  SSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG--- 123
            S S+  L+C S+ C         +CSS    C Y Y Y D S  +GV+ T+  T     
Sbjct: 62  ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 121

Query: 124 ----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
                             VV GC     G   ++  G++ LG + +S AS+  ++ G  +
Sbjct: 122 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GR 180

Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
           FSYCLV      + +S + FG G E  G     T LV   D+         +    ++  
Sbjct: 181 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLV--LDRRVSPFYAVAVDAVYVAGE 238

Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQ 291
           +  IP  +       G   +D+G   T+L    Y  +   +   +   P    DP    +
Sbjct: 239 ALDIP-ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FE 294

Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
            CY   + A   P L   F G A++     S ++     GV C  +Q  +G    V + G
Sbjct: 295 YCYNWTAGAPEIPKLEVSFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGAWPGVSVIG 351

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCT 374
           N  Q +    +D   + + FK T C 
Sbjct: 352 NILQQEHLWEFDLRDRWLRFKHTRCA 377


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/386 (25%), Positives = 154/386 (39%), Gaps = 35/386 (9%)

Query: 13  VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPA 70
           + S   T  G+Y ++F +GTP    +  + DTGSDL WV+C           P   +  +
Sbjct: 94  LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAAGPPASDPPAREFRAS 152

Query: 71  SSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG--- 123
            S S+  L+C S+ C         +CSS    C Y Y Y D S  +GV+ T+  T     
Sbjct: 153 ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 212

Query: 124 ----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
                             VV GC     G   ++  G++ LG + +S AS+  ++ G  +
Sbjct: 213 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GR 271

Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
           FSYCLV      + +S + FG G E  G     T LV   D+         +    ++  
Sbjct: 272 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLV--LDRRVSPFYAVAVDAVYVAGE 329

Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQ 291
           +  IP  +       G   +D+G   T+L    Y  +   +   +   P    DP    +
Sbjct: 330 ALDIP-ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FE 385

Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
            CY   + A   P L   F G A++     S ++     GV C  +Q  +G    V + G
Sbjct: 386 YCYNWTAGAPEIPKLEVSFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGAWPGVSVIG 442

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCT 374
           N  Q +    +D   + + FK T C 
Sbjct: 443 NILQQEHLWEFDLRDRWLRFKHTRCA 468


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 78/225 (34%), Positives = 111/225 (49%), Gaps = 25/225 (11%)

Query: 10  NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
           N+   +N+   +G +++  + GTPP  +   I+DTGS + W QC  CV C +     +N 
Sbjct: 114 NHAHNNNLFDEDGNFLVDVAFGTPPQ-NFMLILDTGSSITWTQCKACVNCLQDSHRYFNW 172

Query: 70  ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
           ++SS+Y   SC      +  TV        NY   Y D S + G    + +T   S + F
Sbjct: 173 SASSTYSSGSC------IPGTVE------NNYNMTYGDDSTSVGNYGCDTMTLEPS-DVF 219

Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSIT 188
               FGCG NN G F     G++GLG+ +LS  SQ  S+   NK FSYCL     + SI 
Sbjct: 220 QKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKF--NKVFSYCLP---EEDSIG 274

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGN 229
           S + FG  +      +  TSLV+     ++  YYFV L  ISVGN
Sbjct: 275 S-LLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGN 318


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 170/396 (42%), Gaps = 73/396 (18%)

Query: 20  ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
           A G Y  K  IGTPP  + Y  VDTGSD+MWV C+ C +C  +        +Y+   SSS
Sbjct: 81  AVGLYYAKIGIGTPPK-NYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSS 139

Query: 75  YKELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NS 125
            K + C  E C  ++      C++   C Y   Y D S T G    + + +        +
Sbjct: 140 GKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKT 199

Query: 126 NNFFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
           ++   ++VFGCG   +G     NE  + G++G G+   S+ SQ+ S     K F++CL  
Sbjct: 200 DSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL-- 257

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
                         NG  V+GGG+ +   V +          D+ +Y V +  + VG+  
Sbjct: 258 --------------NG--VNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAF 301

Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDP 286
           LS S+      +  G I      ID+G     LP+  Y  L  ++ +    +K+    D 
Sbjct: 302 LSLSTDTSTQGDRKGTI------IDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHD- 354

Query: 287 RLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQ------ 338
                 C++ + S+    P +T +F+ G  + +  H   F   P    +C   Q      
Sbjct: 355 ---EYTCFQYSESVDDGFPAVTFYFENGLSLKVYPHDYLF---PSGDFWCIGWQNSGTQS 408

Query: 339 PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
               ++ + G+   S+  + YD ++Q++ +   +C+
Sbjct: 409 RDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 444


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 173/379 (45%), Gaps = 44/379 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
           G Y  K  +G+PP  +    +DTGSD++WV C  C  C +      Q+   ++ +SSS+ 
Sbjct: 64  GLYFTKVKLGSPPR-EFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLN-FFDSSSSSTA 121

Query: 76  KELSCQSEQC-HLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITFGN--SNNFF 129
            ++ C    C   + T +  CSSQ   C+YT+ Y D S T G   ++ + F      +  
Sbjct: 122 GQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLI 181

Query: 130 DN----VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
           DN    +VFGC    +G   + +    G+ G G+  LS+ SQ+ ++ +    FS+CL   
Sbjct: 182 DNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL--- 238

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
             D S    +  G   E+   G+V + LV  +   +Y + L  I+V     + +L+P   
Sbjct: 239 KGDGSGGGILVLG---EILEPGIVYSPLVPSQ--PHYNLNLLSIAV-----NGQLLPIDP 288

Query: 242 SSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTPS 298
           ++ A S      +D+G     L  + Y+     V NAI ++P   P    G+Q    + S
Sbjct: 289 AAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAV-NAI-VSPSVTPITSKGNQCYLVSTS 346

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDL 355
           ++ + P+ + +F GGA + L      IP    G   ++C   Q + G V I G+    D 
Sbjct: 347 VSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVLKDK 405

Query: 356 FIGYDFDSQMVSFKPTDCT 374
              YD   Q + +   DC+
Sbjct: 406 IFVYDLVRQRIGWANYDCS 424


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 100/389 (25%), Positives = 159/389 (40%), Gaps = 56/389 (14%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP  ++  ++DTGS+L W+ C             +N   S SY+ + C
Sbjct: 28  NISLTVSLTVGTPPQ-NVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPC 85

Query: 81  QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
            S  C           SC S  LC+ T  YAD+S ++G LA++    G S+     +VFG
Sbjct: 86  SSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD--IPGMVFG 143

Query: 136 CGHNNTGVFNENE------MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           C  +   VF+ N        GL+G+ R  LS     +SQ+G  KFSYC+    + +  + 
Sbjct: 144 CMDS---VFSSNSDEDSKNTGLMGMNRGSLSF----VSQMGFPKFSYCI----SGTDFSG 192

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS- 242
            +  G  +      +  T LV         D+  Y V LEGI V     S +L+P   S 
Sbjct: 193 MLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKV-----SDRLLPIPKSV 247

Query: 243 --SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY 294
                   G   +D+G   T L    Y  L  +  N     L   +DP    Q    LCY
Sbjct: 248 FEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCY 307

Query: 295 KTPSMAGIAPIL--TAHFDGGAKVPLIHTSTF--IPPPVEG---VFCFAMQPID---GDV 344
           + P    + P L   +    GA++ +        +P  + G   V C +    D    + 
Sbjct: 308 RVPISQRVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEA 367

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            + G+  Q ++++ +D +   +      C
Sbjct: 368 YVIGHHHQQNVWMEFDLERSRIGLAQVRC 396


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 136/278 (48%), Gaps = 25/278 (8%)

Query: 12  VVQSNVSTANGE-------YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
           V +S+V  A+G        Y+++ +IGTP    +  + DT +D  W+ C  CV C   V 
Sbjct: 69  VTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVAL-DTSNDAAWIPCSGCVGCSSSV- 126

Query: 65  PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
            +++P+ SSS + L C++ QC      SC+  + C +   Y  S++ +  L  + +T   
Sbjct: 127 -LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSAI-EAYLTQDTLTL-- 182

Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
           + +   N  FGC +  +G  +    GL+GLGR  LSL SQ    L  + FSYCL P    
Sbjct: 183 ATDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNSKS 239

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYN 241
           S+ +  +  G  ++     + +T L+    + + Y+V L GI VGN  +   +  + +  
Sbjct: 240 SNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
           ++GA   G +F D+G   T L +  Y  +  + R  +K
Sbjct: 298 ATGA---GTIF-DSGTVYTRLVEPAYVAMRNEFRRRVK 331


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 151/366 (41%), Gaps = 46/366 (12%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           +G +++    GTP       I+DTGSD  W+QC  C       K  +NP+ SSSY   SC
Sbjct: 126 DGLFLVNVGFGTPQQ-KFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSC 184

Query: 81  QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
                   DT         NYT  Y D+S +KGV   + +T     + F    FGCG + 
Sbjct: 185 IPST----DT---------NYTMKYEDNSYSKGVFVCDEVTL--KPDVFPKFQFGCGDSG 229

Query: 141 TGVFNENEMGLVGLGR-TRLSLASQILSQLGANKFSYCLVPF-HTDSSITSKMYFGNGSE 198
            G F     G++GL +  + SL SQ  S+    KFSYC  P  HT  S    + FG  + 
Sbjct: 230 GGEFG-TASGVLGLAKGEQYSLISQTASKF-KKKFSYCFPPKEHTLGS----LLFGEKAI 283

Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTG 256
            +   +  T L++      YFV L GISV    L+ SS L   + S G I      ID+G
Sbjct: 284 SASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSL---FASPGTI------IDSG 334

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI---APILTAHF 310
              T LP   Y  L    +  +   P   P    +L   CY      G     P +  HF
Sbjct: 335 TVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHF 394

Query: 311 DGGAKVPLIHTSTFIPPP---VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
            G   V L H S  +       +    FA +     V I GN  Q  L + YD +   + 
Sbjct: 395 VGEVDVSL-HPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLG 453

Query: 368 FKPTDC 373
           F   DC
Sbjct: 454 FG-NDC 458


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 177/387 (45%), Gaps = 54/387 (13%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  IG+PP    Y  VDTGSD++WV C+ C  C  +         Y+PA S 
Sbjct: 79  TDTGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSG 137

Query: 74  SYKELSCQSEQCHLLDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
           +   + C+ E C + ++      +C S+   C +   Y D S T G   T+ + +    G
Sbjct: 138 T--TVGCEQEFC-VANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG 194

Query: 124 NSNNFFDN--VVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYC 177
           N      N  + FGCG     + G  N+   G++G G++  S+ SQ+ +     K F++C
Sbjct: 195 NGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L       ++     F  G+ V    V +T LV   + T+Y V L+GISVG    ++  +
Sbjct: 255 L------DTVRGGGIFAIGNVVQ-PKVKTTPLV--PNVTHYNVNLQGISVG---GATLQL 302

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLCY 294
           P        SKG + ID+G     LP++ Y  L   V +  +  P   YQD      +C+
Sbjct: 303 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-----FVCF 356

Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
           + + S+    P++T  F+G   +  ++   ++      ++C       +Q  DG D+ + 
Sbjct: 357 QFSGSIDDGFPVITFSFEGDLTLN-VYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLL 415

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD + +++ +   +C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 162/384 (42%), Gaps = 42/384 (10%)

Query: 21  NGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYK-- 76
           +G Y  +  +G P     Y + +DTGS+L W+QC  PC  C K    +Y P   +  +  
Sbjct: 200 DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 259

Query: 77  ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
           E  C   Q + L T  C +   C+Y   YAD S + GVL  ++  +   N +    ++VF
Sbjct: 260 EAFCVEVQRNQL-TEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 318

Query: 135 GCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
           GCG++  G+     +   G++GL R ++SL SQ+ S+ + +N   +CL      S +  +
Sbjct: 319 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA-----SDLNGE 373

Query: 191 MYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
            Y   GS+ V   G+    ++       Y + +  +S G       ++     +G +  G
Sbjct: 374 GYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQ-----GMLSLDGENGRV--G 426

Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP------SMAGI 302
            +  DTG+  T  P   Y++L   ++    L   +D    +  +C++        S++ +
Sbjct: 427 KVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDV 486

Query: 303 APILT-AHFDGGAKVPLIHTSTFIPPPV------EGVFCFAM----QPIDGDVGIFGNFA 351
                      G+K  +I     I P        +G  C  +       DG   I G+ +
Sbjct: 487 KKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDIS 546

Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
                I YD   + + +  +DC +
Sbjct: 547 MRGHLIVYDNVKRRIGWMKSDCVR 570


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 160/393 (40%), Gaps = 54/393 (13%)

Query: 18  STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC--YKQVKPIYNPASS 72
           S + G Y +  S GTPP    + ++DTGS  +W  C     C  C    ++ P + P  S
Sbjct: 71  SHSYGGYSISLSFGTPPQTLSF-VMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHS 128

Query: 73  SSYKELSCQSEQCHLL----------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           SS K + C++ +C  +          D  S +  Q+C        S  T GV  +E  T 
Sbjct: 129 SSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSE--TL 186

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
                   N + GC      VF+  +  G+ G GR   SL SQ    LG  KFSYCL+  
Sbjct: 187 HLHGLIVPNFLVGCS-----VFSSRQPAGIAGFGRGPSSLPSQ----LGLTKFSYCLLSH 237

Query: 182 HTDSSITSKMYFGNG---SEVSGGGVVSTSLVSK---EDK----TYYFVTLEGISVGNLS 231
             D +  S     +    S+     ++ T LV     +DK     YY+V+L  IS+G  S
Sbjct: 238 KFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRS 297

Query: 232 NSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDP 286
                IPY Y S      G   ID+G   T +  + +    N    QV+N  +     + 
Sbjct: 298 VK---IPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERAL-MVEA 353

Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPI 340
             G + C+       +  P L  HF GGA V L   + F       V CF +     +  
Sbjct: 354 LSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKA 413

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
            G   I GNF   + ++ YD  ++ + FK   C
Sbjct: 414 SGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
 gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
 gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
 gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
 gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
 gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
 gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
 gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
 gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
          Length = 357

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 31/267 (11%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     +G PP    LD    VDTGSDL W+QC  PC  C K   P+Y PA      
Sbjct: 191 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 246

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            ++L CQ  Q    D   C++ + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 247 PRDLLCQELQG---DQNYCATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDF 303

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +SL SQ+ SQ + +N F +C+     + +  
Sbjct: 304 VFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCIT---KEPNGG 360

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+    +    D  Y+    + ++ G+     + +  +  +G  S 
Sbjct: 361 GYMFLGD-DYVPRWGMTWAPIRGGPDNLYH-TEAQKVNYGD-----QQLRMHGQAG--SS 411

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVR 275
             +  D+G+  T LP + Y +L   ++
Sbjct: 412 IQVIFDSGSSYTYLPDEIYKKLVTAIK 438


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 160/380 (42%), Gaps = 43/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  +  +G+PP  + +  +DTGSD++WV C PC  C            +NP +SS+  
Sbjct: 89  GLYFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147

Query: 77  ELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN- 126
           ++ C  ++C      S      S    C YT+ Y D S T G   ++ + F    GN   
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQT 207

Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVP 180
            N   ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+CL  
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKG 266

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
                 I   +  G   E+   G+V T LV  +   +Y + LE I V    L   S L  
Sbjct: 267 SDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFT 318

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             N+ G I      +D+G     L    Y+     +  A+  +       G+Q    + S
Sbjct: 319 TSNTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSS 372

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSD 354
           +    P ++ +F GG  + +   +  +         ++C   Q   G  + I G+    D
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKD 432

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
               YD  +  + +   DC+
Sbjct: 433 KIFVYDLANMRMGWTDYDCS 452


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 111/422 (26%), Positives = 175/422 (41%), Gaps = 74/422 (17%)

Query: 17  VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQC------YKQVK 64
           V+T    Y++  ++G PP +  +Y  +DTGSDL WV C       C++C       K + 
Sbjct: 18  VTTYTDGYLLSLNLGMPPQVFQVY--LDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIP 75

Query: 65  PIYNPASSSSYKELSCQSEQC---HLLD-------TVSCS----SQQLCN-----YTYGY 105
                 SSS+ KEL C S  C   H  D        V C+       LC      ++Y Y
Sbjct: 76  SFSPSQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTY 134

Query: 106 ADSSLTKGVLATERITFGNS----NNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
              +L  G LA + +T   S        D     FGC     G      +G+ G G+  L
Sbjct: 135 GGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGC----VGSSIREPIGIAGFGKGIL 190

Query: 160 SLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKT 216
           SL SQ+        FS+C + F    + + TS +  G+ +  +    + T ++ S  +  
Sbjct: 191 SLPSQL--GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPN 248

Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
           +Y++ LEG+S+G+   +    P  +S  +   G M +DTG   T LP  FY  +   + +
Sbjct: 249 FYYIGLEGVSIGD-GAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLAS 307

Query: 277 AIKLTPYQD--PRLGSQLCYK-----TPSMAGIAPILTAHFDGGAKVPLIHTSTF--IPP 327
            I      D   R G  LC+K     TP      P++  HF G  K+ L   S +  +  
Sbjct: 308 VILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTA 367

Query: 328 PVEGVF--CFAMQPI-------------DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           P   V   C   Q +             +G   + G+F   ++ + YD ++  + F+P D
Sbjct: 368 PKNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKD 427

Query: 373 CT 374
           C 
Sbjct: 428 CA 429


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 160/380 (42%), Gaps = 43/380 (11%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  +  +G+PP  + +  +DTGSD++WV C PC  C            +NP +SS+  
Sbjct: 89  GLYFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147

Query: 77  ELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN- 126
           ++ C  ++C      S      S    C YT+ Y D S T G   ++ + F    GN   
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQT 207

Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVP 180
            N   ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+CL  
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKG 266

Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
                 I   +  G   E+   G+V T LV  +   +Y + LE I V    L   S L  
Sbjct: 267 SDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFT 318

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
             N+ G I      +D+G     L    Y+     +  A+  +       G+Q    + S
Sbjct: 319 TSNTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSS 372

Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSD 354
           +    P ++ +F GG  + +   +  +         ++C   Q   G  + I G+    D
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKD 432

Query: 355 LFIGYDFDSQMVSFKPTDCT 374
               YD  +  + +   DC+
Sbjct: 433 KIFVYDLANMRMGWTDYDCS 452


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 103/386 (26%), Positives = 172/386 (44%), Gaps = 60/386 (15%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++G+PP  +I  ++DTGS+L W+ C    +    +  ++NP SSS+Y  + C
Sbjct: 58  NVTLTVTLAVGSPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 112

Query: 81  QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            S  C      L    SC  +   C+    YAD++  +G LA +    G+        +F
Sbjct: 113 SSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTR--PGTLF 170

Query: 135 GCGHNNTGVFNENE-----MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
           GC   ++G+ +++E      GL+G+ R  LS     ++QLG +KFSYC+    + S  + 
Sbjct: 171 GC--MDSGLSSDSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCI----SGSDSSG 220

Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNS 242
            +  G+ S    G +  T LV +       D+  Y V LEGI VG+ + +  K +   + 
Sbjct: 221 ILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDH 280

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKT 296
           +GA   G   +D+G   T L    Y  L+ +     K  L    DP    Q    LCY+ 
Sbjct: 281 TGA---GQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRV 337

Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
                P+  G+ P+++  F G      G K+ L   +       E V+CF     D    
Sbjct: 338 GSSTRPNFTGL-PVISLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 395

Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
           +  + G+  Q ++++ +D     V F
Sbjct: 396 EAFVIGHHHQQNVWMEFDLAKSRVGF 421


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 172/387 (44%), Gaps = 52/387 (13%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           TA G Y  +  IG+PP    Y  VDTGSD++WV  + C  C  +         Y+PA S 
Sbjct: 80  TATGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSG 138

Query: 74  SYKELSCQSEQCHLLDTVS-----C-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
           +   + C+ E C      S     C S+   C +   Y D S T G   T+ + +    G
Sbjct: 139 T--TVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSG 196

Query: 124 NSNNFFDNV--VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYC 177
           N      NV   FGCG    G    +     G++G G++  S+ SQ+ +     K F++C
Sbjct: 197 NGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHC 256

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L       ++     F  G+ V    V +T LV   + T+Y V L+GISVG    ++  +
Sbjct: 257 L------DTVRGGGIFAIGNVVQPPIVKTTPLV--PNATHYNVNLQGISVG---GATLQL 305

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCY 294
           P        SKG + ID+G     LP++ Y  L   V +    + +  Y+D      +C+
Sbjct: 306 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYED-----FICF 359

Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
           + + S+    P++T  F+G   +  ++   ++      ++C       +Q  DG D+ + 
Sbjct: 360 QFSGSLDEEFPVITFSFEGDLTLN-VYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLL 418

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD + Q++ +   +C+
Sbjct: 419 GDLVLSNKLVVYDLEKQVIGWTDYNCS 445


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 105/393 (26%), Positives = 160/393 (40%), Gaps = 64/393 (16%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
           N    +  ++GTPP  ++  ++DTGS+L W+ C P     K     + P +SS++  + C
Sbjct: 82  NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPC 140

Query: 81  QSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
            S QC   D  S   C  +   C+ +  YAD S + G LAT+    G+         FGC
Sbjct: 141 ASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPL--RAAFGC 198

Query: 137 GHNNTGVFNEN-----EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
               +  F+ +       GL+G+ R  LS     +SQ    +FSYC+    +D      +
Sbjct: 199 ---MSSAFDSSPDGVASAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVL 247

Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNS 242
             G+    +   +  T +          D+  Y V L GI VG       +S L P +  
Sbjct: 248 LLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTG 307

Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLE-EQVRNAIKLTP-YQDPRLGSQ----LCYKT 296
           +     G   +D+G   T L  D Y+ L+ E  R A  L P   DP    Q     C++ 
Sbjct: 308 A-----GQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRV 362

Query: 297 PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPI 340
           P   G +P  TA   G      GA++ +         P E     GV+C       M PI
Sbjct: 363 PQ--GRSPP-TARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPI 419

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              V   G+  Q ++++ YD +   V   P  C
Sbjct: 420 MAYV--IGHHHQMNVWVEYDLERGRVGLAPVRC 450


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 99/399 (24%), Positives = 151/399 (37%), Gaps = 75/399 (18%)

Query: 23  EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
           +Y +  S+G P       + +DTGSDL+W  C P  C+ C  +                 
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146

Query: 64  ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
                  P+ + A SS+     C + +C L  ++T SC+S       Y Y D SL    L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205

Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
              R+    S    +N  F C H          +G+ G GR  LSL +Q+   L  +  +
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSLSGSTDA 260

Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYF-VTLEGISVGNLSNSS 234
             +    TD                    V T L+      Y++ V LE +SVG     +
Sbjct: 261 AAIGASETD-------------------FVYTPLLHNPKHPYFYSVALEAVSVGGKRIQA 301

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-----QDPRLG 289
           +  P          G M +D+G   T+LP D + R+ ++   A+    +      + + G
Sbjct: 302 Q--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTG 359

Query: 290 SQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAMQPIDGD-- 343
              CY  +PS   + P+   HF G A V L   + F+    E    V C  +  + G+  
Sbjct: 360 LAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 418

Query: 344 --------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
                    G  GNF Q    + YD D+  V F    CT
Sbjct: 419 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 139/280 (49%), Gaps = 29/280 (10%)

Query: 12  VVQSNVSTANGE-------YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
           V +S+V  A+G        Y+++ +IGTP  P+L     +DT +D  W+ C  CV C   
Sbjct: 69  VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVA---LDTSNDAAWIPCSGCVGCSSS 125

Query: 63  VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           V  +++P+ SSS + L C++ QC      SC+  + C +   Y  S++ +  L  + +T 
Sbjct: 126 V--LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTL 182

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             +++   N  FGC +  +G  +    GL+GLGR  LSL SQ    L  + FSYCL P  
Sbjct: 183 --ASDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNS 237

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPY 239
             S+ +  +  G  ++     + +T L+    + + Y+V L GI VGN  +   +  + +
Sbjct: 238 KSSNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAF 295

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
             ++GA   G +F D+G   T L +  Y  +  + R  +K
Sbjct: 296 DPATGA---GTIF-DSGTVYTRLVEPAYVAVRNEFRRRVK 331


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 96/396 (24%), Positives = 166/396 (41%), Gaps = 77/396 (19%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTGSD+MWV C+ C +C +         +YN   S S K
Sbjct: 84  GLYYAKVGIGTPSK-DYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGK 142

Query: 77  ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--- 130
            + C  E C+ ++      C++   C Y   Y D S T G    + + +   +       
Sbjct: 143 LVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTS 202

Query: 131 ---NVVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
              +V+FGCG   +G       E   G++G G++  S+ SQ+ +     K F++CL    
Sbjct: 203 SNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL---- 258

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
                           ++GGG+ +   V +          ++ +Y V +  + VG   L 
Sbjct: 259 --------------DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLH 304

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
             ++     +  GAI      ID+G     LP+  Y  L  ++ +       Q P L   
Sbjct: 305 LPTEEFEAGDRKGAI------IDSGTTLAYLPEIVYEPLVSKIIS-------QQPDLKVH 351

Query: 292 L------CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQP 339
           +      C++ + S+    P +T HF+    +  +H   ++  P EG++C       MQ 
Sbjct: 352 IVRDEYTCFQYSGSVDDGFPNVTFHFENSVFLK-VHPHEYL-FPFEGLWCIGWQNSGMQS 409

Query: 340 ID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            D  ++ + G+   S+  + YD ++Q + +   +C+
Sbjct: 410 RDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCS 445


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 176/387 (45%), Gaps = 54/387 (13%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  IG+PP    Y  VDTGSD++WV C+ C  C  +         Y+PA S 
Sbjct: 79  TDTGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSG 137

Query: 74  SYKELSCQSEQCHLLDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
           +   + C+ E C + ++      +C S+   C +   Y D S T G   T+ + +    G
Sbjct: 138 T--TVGCEQEFC-VANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG 194

Query: 124 NSNNFFDN--VVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYC 177
           N      N  + FGCG     + G  N+   G++G G++  S+ SQ+ +     K F++C
Sbjct: 195 NGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L       ++     F  G+ V    V +T LV   + T+Y V L+GISVG    ++  +
Sbjct: 255 L------DTVRGGGIFAIGNVVQ-PKVKTTPLV--PNVTHYNVNLQGISVG---GATLQL 302

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLCY 294
           P        SKG + ID+G     LP++ Y  L   V +  +  P   YQD      +C+
Sbjct: 303 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-----FVCF 356

Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
           + + S+    P++T  F G   +  ++   ++      ++C       +Q  DG D+ + 
Sbjct: 357 QFSGSIDDGFPVITFSFKGDLTLN-VYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLL 415

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD + +++ +   +C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 139/280 (49%), Gaps = 29/280 (10%)

Query: 12  VVQSNVSTANGE-------YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
           V +S+V  A+G        Y+++ +IGTP  P+L     +DT +D  W+ C  CV C   
Sbjct: 69  VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVA---LDTSNDAAWIPCSGCVGCSSS 125

Query: 63  VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
           V  +++P+ SSS + L C++ QC      SC+  + C +   Y  S++ +  L  + +T 
Sbjct: 126 V--LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTL 182

Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
             +++   N  FGC +  +G  +    GL+GLGR  LSL SQ    L  + FSYCL P  
Sbjct: 183 --ASDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNS 237

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPY 239
             S+ +  +  G  ++     + +T L+    + + Y+V L GI VGN  +   +  + +
Sbjct: 238 KSSNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAF 295

Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
             ++GA   G +F D+G   T L +  Y  +  + R  +K
Sbjct: 296 DPATGA---GTIF-DSGTVYTRLVEPAYVAVRNEFRRRVK 331


>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 13/101 (12%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           MK  IGTPP  +I  ++DTGS+L+W QCLPC+ CY Q  PI++P+ SS++KE  C     
Sbjct: 1   MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55

Query: 86  HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
                   +    C+Y   Y D S T+G LATE +T  +++
Sbjct: 56  --------TPDHSCSYKIVYDDKSYTQGTLATETVTIHSTS 88


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 76/145 (52%), Gaps = 7/145 (4%)

Query: 41  IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
           I+D+GSD+ WVQC PC  + C+ Q  P+++PA+S++Y  + C S  C  L      CS+ 
Sbjct: 164 IIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSAN 223

Query: 97  QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
             C + + Y D +   G  +++ +T G   +     +FGC H + G  F+ +  G + LG
Sbjct: 224 VQCQFGFTYTDGATATGTYSSDDLTLG-PYDVVRGFLFGCAHADRGSTFSFDVSGTLALG 282

Query: 156 RTRLSLASQILSQLGANKFSYCLVP 180
               S   Q  +Q G   FSYC+ P
Sbjct: 283 GGAQSFVQQTATQYG-RVFSYCIPP 306


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 155/385 (40%), Gaps = 52/385 (13%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     +G PP    LD    VDTGSDL W+QC  PC  C K   P+Y P       
Sbjct: 184 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIVP 239

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            ++L CQ  Q    +   C + + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDF 296

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +SL SQ+ S  + +N F +C+     +    
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCIT---REQGGG 353

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   G+  TS+ S  D  Y+          ++    + +     +G   +
Sbjct: 354 GYMFLGD-DYVPRWGITWTSIRSGPDNLYH------TEAHHVKYGDQQLRMREQAGNTVQ 406

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
             +  D+G+  T LP + Y  L   ++ A              LC+K          +  
Sbjct: 407 --VIFDSGSSYTYLPDEIYENLVAAIKYASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQ 464

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
               L  HF    K  L  + TF   P +       G  C  +    +   G   I G+ 
Sbjct: 465 FFKPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
           +     + YD   + + +  +DCTK
Sbjct: 522 SLRGKLVVYDNQRRQIGWTNSDCTK 546


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 155/385 (40%), Gaps = 56/385 (14%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +G P   +    +DTGSD++WV C PC  C           +++   SSS +
Sbjct: 82  GLYFTKVKLGNPAR-EFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140

Query: 77  ELSCQSEQCHLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
            L C    C  + T +  C +Q   C+Y++ Y D S T G   T+ + F    G S   N
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
               +VFGC     G          G+ G G+   S+ SQ+ S+ +    FS+CL     
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL----- 255

Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKE--------DKTYYFVTLEGISV-GNLSNSS 234
                       G E  GG +V   ++            + +Y + L+ I++ G L  + 
Sbjct: 256 -----------KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNP 304

Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
            + P  N+      G   ID+G     L ++ Y+ +   + +A+  +       GSQ   
Sbjct: 305 TMFPISNA------GETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFR 358

Query: 295 KTPSMAGIAPILTAHFDGGAKVP------LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
            + S+A I P+L  +F+G A +       L   S         ++C   Q  +  + I G
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILG 418

Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
           +    D  I YD   Q + +   DC
Sbjct: 419 DLVLKDKIIVYDLAQQRIGWANYDC 443


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/394 (24%), Positives = 165/394 (41%), Gaps = 73/394 (18%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTGSD+MWV C+ C +C K         +YN   S + K
Sbjct: 76  GLYYAKIGIGTPTK-DYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGK 134

Query: 77  ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--- 130
            + C  E C+ ++      C++   C Y   Y D S T G    + + +   +       
Sbjct: 135 LVPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTA 194

Query: 131 ---NVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
              +V+FGCG   +G     NE  + G++G G++  S+ SQ+       K F++CL    
Sbjct: 195 ANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL---- 250

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
                            +GGG+     V +          ++ +Y V +  + VG+  LS
Sbjct: 251 --------------DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLS 296

Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRL 288
             + +    +  GAI      ID+G     LP+  Y  L  ++ +    +K+   +D   
Sbjct: 297 LPTDVFEAGDRKGAI------IDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRD--- 347

Query: 289 GSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQ------PI 340
               C++ + S+    P +T HF+    + +  H   F   P EG++C   Q        
Sbjct: 348 -EYTCFQYSDSLDDGFPNVTFHFENSVILKVYPHEYLF---PFEGLWCIGWQNSGVQSRD 403

Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
             ++ + G+   S+  + YD ++Q + +   +C+
Sbjct: 404 RRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCS 437


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 170/385 (44%), Gaps = 50/385 (12%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T  G Y  +  IGTP     Y  VDTGSD++WV C+ C +C ++        +Y+P  SS
Sbjct: 84  TDTGLYYTEIGIGTPTK-RYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSS 142

Query: 74  SYKELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------ 122
           +  ++SC    C      LL    C++   C Y+  Y D S T G   ++ + F      
Sbjct: 143 TGSKVSCDQGFCAATYGGLLP--GCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200

Query: 123 GNSNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK--FSYC 177
           G +      V FGCG    G     N+   G++G G++  S+ SQ LS  G  K  F++C
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQ-LSAAGKVKKIFAHC 259

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L       +I     F  G+ V    V +T LV   +  +Y V L+ I VG    + KL 
Sbjct: 260 L------DTINGGGIFAIGNVVQ-PKVKTTPLV--PNMPHYNVNLKSIDVG--GTALKLP 308

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-T 296
            +   +G   K    ID+G   T LP+  Y  +   V    K   + + +    LC++  
Sbjct: 309 SHMFDTG--EKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQ--EFLCFQYV 364

Query: 297 PSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDGD-VGIFGN 349
             +    P +T HF+    +PL ++   +     + ++C       +Q  DG  + + G+
Sbjct: 365 GRVDDDFPKITFHFEN--DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGD 422

Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
              S+  + YD ++Q++ +   +C+
Sbjct: 423 LVLSNKLVVYDLENQVIGWTEYNCS 447


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 116 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 231

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +  
Sbjct: 232 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 286

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 287 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 333

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 334 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 393

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 394 TITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 452

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 453 TRSFGTTFDIQGKQFGFKYAAC 474


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 94/361 (26%), Positives = 155/361 (42%), Gaps = 35/361 (9%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
           YV +  +GTP       +VDT S L WV C PC+     + P +NP +SS+YK + C S 
Sbjct: 126 YVTQVQLGTPAKTHNV-LVDTASSLSWVGCEPCIN--ACLIPTFNPNASSTYKVVGCGSA 182

Query: 84  QCHLLDTVSCSSQQL------CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
            C+ + + + + +        C+Y   Y D SL+ GV++++ +T+G  +  F   +FGC 
Sbjct: 183 LCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYGLGSQKF---IFGCC 239

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           +   GV      G++G+   + SL SQ+         SYC  P   +      + FG   
Sbjct: 240 NLFRGVGGRYS-GILGMSVNKFSLFSQMTVGHRYRAMSYCF-PHPRNQGF---LQFGRYD 294

Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
           E       +   +   D   YFV +  + V  +S   +      SSG  +    F DTG 
Sbjct: 295 EHKSLLRFTPLYI---DGNNYFVHVSNVMVETMSLDVQ------SSGNQTM-RCFFDTGT 344

Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS--MAG--IAPILTAHFDGG 313
           P T+LP+  +  L + V N ++   Y+      Q C++     + G    P +   F  G
Sbjct: 345 PYTMLPQSLFVSLSDTVGNLVE-GYYRVGASTGQTCFQADGNWIEGDLYMPTVKIEFQNG 403

Query: 314 AKVPLIHTS-TFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           A++ L      F+  P   VFC A +  DG   + G+     +    D +   +  +   
Sbjct: 404 ARITLNSEDLMFMEEP--NVFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTMGLRGQG 461

Query: 373 C 373
           C
Sbjct: 462 C 462


>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 13/101 (12%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           MK  IGTPP  +I  ++DTGS+L+W QCLPC+ CY Q  PI++P+ SS++KE  C     
Sbjct: 1   MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55

Query: 86  HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
                   +    C Y   Y D S T+G LATE +T  +++
Sbjct: 56  --------TPDHSCXYKIVYDDKSYTQGTLATETVTIHSTS 88


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 147/363 (40%), Gaps = 29/363 (7%)

Query: 23  EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
            YV +  +GTP    +  I D  +D  WV C           P ++P  SS+Y+ + C +
Sbjct: 106 SYVARARLGTPAQALLVAI-DPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGA 162

Query: 83  EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
            QC      SC       C +   YA S+  + +L  + +   +  +      FGC H  
Sbjct: 163 PQCSQAPAPSCPGGLGSSCAFNLSYAASTF-QALLGQDALALHDDVDAVAAYTFGCLHVV 221

Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
           TG  +    GLVG GR  LS  SQ     G + FSYCL P +  S+ +  +  G   +  
Sbjct: 222 TG-GSVPPQGLVGFGRGPLSFPSQTKDVYG-SVFSYCL-PSYKSSNFSGTLRLGPAGQPK 278

Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI---SKGNMFIDTG 256
              + +T L+S   + + Y+V + GI VG      + +P   S+ A    S     +D G
Sbjct: 279 --RIKTTPLLSNPHRPSLYYVNMVGIRVGG-----RPVPVPASALAFDPTSGRGTIVDAG 331

Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
              T L    Y  + +  R+ ++  P   P  G   CY    +    P +T  FDG   V
Sbjct: 332 TMFTRLSAPVYAAVRDVFRSRVR-APVAGPLGGFDTCYN---VTISVPTVTFSFDGRVSV 387

Query: 317 PLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
            L   +  I     G+ C AM       +D  + +  +  Q +  + +D  +  V F   
Sbjct: 388 TLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRE 447

Query: 372 DCT 374
            CT
Sbjct: 448 LCT 450


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 100/389 (25%), Positives = 167/389 (42%), Gaps = 56/389 (14%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           T+NG Y  K  +G     D Y  VDTGSD +WV C+ C  C K+        +Y+P  S 
Sbjct: 71  TSNGLYYTKIGLGPK---DYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSK 127

Query: 74  SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
           + K + C  E C   +      C+    C Y+  Y D S T G    + +TF    G+  
Sbjct: 128 TSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLR 187

Query: 127 NFFDN--VVFGCGHNNTGVFNENE----MGLVGLGRTRLSLASQILSQLGANK-FSYCLV 179
              DN  V+FGCG   +G  +        G++G G+   S+ SQ+ +     + FS+CL 
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL- 246

Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLI 237
                 SI+    F  G EV    V +T L+  +   +Y V L+ I V    +   S ++
Sbjct: 247 -----DSISGGGIFAIG-EVVQPKVKTTPLL--QGMAHYNVVLKDIEVAGDPIQLPSDIL 298

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY 294
              +  G I      ID+G     LP   Y++L E++   R+ +KL   +D       C+
Sbjct: 299 DSSSGRGTI------IDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCF 348

Query: 295 ---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVG 345
                 S+  + P +   F+ G  +   +   ++    E ++C   Q       DG ++ 
Sbjct: 349 HYSDEESVDDLFPTVKFTFEEGLTLT-TYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELI 407

Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + G+   ++  + YD D+  + +   +C+
Sbjct: 408 LLGDLVLANKLVVYDLDNMAIGWADYNCS 436


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 163/367 (44%), Gaps = 36/367 (9%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G YV++  +GTPP L ++ ++DT +D +W+ C  C  C       +N  SSS+Y  +SC 
Sbjct: 103 GNYVVRARLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 160

Query: 82  SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           + QC     ++C S      +C++   Y   S     L  + +T   S +   N  FGC 
Sbjct: 161 TTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL--SPDVIPNFSFGCI 218

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
           ++ +G  +    GL+GLGR  +SL SQ  S L +  FSYCL  F +        YF    
Sbjct: 219 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 269

Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
           ++   G    +  T L+    + + Y+V L G+SVG++     + P Y +  + S     
Sbjct: 270 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDSNSGAGTI 327

Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
           ID+G   T   +  Y  + ++ R  +  +      LG+   C+   +   + P +T H  
Sbjct: 328 IDSGTVITRFAQPVYEAIRDEFRKQVNGS---FSTLGAFDTCFSADN-ENVTPKITLHMT 383

Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
               K+P+   +T I      + C +M    Q  +  + +  N  Q +L I +D  +  +
Sbjct: 384 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441

Query: 367 SFKPTDC 373
              P  C
Sbjct: 442 GIAPEPC 448


>gi|326490700|dbj|BAJ90017.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493830|dbj|BAJ85377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 459

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 145/357 (40%), Gaps = 51/357 (14%)

Query: 37  DIYGIVDTGSDLMWV-QCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL---DTVS 92
           D+ G+VD   D +W  QC+          P+           + C S+ C  L   DT  
Sbjct: 102 DVSGVVDVLDDFVWTTQCV--------AAPV----------RVQCASQTCRSLLANDTTD 143

Query: 93  C-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNEN 147
                 S    C+Y   YA  S T G LA E +  G+   F    + GC   N+      
Sbjct: 144 ACGGNPSGDDTCSYVNVYAPGSNTTGFLANETVAVGS---FVGAAILGCSAANSTGPLVG 200

Query: 148 EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNGS--EVSGGGV 204
           E+G  G  R  LSL    +SQL  +KFSY L P    SS + S +  G+ +  +  GGG 
Sbjct: 201 EVGSFGFNRGALSL----VSQLSVSKFSYYLAPDEAGSSDSESVVLLGDAAVPQTRGGGR 256

Query: 205 VSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
            +  L S      Y+V L  I V   + S      ++ +   S G + + T  P T L +
Sbjct: 257 STPLLRSTAFPDVYYVKLSAIQVDGQALSGIPAGAFDLAADGSSGGVVMGTLYPITRLQE 316

Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA-PILTAHFDGG---AKVP 317
           D YN + + + + I                LCY   S+A +  P +T  FDGG   A + 
Sbjct: 317 DAYNAVRQALVSKINAQEVNGSAFAGGVFDLCYDAQSVATLTFPKITLVFDGGNAPATLE 376

Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVG-----IFGNFAQSDLFIGYDFDSQMVSFK 369
           L     F    V G+ CF M P+   VG     + G+  Q+   + YD   + ++ +
Sbjct: 377 LTTVHYFFKDNVTGLQCFTMLPM--PVGTPFGSVLGSMVQAGTNMIYDVGGETLTLE 431


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 159/389 (40%), Gaps = 56/389 (14%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKEL 78
           N    +  ++GTPP  ++  ++DTGS+L W+ C P        +    + P +S ++  +
Sbjct: 63  NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121

Query: 79  SCQSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            C S QC   D  S   C  + + C  +  YAD S + G LATE  T G          F
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPL--RAAF 179

Query: 135 GCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           GC     +T        GL+G+ R  LS     +SQ    +FSYC+    +D      + 
Sbjct: 180 GCMATAFDTSPDGVATAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVLL 231

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNSS 243
            G+ S++    +  T L          D+  Y V L GI VG       +S L P  + +
Sbjct: 232 LGH-SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP--DHT 288

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
           GA   G   +D+G   T L  D Y+ L+ +     K  L    DP    Q     C++ P
Sbjct: 289 GA---GQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVP 345

Query: 298 ---SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPIDGDV 344
              +     P +T  F+ GA++ +         P E     GV+C       M PI   V
Sbjct: 346 QGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYV 404

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              G+  Q ++++ YD +   V   P  C
Sbjct: 405 --IGHHHQMNVWVEYDLERGRVGLAPIRC 431


>gi|222632756|gb|EEE64888.1| hypothetical protein OsJ_19747 [Oryza sativa Japonica Group]
          Length = 384

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 80/276 (28%), Positives = 127/276 (46%), Gaps = 29/276 (10%)

Query: 42  VDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQSEQCHLL------DTV 91
           +DTGS L WVQC PC ++C+ Q   V PI++P++SS+++ + C +  C  L       + 
Sbjct: 1   MDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGCSTSICSYLGRTLRIQSK 60

Query: 92  SCSS-QQLCNYTYGYADS-SLTKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVF 144
           +C   + +C YT  Y    + + G   T+R+  G            N VFGC   +T   
Sbjct: 61  ACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTLSLANFVFGCSM-DTQYS 119

Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
              E G+ GLG +  S   QI   L    FSYCL      S    + Y   G + SGG  
Sbjct: 120 THKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCL-----PSDEAHQGYLSIGPDSSGG-- 171

Query: 205 VSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
           V TS+     +  Y + + G++V  ++   + +   + S       M +D+GA  TLL  
Sbjct: 172 VPTSMFPGTPRPVYSIGMTGLTV-TVNGEVRSLVSGSGSSPSPSSLMVVDSGAKLTLLLA 230

Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS 298
             + +LE+ +  A++   Y        +QLC+ T S
Sbjct: 231 STFGQLEDAIIPAMESLGYSLNTAAGQNQLCFLTES 266


>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
          Length = 426

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 152/365 (41%), Gaps = 40/365 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
           G  V K S+G    +   G+VD  +D +W QC           P+     SS + E+ C 
Sbjct: 74  GLVVYKISVGVAEEV-FSGVVDVATDFIWAQC-----------PV-----SSDFTEVFCF 116

Query: 82  SEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
           S+ C L     D    S+   C Y Y Y     T G ++ E +T     +     +FGC 
Sbjct: 117 SQTCQLALDEEDACGNSTSFTCPYAYQYGPGISTTGYISAEEVT-AVGTHITGRALFGCS 175

Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNG 196
             +T V  + E G++G  R   SL    LSQL  ++FSY ++P   D   + S +  G+ 
Sbjct: 176 LAST-VPLDGESGVLGFSRGPYSL----LSQLKISRFSYFMLPDDADKPDSESVLLLGDD 230

Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
           +        ST L+  E     Y+V L GI V + S S      ++ +     G + + T
Sbjct: 231 AVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMST 290

Query: 256 GAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFD 311
            +P T L    YN L   + + IK   + P  D     +LCY   S+A +  P +T  F 
Sbjct: 291 LSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNIQSVANLTFPKITLVFH 350

Query: 312 G--GAKVPLIHTST--FIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQ 364
           G  G   P+  T+   FI     G+ C  M P         + G+  Q+   + YD    
Sbjct: 351 GVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLGSLLQTGTHMIYDLRGG 410

Query: 365 MVSFK 369
            ++F+
Sbjct: 411 SLTFE 415


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 160/393 (40%), Gaps = 72/393 (18%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTGSD++WV C  C +C  +        +Y+  +S++  
Sbjct: 153 GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 211

Query: 77  ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
            + C    C L D     C     C Y+  Y D S T G    + + +   +  F     
Sbjct: 212 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 271

Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
              VVFGCG+  +G     +E   G++G G+   S+ SQ+ S     K FS+CL      
Sbjct: 272 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 325

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
                         V GGG+ +   V +         +++ +Y V ++ I VG       
Sbjct: 326 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 367

Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
             P    S A   G+     ID+G      P++ Y  L E++     L+   D RL +  
Sbjct: 368 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 421

Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG 342
               C+  T ++    P +T HFD    +  ++   ++    E  +C        Q  DG
Sbjct: 422 QAFTCFDYTGNVDDGFPTVTLHFDKSISLT-VYPHEYLFQVKEFEWCIGWQNSGAQTKDG 480

Query: 343 -DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            D+ + G+   S+  + YD + Q + +   +C+
Sbjct: 481 KDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 513


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 160/374 (42%), Gaps = 47/374 (12%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----------YNPASSS 73
           Y    S+GTPP   +  + DTGSDL W+ C     C + ++ I          Y P +S+
Sbjct: 102 YYANVSVGTPPSSFLVAL-DTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160

Query: 74  SYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
           +   + C  ++C    +  CSS   +C Y   Y++S+ TKG L  + +     +      
Sbjct: 161 TSSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPV 218

Query: 131 --NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDS 185
             NV  GCG   TG+F  N    G++GLG    S+ S +  + + AN FS C   F    
Sbjct: 219 KANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMC---FGRVI 275

Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
               ++ FG+           T  +S    T Y V + G+SV       +L   +++   
Sbjct: 276 GNVGRISFGDRGYTD---QEETPFISVAPSTAYGVNISGVSVAGDPVDIRLFAKFDT--- 329

Query: 246 ISKGNMFIDTGAPP-TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIA 303
              G+ F     P   +L K F   +E++ R         DP L  + CY  +P+   I 
Sbjct: 330 ---GSSFTHLREPAYGVLTKSFDELVEDRRRPV-------DPELPFEFCYDLSPNATTIQ 379

Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFA-MQPIDGDVGIFGNFAQSDLFIGY 359
            P++   F GG+K+ +++   F     EG  ++C   ++ +   + + G    +   I +
Sbjct: 380 FPLVEMTFIGGSKI-ILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVF 438

Query: 360 DFDSQMVSFKPTDC 373
           D +  ++ +K + C
Sbjct: 439 DRERMILGWKQSLC 452


>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
          Length = 357

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTTEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357


>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
 gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
 gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
 gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
 gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L    FSYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTTEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 43/375 (11%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQV--KPIYNPASSSSYKE 77
            G Y  +  IGTP   +   IVDTGS + +V C  C  C + Q    P + P +SSSY+ 
Sbjct: 96  KGYYTSRVFIGTPAQ-EFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQT 154

Query: 78  LSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-VVFG 135
           +SC S  C    T  C ++   C Y   YA+ S +KGVL  + + FGN +    + ++FG
Sbjct: 155 VSCNSPDC---ITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFG 211

Query: 136 CGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
           C    TG ++ ++  G++GLGR  LS+  Q++   GA + S+ L     D         G
Sbjct: 212 CETAETGDLYLQHADGIMGLGRGPLSIVDQLVGT-GAMEDSFSLCYGGMDE--------G 262

Query: 195 NGSEVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
            GS V G      ++V +K D     YY + L  I V  +S         N    +  G 
Sbjct: 263 GGSMVLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVS--------LNVPSEVFNGR 314

Query: 251 M--FIDTGAPPTLLPKDFYNRLEE---QVRNAIKLTPYQDPRLGSQLCY-----KTPSMA 300
           +   +D+G     LP   ++  ++   Q   +++  P  DP     +C+      + ++ 
Sbjct: 315 LGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSY-PDVCFAGAGSDSKALG 373

Query: 301 GIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
              P +   F G  KV L      F    V G +C           + G     +  + Y
Sbjct: 374 KHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTY 433

Query: 360 DFDSQMVSFKPTDCT 374
           D  +  + F  T+CT
Sbjct: 434 DRANHQIGFFKTNCT 448


>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 13/101 (12%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
           MK  IGTPP  +I  ++DTGS+L+W QCLPC+ CY Q  PI++P+ SS++KE  C     
Sbjct: 1   MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55

Query: 86  HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
                   +    C Y   Y D S T+G LATE +T  +++
Sbjct: 56  --------TPDHSCPYKIVYDDKSYTQGTLATETVTIHSTS 88


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 159/378 (42%), Gaps = 43/378 (11%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKEL 78
           Y  +  +G+PP  + +  +DTGSD++WV C PC  C            +NP +SS+  ++
Sbjct: 117 YFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 175

Query: 79  SCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
            C  ++C      S      S    C YT+ Y D S T G   ++ + F    GN    N
Sbjct: 176 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 235

Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFH 182
              ++VFGC ++ +G   + +    G+ G G+ +LS+ SQ L+ LG +   FS+CL    
Sbjct: 236 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKGSD 294

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYY 240
               I   +  G   E+   G+V T LV  +   +Y + LE I V    L   S L    
Sbjct: 295 NGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFTTS 346

Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
           N+ G I      +D+G     L    Y+     +  A+  +       G+Q    + S+ 
Sbjct: 347 NTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVD 400

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSDLF 356
              P ++ +F GG  + +   +  +         ++C   Q   G  + I G+    D  
Sbjct: 401 SSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKI 460

Query: 357 IGYDFDSQMVSFKPTDCT 374
             YD  +  + +   DC+
Sbjct: 461 FVYDLANMRMGWTDYDCS 478


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 41/362 (11%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
           IGTPP  +   IVDTGS + +V C  C QC     P + P  S +Y  + C        D
Sbjct: 2   IGTPPQ-EFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNP------D 54

Query: 90  TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNNTG-VFNEN 147
               +    C Y   YA+ S + G+L  + ++FGN +       VFGC +  TG +F+++
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQH 114

Query: 148 EMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
             G++GLGR  LS+  Q++ +   N  FS C             M  G G+ V G     
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY----------GGMEVGGGAMVLGQISPP 164

Query: 207 TSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
           + +V   S  D++ YY + L G+ V       KL    N      K    +D+G     L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAG----KKLD--INPQVFDGKHGTILDSGTTYAYL 218

Query: 263 PKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGA 314
           P+  +    + +    + +K     DP   + +C+     + P +    P +   FD G 
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNY-NDVCFSGAGSEIPELYKTFPSVDMVFDNGE 277

Query: 315 KVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           K  L      F    V G +C  + Q       + G     +  + YD +   V F  T+
Sbjct: 278 KYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTN 337

Query: 373 CT 374
           C+
Sbjct: 338 CS 339


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 41/362 (11%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
           IGTPP  +   IVDTGS + +V C  C QC     P + P  S +Y  + C        D
Sbjct: 2   IGTPPQ-EFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNP------D 54

Query: 90  TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNNTG-VFNEN 147
               +    C Y   YA+ S + G+L  + ++FGN +       VFGC +  TG +F+++
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQH 114

Query: 148 EMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
             G++GLGR  LS+  Q++ +   N  FS C             M  G G+ V G     
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY----------GGMEVGGGAMVLGQISPP 164

Query: 207 TSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
           + +V   S  D++ YY + L G+ V       KL    N      K    +D+G     L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAG----KKLD--INPQVFDGKHGTILDSGTTYAYL 218

Query: 263 PKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGA 314
           P+  +    + +    + +K     DP   + +C+     + P +    P +   FD G 
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNY-NDVCFSGAGSEIPELYKTFPSVDMVFDNGE 277

Query: 315 KVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
           K  L      F    V G +C  + Q       + G     +  + YD +   V F  T+
Sbjct: 278 KYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTN 337

Query: 373 CT 374
           C+
Sbjct: 338 CS 339


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 153/378 (40%), Gaps = 49/378 (12%)

Query: 22  GEYVMKFSIGTPPL---LDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYKE 77
           G Y +  +IG PP    LDI    DTGSDL WVQC  PC  C K +  +Y P ++     
Sbjct: 66  GHYSVILNIGNPPKAFDLDI----DTGSDLTWVQCDAPCKGCTKPLDKLYKPKNN----R 117

Query: 78  LSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
           + C S  C  +   +C    + C+Y   YAD   + GVL ++   +   N +     + F
Sbjct: 118 VPCASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAF 177

Query: 135 GCGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
           GCG++   +      +  G++GLGR + S+ SQ L  LG  +     V  H  S +T   
Sbjct: 178 GCGYDQKYLGPHSPPDTAGILGLGRGKASILSQ-LRTLGITQN----VVGHCFSRVTGGF 232

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            F     +   G+  T ++     T Y            S+    + +      I    +
Sbjct: 233 LFFGDHLLPPSGITWTPMLRSSSDTLY------------SSGPAELLFGGKPTGIKGLQL 280

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPS-MAGIAPI--- 305
             D+G+  T      Y  +   VR  +   P +D      L  C+KT   +  I  I   
Sbjct: 281 IFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSF 340

Query: 306 ---LTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFI 357
              LT +F     V L +    ++    +G  C  +    +   G++ + G+    D  +
Sbjct: 341 FKPLTINFIKAKNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVIGDIFMQDRVV 400

Query: 358 GYDFDSQMVSFKPTDCTK 375
            YD + Q + + PT+C +
Sbjct: 401 VYDNERQQIGWFPTNCNR 418


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 160/393 (40%), Gaps = 72/393 (18%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTGSD++WV C  C +C  +        +Y+  +S++  
Sbjct: 72  GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 130

Query: 77  ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
            + C    C L D     C     C Y+  Y D S T G    + + +   +  F     
Sbjct: 131 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 190

Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
              VVFGCG+  +G     +E   G++G G+   S+ SQ+ S     K FS+CL      
Sbjct: 191 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 244

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
                         V GGG+ +   V +         +++ +Y V ++ I VG       
Sbjct: 245 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 286

Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
             P    S A   G+     ID+G      P++ Y  L E++     L+   D RL +  
Sbjct: 287 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 340

Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG 342
               C+  T ++    P +T HFD    +  ++   ++    E  +C        Q  DG
Sbjct: 341 QAFTCFDYTGNVDDGFPTVTLHFDKSISLT-VYPHEYLFQVKEFEWCIGWQNSGAQTKDG 399

Query: 343 -DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
            D+ + G+   S+  + YD + Q + +   +C+
Sbjct: 400 KDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 432


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L     SYCL    TD +  
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKP 284

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAVC 472


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)

Query: 24  YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
           ++M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + 
Sbjct: 116 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174

Query: 80  CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
           C S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 231

Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
           ++FGC  +    ++E E G+ G G +  S   Q+      L     SYCL    TD +  
Sbjct: 232 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKP 286

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S 
Sbjct: 287 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 333

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
             M +D+GA  T L    +  L++ +  A+    Y      R  S +CY           
Sbjct: 334 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 393

Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
             TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN  
Sbjct: 394 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 452

Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
                  +D   +   FK   C
Sbjct: 453 TRSFGTTFDIQGKQFGFKYAVC 474


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 159/389 (40%), Gaps = 56/389 (14%)

Query: 21  NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKEL 78
           N    +  ++GTPP  ++  ++DTGS+L W+ C P        +    + P +S ++  +
Sbjct: 62  NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120

Query: 79  SCQSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
            C S QC   D  S   C  + + C  +  YAD S + G LATE  T G          F
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPL--RAAF 178

Query: 135 GCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
           GC     +T        GL+G+ R  LS     +SQ    +FSYC+    +D      + 
Sbjct: 179 GCMATAFDTSPDGVATAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVLL 230

Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNSS 243
            G+ S++    +  T L          D+  Y V L GI VG       +S L P  + +
Sbjct: 231 LGH-SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP--DHT 287

Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
           GA   G   +D+G   T L  D Y+ L+ +     K  L    DP    Q     C++ P
Sbjct: 288 GA---GQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVP 344

Query: 298 ---SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPIDGDV 344
              +     P +T  F+ GA++ +         P E     GV+C       M PI   V
Sbjct: 345 QGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYV 403

Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
              G+  Q ++++ YD +   V   P  C
Sbjct: 404 --IGHHHQMNVWVEYDLERGRVGLAPIRC 430


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 159/367 (43%), Gaps = 48/367 (13%)

Query: 29  SIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKELS 79
           ++GTP    +  + DTGSDL W+ C  C  C +++K          IY+P +SS+  ++ 
Sbjct: 109 TVGTPSDWFLVAL-DTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVP 166

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVVF 134
           C S  C   D  + S +  C Y   Y ++ + + GVL  + +       +S      V  
Sbjct: 167 CNSTLCTRGDRCA-SPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTL 225

Query: 135 GCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKM 191
           GCG   TGVF++     GL GLG   +S+ S +  + + AN FS C   F  D +   ++
Sbjct: 226 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GRI 280

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGN 250
            FG+   V       T L  ++    Y +T+  ISV GN             +G +    
Sbjct: 281 SFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVEGN-------------TGDLEFDA 324

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAGIA-PIL 306
           +F D+G   T L    Y  + E   +      YQ  D  L  + CY  +P+      P +
Sbjct: 325 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383

Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
                GG+  P+ H    IP     V+C A+  I+ D+ I G    +   + +D +  ++
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAILKIE-DISIIGQNFMTGYRVVFDREKLIL 442

Query: 367 SFKPTDC 373
            +K +DC
Sbjct: 443 GWKESDC 449


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 159/377 (42%), Gaps = 38/377 (10%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  +GTPP  +    +DTGSD++WV C  C  C +  +       ++   SS+  
Sbjct: 76  GLYYTKVKMGTPPK-EFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAA 134

Query: 77  ELSCQSEQCHLL---DTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG------NSN 126
            + C    C          CS +   C+YT+ Y D S T G   ++ + F        + 
Sbjct: 135 LIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAV 194

Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
           N    +VFGC  + +G   + +    G+ G G   LS+ SQ+ S+ +    FS+CL    
Sbjct: 195 NSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL---- 250

Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
                          E+    +V + LV  +   +Y + L+ I+V     + +L+P   +
Sbjct: 251 --KGDGDGGGVLVLGEILEPSIVYSPLVPSQ--PHYNLNLQSIAV-----NGQLLPINPA 301

Query: 243 SGAIS--KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
             +IS  +G   +D G     L ++ Y+ L   +  A+  +  Q    G+Q    + S+ 
Sbjct: 302 VFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQCYLVSTSIG 361

Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPP-VEG--VFCFAMQPIDGDVGIFGNFAQSDLFI 357
            I P ++ +F+GGA + L      +    ++G  ++C   Q       I G+    D  +
Sbjct: 362 DIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIV 421

Query: 358 GYDFDSQMVSFKPTDCT 374
            YD   Q + +   DC+
Sbjct: 422 VYDIAQQRIGWANYDCS 438


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 163/378 (43%), Gaps = 60/378 (15%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKPI----YNPASSSSYKEL 78
           IGTP +  +  + DTGSDL+W+ C  CVQC       Y  +       YNP+SSS+ K  
Sbjct: 106 IGTPSVSFLVAL-DTGSDLLWIPC-NCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163

Query: 79  SCQSEQCHLLDTVS-CSS-QQLCNYTYGYADSSLTKGVLATERI---TFGNSNNFFD--- 130
            C  + C   D+ S C S ++ C YT  Y   + +   L  E I   T+  +N   +   
Sbjct: 164 LCSHKLC---DSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSS 220

Query: 131 ----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFH 182
                VV GCG   +G + +     GL+GLG   +S+ S  LS+ G   N FS C     
Sbjct: 221 SVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPS-FLSKAGLMRNSFSLCF---- 275

Query: 183 TDSSITSKMYFGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
            D   + ++YFG+ G  +      ST  +  E+ + Y V +E   +GN            
Sbjct: 276 -DEEDSGRIYFGDMGPSIQQ----STPFLQLENNSGYIVGVEACCIGN------------ 318

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
           S    +    FID+G   T LP++ Y ++  ++   I  T      +  + CY++ S+  
Sbjct: 319 SCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKSFEGVSWEYCYES-SVEP 377

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGD-VGIFGNFAQSDLFIG 358
             P +   F       +IH   F+    +G+  FC  + P   + +G  G        + 
Sbjct: 378 KVPAIKLKFSHNNTF-VIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMV 436

Query: 359 YDFDSQMVSFKPTDCTKQ 376
           +D ++  + +  + C ++
Sbjct: 437 FDRENMKLRWSASKCQEE 454


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/366 (26%), Positives = 158/366 (43%), Gaps = 44/366 (12%)

Query: 29  SIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKELS 79
           ++GTP    +  + DTGSDL W+ C     C +++K          IY+P +SS+  ++ 
Sbjct: 109 TVGTPSDWFLVAL-DTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVP 167

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVVF 134
           C S  C  +D  + S    C Y   Y ++ + + GVL  + +       NS      +  
Sbjct: 168 CNSTLCTRVDRCA-SPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITL 226

Query: 135 GCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKM 191
           GCG   TGVF++     GL GLG   +S+ S +  + + AN FS C   F  D +   ++
Sbjct: 227 GCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGDDGA--GRI 281

Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
            FG+   V       T L  ++    Y VT+  ISVG             ++G +    +
Sbjct: 282 SFGDKGSVDQR---ETPLNIRQPHPTYNVTVTQISVG------------GNTGDLEFDAV 326

Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYK-TPSMAGIA-PILTA 308
           F DTG   T L    Y  + E   +      YQ D  L  + CY  +P+      P +  
Sbjct: 327 F-DTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFEYPDVNL 385

Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
              GG+  P+ H    +P     V+C A+   + D+ I G    +   + +D +  ++ +
Sbjct: 386 TMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSE-DISIIGQNFMTGYRVVFDREKLILGW 444

Query: 369 KPTDCT 374
           K +DC+
Sbjct: 445 KESDCS 450


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 161/391 (41%), Gaps = 50/391 (12%)

Query: 14  QSNVSTANGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNP 69
           +S       +Y    +IG PP    LDI    DTGSD  W+ C  PC  C K   P+Y P
Sbjct: 6   KSTAVVPERQYYTSINIGNPPRPYFLDI----DTGSDFTWIHCDAPCTNCTKGPHPVYKP 61

Query: 70  ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
                 K +  +   C  L  +   C + + C+Y   YAD S +KGVLA + +    ++ 
Sbjct: 62  TEG---KIVHPRDPLCEELQGNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADG 118

Query: 128 FFDNV--VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQIL-SQLGANKFSYCLVPF 181
              NV  VFGC HN  G   ++     G++GL    +SL++Q+  S + +N F +C+   
Sbjct: 119 EMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMA-- 176

Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
            TD S    M+ G+   V   G+    + +     Y         V  ++  ++ +    
Sbjct: 177 -TDPSSGGYMFLGD-DYVPRWGMTWVPIRNGPGNVY------STEVPKVNYGAQELNLRG 228

Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK----TP 297
            +G +++  +  D+G+  T  P + Y  L   + +A       +       C K      
Sbjct: 229 QAGKLTQ--VIFDSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMKPNVPVR 286

Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPIDG-DVG---- 345
           S+  +  +         K   +  +TF   P        +G  C  +  +DG ++G    
Sbjct: 287 SVGDVEQLFNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGV--LDGTEIGHSST 344

Query: 346 -IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
            I G+ +    F+ YD D   + +  +DCT+
Sbjct: 345 IIIGDASLRGKFVVYDNDENRIGWVQSDCTR 375


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 152/385 (39%), Gaps = 52/385 (13%)

Query: 21  NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
           +G+Y     IG PP    LD    VDTGSDL W+QC  PC    K   P+Y PA      
Sbjct: 184 DGQYYTSIFIGNPPRPYFLD----VDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVP 239

Query: 76  -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
            ++L CQ  Q    +   C + + C+Y   YAD S + GVLA + +    +N   + +  
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDF 296

Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
           VFGC ++  G    +     G++GL    +S  SQ+ S  + AN F +C+     +    
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT---REQGGG 353

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
             M+ G+   V   GV  TS+ S  D  Y+      +  G+               A S 
Sbjct: 354 GYMFLGD-DYVPRWGVTWTSIRSGPDNLYH-TQAHHVKYGDQQ-------LRRPEQAGST 404

Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
             +  D+G+  T LP + Y  L   ++ A              LC+K          +  
Sbjct: 405 VQVIFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQ 464

Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
               L  HF    K  L  + TF   P +       G  C  +    +   G   I G+ 
Sbjct: 465 FFEPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521

Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
           +     + YD   + + +  +DCTK
Sbjct: 522 SLRGKLVVYDNQRKQIGWADSDCTK 546


>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
          Length = 274

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/274 (28%), Positives = 110/274 (40%), Gaps = 64/274 (23%)

Query: 114 VLATERITFGNSNNF----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
           +LAT+  TFG  +N        V FGCGH N G+F  NE G+ G GR R SL     SQL
Sbjct: 48  ILATDSFTFGGDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLP----SQL 103

Query: 170 GANKFSYCLVP-FHTDSS------ITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVT 221
               FSYC    F T SS        +            G V +T L+    + + YFV 
Sbjct: 104 NVTSFSYCFTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVP 163

Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
           L GISVG    +   +P         + +  ID+GA  T LP+D Y  ++ +  +     
Sbjct: 164 LRGISVG---GARVAVPESR-----LRSSTIIDSGASITTLPEDVYEAVKAEFVS----- 210

Query: 282 PYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID 341
             Q PR      ++               D  A+                V C  +    
Sbjct: 211 --QLPR--GNYVFE---------------DYAAR----------------VLCVVLDAAA 235

Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
           G+  + GN+ Q +  + YD ++ ++SF P  C K
Sbjct: 236 GEQVVIGNYQQQNTHVVYDLENDVLSFAPARCDK 269


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 156/391 (39%), Gaps = 69/391 (17%)

Query: 22  GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
           G Y  K  IGTP   D Y  VDTGSD++WV C  C +C  +        +Y+  +S++  
Sbjct: 153 GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 211

Query: 77  ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
            + C    C L D     C     C Y+  Y D S T G    + + +   +  F     
Sbjct: 212 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 271

Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
              VVFGCG+  +G     +E   G++G G+   S+ SQ+ S     K FS+CL      
Sbjct: 272 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 325

Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
                         V GGG+ +   V +         +++ +Y V ++ I VG       
Sbjct: 326 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 367

Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
             P    S A   G+     ID+G      P++ Y  L E++     L+   D RL +  
Sbjct: 368 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 421

Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF---AMQPIDG-D 343
               C+  T ++    P +T HFD    + +           E    +     Q  DG D
Sbjct: 422 QAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKD 481

Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           + + G+   S+  + YD + Q + +   +C+
Sbjct: 482 LTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 512


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 172/377 (45%), Gaps = 65/377 (17%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
           IGTP +  +  + D GSDL+WV C  C+QC       Y ++      Y+P+ SS+ K LS
Sbjct: 99  IGTPNVSFLVAL-DAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLS 156

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERITF------GNSNNFFDNV 132
           C  + C L      SS+  C Y    Y++++ + G+L  +R+         + ++ + +V
Sbjct: 157 CNDQLCELGSDCK-SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASV 215

Query: 133 VFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSIT 188
           + GCG   +G F++     GL+GLG   LS+ S +L++ G   N FS C      D + +
Sbjct: 216 IIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS-LLAKAGLVRNTFSICF-----DDNHS 269

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
             + FG+   V+     STS V  E K   Y + +EG  VG            +SS   +
Sbjct: 270 GTILFGDQGLVTQK---STSFVPLEGKFVTYLIEVEGYLVG------------SSSLKTA 314

Query: 248 KGNMFIDTGAPPTLLPKDFYNRL----EEQV---RNAIKLTPYQDPRLGSQLCYKTPSMA 300
                +D+G   T LP + Y ++    ++QV   R++ K +P+       + CY + S  
Sbjct: 315 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPW-------KYCYNSSSQE 367

Query: 301 GI-APILTAHFDGGAKVPLIHTST--FIPPPVE-GVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +  P +T  F       ++H      I    E  VFC  +QPI  + GI G        
Sbjct: 368 LLNIPTVTLVFAMNQSF-IVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYR 426

Query: 357 IGYDFDSQMVSFKPTDC 373
           + +D ++  + +  ++C
Sbjct: 427 MVFDRENLKLGWSTSNC 443


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 171/387 (44%), Gaps = 54/387 (13%)

Query: 19  TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
           TA G Y  +  IG+P     Y  VDTGSD++WV C+ C  C            Y+PA S 
Sbjct: 80  TATGLYYTQIEIGSPSK-GYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSG 138

Query: 74  SYKELSCQSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
           +   + C  E C     + L     S+   C +   Y D S T G   ++ + +    GN
Sbjct: 139 T--TVGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGN 196

Query: 125 SNNFFDN--VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCL 178
                 N  + FGCG    G    +     G++G G+   S+ SQ+ +     K F++CL
Sbjct: 197 GQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256

Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
              H          F  G+ V    V +T LV  ++ T+Y V L+GISVG    ++  +P
Sbjct: 257 DTVHGGG------IFAIGNVVQ-PKVKTTPLV--QNVTHYNVNLQGISVG---GATLQLP 304

Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK 295
                   SKG + ID+G     LP++ Y  L   V +    + L  YQD      +C++
Sbjct: 305 SSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQD-----FVCFQ 358

Query: 296 -TPSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
            + S+    P++T  F+G  ++ L ++   ++      ++C       +Q  DG D+ + 
Sbjct: 359 FSGSIDDGFPVVTFSFEG--EITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLL 416

Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
           G+   S+  + YD + Q++ +   +C+
Sbjct: 417 GDLVLSNKLVVYDLEKQVIGWADYNCS 443


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 172/377 (45%), Gaps = 65/377 (17%)

Query: 30  IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
           IGTP +  +  + D GSDL+WV C  C+QC       Y ++      Y+P+ SS+ K LS
Sbjct: 109 IGTPNVSFLVAL-DAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLS 166

Query: 80  CQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERITF------GNSNNFFDNV 132
           C  + C L      SS+  C Y    Y++++ + G+L  +R+         + ++ + +V
Sbjct: 167 CNDQLCELGSDCK-SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASV 225

Query: 133 VFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSIT 188
           + GCG   +G F++     GL+GLG   LS+ S +L++ G   N FS C      D + +
Sbjct: 226 IIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS-LLAKAGLVRNTFSICF-----DDNHS 279

Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
             + FG+   V+     STS V  E K   Y + +EG  VG            +SS   +
Sbjct: 280 GTILFGDQGLVTQK---STSFVPLEGKFVTYLIEVEGYLVG------------SSSLKTA 324

Query: 248 KGNMFIDTGAPPTLLPKDFYNRL----EEQV---RNAIKLTPYQDPRLGSQLCYKTPSMA 300
                +D+G   T LP + Y ++    ++QV   R++ K +P+       + CY + S  
Sbjct: 325 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPW-------KYCYNSSSQE 377

Query: 301 GI-APILTAHFDGGAKVPLIHTST--FIPPPVE-GVFCFAMQPIDGDVGIFGNFAQSDLF 356
            +  P +T  F       ++H      I    E  VFC  +QPI  + GI G        
Sbjct: 378 LLNIPTVTLVFAMNQSF-IVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYR 436

Query: 357 IGYDFDSQMVSFKPTDC 373
           + +D ++  + +  ++C
Sbjct: 437 MVFDRENLKLGWSTSNC 453


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)

Query: 26  MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
           M  S+G PP++++  I DTGS L WVQC PC V C+ Q     PI++P  S + + + C 
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 82  SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
           S +C        L   +C  ++  C Y+  Y +  + + G + T+ +  G+S   F +++
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116

Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
           FGC  +    ++E E G+ G G +  S   Q+      L     SYCL    TD +    
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKPGY 171

Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
           M  G     +  G   T L    ++  Y +T+E +    ++N  +L+         S   
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218

Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
           M +D+GA  T L    +  L++ +  A+    Y      R  S +CY             
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278

Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
           TP S     P+L   F GGA + L   + F   P  G+   FA  P      I GN    
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337

Query: 354 DLFIGYDFDSQMVSFKPTDC 373
                +D   +   FK   C
Sbjct: 338 SFGTTFDIQGKQFGFKYAVC 357


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 131/302 (43%), Gaps = 46/302 (15%)

Query: 13  VQSNVSTANGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYN 68
           +Q NV    G Y +  +IG P     LD    VDTGSDL W+QC  PC  C K   P+Y 
Sbjct: 44  LQGNV-YPTGHYYVTMNIGNPAKPYFLD----VDTGSDLTWLQCDAPCRSCNKVPHPLYR 98

Query: 69  PASSSSYKELSCQSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
           P ++S    + C +  C  L +       C S + C+Y   Y DS+ ++GVL  +  +  
Sbjct: 99  PTANS---LVPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLP 155

Query: 124 -NSNNFFDNVVFGCGHN----NTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYC 177
             S+N    + FGCG++      G       G++GLGR  +SL SQ+  Q +  N   +C
Sbjct: 156 MRSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHC 215

Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
           L    T+         G G    G  +V TS V+       +V +  IS    S  S  +
Sbjct: 216 L---STN---------GGGFLFFGDDIVPTSRVT-------WVPMAKISGNYYSPGSGTL 256

Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYK 295
            +   S  +    +  D+G+  T      Y  +   +++ +   L    DP L   LC+K
Sbjct: 257 YFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSL--PLCWK 314

Query: 296 TP 297
            P
Sbjct: 315 GP 316


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 86/173 (49%), Gaps = 23/173 (13%)

Query: 22  GEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYKE 77
           G Y +  +IG P     LD    VDTGSDL W+QC  PC  C K   P+Y P  +   K 
Sbjct: 55  GHYYVTMNIGDPAKPYFLD----VDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KL 107

Query: 78  LSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERIT--FGNSNNFFD 130
           + C +  C  L + S     C++QQ C+Y   Y D + + GVL T+  +    N +N   
Sbjct: 108 VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRP 167

Query: 131 NVVFGCGHN----NTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCL 178
           ++ FGCG++      G       GL+GLGR  +SL SQ+  Q +  N   +CL
Sbjct: 168 SLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL 220


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 130/273 (47%), Gaps = 33/273 (12%)

Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY 176
           TE  TFG+    F  + FGC   + G F     GLVGLGR +LSL    ++QL    F Y
Sbjct: 2   TETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGY 56

Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN- 229
            L    +D S  S + FG+ ++V+GG     +ST L++    +D  +Y+V L GISVG  
Sbjct: 57  RL---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK 113

Query: 230 -LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
            +   S    +  S+GA   G +  D+G   T+LP   Y  + +++ + +    +Q P  
Sbjct: 114 LVQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPP 167

Query: 289 GSQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPI 340
            +     +C+   S     P +  HFDGGA + L  T  ++P       E   C+++   
Sbjct: 168 AANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKS 226

Query: 341 DGDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
              + I GN  Q D  + +D   +++M+   PT
Sbjct: 227 SQALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 259


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.135    0.409 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,284,721,262
Number of Sequences: 23463169
Number of extensions: 273047639
Number of successful extensions: 555895
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 666
Number of HSP's successfully gapped in prelim test: 1705
Number of HSP's that attempted gapping in prelim test: 548786
Number of HSP's gapped (non-prelim): 2642
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)